Gemma is Google’s contribution to the open-weights community. It is built from the same technology as Gemini.
High-speed inference on MacBooks and standard PCs.
This compact model by Stability AI is focused on being a "helpful assistant." Local chatbots that don't require a GPU. 8. Qwen-1.8B (Alibaba) tiny 10 github top
While not a model itself, this is the essential framework for the Tiny 10 movement. It allows users to run LLMs on consumer hardware using 4-bit quantization.
The TinyLlama project aims to pretrain a 1.1B Llama model on 3 trillion tokens. Fully open-source and highly compact. This compact model by Stability AI is focused
The project on GitHub has become a cornerstone for developers, researchers, and hobbyists looking to push the boundaries of Minimalist AI. As Large Language Models (LLMs) grow in size, the "Tiny 10" represents a counter-movement focused on efficiency, portability, and "Edge AI" capabilities.
This series of ultra-small models (1.8B) is designed by H2O.ai. Fine-tuned for chat and instructional following. It allows users to run LLMs on consumer
This GitHub project explores models where weights are just -1, 0, or 1.