Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
llama.cpp is a pure C/C++ LLM inference engine with no external dependencies, enabling high-performance local inference across a wide range of hardware. It supports Apple Silicon via Metal, NVIDIA CUDA, AMD ROCm, Intel SYCL, Vulkan, and ARM NEON — from Raspberry Pi 5 boards to multi-GPU servers. In March 2026, it crossed 100,000 GitHub stars, making it the fastest open-source AI project to reach that milestone, with over 700 contributors and 3,800+ merged pull requests in 2025 alone.
ollama
The simplest way to run LLMs locally with 165K+ GitHub stars. One-command deployment, 100+ models, REST API, and multi-platform support.
unslothai
2x faster LLM fine-tuning with 70% less VRAM via custom Triton kernels. Supports Llama, Qwen, DeepSeek, Gemma, and 500+ models.
SGLang (Structured Generation Language) is a high-throughput, low-latency inference engine for large language models and multimodal models, developed by the LMSYS team. With 26,600 GitHub stars and over 12,000 commits, it has become the de facto o…
sgl-project
SGLang is a high-performance open-source serving framework for LLMs with RadixAttention prefix caching, zero-overhead CPU scheduling, and prefill-decode disaggregation — deployed across 400,000+ GPUs at xAI, LinkedIn, and Cursor.