Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
Block Diffusion for Flash Speculative Decoding — a lightweight architecture enabling high-quality parallel drafting for LLM inference acceleration.
ollama
The simplest way to run LLMs locally with 165K+ GitHub stars. One-command deployment, 100+ models, REST API, and multi-platform support.
ggml-org
Pure C/C++ LLM inference engine supporting CPUs, Apple Silicon, CUDA, and Vulkan
unslothai
2x faster LLM fine-tuning with 70% less VRAM via custom Triton kernels. Supports Llama, Qwen, DeepSeek, Gemma, and 500+ models.
BerriAI
MIT-licensed Python SDK and self-hosted AI Gateway that exposes 100+ LLM providers — OpenAI, Anthropic, Gemini, Bedrock, Azure, VertexAI, vLLM, NVIDIA NIM, and more — through an OpenAI-compatible interface, with virtual keys, spend tracking, load balancing, fallbacks, guardrails, and observability callbacks for Lunary, MLflow, and Langfuse.