Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
MLC-LLM is a universal LLM deployment engine powered by ML compilation via Apache TVM. It compiles models once and runs them natively across NVIDIA, AMD, Apple, and Intel GPUs as well as mobile platforms including iOS and Android, with WebGPU support for browser-based inference. The unified MLCEngine provides an OpenAI-compatible REST API, Python, JavaScript, and mobile bindings from the same compiled artifact, enabling developers to deploy quantized LLMs from cloud to edge without platform-specific rewrites.
ollama
The simplest way to run LLMs locally with 165K+ GitHub stars. One-command deployment, 100+ models, REST API, and multi-platform support.
ggml-org
Pure C/C++ LLM inference engine supporting CPUs, Apple Silicon, CUDA, and Vulkan
unslothai
2x faster LLM fine-tuning with 70% less VRAM via custom Triton kernels. Supports Llama, Qwen, DeepSeek, Gemma, and 500+ models.
SGLang (Structured Generation Language) is a high-throughput, low-latency inference engine for large language models and multimodal models, developed by the LMSYS team. With 26,600 GitHub stars and over 12,000 commits, it has become the de facto o…