Open Source

MIT-licensed PyTorch Jupyter notebook that builds a GPT-style transformer end to end from The Pile data through tiktoken tokenization to text generation, scaling from 13M to 2B+ parameters.

MIT73

TrendingAgent

GitHub

46.1K4.8K

Goose

aaif-goose

Apache-2.0 Rust-based open AI agent from Block (now Linux Foundation AAIF) with native desktop, CLI, and API surfaces, 70+ MCP extensions, and 15+ LLM providers including local Ollama.

Open-dLLM

pengzhangzhi

Apache-2.0 fully open diffusion language model for code, shipping data pipeline, pretraining code, evaluation suite, and 0.5B Open-dCoder checkpoints with representation-alignment-based 4x training speedup.

TokenSpeed

lightseekorg

LightSeek Foundation's MIT-licensed inference engine targeting TensorRT-LLM-level performance with vLLM-level usability, purpose-built for agentic workloads on Blackwell and Hopper GPUs.

ViMax

HKUDS

HKUDS's MIT-licensed multi-agent video generation framework with director, screenwriter, producer, and generator roles that assembles minute-scale narrative video from ideas, novels, or scripts.

Heretic

p-e-w

Automated directional ablation tool that removes refusal behavior from open LLMs while preserving capability through Optuna-tuned per-layer ablation kernels.

VoxCPM

OpenBMB

OpenBMB's tokenizer-free 2B-parameter TTS model emitting native 48kHz audio across 30 languages with voice design, controllable cloning, and an OpenAI-compatible endpoint.

Stable WorldModel

galilai-group

Open-source platform from galilai-group, with authors including Yann LeCun, that unifies data collection, training, and model-predictive control evaluation for world model research across 25+ environments.

LiteParse

LlamaIndex

Fast, local-first open-source document parser from run-llama with Rust core, spatial text plus bounding boxes, bundled OCR, and bindings for Python, Node.js, Rust, and WASM.

NVIDIA Eagle

NVlabs

NVIDIA's open vision-language model family using data-centric strategies, spanning Eagle, Eagle 2, Eagle 2.5 with 128K context, and the new LocateAnything generalist grounding model.

MOSS-TTS

OpenMOSS

Open-source speech and sound generation model family from OpenMOSS with multilingual TTS, dialogue, real-time voice agents, voice design, and sound effects under a unified audio tokenizer.

GPUStack

Open-source GPU cluster manager that turns heterogeneous accelerators into a self-hosted, OpenAI-compatible model-as-a-service platform powered by vLLM, SGLang, and llama.cpp.

Apache-2.0119

7 8 9 10 11 12 13 14 15

625 projects

Sort:

TrendingLLM

GitHub

3.8K521

Train LLM From Scratch

FareedKhan-dev

MIT-licensed PyTorch Jupyter notebook that builds a GPT-style transformer end to end from The Pile data through tiktoken tokenization to text generation, scaling from 13M to 2B+ parameters.

Goose

aaif-goose

Apache-2.0 Rust-based open AI agent from Block (now Linux Foundation AAIF) with native desktop, CLI, and API surfaces, 70+ MCP extensions, and 15+ LLM providers including local Ollama.

Open-dLLM

pengzhangzhi

TokenSpeed

lightseekorg

LightSeek Foundation's MIT-licensed inference engine targeting TensorRT-LLM-level performance with vLLM-level usability, purpose-built for agentic workloads on Blackwell and Hopper GPUs.

ViMax

HKUDS

HKUDS's MIT-licensed multi-agent video generation framework with director, screenwriter, producer, and generator roles that assembles minute-scale narrative video from ideas, novels, or scripts.

Heretic

p-e-w

Automated directional ablation tool that removes refusal behavior from open LLMs while preserving capability through Optuna-tuned per-layer ablation kernels.

VoxCPM

OpenBMB

OpenBMB's tokenizer-free 2B-parameter TTS model emitting native 48kHz audio across 30 languages with voice design, controllable cloning, and an OpenAI-compatible endpoint.

Stable WorldModel

galilai-group

LiteParse

LlamaIndex

Fast, local-first open-source document parser from run-llama with Rust core, spatial text plus bounding boxes, bundled OCR, and bindings for Python, Node.js, Rust, and WASM.

NVIDIA Eagle

NVlabs

NVIDIA's open vision-language model family using data-centric strategies, spanning Eagle, Eagle 2, Eagle 2.5 with 128K context, and the new LocateAnything generalist grounding model.

MOSS-TTS

OpenMOSS

Open-source speech and sound generation model family from OpenMOSS with multilingual TTS, dialogue, real-time voice agents, voice design, and sound effects under a unified audio tokenizer.

GPUStack

Open-source GPU cluster manager that turns heterogeneous accelerators into a self-hosted, OpenAI-compatible model-as-a-service platform powered by vLLM, SGLang, and llama.cpp.

Apache-2.0119

7 8 9 10 11 12 13 14 15

10 11 12 13 14