Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
InternVL-U is a 4B-parameter unified multimodal model (UMM) from OpenGVLab/Shanghai AI Laboratory that integrates multimodal understanding, reasoning, image generation, and image editing into a single framework. Released in March 2026, it democratizes omni-capable multimodal intelligence by combining state-of-the-art vision-language comprehension with generative capabilities, outperforming larger unified baselines at a compact parameter scale.
hacksider
Real-time AI face swap and one-click video deepfake with only a single image
harry0703
AI-powered short video generator that automates scripting, footage sourcing, subtitles, and composition — supporting 10+ LLM providers and batch production.
microsoft
Microsoft's official 1-bit LLM inference framework achieving 1.37x-6.17x speedup and up to 82% energy reduction, enabling 100B parameter models to run on consumer CPUs.
bytedance
ByteDance's open-source multimodal AI agent stack for GUI automation with vision-language model integration