Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
YOLOE (Real-Time Seeing Anything) is a highly efficient, unified, and open object detection and segmentation model from Tsinghua University that operates under multiple prompt mechanisms—text prompts, visual inputs, and prompt-free paradigm—with zero inference and transferring overhead compared to closed-set YOLOs. It introduces Re-parameterizable Region-Text Alignment (RepRTA) for text prompts and Semantic-Activated Visual Prompt Encoder (SAVPE) for visual prompts, achieving state-of-the-art open-vocabulary detection and segmentation in real time. Accepted at ICCV 2025, YOLOE surpasses YOLO-Worldv2-S by 3.5 AP with 1.4× inference speedup.
ultralytics
Definitive real-time computer vision framework with 54k+ stars, YOLO26 NMS-free inference, and multi-task support for detection, segmentation, and pose estimation
roboflow
Reusable, model-agnostic Python tools for building computer vision applications
QwenLM
Alibaba's most powerful open-source vision-language model with 256K context, spatial reasoning, and visual agent capabilities.
web-infra-dev
AI-powered, vision-driven UI automation for web, mobile, and desktop using natural language with 12.1K+ GitHub stars.