Reviews AI Tools Open Source Live News AI Official

Reviews AI Tools Open Source Live News AI Official

Open Source

Explore the latest AI open-source projects from GitHub and HuggingFace.

Latest AI/LLM news and in-depth reviews.
We analyze usability, potential, and trade-offs.

info@evermx.com

LLM

Claude
Gemini
GPT
Llama
Other LLM

More Content

AI Tools
Open Source
IT News
Tutorials
Research

Official Sites

Anthropic (Claude)
Google AI (Gemini)
OpenAI (GPT)
Meta AI (Llama)
Hugging Face

© 2026 Evermx. All rights reserved.

About Editorial Policy Contact Privacy Policy Terms of Service

Reviews Tools Open Source Live Official Profile

Unsloth - Open Source | Evermx | Evermx

Back to Open Source

Unsloth

Trending

Unsloth

unslothaiApache-2.0

Inference62.1K Stars5.4K Forks100 views

2x faster LLM fine-tuning with 70% less VRAM via custom Triton kernels. Supports Llama, Qwen, DeepSeek, Gemma, and 500+ models.

Key Features

2x faster training with 70% less VRAM via custom Triton and mathematical kernels — no accuracy loss
LoRA, QLoRA, and full fine-tuning at 4-bit, 16-bit, and FP8 precision
GRPO reinforcement learning support with 80% less VRAM than standard implementations
500K context window fine-tuning for long-document and multi-turn conversation specialization
Supports 500+ models: Llama 4, Qwen 3.5, DeepSeek, Gemma 4, Mistral, Phi-4, gpt-oss
Unsloth Studio web UI for self-hosted inference with GGUF, LoRA, and safetensors formats
Visual dataset recipe editor that ingests PDFs, CSVs, and DOCX files
Multi-GPU training support and free Colab notebooks for getting started

Tags

fine-tuningllmloraqlorareinforcement-learningtritonvram-optimizationlocal-ai

Related Projects

Ollama

TrendingInference

Ollama

ollama

The simplest way to run LLMs locally with 165K+ GitHub stars. One-command deployment, 100+ models, REST API, and multi-platform support.

llama.cpp

TrendingInference

llama.cpp

ggml-org

Pure C/C++ LLM inference engine supporting CPUs, Apple Silicon, CUDA, and Vulkan

SGLang

SGLang

SGLang (Structured Generation Language) is a high-throughput, low-latency inference engine for large language models and multimodal models, developed by the LMSYS team. With 26,600 GitHub stars and over 12,000 commits, it has become the de facto o…

SGLang

TrendingInference

SGLang

sgl-project

SGLang is a high-performance open-source serving framework for LLMs with RadixAttention prefix caching, zero-overhead CPU scheduling, and prefill-decode disaggregation — deployed across 400,000+ GPUs at xAI, LinkedIn, and Cursor.