Open Source

Explore the latest AI open-source projects from GitHub and HuggingFace.

Evermx

Latest AI/LLM news and in-depth reviews.
We analyze usability, potential, and trade-offs.

info@evermx.com

LLM

Claude
Gemini
GPT
Llama
Other LLM

Official Sites

Anthropic (Claude)
Google AI (Gemini)
OpenAI (GPT)
Meta AI (Llama)
Hugging Face

About Editorial Policy Contact Privacy Policy Terms of Service

Reviews Tools Open Source Live Official Profile

TileLang - Open Source | Evermx | Evermx

Back to Open Source

TileLang

View on GitHub

Inference5.8K Stars524 Forks139 views

TileLang is a domain-specific language (DSL) for streamlining high-performance GPU/CPU/accelerator kernel development using Pythonic syntax with TVM compiler infrastructure. It supports GEMM, FlashAttention, LinearAttention, and sparse operations across CUDA, HIP, Metal, WebGPU, and Ascend backends.

Key Features

Pythonic DSL for GPU/CPU kernel development backed by TVM compiler infrastructure
Multi-backend support: CUDA (NVIDIA), HIP (AMD), Metal (Apple), WebGPU, Ascend (Huawei)
Built-in FlashAttention, GEMM, dequantization GEMM, and LinearAttention kernels
Auto-tuning, layout annotations, L2 cache swizzling, and pipelining support
Z3 theorem prover integration for kernel transformation correctness verification

Related Projects

TrendingInference

GitHub

165.0K15.0K

Ollama

ollama

The simplest way to run LLMs locally with 165K+ GitHub stars. One-command deployment, 100+ models, REST API, and multi-platform support.

llama.cpp

ggml-org

Pure C/C++ LLM inference engine supporting CPUs, Apple Silicon, CUDA, and Vulkan

vLLM

vLLM Project

A high-throughput, memory-efficient LLM inference and serving engine built around PagedAttention, with an OpenAI-compatible API and 200+ model support.

Apache-2.0101