Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
WorldGen is an open-source project from researcher Ziyang Xie that generates complete, explorable 3D scenes in seconds from either a text prompt or a single image. Since the codebase was open-sourced in April 2025 it has grown past 2,000 GitHub stars, and the maintainer has kept it moving with a steady stream of pipeline upgrades — the latest reworking depth estimation and Gaussian-splat alignment for more globally consistent worlds. Rather than producing a single object mesh, WorldGen targets full environments suitable for games, simulation, robotics, and virtual reality. ## Text-to-Scene and Image-to-Scene WorldGen's headline capability is turning a short description into a navigable 3D world with roughly two lines of Python: instantiate `WorldGen()` and call `generate_world("<prompt>")`. It supports both text-to-scene and image-to-scene generation, so a concept painting or a street photo can be lifted into a 3D environment just as easily as a written prompt. The project handles indoor and outdoor scenes and does not restrict itself to photorealism — stylized, painterly, and unrealistic scenes are explicitly supported. ## 360-Degree Exploration With Loop Closure What distinguishes WorldGen from single-view 3D generators is that its output is a genuinely explorable scene rather than a front-facing shell. It supports free 360-degree navigation with loop closure, meaning a viewer can turn all the way around and back to the start with the geometry staying consistent. Rendering is flexible: any resolution, any camera setting, and arbitrary trajectories can be rendered in real time, which makes the output directly usable as a game or simulation backdrop. ## An Actively Engineered Pipeline The project reads as a living research pipeline. Recent updates swapped the depth model from UniK3D to DA-2 for better 360-degree depth estimation, reworked the optional ml-sharp module to align per-face Gaussians using cubemap depth (cutting from eight-plus views down to six cubemap faces), and fixed projection-scale issues to improve Gaussian-splat quality. A low-VRAM mode brings generation down to roughly 10GB, widening the range of GPUs that can run it, and mesh-based scene generation is available as an alternative to splats. ## Trade-offs and Limitations WorldGen is early-stage research software (versioned v0.2.0) rather than a turnkey product: installation requires a CUDA GPU, conda, and a recursive clone with several model dependencies, and the technical report and video are still listed as to-do items. Invisible-region and background inpainting is flagged as an area still being improved, so occluded areas of a scene can show artifacts, and results — while fast — carry the usual generative-3D caveats around fine geometric fidelity. ## Who Should Use This WorldGen is a strong fit for game developers, simulation and robotics researchers, and VR creators who need to spin up 3D environments quickly for prototyping or synthetic data, and who value 360-degree explorability over pixel-perfect asset quality. Its two-line API and low-VRAM mode also make it an approachable entry point for anyone experimenting with generative 3D scene synthesis.
graphdeco-inria
Original reference implementation of 3D Gaussian Splatting for real-time radiance field rendering
ahujasid
Connect Blender to Claude AI via MCP for natural-language-driven 3D scene creation and manipulation.
Tencent Hunyuan
Tencent's open-source diffusion system that turns a single image or text prompt into high-resolution, textured 3D assets.