Reviews AI Tools Open Source Live News AI Official

Open Source

Explore the latest AI open-source projects from GitHub and HuggingFace.

PersonaPlex - Open Source | Evermx | Evermx

Back to Open Source

Trending

PersonaPlex

NVIDIAMIT

View on GitHub

Audio7.4K Stars320 Forks124 views

NVIDIA's open-source full-duplex conversational speech AI system that enables real-time voice interactions with controllable voice characteristics and conversational persona. Built on the Moshi speech foundation model with a Helium LLM backbone, PersonaPlex processes incoming audio and generates contextually appropriate spoken responses simultaneously — supporting natural interruptions and overlapping speech. It ships with 16 pre-configured voices (Natural and Variety categories), audio conditioning for custom voice profiles, and text-based role prompts for persona definition. Designed for customer service automation, educational tutoring, and companion AI, the system supports GPU acceleration with CPU offload for limited VRAM environments. Deployment options include Docker with a web UI, offline evaluation tools, and experimental out-of-distribution prompt testing. The codebase is MIT-licensed; model weights are released under NVIDIA's Open Model License.

Key Features

Full-duplex bidirectional audio: listen and speak simultaneously with natural interruption support
16 pre-configured voices across Natural and Variety categories (female and male)
Audio conditioning for custom voice profile targeting from audio samples
Text-based role prompts for consistent conversational persona throughout sessions
Built on Moshi speech foundation model with Helium LLM backbone
GPU-accelerated low-latency inference with CPU offload option for limited VRAM
Docker deployment with web UI for rapid local prototyping
Offline evaluation tools for benchmarking and stress testing custom scenarios
Experimental out-of-distribution prompt support for edge case testing
MIT-licensed codebase with NVIDIA Open Model License for weights