Real-time AI avatars on a single GPU: When local beats the cloud API
Open-weights joint audio-video models hit a usable bar this spring, and a single, local GPU card now runs a real-time conversational avatar end to end. Here's where self-hosting with LTX 2.3 and Daydream Scope beats a third-party cloud API, and where it doesn't.