JSF Labs

JSF Labs · Insights

Engineering thoughts,
AI breakthroughs & voice stories.

Jawad Khan
Jawad KhanCTO & AI Engineer·Jan 2025
Engineering

How We Built Sub-200ms Latency TTS at Scale

Achieving real-time audio generation requires rethinking every layer of the stack — from model quantization to async chunk streaming. Here's exactly how we did it.

How We Built Sub-200ms Latency TTS at Scale
Jawad Khan
Jawad KhanCTO & AI Engineer·Feb 2025
Research

Emotional Speech Synthesis: Beyond Text

Modern TTS can produce grammatically correct speech that sounds emotionally flat. We spent six months solving this. The results surprised us.

Emotional Speech Synthesis: Beyond Text
Jawad Khan
Jawad KhanCTO & AI Engineer·Mar 2025
Voice AI

Voice Cloning in 2025: A Technical Deep Dive

Voice cloning has matured dramatically. We benchmarked 11 approaches and built our own. Here's what the data actually shows.

Voice Cloning in 2025: A Technical Deep Dive
Jawad Khan
Jawad KhanCTO & AI Engineer·Apr 2025
Product

Multilingual AI — The Alignment Problem

Supporting 15 languages in a single model without quality degradation requires more than just multilingual training data. The alignment problem is subtle.

Multilingual AI — The Alignment Problem
Jawad Khan
Jawad KhanCTO & AI Engineer·May 2025
Deep Learning

From Waveform to Words: Our Signal Pipeline

Our audio preprocessing pipeline handles everything from noise removal to prosody normalization. It runs in 8ms on average. Here's the architecture.

From Waveform to Words: Our Signal Pipeline