JSF Labs · Insights

Engineering thoughts,
AI breakthroughs & voice stories.

Jawad KhanCTO & AI Engineer·Jan 2025·3 min read

Engineering

How We Built Sub-200ms Latency TTS at Scale

Achieving real-time audio generation requires rethinking every layer of the stack — from model quantization to async chunk streaming. Here's exactly how we did it.

Jawad KhanCTO & AI Engineer·Feb 2025·4 min read

Research

Emotional Speech Synthesis: Beyond Text

Modern TTS can produce grammatically correct speech that sounds emotionally flat. We spent six months solving this. The results surprised us.

Jawad KhanCTO & AI Engineer·Mar 2025·5 min read

Voice AI

Voice Cloning in 2025: A Technical Deep Dive

Voice cloning has matured dramatically. We benchmarked 11 approaches and built our own. Here's what the data actually shows.

Jawad KhanCTO & AI Engineer·Apr 2025·6 min read

Product

Multilingual AI — The Alignment Problem

Supporting 15 languages in a single model without quality degradation requires more than just multilingual training data. The alignment problem is subtle.

Jawad KhanCTO & AI Engineer·May 2025·7 min read

Deep Learning

From Waveform to Words: Our Signal Pipeline

Our audio preprocessing pipeline handles everything from noise removal to prosody normalization. It runs in 8ms on average. Here's the architecture.