Hacker News

Accurate real-time speech recognition

by headson 2/1/2024, 3:57:04 AM with 1 comment

by headson 2/1/2024, 4:03:43 AM
This is a product update we’ve been working on. Our multi-headed model is accurate in 49 languages and the full pipeline has really good normalisation (“ten dollars” -> “$10”), abbreviations, and punctuation too. It makes a big difference when you’re feeding this, say, to an LLM in real time.
(Should we add oxford_comma as a config option?)