Accurate real-time speech recognition

  • This is a product update we’ve been working on. Our multi-headed model is accurate in 49 languages and the full pipeline has really good normalisation (“ten dollars” -> “$10”), abbreviations, and punctuation too. It makes a big difference when you’re feeding this, say, to an LLM in real time.

    (Should we add oxford_comma as a config option?)