Show HN: Serverless speech recognition with WebAssembly

  • In this thread:

    The Zork game, written in WebAssembly, using serverless (offline) speech recognition by using the Kaldi Speech Recognition Toolkit. Kaldi is written in C++ which then (i guess) is compiled into WebAssembly via Emscripten.

    https://github.com/kaldi-asr/kaldi

    Very very interesting.

    Additional comment: When I saw the Amazon Alexa API (i was at a small meeting which included defining your own "utterances", i was dissapointed that the Alexa required internet connection to perform the speech recognition, even moreso when the interaction with it was not so sophisticated (after all, it is geared to understanding predefined commands).

  • Deadly. Have always hated the privacy implications of server-side speech processing.

  • Super super cool. What data did you use to train the acoustic model?