Talaria: Interactively Optimizing Machine Learning Models for Efficient Inferenc

  • Hi, I’m Jochen, one of the authors.

    We recently did a Show HN (https://news.ycombinator.com/item?id=41463916) which did not get much traction, so I’m posting this again here:

    We just released Mycelium, the library that powers Talaria’s graph viewer. You can check it out and play around with it here: https://apple.github.io/ml-mycelium

    I’m happy to answer any questions about Talaria or Mycelium!

  • Are inference metrics like latency and power measured live from device? To which devices can Talaria be applied?

  • How does this compare to TVM?

  • Could you give us a tl;dr on this project? and how could I use something like this work for on-device applications, think "smart home" style applications?