Are inference metrics like latency and power measured live from device? To which devices can Talaria be applied?
How does this compare to TVM?
Could you give us a tl;dr on this project? and how could I use something like this work for on-device applications, think "smart home" style applications?
Hi, I’m Jochen, one of the authors.
We recently did a Show HN (https://news.ycombinator.com/item?id=41463916) which did not get much traction, so I’m posting this again here:
We just released Mycelium, the library that powers Talaria’s graph viewer. You can check it out and play around with it here: https://apple.github.io/ml-mycelium
I’m happy to answer any questions about Talaria or Mycelium!