Hacker News

Ask HN: Benchmarks for models other than LLMs

by caydenmon 4/24/2024, 6:00:54 AM with 1 comment

by luke-stanleyon 4/24/2024, 11:53:10 AM
On PapersWithCode, different datasets have benchmarks: https://paperswithcode.com/datasets
You can also break down by task here: https://paperswithcode.com/sota
For churn, you might go to time series forecasting first: https://paperswithcode.com/task/time-series-forecasting
They have this subtask which is a bit different because it's about novel products rather that continued sales, for example:
https://paperswithcode.com/task/new-product-sales-forecastin...
But you get the idea of how they organise by task. I'm curious about other benchmarks and interfaces too and would like to see others.
I think HuggingFace and Kaggle have some overlap with different tasks that have benchmarks.