parmesant's comments

parmesant · 2025-06-14T08:35:03 1749890103

Based on the feedback, we could have done a much better job with these results (lessons for our next experiment). But yes, the models were tested against the same dataset which was aggregated over different granularities (1 minute, 1 hour, 1 day)

parmesant · 2025-06-14T08:31:51 1749889911

We'll definitely include it in our next experiment (shaping up to be quite big!)

parmesant · 2025-06-14T08:08:36 1749888516

At the moment our focus is on observability, hence the narrow scope of our dataset. A pretty good benchmark for observability seems to be Datadog's BOOM- https://huggingface.co/datasets/Datadog/BOOM

But for general purpose time-series forecasting, benchmarks mentioned in other comments like GIFT or M4 might come in handy. We might include them in the follow-up experiment.

parmesant · 2025-06-13T12:12:03 1749816723

That's actually one of the use-cases that we set out to explore with these models. We'll release a head-to-head comparison soon!

CubsFan1060 · 2025-06-13T13:08:08 1749820088

That's the thing I'm most interested in out of these. Super interested to see what you find out.

Did you or do you plan to publish any of your code or data sets from this?

Debanitrkl · 2025-06-13T14:38:27 1749825507

Author here, we’re just getting started with these experiments and plan to apply them to more features on our roadmap. Future posts will be more detailed, based on the feedback we received here. Once we finish implementing these features, we’ll be happy to share the code and dataset.

parmesant · 2025-06-13T12:03:40 1749816220

This looks like a great benchmark! We've been thinking of doing a better and more detailed follow-up and this seems like the perfect dataset to do that with. Thanks!

parmesant · 2025-06-13T11:59:09 1749815949

Author here, we're trying these out for the first time for our use-cases so these are great points for us to improve upon!

mvATM99 · 2025-06-13T15:14:08 1749827648

Good to see positive reception to feedback! Sorry if my message came out as condescending, was not the intent. I recommend reading this piece on metrics https://openforecast.org/wp-content/uploads/2024/07/Svetunko.... It's easy to grasp, yet it contains great tips.

parmesant · 2025-06-14T07:55:04 1749887704

we're grateful for the honest feedback (and the awesome resource!), makes it easier to identify areas for improvement. Also, your point about using multiple metrics (based on use-cases, audience, etc) makes a lot of sense. Will incorporate this in our next experiment.