thanos: query: Staleness problem

Thanos, Prometheus and Golang version used: thanos: v0.12.0

Object Storage Provider: private CEPH (S3)

What happened: See on end_input time and resolution: Снимок экрана 2020-05-14 в 16 34 52 Снимок экрана 2020-05-14 в 16 35 02 Staleness functionality in prometheus library get rid of some points returned from thanos-stores.

What you expected to happen: Return all data from store on any time_range

How to reproduce it (as minimally and precisely as possible): see on screenshots.

Full logs to relevant components:

Anything else we need to know: I think, that we have few ways to resolve problem:

  1. Update prometheus library and set LookbackDelta parameter > 5 min (need check)
  2. Update query and move/duplicate points to needed timestamp. (Interpolate data for PromQL)
  3. Update prometheus library to return all points from stores.

About this issue

  • Original URL
  • State: closed
  • Created 4 years ago
  • Comments: 22 (20 by maintainers)

Commits related to this issue

Most upvoted comments

BTW do you know we can now configure stalenees Lookback delta?

However we might want to adjust it for different resolutions indeed

Well, here’s my attempt at it: https://github.com/thanos-io/thanos/pull/3277

We just allow users to configure it on Querier from flag that’s it.

So essentially you would actually for each downsampled data, actually expand it to have samples every 1m, fake interval

Yes, something like that.

Why it depends on query?

For e.g. sum_over_time you need different data than count_over_time to produce the desired result.