Acknowledgement based mechanism to mark metric timestamps as completed #147

xichen2020 · 2018-07-14T20:52:38Z

I mentioned this issue to you a while back. Basically the aggregator currently marks a timestamp as "flushed" and persist that timestamp in KV as soon as the metrics with that timestamp have been flushed to either the backends (m3msg ingesters/indexers/etc) or written out to the TCP connection to other aggregation servers as forwarded metrics. However, without acknowledgements there's no reliable way to know for sure whether the metrics have made their way to the receiver end and as such marking tiles as completed can be premature and in turn cause the followers to discard metrics too early and can cause data loss during server deployments.

With the integration of m3msg into m3aggregator, this should be an achievable goal. Basically when a timestamp is flushed, the timestamp should not be marked as completed until the metrics associated with that timestamp have been acked on the other side (or dropped locally due to buffer full) so we can mark metrics as written with confidence.

In the short term, a workaround to mitigate the issue for forwarded metrics could be for the follower to use lastFlushedNanos - maxSingleDelay as the target timestamp to discard its metrics, as for forwarded metrics they would be rejected after maxSingleDelay anyway. Nonetheless, this is certainly not ideal, and using m3msg based acks would be a much cleaner solution.

The text was updated successfully, but these errors were encountered:

xichen2020 mentioned this issue Jul 14, 2018

Revert eager forwarding logic #148

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Acknowledgement based mechanism to mark metric timestamps as completed #147

Acknowledgement based mechanism to mark metric timestamps as completed #147

xichen2020 commented Jul 14, 2018

Acknowledgement based mechanism to mark metric timestamps as completed #147

Acknowledgement based mechanism to mark metric timestamps as completed #147

Comments

xichen2020 commented Jul 14, 2018