Skip to content
This repository has been archived by the owner on Sep 18, 2020. It is now read-only.

operator: expose metrics #156

Open
lucab opened this issue Nov 10, 2017 · 0 comments
Open

operator: expose metrics #156

lucab opened this issue Nov 10, 2017 · 0 comments

Comments

@lucab
Copy link
Contributor

lucab commented Nov 10, 2017

update-operator is a long running Go process which supervises cluster-wide complex operations. As such it should expose metrics regarding its status, which can be scraped by Prometheus and alerted upon. Access to such endpoint should be governed by kubernetes RBAC policies.

This is a preliminary list of interesting metric:

  • go runtime stats
  • nodes being managed by CLUO
  • nodes in reboot-needed state
  • nodes in before-reboot state
  • nodes in after-reboot state
  • optional "before" and "after" checks state
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant