Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Supervisor process environment issue #55

Open
davissp14 opened this issue Jan 11, 2022 · 0 comments
Open

Supervisor process environment issue #55

davissp14 opened this issue Jan 11, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@davissp14
Copy link
Contributor

During Restores and Standalone -> HA setup conversions, Stolon is booted with "existing" and is seeded with a old/fake keeperUID. Stolon notices the keeperUID doesn't exist and will work to generate a new one. Stolon updates the keeperstate file with the new UID that we need to use, but our supervisor will continue to use the original UID until the VM gets rebooted or hits its timeout. The end result is that Stolon won't be able to connect with the backend store until the VM is rebooted and the supervisor process is able to resolve the correct environment.

https://github.com/fly-apps/postgres-ha/blob/main/cmd/start/main.go#L136

While this is a noticeable problem, the problem does auto-resolve after the supervisor process restarts post-timeout.

@davissp14 davissp14 added the bug Something isn't working label Jan 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant