Monitoring broken (since April 29?)
Originally created by @intrigeri on #17703 (Redmine)
On https://icingaweb2.tails.boum.org/monitoring/health/info I see that the last Icinga status update happened on April 29.
On ecours, I see that icinga2.service
cannot start because the config
files in the teels.tails.boum.org
zone refer to a zone that is not
declared anymore. Indeed, in etckeeper’s log on ecours, I see that
Puppet removed that zone on April 29
(6ffca75bb39d22133064d5d3f306c5be77a9eb46). I could not find where that
zone was configured in Puppet so I tried deleting
/etc/icinga2/zones.d/teels.tails.boum.org/
, which allowed
icinga2.service
to start. Then I ran Puppet on ecours, and those files
did not come back, so I’m confused.
Then I saw the exact same problem on monitor.lizard
, and applied the
same solution. Here again, running Puppet on that host did not bring
back the config files I had deleted.
I’ll stop here for today. I hope I did more good than harm.