Power failure again with no obvious reason. This time there were problems: n0007 appears to be dead (no POST), and the server failed to shutdown cleanly. xfs_check repaired one unlinked file coming from slurm. Server boot-up normally (?).
The current scenario concerning unclean shutdowns is that the culprit is the powerfail-duration
line in /etc/pwrstatd.conf
: it seems that whenever powerfail-duration
is not zero, the shutdown is not clean. We'll see …
Power supply replaced on n0007 → back to service.