Overnight health check failures - linaro-validation

21 Nov 2012


      Hi all,
Two last night, which means we're averaging approximately one health check failure per day, which equates to a 95% pass rate. Not great.
------------
panda04
------------
http://validation.linaro.org/lava-server/scheduler/job/39484
When it got into the test image, the device was spewing out lots of weird error messages. Went onto the board and rebooted the test image: same problem. Shell prompt was also corrupted. I wasn't clear if this is a board/sd card/corrupt image deployment problem, so I booted the master image, and that seems fine. Putting back online to see if it was a one off corruption.
If the board passes this time, then the only way to fix this problem would be to set up so that if for some reason things fail in the test image, go round and do it all again - including deployment, because rebooting the test image wouldn't have worked.
------------
panda06
------------
http://validation.linaro.org/lava-server/scheduler/job/39477
wget weirdness. Kept getting "Connection reset by peer", and then retrying. Putting back online to see if it's a one off glitch.
If the board passes this time, then the way to fix the problem is, if deployment fails, reboot to the master image and try again.
Thanks
Dave