Dave Pigott dave.pigott@linaro.org writes:
All on snowballs this time. Some have been stuck for 3 days.
snowball02
http://validation.linaro.org/lava-server/scheduler/job/40045
The usual story. Pushed bundle, and then stuck. Did a cancel and kill -2
You don't need to cancel the jobs -- just kill -2 them.
This is https://bugs.launchpad.net/lava-scheduler/+bug/1043059 -- I'm completely stumped on it, unfortunately. What happens is that we have an extra fd open between the scheduler monitor process and the dispatcher and this extra fd is never reported closed to the monitor code so it doesn't think that it's exited. Probably we should just give up on this --oob-fd cuteness and parse the dashboard-put-result: message out of stdout.
Cheers, mwh