On 23 May 2018 at 15:36, Robert Marshall <robert.marshall@codethink.co.uk> wrote:
At some point last week - I think because of network connectivity issues
a job got stuck and I I cancelled it, it when run again it again appeared to hang. I again
cancelled it and am now seeing the health check not start (at least no
output appears on the job's webspage.


What is the status of the relevant device(s) and any associated test jobs?

Check the /var/log/lava-server/lava-master.log for the reasons why the device is not being assigned.

Check the status of all daemons, including lava-logs

sudo service lava-master status
sudo service lava-logs status
sudo service lava-slave status
 
Looking at the output.yaml  (in /var/lib/lava-server/default/media/job-output/2018/05/23/32 ) I see
... progress output for downloading https://images.validation.linaro.org/kvm/standard/stretch-2.img.gz

- {"dt": "2018-05-23T07:39:54.728015", "lvl": "debug", "msg": "[common] Preparing overlay tarball in /var/lib/lava/dispatcher/tmp/32/lava-overlay-aye3n2ke"}
- {"dt":
- "2018-05-23T07:39:54.728root@stretch:/var/lib/lava-server/default/media/job-output/2018/05/23/32

But none of this appears in http://localhost:8080/scheduler/job/32

and at the head of that page I see the message:

Unable to parse invalid logs: This is maybe a bug in LAVA that should be reported.

which other logs are best for checking whether this is an error that
should be fed back?

(LAVA 2018.4)

Robert
_______________________________________________
Lava-users mailing list
Lava-users@lists.linaro.org
https://lists.linaro.org/mailman/listinfo/lava-users



--