On 07/01/2014 09:16 AM, Daniel Thompson wrote:
Currently if an active CPU fails to respond to a roundup request the CPU that requested the roundup will become stuck. This needlessly reduces the robustness of the debugger.
This patch introduces a timeout allowing the system state to be examined even when the system contains unresponsive processors. It also modifies kdb's cpu command to make it censor attempts to switch to unresponsive processors and to report their state as (D)ead.
It seems reasonable to allow entry on the master core because there certainly could be useful information to be had with respect to how you got there in the first place, but I wonder about the case for resuming the system. In general if you couldn't sync in the the first place, the system is dead. My opinion is that we probably should explicitly disallow a resume or single step at that point.
Jason.