On Fri, May 10, 2024 at 05:36:03PM +0800, Li, Xin3 wrote:
if (kvm_is_fred_enabled(vcpu)) {
u64 event_data = 0;
if (is_debug(intr_info))
/*
* Compared to DR6, FRED #DB event data saved on
* the stack frame have bits 4 ~ 11 and 16 ~ 31
* inverted, i.e.,
* fred_db_event_data = dr6 ^ 0xFFFF0FF0UL
*/
event_data = vcpu->arch.dr6 ^ DR6_RESERVED;
else if (is_page_fault(intr_info))
event_data = vcpu->arch.cr2;
else if (is_nm_fault(intr_info))
event_data =
- to_vmx(vcpu)->fred_xfd_event_data;
IMO, deriving an event_data from CR2/DR6 is a little short-sighted because the event_data and CR2/DR6 __can__ be different, e.g., L1 VMM __can__ set CR2 to A and event_data field to B (!=A) when injecting #PF.
VMM should guarantee a FRED guest _sees_ consistent values in CR6/DR6 and event data. If not it's just a VMM bug that we need to fix.
I don't get why VMM should.
I know the hardware will guarantee this. And likely KVM will also do this. but I don't think it is necessary for KVM to assume L1 VMM will guarantee this. because as long as L2 guest is enlightened to read event_data from stack only, the ABI between L1 VMM and L2 guest can be: CR2/DR6 may be out of sync with the event_data. I am not saying it is good that L1 VMM deviates from the real hardware behavior. But how L1 VMM defines this ABI with L2 has nothing to do with KVM as L0. KVM shouldn't make assumptions on that.