On Wed, Sep 14, 2022 at 10:21:04PM -0400, Jason Wittlin-Cohen wrote:
#regzbot introduced 6d17a112e9a63ff6a5edffd1676b99e0ffbcd269
Issue: When running a 5.10.140 kernel compiled from kernel.org source, or a bisected kernel with commit 6d17a112e9a63ff6a5edffd1676b99e0ffbcd269, 6 of the 48 drives attached to my LSI 9207-8e SAS HBA (P20 firmware, IT mode) will drop from the controller shortly after the boot process completes. At this point, the drives are not visible to the LSI controller, verified using LSI's lsiutil.x86_64 to list all attached devices, nor are the drives enumerated in /dev/disk/by-id. Attempts to access the drives result in I/O errors reported in syslog. At some point thereafter, the drives reappear and are accessible.
Running a vanilla 5.10.139 kernel or a bisected kernel with commit 6d17a112e9a63ff6a5edffd1676b99e0ffbcd269 removed, all 48 drives appear correctly, are listed in LSI's lsiutil tool, and appear in /dev/disk/by-id. No I/O errors are reported from any drive and none of the drives drop off the controller as experienced in 5.10.140. SMART testing shows normal results for all impacted drives.
Does this also have problems in the latest 5.15 and 5.19 release, or is it somehow limited to 5.10.y?
Also, html emails are rejected by the mailing lists, so you might want to resend it all in text-only mode so that everyone can see the full details.
thanks,
greg k-h