On Wed, Nov 18, 2020 at 07:28:30PM +0100, Timo Rothenpieler wrote:
Hi,
After 50b2412b7e7862c5af0cbf4b10d93bc5c712d021 was backported to stable branches (I only tested 5.4), some serious issues started to arrise.
According to linux-rdma, the following two patches that need to go along with 50b2412b7e are missing:
- 1d5558b1f0de net/mlx5: poll cmd EQ in case of command timeout
- 410bd754cd73 net/mlx5: Add retry mechanism to the command entry ...
I managed to apply those mostly cleanly after also applying two dependencies. So the complete list of needed commits for 5.4 is:
- 3ed879965cc4 net/mlx5: Use async EQ setup cleanup helpers ...
- 1d5558b1f0de net/mlx5: poll cmd EQ in case of command timeout
- d43b7007dbd1 net/mlx5: Fix a race when moving command ...
- 410bd754cd73 net/mlx5: Add retry mechanism to the command entry ...
With those 4 commits applied, the issue is fixed. For reference, that's the output I get with 5.4.77:
All now queued up, thanks.
greg k-h