[PATCH 5.15 045/196] nvme-tcp: fix possible use-after-free in transport error_recovery work

21 Feb 2022

From: Sagi Grimberg sagi@grimberg.me
[ Upstream commit ff9fc7ebf5c06de1ef72a69f9b1ab40af8b07f9e ]
While nvme_tcp_submit_async_event_work is checking the ctrl and queue
state before preparing the AER command and scheduling io_work, in order
to fully prevent a race where this check is not reliable the error
recovery work must flush async_event_work before continuing to destroy
the admin queue after setting the ctrl state to RESETTING such that
there is no race .submit_async_event and the error recovery handler
itself changing the ctrl state.
Tested-by: Chris Leech cleech@redhat.com
Signed-off-by: Sagi Grimberg sagi@grimberg.me
Signed-off-by: Sasha Levin sashal@kernel.org
---
 drivers/nvme/host/tcp.c | 1 +
 1 file changed, 1 insertion(+)

diff --git a/drivers/nvme/host/tcp.c b/drivers/nvme/host/tcp.c
index efa9037da53c9..ef65d24639c44 100644
--- a/drivers/nvme/host/tcp.c
+++ b/drivers/nvme/host/tcp.c
@@ -2105,6 +2105,7 @@ static void nvme_tcp_error_recovery_work(struct work_struct *work)
    struct nvme_ctrl *ctrl = &tcp_ctrl->ctrl;
nvme_stop_keep_alive(ctrl);
+	flush_work(&ctrl->async_event_work);
    nvme_tcp_teardown_io_queues(ctrl, false);
    /* unquiesce to fail fast pending requests */
    nvme_start_queues(ctrl);
-- 
2.34.1




    

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

[PATCH 5.15 045/196] nvme-tcp: fix possible use-after-free in transport error_recovery work