Diff - 0f6e2f4e06be4da35c1b9e52d638218bafa91e25^! - LeafOS-Devices/android_kernel_samsung_gta4xl

commit	0f6e2f4e06be4da35c1b9e52d638218bafa91e25	[log] [tgz]
author	James Smart <jsmart2021@gmail.com>	Tue Mar 13 09:48:07 2018 -0700
committer	Greg Kroah-Hartman <gregkh@linuxfoundation.org>	Sat Oct 13 09:27:28 2018 +0200
tree	e713337a1108a40cbf3d1ded2b38b643d7d0ee1c
parent	1b2ad48a85c4011f2cb620fa96fd50645bd11263 [diff] [blame]

nvme_fc: fix ctrl create failures racing with workq items

commit cf25809bec2c7df4b45df5b2196845d9a4a3c89b upstream.

If there are errors during initial controller create, the transport
will teardown the partially initialized controller struct and free
the ctlr memory.  Trouble is - most of those errors can occur due
to asynchronous events happening such io timeouts and subsystem
connectivity failures. Those failures invoke async workq items to
reset the controller and attempt reconnect.  Those may be in progress
as the main thread frees the ctrl memory, resulting in NULL ptr oops.

Prevent this from happening by having the main ctrl failure thread
changing state to DELETING followed by synchronously cancelling any
pending queued work item. The change of state will prevent the
scheduling of resets or reconnect events.

Signed-off-by: James Smart <james.smart@broadcom.com>
Signed-off-by: Keith Busch <keith.busch@intel.com>
Signed-off-by: Jens Axboe <axboe@kernel.dk>
Signed-off-by: Amit Pundir <amit.pundir@linaro.org>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

diff --git a/drivers/nvme/host/fc.c b/drivers/nvme/host/fc.c
index 7deb7b5..058d542 100644
--- a/drivers/nvme/host/fc.c
+++ b/drivers/nvme/host/fc.c

@@ -2868,6 +2868,10 @@
 	}
 
 	if (ret) {
+		nvme_change_ctrl_state(&ctrl->ctrl, NVME_CTRL_DELETING);
+		cancel_work_sync(&ctrl->ctrl.reset_work);
+		cancel_delayed_work_sync(&ctrl->connect_work);
+
 		/* couldn't schedule retry - fail out */
 		dev_err(ctrl->ctrl.device,
 			"NVME-FC{%d}: Connect retry failed\n", ctrl->cnum);