From mboxrd@z Thu Jan 1 00:00:00 1970 From: Bart Van Assche Subject: Re: [PATCH] multipathd: check and cleanup zombie paths Date: Tue, 20 Mar 2018 15:14:14 +0000 Message-ID: <1521558853.3156.10.camel@wdc.com> References: <1520325779.4131.4.camel@suse.com> <1520349519.4131.20.camel@suse.com> <1520426679.11340.5.camel@suse.com> <20180308154435.GB14513@octiron.msp.redhat.com> <20180309162243.GE14513@octiron.msp.redhat.com> <1521495726.3798.141.camel@suse.com> <1521557887.3156.5.camel@wdc.com> <3537d472-27ad-53fd-7d13-229359a53849@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <3537d472-27ad-53fd-7d13-229359a53849@gmail.com> Content-Language: en-US Content-ID: <88D95E8E192BFF4B8F221AE651FAD5CB@namprd04.prod.outlook.com> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: dm-devel-bounces@redhat.com Errors-To: dm-devel-bounces@redhat.com To: "xose.vazquez@gmail.com" , "bmarzins@redhat.com" , "wu.chongyun@h3c.com" , "mwilck@suse.com" Cc: "guozhonghua@h3c.com" , "dm-devel@redhat.com" , "changlimin@h3c.com" , "ge.changwei@h3c.com" List-Id: dm-devel.ids On Tue, 2018-03-20 at 16:12 +0100, Xose Vazquez Perez wrote: > On 03/20/2018 03:58 PM, Bart Van Assche wrote: > > > It is on purpose that the SCSI core does not remove stale SCSI device nodes. > > If you want that these stale SCSI device nodes get removed automatically, > > two possible approaches are (there might be other approaches): > > * Write a new user space daemon that periodically checks for stale devices > > (e.g. by running grep -aH . /sys/class/scsi_device/*/*/state | > > grep -v running) and that triggers a SCSI rescan if any stale devices are > > found. > > * Write a udev rule that listens for SDEV_UA=REPORTED_LUNS_DATA_HAS_CHANGED > > and that triggers a SCSI rescan if this event is triggered by the kernel. > > There are some "remove" flags in rescan-scsi-bus.sh: > https://github.com/hreinecke/sg3_utils/blob/d4dbbede04db21c206e4c2acc1cf766117f003c3/scripts/rescan-scsi-bus.sh#L1080 > > -r enables removing of devices [default: disabled] > --forceremove: Remove stale devices (DANGEROUS) > --forcerescan: Remove and readd existing devices (DANGEROUS) Last time I checked the rescan-scsi-bus.sh script relied on the SCSI sysfs delete attribute to remove stale devices. That is the mechanism that can trigger a deadlock in the kernel. Bart.