* Re: [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts [not found] ` <f6e73c8e-1186-744b-c856-2c56ceeeab10@redhat.com> @ 2018-03-13 13:08 ` Martin Steigerwald 2018-03-13 14:32 ` Ming Lei 0 siblings, 1 reply; 3+ messages in thread From: Martin Steigerwald @ 2018-03-13 13:08 UTC (permalink / raw) To: Hans de Goede Cc: Linux Kernel Mailing List, Thorsten Leemhuis, Tejun Heo, linux-block, Ming Lei, Bart Van Assche Hans de Goede - 11.03.18, 15:37: > Hi Martin, > > On 11-03-18 09:20, Martin Steigerwald wrote: > > Hello. > > > > Since 4.16-rc4 (upgraded from 4.15.2 which worked) I have an issue > > with SMART checks occassionally failing like this: > > > > smartd[28017]: Device: /dev/sdb [SAT], is in SLEEP mode, suspending checks > > udisksd[24408]: Error performing housekeeping for drive > > /org/freedesktop/UDisks2/drives/INTEL_SSDSA2CW300G3_[…]: Error updating > > SMART data: Error sending ATA command CHECK POWER MODE: Unexpected sense > > data returned:#0120000: 0e 09 0c 00 00 00 ff 00 00 00 00 00 00 00 50 > > 00 ..............P.#0120010: 00 00 00 00 00 00 00 00 00 00 00 00 00 > > 00 00 00 ................#012 (g-io-error-quark, 0) merkaba > > udisksd[24408]: Error performing housekeeping for drive > > /org/freedesktop/UDisks2/drives/Crucial_CT480M500SSD3_[…]: Error updating > > SMART dat a: Error sending ATA command CHECK POWER MODE: Unexpected sense > > data returned:#0120000: 01 00 1d 00 00 00 0e 09 0c 00 00 00 ff 00 00 > > 00 ................#0120010: 00 0 0 00 00 50 00 00 00 00 00 00 00 > > 00 00 00 00 ....P...........#012 (g-io-error-quark, 0) > > > > (Intel SSD is connected via SATA, Crucial via mSATA in a ThinkPad T520) > > > > However when I then check manually with smartctl -a | -x | -H the device > > reports SMART data just fine. > > > > As smartd correctly detects that device is in sleep mode, this may be an > > userspace issue in udisksd. > > > > Also at some boot attempts the boot hangs with a message like "could not > > connect to lvmetad, scanning manually for devices". I use BTRFS RAID 1 > > on to LVs (each on one of the SSDs). A configuration that requires a > > manual > > adaption to InitRAMFS in order to boot (basically vgchange -ay before > > btrfs device scan). > > > > I wonder whether that has to do with the new SATA LPM policy stuff, but as > > I had issues with > > > > 3 => Medium power with Device Initiated PM enabled > > > > (machine did not boot, which could also have been caused by me > > accidentally > > removing all TCP/IP network support in the kernel with that setting) > > > > I set it back to > > > > CONFIG_SATA_MOBILE_LPM_POLICY=0 > > > > (firmware settings) > > Right, so at that settings the LPM policy changes are effectively > disabled and cannot explain your SMART issues. Yes, I now good a photo of one of those boot failures I mentioned, at it seems to be related to blk-mq, as the backtrace contains "blk_mq_terminate_expired". I add the screenshot to my bug report. [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and boot failures with blk_mq_terminate_expired in backtrace https://bugzilla.kernel.org/show_bug.cgi?id=199077 Hans, I will test your LPM policy horkage for Crucial m500 patch at a later time. I first wanted to add the photo of the boot failure to the bug report. Ming and Bart, I added you to cc, cause I had to do with you about another blk-mq report, please feel free to adapt. Thanks, -- Martin ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts 2018-03-13 13:08 ` [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts Martin Steigerwald @ 2018-03-13 14:32 ` Ming Lei 2018-03-13 14:56 ` Bart Van Assche 0 siblings, 1 reply; 3+ messages in thread From: Ming Lei @ 2018-03-13 14:32 UTC (permalink / raw) To: Martin Steigerwald Cc: Hans de Goede, Linux Kernel Mailing List, Thorsten Leemhuis, Tejun Heo, linux-block, Bart Van Assche, linux-scsi, Martin K. Petersen, James E.J. Bottomley On Tue, Mar 13, 2018 at 02:08:23PM +0100, Martin Steigerwald wrote: > Hans de Goede - 11.03.18, 15:37: > > Hi Martin, > > > > On 11-03-18 09:20, Martin Steigerwald wrote: > > > Hello. > > > > > > Since 4.16-rc4 (upgraded from 4.15.2 which worked) I have an issue > > > with SMART checks occassionally failing like this: > > > > > > smartd[28017]: Device: /dev/sdb [SAT], is in SLEEP mode, suspending checks > > > udisksd[24408]: Error performing housekeeping for drive > > > /org/freedesktop/UDisks2/drives/INTEL_SSDSA2CW300G3_[…]: Error updating > > > SMART data: Error sending ATA command CHECK POWER MODE: Unexpected sense > > > data returned:#0120000: 0e 09 0c 00 00 00 ff 00 00 00 00 00 00 00 50 > > > 00 ..............P.#0120010: 00 00 00 00 00 00 00 00 00 00 00 00 00 > > > 00 00 00 ................#012 (g-io-error-quark, 0) merkaba > > > udisksd[24408]: Error performing housekeeping for drive > > > /org/freedesktop/UDisks2/drives/Crucial_CT480M500SSD3_[…]: Error updating > > > SMART dat a: Error sending ATA command CHECK POWER MODE: Unexpected sense > > > data returned:#0120000: 01 00 1d 00 00 00 0e 09 0c 00 00 00 ff 00 00 > > > 00 ................#0120010: 00 0 0 00 00 50 00 00 00 00 00 00 00 > > > 00 00 00 00 ....P...........#012 (g-io-error-quark, 0) > > > > > > (Intel SSD is connected via SATA, Crucial via mSATA in a ThinkPad T520) > > > > > > However when I then check manually with smartctl -a | -x | -H the device > > > reports SMART data just fine. > > > > > > As smartd correctly detects that device is in sleep mode, this may be an > > > userspace issue in udisksd. > > > > > > Also at some boot attempts the boot hangs with a message like "could not > > > connect to lvmetad, scanning manually for devices". I use BTRFS RAID 1 > > > on to LVs (each on one of the SSDs). A configuration that requires a > > > manual > > > adaption to InitRAMFS in order to boot (basically vgchange -ay before > > > btrfs device scan). > > > > > > I wonder whether that has to do with the new SATA LPM policy stuff, but as > > > I had issues with > > > > > > 3 => Medium power with Device Initiated PM enabled > > > > > > (machine did not boot, which could also have been caused by me > > > accidentally > > > removing all TCP/IP network support in the kernel with that setting) > > > > > > I set it back to > > > > > > CONFIG_SATA_MOBILE_LPM_POLICY=0 > > > > > > (firmware settings) > > > > Right, so at that settings the LPM policy changes are effectively > > disabled and cannot explain your SMART issues. > > Yes, I now good a photo of one of those boot failures I mentioned, at it seems > to be related to blk-mq, as the backtrace contains "blk_mq_terminate_expired". > > I add the screenshot to my bug report. > > [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and > boot failures with blk_mq_terminate_expired in backtrace > https://bugzilla.kernel.org/show_bug.cgi?id=199077 > > Hans, I will test your LPM policy horkage for Crucial m500 patch at a later > time. I first wanted to add the photo of the boot failure to the bug report. > > Ming and Bart, I added you to cc, cause I had to do with you about another > blk-mq report, please feel free to adapt. Looks RIP points to scsi_times_out+0x17/0x1d0, maybe a SCSI regression? Thanks, Ming ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts 2018-03-13 14:32 ` Ming Lei @ 2018-03-13 14:56 ` Bart Van Assche 0 siblings, 0 replies; 3+ messages in thread From: Bart Van Assche @ 2018-03-13 14:56 UTC (permalink / raw) To: martin@lichtvoll.de, ming.lei@redhat.com Cc: linux-kernel@vger.kernel.org, linux-block@vger.kernel.org, hdegoede@redhat.com, martin.petersen@oracle.com, linux-scsi@vger.kernel.org, regressions@leemhuis.info, tj@kernel.org, jejb@linux.vnet.ibm.com T24gVHVlLCAyMDE4LTAzLTEzIGF0IDIyOjMyICswODAwLCBNaW5nIExlaSB3cm90ZToNCj4gT24g VHVlLCBNYXIgMTMsIDIwMTggYXQgMDI6MDg6MjNQTSArMDEwMCwgTWFydGluIFN0ZWlnZXJ3YWxk IHdyb3RlOg0KPiA+IE1pbmcgYW5kIEJhcnQsIEkgYWRkZWQgeW91IHRvIGNjLCBjYXVzZSBJIGhh ZCB0byBkbyB3aXRoIHlvdSBhYm91dCBhbm90aGVyIA0KPiA+IGJsay1tcSByZXBvcnQsIHBsZWFz ZSBmZWVsIGZyZWUgdG8gYWRhcHQuDQo+IA0KPiBMb29rcyBSSVAgcG9pbnRzIHRvIHNjc2lfdGlt ZXNfb3V0KzB4MTcvMHgxZDAsIG1heWJlIGEgU0NTSSByZWdyZXNzaW9uPw0KDQpJIHRoaW5rIHRo YXQgaXQncyBtdWNoIG1vcmUgbGlrZWx5IHRoYXQgdGhpcyBpcyBhIGJsb2NrIGxheWVyIHJlZ3Jl c3Npb24uIFNlZQ0KZS5nLiAiW1BBVENIIHYyXSBibGstbXE6IEZpeCByYWNlIGJldHdlZW4gcmVz ZXR0aW5nIHRoZSB0aW1lciBhbmQgY29tcGxldGlvbg0KaGFuZGxpbmciIChodHRwczovL3d3dy5t YWlsLWFyY2hpdmUuY29tL2xpbnV4LWJsb2NrQHZnZXIua2VybmVsLm9yZy9tc2cxODMzOC5odG1s KS4NCg0KQmFydC4= ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2018-03-13 14:56 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <27165802.vQ9JbjrmvU@merkaba>
[not found] ` <f6e73c8e-1186-744b-c856-2c56ceeeab10@redhat.com>
2018-03-13 13:08 ` [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts Martin Steigerwald
2018-03-13 14:32 ` Ming Lei
2018-03-13 14:56 ` Bart Van Assche
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox