* Re: [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts
[not found] ` <f6e73c8e-1186-744b-c856-2c56ceeeab10@redhat.com>
@ 2018-03-13 13:08 ` Martin Steigerwald
2018-03-13 14:32 ` Ming Lei
0 siblings, 1 reply; 3+ messages in thread
From: Martin Steigerwald @ 2018-03-13 13:08 UTC (permalink / raw)
To: Hans de Goede
Cc: Linux Kernel Mailing List, Thorsten Leemhuis, Tejun Heo,
linux-block, Ming Lei, Bart Van Assche
Hans de Goede - 11.03.18, 15:37:
> Hi Martin,
>
> On 11-03-18 09:20, Martin Steigerwald wrote:
> > Hello.
> >
> > Since 4.16-rc4 (upgraded from 4.15.2 which worked) I have an issue
> > with SMART checks occassionally failing like this:
> >
> > smartd[28017]: Device: /dev/sdb [SAT], is in SLEEP mode, suspending checks
> > udisksd[24408]: Error performing housekeeping for drive
> > /org/freedesktop/UDisks2/drives/INTEL_SSDSA2CW300G3_[…]: Error updating
> > SMART data: Error sending ATA command CHECK POWER MODE: Unexpected sense
> > data returned:#0120000: 0e 09 0c 00 00 00 ff 00 00 00 00 00 00 00 50
> > 00 ..............P.#0120010: 00 00 00 00 00 00 00 00 00 00 00 00 00
> > 00 00 00 ................#012 (g-io-error-quark, 0) merkaba
> > udisksd[24408]: Error performing housekeeping for drive
> > /org/freedesktop/UDisks2/drives/Crucial_CT480M500SSD3_[…]: Error updating
> > SMART dat a: Error sending ATA command CHECK POWER MODE: Unexpected sense
> > data returned:#0120000: 01 00 1d 00 00 00 0e 09 0c 00 00 00 ff 00 00
> > 00 ................#0120010: 00 0 0 00 00 50 00 00 00 00 00 00 00
> > 00 00 00 00 ....P...........#012 (g-io-error-quark, 0)
> >
> > (Intel SSD is connected via SATA, Crucial via mSATA in a ThinkPad T520)
> >
> > However when I then check manually with smartctl -a | -x | -H the device
> > reports SMART data just fine.
> >
> > As smartd correctly detects that device is in sleep mode, this may be an
> > userspace issue in udisksd.
> >
> > Also at some boot attempts the boot hangs with a message like "could not
> > connect to lvmetad, scanning manually for devices". I use BTRFS RAID 1
> > on to LVs (each on one of the SSDs). A configuration that requires a
> > manual
> > adaption to InitRAMFS in order to boot (basically vgchange -ay before
> > btrfs device scan).
> >
> > I wonder whether that has to do with the new SATA LPM policy stuff, but as
> > I had issues with
> >
> > 3 => Medium power with Device Initiated PM enabled
> >
> > (machine did not boot, which could also have been caused by me
> > accidentally
> > removing all TCP/IP network support in the kernel with that setting)
> >
> > I set it back to
> >
> > CONFIG_SATA_MOBILE_LPM_POLICY=0
> >
> > (firmware settings)
>
> Right, so at that settings the LPM policy changes are effectively
> disabled and cannot explain your SMART issues.
Yes, I now good a photo of one of those boot failures I mentioned, at it seems
to be related to blk-mq, as the backtrace contains "blk_mq_terminate_expired".
I add the screenshot to my bug report.
[Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and
boot failures with blk_mq_terminate_expired in backtrace
https://bugzilla.kernel.org/show_bug.cgi?id=199077
Hans, I will test your LPM policy horkage for Crucial m500 patch at a later
time. I first wanted to add the photo of the boot failure to the bug report.
Ming and Bart, I added you to cc, cause I had to do with you about another
blk-mq report, please feel free to adapt.
Thanks,
--
Martin
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts
2018-03-13 13:08 ` [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts Martin Steigerwald
@ 2018-03-13 14:32 ` Ming Lei
2018-03-13 14:56 ` Bart Van Assche
0 siblings, 1 reply; 3+ messages in thread
From: Ming Lei @ 2018-03-13 14:32 UTC (permalink / raw)
To: Martin Steigerwald
Cc: Hans de Goede, Linux Kernel Mailing List, Thorsten Leemhuis,
Tejun Heo, linux-block, Bart Van Assche, linux-scsi,
Martin K. Petersen, James E.J. Bottomley
On Tue, Mar 13, 2018 at 02:08:23PM +0100, Martin Steigerwald wrote:
> Hans de Goede - 11.03.18, 15:37:
> > Hi Martin,
> >
> > On 11-03-18 09:20, Martin Steigerwald wrote:
> > > Hello.
> > >
> > > Since 4.16-rc4 (upgraded from 4.15.2 which worked) I have an issue
> > > with SMART checks occassionally failing like this:
> > >
> > > smartd[28017]: Device: /dev/sdb [SAT], is in SLEEP mode, suspending checks
> > > udisksd[24408]: Error performing housekeeping for drive
> > > /org/freedesktop/UDisks2/drives/INTEL_SSDSA2CW300G3_[…]: Error updating
> > > SMART data: Error sending ATA command CHECK POWER MODE: Unexpected sense
> > > data returned:#0120000: 0e 09 0c 00 00 00 ff 00 00 00 00 00 00 00 50
> > > 00 ..............P.#0120010: 00 00 00 00 00 00 00 00 00 00 00 00 00
> > > 00 00 00 ................#012 (g-io-error-quark, 0) merkaba
> > > udisksd[24408]: Error performing housekeeping for drive
> > > /org/freedesktop/UDisks2/drives/Crucial_CT480M500SSD3_[…]: Error updating
> > > SMART dat a: Error sending ATA command CHECK POWER MODE: Unexpected sense
> > > data returned:#0120000: 01 00 1d 00 00 00 0e 09 0c 00 00 00 ff 00 00
> > > 00 ................#0120010: 00 0 0 00 00 50 00 00 00 00 00 00 00
> > > 00 00 00 00 ....P...........#012 (g-io-error-quark, 0)
> > >
> > > (Intel SSD is connected via SATA, Crucial via mSATA in a ThinkPad T520)
> > >
> > > However when I then check manually with smartctl -a | -x | -H the device
> > > reports SMART data just fine.
> > >
> > > As smartd correctly detects that device is in sleep mode, this may be an
> > > userspace issue in udisksd.
> > >
> > > Also at some boot attempts the boot hangs with a message like "could not
> > > connect to lvmetad, scanning manually for devices". I use BTRFS RAID 1
> > > on to LVs (each on one of the SSDs). A configuration that requires a
> > > manual
> > > adaption to InitRAMFS in order to boot (basically vgchange -ay before
> > > btrfs device scan).
> > >
> > > I wonder whether that has to do with the new SATA LPM policy stuff, but as
> > > I had issues with
> > >
> > > 3 => Medium power with Device Initiated PM enabled
> > >
> > > (machine did not boot, which could also have been caused by me
> > > accidentally
> > > removing all TCP/IP network support in the kernel with that setting)
> > >
> > > I set it back to
> > >
> > > CONFIG_SATA_MOBILE_LPM_POLICY=0
> > >
> > > (firmware settings)
> >
> > Right, so at that settings the LPM policy changes are effectively
> > disabled and cannot explain your SMART issues.
>
> Yes, I now good a photo of one of those boot failures I mentioned, at it seems
> to be related to blk-mq, as the backtrace contains "blk_mq_terminate_expired".
>
> I add the screenshot to my bug report.
>
> [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and
> boot failures with blk_mq_terminate_expired in backtrace
> https://bugzilla.kernel.org/show_bug.cgi?id=199077
>
> Hans, I will test your LPM policy horkage for Crucial m500 patch at a later
> time. I first wanted to add the photo of the boot failure to the bug report.
>
> Ming and Bart, I added you to cc, cause I had to do with you about another
> blk-mq report, please feel free to adapt.
Looks RIP points to scsi_times_out+0x17/0x1d0, maybe a SCSI regression?
Thanks,
Ming
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts
2018-03-13 14:32 ` Ming Lei
@ 2018-03-13 14:56 ` Bart Van Assche
0 siblings, 0 replies; 3+ messages in thread
From: Bart Van Assche @ 2018-03-13 14:56 UTC (permalink / raw)
To: martin@lichtvoll.de, ming.lei@redhat.com
Cc: linux-kernel@vger.kernel.org, linux-block@vger.kernel.org,
hdegoede@redhat.com, martin.petersen@oracle.com,
linux-scsi@vger.kernel.org, regressions@leemhuis.info,
tj@kernel.org, jejb@linux.vnet.ibm.com
T24gVHVlLCAyMDE4LTAzLTEzIGF0IDIyOjMyICswODAwLCBNaW5nIExlaSB3cm90ZToNCj4gT24g
VHVlLCBNYXIgMTMsIDIwMTggYXQgMDI6MDg6MjNQTSArMDEwMCwgTWFydGluIFN0ZWlnZXJ3YWxk
IHdyb3RlOg0KPiA+IE1pbmcgYW5kIEJhcnQsIEkgYWRkZWQgeW91IHRvIGNjLCBjYXVzZSBJIGhh
ZCB0byBkbyB3aXRoIHlvdSBhYm91dCBhbm90aGVyIA0KPiA+IGJsay1tcSByZXBvcnQsIHBsZWFz
ZSBmZWVsIGZyZWUgdG8gYWRhcHQuDQo+IA0KPiBMb29rcyBSSVAgcG9pbnRzIHRvIHNjc2lfdGlt
ZXNfb3V0KzB4MTcvMHgxZDAsIG1heWJlIGEgU0NTSSByZWdyZXNzaW9uPw0KDQpJIHRoaW5rIHRo
YXQgaXQncyBtdWNoIG1vcmUgbGlrZWx5IHRoYXQgdGhpcyBpcyBhIGJsb2NrIGxheWVyIHJlZ3Jl
c3Npb24uIFNlZQ0KZS5nLiAiW1BBVENIIHYyXSBibGstbXE6IEZpeCByYWNlIGJldHdlZW4gcmVz
ZXR0aW5nIHRoZSB0aW1lciBhbmQgY29tcGxldGlvbg0KaGFuZGxpbmciIChodHRwczovL3d3dy5t
YWlsLWFyY2hpdmUuY29tL2xpbnV4LWJsb2NrQHZnZXIua2VybmVsLm9yZy9tc2cxODMzOC5odG1s
KS4NCg0KQmFydC4=
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2018-03-13 14:56 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <27165802.vQ9JbjrmvU@merkaba>
[not found] ` <f6e73c8e-1186-744b-c856-2c56ceeeab10@redhat.com>
2018-03-13 13:08 ` [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts Martin Steigerwald
2018-03-13 14:32 ` Ming Lei
2018-03-13 14:56 ` Bart Van Assche
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox