From: AlanCui4080 <me@alancui.cc>
To: Niklas Cassel <cassel@kernel.org>
Cc: linux-ide@vger.kernel.org
Subject: Re: Default IDENTIFY timeout is 5000ms which is too short for enterprise disks
Date: Thu, 23 Apr 2026 22:26:55 +0800 [thread overview]
Message-ID: <p9jLdz6RRGKjgu3OloMgWw@alancui.cc> (raw)
In-Reply-To: <aen_SQ-7fPfdAylr@ryzen>
Hi,
Thank you for your really really detailed response. And i was surprised by your
reply, hopefully that doesn't cost your time too much.
On Thursday, 23 April 2026 19:15,you wrote:
> My best guess is that it is a HDD firmware bug where the drive sometimes
> hangs after a STANDBY + COMRESET + IDENTIFY. Or claims to be ready before
> it is actually ready.
>
> It could of course also be a bug in e.g. ata_wait_ready(), and we are sending
> the IDENTIFY command too quickly after the COMRESET, but if that was the case,
> I think we would have seen way more bug reports from different vendors by now.
>
That's the same as what i guessed, the drive's firmware didn't implement the ATA
correctly, they either ignored the command or just don't want to reply. The vendor
do really not instrest about specification, they do even break the APM and changed
the meaning of standard SMART value.
I'm not a native speaker so the actually question that i want to ask is that
will kernel do quirk for those drives so let it just don't fail to avoid costing
extra 5 seconds and producing annoying erros on console?
> >
> > > So I think the question is, at this point, can you read from the drive?
> > >
> > > E.g.:
> > > # dd if=/dev/sda of=/dev/null iflag=direct bs=4K count=1
> >
> > I will be blocked out of the shell for 5 secs unless the IDENTIFY succeed.
>
> But as soon as you get a shell after a system resume, the above command
> succeeds, right?
Yes, that's correct.
>
> My suggestion is to look at the zpool code to see how long it waits to finds
> all devices after a system resume before it kicks devices off the RAID array.
>
> My initial feeling is that if your device is ready after 5 seconds after a
> system resume, then the timeout value for zpool to kick off a device must be
> very low.
I do believe that's about zpool, i migrated it into btrfs RAID now, it works very
well. And if you want to know, the timeout of TXG to commit is 5000ms also.
Alan
next prev parent reply other threads:[~2026-04-23 14:27 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-04-09 10:21 Default IDENTIFY timeout is 5000ms which is too short for enterprise disks AlanCui4080
2026-04-09 11:55 ` Damien Le Moal
2026-04-09 12:01 ` Damien Le Moal
2026-04-15 12:40 ` Niklas Cassel
2026-04-16 12:59 ` AlanCui4080
2026-04-20 16:27 ` Niklas Cassel
2026-04-23 9:18 ` AlanCui4080
2026-04-23 11:15 ` Niklas Cassel
2026-04-23 14:26 ` AlanCui4080 [this message]
2026-04-23 16:17 ` Niklas Cassel
2026-05-08 20:48 ` AlanCui4080
[not found] ` <14062658.dW097sEU6C@alanarchdesktop>
[not found] ` <4482b737-1454-48cb-a941-165aa84fb2eb@kernel.org>
2026-04-10 11:24 ` AlanCui4080
2026-04-10 12:14 ` AlanCui4080
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=p9jLdz6RRGKjgu3OloMgWw@alancui.cc \
--to=me@alancui.cc \
--cc=cassel@kernel.org \
--cc=linux-ide@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox