All of lore.kernel.org
 help / color / mirror / Atom feed
From: Christian <christiand59@web.de>
To: linux-kernel@vger.kernel.org
Subject: Re: ata2: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl
Date: Sun, 19 Nov 2006 15:19:50 +0100	[thread overview]
Message-ID: <200611191519.50163.christiand59@web.de> (raw)
In-Reply-To: <cb8795142da89.455f6345@shaw.ca>

Am Sonntag, 19. November 2006 01:47 schrieb ROBERT HANCOCK:
> Christian wrote:
> > During my I/O load test, after about half an hour of heavy I/O on three
> > SATAII disks the system suddenly hung for about 3 seconds. After that I
> > checked dmesg and found the following error output:
> >
> > [ 4574.193809] ata2: EH in ADMA mode, notifier 0x0 notifier_error 0x0
> > gen_ctl 0x1501000 status 0x400
> > [ 4574.193826] ata2: CPB 0: ctl_flags 0x1f, resp_flags 0x1
> > [ 4574.193835] ata2: CPB 1: ctl_flags 0x1f, resp_flags 0x2
>
> All this output is from the debugging code I have in the error handler in
> sata_nv for ADMA mode.
>
> > [ 4574.194366] ata2: Resetting port
> > [ 4574.194411] ata2.00: exception Emask 0x0 SAct 0x2 SErr 0x0 action 0x2
> > frozen
> > [ 4574.194453] ata2.00: tag 1 cmd 0x60 Emask 0x4 stat 0x40 err 0x0
> > (timeout)
>
> Hmm, it looks like the controller thinks the command has been sent to the
> drive and has "released" the command for the drive to do its thing, and
> hasn't received a response back yet. (At least that's what I believe bit 1
> in the response flags means..) This might not be the fault of the
> controller or driver, it might just be the drive not responding. Can you
> post some drive information (like full dmesg from bootup)? -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/

FYI:
My system config is one 400GB disk at sda, and two 250GB disks on a dmraid 
nvidia-fakeraid set of sdb and sdc.

My kernel message buffer gets quickly overrun by a flood of these error 
messages:

Nov 17 22:48:12 ubuntu kernel: [  119.566540] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.566602] sdb: rw=0, want=976784000, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.566661] printk: 62 messages suppressed.
Nov 17 22:48:12 ubuntu kernel: [  119.566719] Buffer I/O error on device sdb3, 
logical block 669380224
Nov 17 22:48:12 ubuntu kernel: [  119.566779] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.566836] sdb: rw=0, want=976784001, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.566892] Buffer I/O error on device sdb3, 
logical block 669380225
Nov 17 22:48:12 ubuntu kernel: [  119.566951] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.567009] sdb: rw=0, want=976784002, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.567066] Buffer I/O error on device sdb3, 
logical block 669380226
Nov 17 22:48:12 ubuntu kernel: [  119.567124] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.567184] sdb: rw=0, want=976784003, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.567241] Buffer I/O error on device sdb3, 
logical block 669380227
Nov 17 22:48:12 ubuntu kernel: [  119.567299] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.567400] sdb: rw=0, want=976784004, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.567457] Buffer I/O error on device sdb3, 
logical block 669380228
Nov 17 22:48:12 ubuntu kernel: [  119.567515] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.567572] sdb: rw=0, want=976784005, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.567629] Buffer I/O error on device sdb3, 
logical block 669380229
Nov 17 22:48:12 ubuntu kernel: [  119.567687] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.567744] sdb: rw=0, want=976784006, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.567800] Buffer I/O error on device sdb3, 
logical block 669380230
Nov 17 22:48:12 ubuntu kernel: [  119.567868] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.567925] sdb: rw=0, want=976784007, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.567982] Buffer I/O error on device sdb3, 
logical block 669380231
Nov 17 22:48:12 ubuntu kernel: [  119.568042] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.568098] sdb: rw=0, want=976784000, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.568159] Buffer I/O error on device sdb3, 
logical block 669380224
Nov 17 22:48:12 ubuntu kernel: [  119.568217] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.568274] sdb: rw=0, want=976784001, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [  119.568332] Buffer I/O error on device sdb3, 
logical block 669380225
Nov 17 22:48:12 ubuntu kernel: [  119.568390] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [  119.568447] sdb: rw=0, want=976784002, 
limit=488397168


Here is the relevant content of /var/log/kern.log:

Nov 17 22:48:12 ubuntu kernel: Inspecting /boot/System.map-2.6.19-rc5-mm1
Nov 17 22:48:12 ubuntu kernel: Loaded 26134 symbols 
from /boot/System.map-2.6.19-rc5-mm1.
Nov 17 22:48:12 ubuntu kernel: Symbols match kernel version 2.6.19.
Nov 17 22:48:12 ubuntu kernel: No module symbols loaded - kernel modules not 
enabled.
Nov 17 22:48:12 ubuntu kernel:   50.068747] scsi2 : sata_nv
Nov 17 22:48:12 ubuntu kernel: [   50.521057] ata1: SATA link up 3.0 Gbps 
(SStatus 123 SControl 300)
Nov 17 22:48:12 ubuntu kernel: [   50.522095] ata3: SATA link up 3.0 Gbps 
(SStatus 123 SControl 300)
Nov 17 22:48:12 ubuntu kernel: [   50.525228] ata1.00: ATA-7, max UDMA7, 
781422768 sectors: LBA48 NCQ (depth 31/32)
Nov 17 22:48:12 ubuntu kernel: [   50.525319] ata1.00: ata1: dev 0 multi count 
16
Nov 17 22:48:12 ubuntu kernel: [   50.529045] ata3.00: ATA-7, max UDMA7, 
488397168 sectors: LBA48 NCQ (depth 31/32)
Nov 17 22:48:12 ubuntu kernel: [   50.529136] ata3.00: ata3: dev 0 multi count 
16
Nov 17 22:48:12 ubuntu kernel: [   50.556623] ata1.00: configured for UDMA/133
Nov 17 22:48:12 ubuntu kernel: [   50.557059] scsi1 : sata_nv
Nov 17 22:48:12 ubuntu kernel: [   50.562842] ata3.00: configured for UDMA/133
Nov 17 22:48:12 ubuntu kernel: [   50.562942] scsi3 : sata_nv
Nov 17 22:48:12 ubuntu kernel: [   50.859873] ata2: SATA link down (SStatus 0 
SControl 300)
Nov 17 22:48:12 ubuntu kernel: [   50.860017] scsi 0:0:0:0: Direct-Access     
ATA      SAMSUNG HD401LJ  ZZ10 PQ: 0 ANSI: 5
Nov 17 22:48:12 ubuntu kernel: [   50.860110] ata1: bounce limit 
0xFFFFFFFFFFFFFFFF, segment boundary 0xFFFFFFFF, hw segs 61
Nov 17 22:48:12 ubuntu kernel: [   50.863680] SCSI device sda: 781422768 
512-byte hdwr sectors (400088 MB)
Nov 17 22:48:12 ubuntu kernel: [   50.863750] sda: Write Protect is off
Nov 17 22:48:12 ubuntu kernel: [   50.863806] sda: Mode Sense: 00 3a 00 00
Nov 17 22:48:12 ubuntu kernel: [   50.863816] SCSI device sda: drive cache: 
write back
Nov 17 22:48:12 ubuntu kernel: [   50.863909] SCSI device sda: 781422768 
512-byte hdwr sectors (400088 MB)
Nov 17 22:48:12 ubuntu kernel: [   50.863973] sda: Write Protect is off
Nov 17 22:48:12 ubuntu kernel: [   50.864029] sda: Mode Sense: 00 3a 00 00
Nov 17 22:48:12 ubuntu kernel: [   50.864039] SCSI device sda: drive cache: 
write back
Nov 17 22:48:12 ubuntu kernel: [   50.864098]  sda: sda1
Nov 17 22:48:12 ubuntu kernel: [   50.887940] sd 0:0:0:0: Attached scsi disk 
sda
Nov 17 22:48:12 ubuntu kernel: [   51.016792] ata4: SATA link up 3.0 Gbps 
(SStatus 123 SControl 300)
Nov 17 22:48:12 ubuntu kernel: [   51.019832] ata4.00: ATA-7, max UDMA7, 
488397168 sectors: LBA48 NCQ (depth 31/32)
Nov 17 22:48:12 ubuntu kernel: [   51.019923] ata4.00: ata4: dev 0 multi count 
16
Nov 17 22:48:12 ubuntu kernel: [   51.062341] ata4.00: configured for UDMA/133
Nov 17 22:48:12 ubuntu kernel: [   51.062470] scsi 2:0:0:0: Direct-Access     
ATA      SAMSUNG SP2504C  VT10 PQ: 0 ANSI: 5
Nov 17 22:48:12 ubuntu kernel: [   51.062563] ata3: bounce limit 
0xFFFFFFFFFFFFFFFF, segment boundary 0xFFFFFFFF, hw segs 61
Nov 17 22:48:12 ubuntu kernel: [   51.063186] SCSI device sdb: 488397168 
512-byte hdwr sectors (250059 MB)
Nov 17 22:48:12 ubuntu kernel: [   51.063257] sdb: Write Protect is off
Nov 17 22:48:12 ubuntu kernel: [   51.063312] sdb: Mode Sense: 00 3a 00 00
Nov 17 22:48:12 ubuntu kernel: [   51.063323] SCSI device sdb: drive cache: 
write back
Nov 17 22:48:12 ubuntu kernel: [   51.063413] SCSI device sdb: 488397168 
512-byte hdwr sectors (250059 MB)
Nov 17 22:48:12 ubuntu kernel: [   51.063475] sdb: Write Protect is off
Nov 17 22:48:12 ubuntu kernel: [   51.063530] sdb: Mode Sense: 00 3a 00 00
Nov 17 22:48:12 ubuntu kernel: [   51.063539] SCSI device sdb: drive cache: 
write back
Nov 17 22:48:12 ubuntu kernel: [   51.063601]  sdb: sdb1 sdb2 sdb3
Nov 17 22:48:12 ubuntu kernel: [   51.072794]  sdb: p3 exceeds device capacity
Nov 17 22:48:12 ubuntu kernel: [   51.072895] sd 2:0:0:0: Attached scsi disk 
sdb
Nov 17 22:48:12 ubuntu kernel: [   51.073439] scsi 3:0:0:0: Direct-Access     
ATA      SAMSUNG SP2504C  VT10 PQ: 0 ANSI: 5
Nov 17 22:48:12 ubuntu kernel: [   51.073532] ata4: bounce limit 
0xFFFFFFFFFFFFFFFF, segment boundary 0xFFFFFFFF, hw segs 61
Nov 17 22:48:12 ubuntu kernel: [   51.074110] SCSI device sdc: 488397168 
512-byte hdwr sectors (250059 MB)
Nov 17 22:48:12 ubuntu kernel: [   51.074180] sdc: Write Protect is off
Nov 17 22:48:12 ubuntu kernel: [   51.074235] sdc: Mode Sense: 00 3a 00 00
Nov 17 22:48:12 ubuntu kernel: [   51.074245] SCSI device sdc: drive cache: 
write back
Nov 17 22:48:12 ubuntu kernel: [   51.074334] SCSI device sdc: 488397168 
512-byte hdwr sectors (250059 MB)
Nov 17 22:48:12 ubuntu kernel: [   51.074396] sdc: Write Protect is off
Nov 17 22:48:12 ubuntu kernel: [   51.074451] sdc: Mode Sense: 00 3a 00 00
Nov 17 22:48:12 ubuntu kernel: [   51.074461] SCSI device sdc: drive cache: 
write back
Nov 17 22:48:12 ubuntu kernel: [   51.074518]  sdc: unknown partition table
Nov 17 22:48:12 ubuntu kernel: [   51.085297] sd 3:0:0:0: Attached scsi disk 
sdc
Nov 17 22:48:12 ubuntu kernel: [   51.124775] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [   51.124836] sdb: rw=0, want=976784000, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [   51.124894] Buffer I/O error on device sdb3, 
logical block 669380224
Nov 17 22:48:12 ubuntu kernel: [   51.124955] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [   51.125012] sdb: rw=0, want=976784001, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [   51.125068] Buffer I/O error on device sdb3, 
logical block 669380225
Nov 17 22:48:12 ubuntu kernel: [   51.125126] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [   51.125183] sdb: rw=0, want=976784002, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [   51.125239] Buffer I/O error on device sdb3, 
logical block 669380226
Nov 17 22:48:12 ubuntu kernel: [   51.125297] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [   51.125354] sdb: rw=0, want=976784003, 
limit=488397168
Nov 17 22:48:12 ubuntu kernel: [   51.125410] Buffer I/O error on device sdb3, 
logical block 669380227
Nov 17 22:48:12 ubuntu kernel: [   51.125468] attempt to access beyond end of 
device
Nov 17 22:48:12 ubuntu kernel: [   51.125524] sdb: rw=0, want=976784004, 
limit=488397168


user@ubuntu:~$ sudo hdparm -I /dev/sd[a-c]

/dev/sda:

ATA device, with non-removable media
        Model Number:       SAMSUNG HD401LJ
        Serial Number:      S0HVJ1FL900207
        Firmware Revision:  ZZ100-15
Standards:
        Used: ATA/ATAPI-7 T13 1532D revision 4a
        Supported: 7 6 5 4
Configuration:
        Logical         max     current
        cylinders       16383   16383
        heads           16      16
        sectors/track   63      63
        --
        CHS current addressable sectors:   16514064
        LBA    user addressable sectors:  268435455
        LBA48  user addressable sectors:  781422768
        device size with M = 1024*1024:      381554 MBytes
        device size with M = 1000*1000:      400088 MBytes (400 GB)
Capabilities:
        LBA, IORDY(can be disabled)
        Queue depth: 32
        Standby timer values: spec'd by Standard, no device specific minimum
        R/W multiple sector transfer: Max = 16  Current = 16
        Recommended acoustic management value: 254, current value: 0
        DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7
             Cycle time: min=120ns recommended=120ns
        PIO: pio0 pio1 pio2 pio3 pio4
             Cycle time: no flow control=120ns  IORDY flow control=120ns
Commands/features:
        Enabled Supported:
           *    SMART feature set
                Security Mode feature set
           *    Power Management feature set
           *    Write cache
           *    Look-ahead
           *    Host Protected Area feature set
           *    WRITE_BUFFER command
           *    READ_BUFFER command
           *    NOP cmd
           *    DOWNLOAD_MICROCODE
                SET_MAX security extension
                Automatic Acoustic Management feature set
           *    48-bit Address feature set
           *    Device Configuration Overlay feature set
           *    Mandatory FLUSH_CACHE
           *    FLUSH_CACHE_EXT
           *    SMART error logging
           *    SMART self-test
           *    General Purpose Logging feature set
           *    SATA-I signaling speed (1.5Gb/s)
           *    SATA-II signaling speed (3.0Gb/s)
           *    Native Command Queueing (NCQ)
           *    Host-initiated interface power management
           *    Phy event counters
                DMA Setup Auto-Activate optimization
                Device-initiated interface power management
           *    Software settings preservation
Security:
        Master password revision code = 65534
                supported
        not     enabled
        not     locked
                frozen
        not     expired: security count
                supported: enhanced erase
        228min for SECURITY ERASE UNIT. 228min for ENHANCED SECURITY ERASE 
UNIT.
Checksum: correct

/dev/sdb:

ATA device, with non-removable media
        Model Number:       SAMSUNG SP2504C
        Serial Number:      S09QJ1LYC06381
        Firmware Revision:  VT100-33
Standards:
        Used: ATA/ATAPI-7 T13 1532D revision 4a
        Supported: 7 6 5 4
Configuration:
        Logical         max     current
        cylinders       16383   16383
        heads           16      16
        sectors/track   63      63
        --
        CHS current addressable sectors:   16514064
        LBA    user addressable sectors:  268435455
        LBA48  user addressable sectors:  488397168
        device size with M = 1024*1024:      238475 MBytes
        device size with M = 1000*1000:      250059 MBytes (250 GB)
Capabilities:
        LBA, IORDY(can be disabled)
        Queue depth: 32
        Standby timer values: spec'd by Standard, no device specific minimum
        R/W multiple sector transfer: Max = 16  Current = 16
        Recommended acoustic management value: 254, current value: 254
        DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7
             Cycle time: min=120ns recommended=120ns
        PIO: pio0 pio1 pio2 pio3 pio4
             Cycle time: no flow control=120ns  IORDY flow control=120ns
Commands/features:
        Enabled Supported:
           *    SMART feature set
                Security Mode feature set
           *    Power Management feature set
           *    Write cache
           *    Look-ahead
           *    Host Protected Area feature set
           *    WRITE_BUFFER command
           *    READ_BUFFER command
           *    NOP cmd
           *    DOWNLOAD_MICROCODE
                SET_MAX security extension
           *    Automatic Acoustic Management feature set
           *    48-bit Address feature set
           *    Device Configuration Overlay feature set
           *    Mandatory FLUSH_CACHE
           *    FLUSH_CACHE_EXT
           *    SMART error logging
           *    SMART self-test
           *    General Purpose Logging feature set
           *    SATA-I signaling speed (1.5Gb/s)
           *    SATA-II signaling speed (3.0Gb/s)
           *    Native Command Queueing (NCQ)
           *    Host-initiated interface power management
           *    Phy event counters
                DMA Setup Auto-Activate optimization
                Device-initiated interface power management
           *    Software settings preservation
Security:
        Master password revision code = 65534
                supported
        not     enabled
        not     locked
                frozen
        not     expired: security count
                supported: enhanced erase
        120min for SECURITY ERASE UNIT. 120min for ENHANCED SECURITY ERASE 
UNIT.
Checksum: correct

/dev/sdc:

ATA device, with non-removable media
        Model Number:       SAMSUNG SP2504C
        Serial Number:      S09QJ10L420645
        Firmware Revision:  VT100-41
Standards:
        Used: ATA/ATAPI-7 T13 1532D revision 4a
        Supported: 7 6 5 4
Configuration:
        Logical         max     current
        cylinders       16383   16383
        heads           16      16
        sectors/track   63      63
        --
        CHS current addressable sectors:   16514064
        LBA    user addressable sectors:  268435455
        LBA48  user addressable sectors:  488397168
        device size with M = 1024*1024:      238475 MBytes
        device size with M = 1000*1000:      250059 MBytes (250 GB)
Capabilities:
        LBA, IORDY(can be disabled)
        Queue depth: 32
        Standby timer values: spec'd by Standard, no device specific minimum
        R/W multiple sector transfer: Max = 16  Current = 16
        Recommended acoustic management value: 254, current value: 0
        DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7
             Cycle time: min=120ns recommended=120ns
        PIO: pio0 pio1 pio2 pio3 pio4
             Cycle time: no flow control=120ns  IORDY flow control=120ns
Commands/features:
        Enabled Supported:
                SMART feature set
                Security Mode feature set
           *    Power Management feature set
           *    Write cache
           *    Look-ahead
           *    Host Protected Area feature set
           *    WRITE_BUFFER command
           *    READ_BUFFER command
           *    NOP cmd
           *    DOWNLOAD_MICROCODE
                SET_MAX security extension
                Automatic Acoustic Management feature set
           *    48-bit Address feature set
           *    Device Configuration Overlay feature set
           *    Mandatory FLUSH_CACHE
           *    FLUSH_CACHE_EXT
           *    SMART error logging
           *    SMART self-test
           *    General Purpose Logging feature set
           *    Segmented DOWNLOAD_MICROCODE
           *    SATA-I signaling speed (1.5Gb/s)
           *    SATA-II signaling speed (3.0Gb/s)
           *    Native Command Queueing (NCQ)
           *    Host-initiated interface power management
           *    Phy event counters
                DMA Setup Auto-Activate optimization
                Device-initiated interface power management
           *    Software settings preservation
Security:
        Master password revision code = 65534
                supported
        not     enabled
        not     locked
                frozen
        not     expired: security count
                supported: enhanced erase
        88min for SECURITY ERASE UNIT. 88min for ENHANCED SECURITY ERASE UNIT.
Checksum: correct


Hope that helps!


p.s:
Why does the kernel report a queue depth of 31/32, but hdparm says its 32? Is 
this correct?

ata1.00: ATA-7, max UDMA7, 781422768 sectors: LBA48 NCQ (depth 31/32)

-Christian

  reply	other threads:[~2006-11-19 14:20 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-11-19  0:47 ata2: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl ROBERT HANCOCK
2006-11-19 14:19 ` Christian [this message]
2006-11-19 16:18 ` ROBERT HANCOCK
  -- strict thread matches above, loose matches on Subject: below --
2006-11-18 13:36 Christian

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200611191519.50163.christiand59@web.de \
    --to=christiand59@web.de \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.