public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: George Georgalis <george@galis.org>
To: Linux Kernel Mail List <linux-kernel@vger.kernel.org>
Subject: SATA_SIL works with 2.6.7-bk8 seagate drive, but oops
Date: Fri, 25 Jun 2004 17:34:33 -0400	[thread overview]
Message-ID: <20040625213433.GB6502@trot.local> (raw)
In-Reply-To: <Pine.GSO.4.33.0406241442430.25702-100000@sweetums.bluetronic.net>

[-- Attachment #1: Type: text/plain, Size: 4165 bytes --]

On Thu, Jun 24, 2004 at 02:46:39PM -0400, Ricky Beam wrote:
>On Thu, 24 Jun 2004, George Georgalis wrote:
>...
>>has caused pdflush to block IO, any access to /mnt and the process
>>does not return. other than the pdflush load of ~99% the box seems to
>>function normally. 2.6.7-bk6, seagate drive
>
>-bk6 is not new enough.  bk7 has the necessary max_sectors fix.  You
>may need to add your drive model to the sil_blacklist in
>drivers/scsi/sata_sil.c.

Okay, 2.6.7-bk8 has written 8Gb to the sda4 with SATA_SIL and still
going strong! "dd if=/dev/zero of=/mnt/zero-`date +%s`"

However at about 3Gb (if that is relevant) top segfaulted with a
non critical oops. top will not restart, but the box is otherwise
functioning well considering the write load.

Is there any way to determine the drive model without first connecting
with the other sata driver (as hdc) and using hdparm?


Unable to handle kernel NULL pointer dereference at virtual address 000000b4
 printing eip:
c017c78a
*pde = 00000000
Oops: 0000 [#1]
PREEMPT 
CPU:    0
EIP:    0060:[<c017c78a>]    Not tainted
EFLAGS: 00010286   (2.6.7-sta-bk8) 
EIP is at pid_alive+0xa/0x30
eax: 000000b8   ebx: d32b0310   ecx: 00000000   edx: 00000000
esi: 00000000   edi: ef7bb7a0   ebp: d22b1b40   esp: db473e4c
ds: 007b   es: 007b   ss: 0068
Process top (pid: 489, threadinfo=db472000 task=e60ac7c0)
Stack: c017cca4 00000000 d22b1b40 db473f18 ef7bb7a0 db473ec4 c0159754 d22b1b40 
       db473f18 eaa1f006 eaa1f009 db473ec4 db473f18 c0159cc5 db473f18 db473ecc 
       db473ec4 ef7b86e0 d22b1dfc ee655240 bffff000 c0141ec8 c15cd660 c013e95c 
Call Trace:
 [<c017cca4>] pid_revalidate+0x14/0xc0
 [<c0159754>] do_lookup+0x44/0x80
 [<c0159cc5>] link_path_walk+0x535/0xa20
 [<c0141ec8>] find_extend_vma+0x18/0x70
 [<c013e95c>] follow_page+0x8c/0xb0
 [<c013ea3c>] get_user_pages+0xbc/0x3d0
 [<c015a406>] path_lookup+0x86/0x1a0
 [<c015a6a9>] __user_walk+0x39/0x70
 [<c0155a95>] vfs_stat+0x15/0x60
 [<c02445dd>] copy_to_user+0x2d/0x40
 [<c0156151>] sys_stat64+0x11/0x30
 [<c014dcbd>] __fput+0x8d/0xf0
 [<c014c6c3>] filp_close+0x43/0x70
 [<c014c744>] sys_close+0x54/0x80
 [<c0105dc7>] syscall_call+0x7/0xb




Could this be related to "Unknown HZ value! (91) Assume 100." which
started showing up with VIA motherboards on 2.5.x (I think) on top or ps
commands.  When I researched it before, It never caused ill, had been
identified as a "kernel bug" but benign. I know nothing more.

ATM, ps also seg faults, here is a corresponding oops,

 <1>Unable to handle kernel NULL pointer dereference at virtual address 000000b4
 printing eip:
c017c78a
*pde = 00000000
Oops: 0000 [#5]
PREEMPT 
CPU:    0
EIP:    0060:[<c017c78a>]    Not tainted
EFLAGS: 00010286   (2.6.7-sta-bk8) 
EIP is at pid_alive+0xa/0x30
eax: 000000b8   ebx: d32b0310   ecx: 00000000   edx: 00000000
esi: 00000000   edi: ef7bb7a0   ebp: d22b1b40   esp: ecc59e4c
ds: 007b   es: 007b   ss: 0068
Process ps (pid: 3456, threadinfo=ecc58000 task=e60ac7c0)
Stack: c017cca4 00000000 d22b1b40 ecc59f18 ef7bb7a0 ecc59ec4 c0159754 d22b1b40 
       ecc59f18 cf499006 cf499009 ecc59ec4 ecc59f18 c0159cc5 ecc59f18 ecc59ecc 
       ecc59ec4 ef7b86e0 d22b1dfc ee655240 bffff000 c0141ec8 c15cd660 c013e95c 
Call Trace:
 [<c017cca4>] pid_revalidate+0x14/0xc0
 [<c0159754>] do_lookup+0x44/0x80
 [<c0159cc5>] link_path_walk+0x535/0xa20
 [<c0141ec8>] find_extend_vma+0x18/0x70
 [<c013e95c>] follow_page+0x8c/0xb0
 [<c013ea3c>] get_user_pages+0xbc/0x3d0
 [<c015a406>] path_lookup+0x86/0x1a0
 [<c015a6a9>] __user_walk+0x39/0x70
 [<c0155a95>] vfs_stat+0x15/0x60
 [<c02445dd>] copy_to_user+0x2d/0x40
 [<c0156151>] sys_stat64+0x11/0x30
 [<c014dcbd>] __fput+0x8d/0xf0
 [<c014c6c3>] filp_close+0x43/0x70
 [<c014c744>] sys_close+0x54/0x80
 [<c0105dc7>] syscall_call+0x7/0xb
Code: 39 82 b4 00 00 00 75 07 8b 82 bc 00 00 00 c3 0f 0b 04 03 72 


config attached. I wrote 25G of zero and killed the dd process, top and
ps still segfault. Thanks all for your help!

// George



-- 
George Georgalis, Architect and administrator, Linux services. IXOYE
http://galis.org/george/  cell:646-331-2027  mailto:george@galis.org
Key fingerprint = 5415 2738 61CF 6AE1 E9A7  9EF0 0186 503B 9831 1631

[-- Attachment #2: 2.6.7-sta-bk8.config.gz --]
[-- Type: application/octet-stream, Size: 8062 bytes --]

  reply	other threads:[~2004-06-25 21:34 UTC|newest]

Thread overview: 28+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-06-22 17:05 SIIMAGE sata fails with 2.6.7 George Georgalis
2004-06-22 20:25 ` Jeff Garzik
2004-06-23 14:16   ` George Georgalis
2004-06-23 15:32     ` Hugo Mills
2004-06-23 16:35       ` George Georgalis
2004-06-23 17:30         ` Ricky Beam
2004-06-23 20:25           ` George Georgalis
2004-06-23 21:23             ` Fabian Fenaut
2004-06-23 21:50               ` SIIMAGE and sata_sil with 2.6.7-bk George Georgalis
2004-06-24  0:59             ` SIIMAGE sata fails with 2.6.7 Ricky Beam
2004-06-24 15:59               ` SATA_SIL fails with 2.6.7-bk6 seagate drive George Georgalis
2004-06-24 18:46                 ` Ricky Beam
2004-06-25 21:34                   ` George Georgalis [this message]
2004-06-25 23:16                     ` SATA_SIL works with 2.6.7-bk8 seagate drive, but oops Linus Torvalds
2004-06-28  2:12                       ` George Georgalis
2004-06-29  8:28                         ` radix-tree.c or sata_sil.c 2.6.7-bk oops George Georgalis
2004-07-02 22:17                           ` George Georgalis
2004-06-29  8:46                     ` SATA_SIL works with 2.6.7-bk8 seagate drive, but oops Sebastian Slota
2004-06-30  4:43                       ` George Georgalis
2004-06-30  6:16                         ` Jeff Garzik
2004-07-02 23:01                           ` George Georgalis
  -- strict thread matches above, loose matches on Subject: below --
2004-06-26 12:37 Albert Cahalan
2004-06-26 15:12 ` Arjan van de Ven
2004-06-26 16:00   ` Linus Torvalds
2004-06-26 16:14     ` Arjan van de Ven
2004-06-26 17:17     ` Albert Cahalan
2004-06-26 17:13   ` Albert Cahalan
2004-06-26 15:54 ` Linus Torvalds

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20040625213433.GB6502@trot.local \
    --to=george@galis.org \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox