All of lore.kernel.org
 help / color / mirror / Atom feed
* oops if unplug dd of=/dev/scd$n
@ 2003-09-12 14:57 Pat LaVarre
  2003-09-12 18:19 ` console lost to Ctrl+Alt+F$n in 2.6.0-test5 Pat LaVarre
  2003-09-12 19:44 ` oops if unplug dd of=/dev/scd$n Pat LaVarre
  0 siblings, 2 replies; 14+ messages in thread
From: Pat LaVarre @ 2003-09-12 14:57 UTC (permalink / raw)
  To: linux-scsi

Here's an oops for we of linux-scsi to fix, brought to us courtesy the
usb-storage mailing list  ...

> > > Subject: [usb-storage] unplug while writing
> > >
> > > I inadvertently unplugged a [usb] drive during a
> > > 2.6.0-test5:
> > >
> > > sudo dd of=/dev/scd1 if=/dev/zero bs=1M count=128
> > >
> > > And I got the following /proc/kmsg ...
> > >
> > > I ask because seemingly til reboot I'm
> > > stuck with:
> > > $
> > > $ /sbin/modprobe -r sr_mod
> > > FATAL: Module sr_mod is in use.
> > > $
> >
> > ... always ... an oops ... Must be fixed.
>
> ... doesn't look like a usb-storage specific
> problem; ... looks ... like a SCSI hotplug
> problem ... Unplug while writing should 'work'
> gracefully.  Much of the filesystem code
> doesn't yet handle this, but something like
> dd to a scd node should.

Pat LaVarre

...
<4>SCSI error : <1 0 0 0> return code = 0x70000
<4>end_request: I/O error, dev sr1, sector 154656
<3>Buffer I/O error on device sr1, logical block 19332
<4>lost page write due to I/O error on sr1
...
<4>lost page write due to I/O error on sr1
<4>SCSI error : <1 0 0 0> return code = 0x10000
<4>end_request: I/O error, dev sr1, sector 262136
<3>Buffer I/O error on device sr1, logical block 32767
<4>lost page write due to I/O error on sr1
<1>Unable to handle kernel NULL pointer dereference at virtual address 00000000
<4> printing eip:
<4>c01eb383
<1>*pde = 00000000
<4>Oops: 0000 [#1]
<4>CPU:    1
<4>EIP:    0060:[<c01eb383>]    Not tainted
<4>EFLAGS: 00010282
<4>EIP is at get_kobj_path_length+0x19/0x30
<4>eax: 00000000   ebx: 00000000   ecx: ffffffff   edx: d60e0af4
<4>esi: 00000015   edi: 00000000   ebp: ce1cde78   esp: ce1cde6c
<4>ds: 007b   es: 007b   ss: 0068
<4>Process dd (pid: 3015, threadinfo=ce1cc000 task=ce73f940)
<4>Stack: ce1cc000 0000008c dda34300 ce1cdec0 c01eb543 c03ff060 d65fe1ac 0000008c 
<4>       d65fe188 ce1cdea4 df91e1be ce17e400 dda34280 ce17e419 c03c35a0 c038cfce 
<4>       00000000 d65fe1b0 d65fe1ac c03ff0a8 de6066ac ce1cded8 c01eba47 c0387165 
<4>Call Trace:
<4> [<c01eb543>] kset_hotplug+0x15e/0x2b0
<4> [<df91e1be>] sr_remove+0x6f/0x8d [sr_mod]
<4> [<c01eba47>] kobject_del+0x66/0x6d
<4> [<c024bbd1>] device_del+0x83/0xa5
<4> [<c0280f64>] scsi_device_put+0xe0/0xfe
<4> [<df8e78d6>] cdrom_release+0x8c/0x105 [cdrom]
<4> [<c0166597>] blkdev_put+0x20f/0x23e
<4> [<c015e497>] __fput+0x123/0x135
<4> [<c015caf4>] filp_close+0x57/0x81
<4> [<c015cb9f>] sys_close+0x81/0xc7
<4> [<c010b46d>] sysenter_past_esp+0x52/0x71
<4>
<4>Code: f2 ae f7 d1 49 8b 52 24 8d 74 31 01 85 d2 75 e7 5b 89 f0 5e 
<4> <6>updfstab: numerical sysctl 1 23 is obsolete.
...



^ permalink raw reply	[flat|nested] 14+ messages in thread

* console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-12 14:57 oops if unplug dd of=/dev/scd$n Pat LaVarre
@ 2003-09-12 18:19 ` Pat LaVarre
  2003-09-12 23:06   ` Matt Mackall
  2003-09-12 19:44 ` oops if unplug dd of=/dev/scd$n Pat LaVarre
  1 sibling, 1 reply; 14+ messages in thread
From: Pat LaVarre @ 2003-09-12 18:19 UTC (permalink / raw)
  To: linux-kernel


> ... always ... an oops ... Must be fixed.

Once upon a time Ctrl+Alt+F1 gave me a plain text console, Ctrl+Alt+F7
returned me to me X Windows console.

Much has changed, the last thing I changed was upgrading to 2.6.0-test5
from 2.6.0-test4, and now I find that toggling back and forth a few
times leaves my display permanently dark.  Recovered from my ext3
journal are the following two examples of  `cat /proc/kmsg | tee ...`
output.

This report differs slightly, e.g. by severity, repeatability, and
mention of handle_vm86_fault, from much of:
http://groups.google.com/groups?q=__might_sleep&scoring=d

Example #1:

...
<4>sr0: scsi3-mmc drive: 0x/48x writer cd/rw xa/form2 cdda tray
<4>sr0: scsi3-mmc maybe not writeable
<6>Uniform CD-ROM driver Revision: 3.12
<7>Attached scsi CD-ROM sr0 at scsi0, channel 0, id 0, lun 0
<4>sr1: scsi3-mmc writable profile: 0x0002
<7>Attached scsi CD-ROM sr1 at scsi1, channel 0, id 0, lun 0
<3>Debug: sleeping function called from invalid context at include/asm/uaccess.h:473
<4>Call Trace:
<4> [<c0121f16>] __might_sleep+0x5f/0x72
<4> [<c010e76a>] save_v86_state+0x6a/0x20f
<4> [<c010f32d>] handle_vm86_fault+0xa7/0x8fb
<4> [<c010cc8f>] do_general_protection+0x0/0x93
<4> [<c010bf49>] error_code+0x2d/0x38
<4> [<c010b4bf>] syscall_call+0x7/0xb
<4>

Example #2:

...
<3>Debug: sleeping function called from invalid context at include/asm/uaccess.h:473
<4>Call Trace:
<4> [<c0121f16>] __might_sleep+0x5f/0x72
<4> [<c010e76a>] save_v86_state+0x6a/0x20f
<4> [<c010f32d>] handle_vm86_fault+0xa7/0x8fb
<4> [<c02323aa>] ipi_handler+0x0/0x7
<4> [<c010cc8f>] do_general_protection+0x0/0x93
<4> [<c010bf49>] error_code+0x2d/0x38
<4> [<c010b4bf>] syscall_call+0x7/0xb
<4>
...

Pat LaVarre




^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: oops if unplug dd of=/dev/scd$n
  2003-09-12 14:57 oops if unplug dd of=/dev/scd$n Pat LaVarre
  2003-09-12 18:19 ` console lost to Ctrl+Alt+F$n in 2.6.0-test5 Pat LaVarre
@ 2003-09-12 19:44 ` Pat LaVarre
  1 sibling, 0 replies; 14+ messages in thread
From: Pat LaVarre @ 2003-09-12 19:44 UTC (permalink / raw)
  To: linux-scsi

I unplugged (& therefore rebooted) again per request offline.

Similar /proc/kmsg appear.  (And by the way, I see often here a "sector"
= 8 * "block" e.g. 16608288 = 8 * 2076036.)

<4>SCSI error : <1 0 0 0> return code = 0x70000
<4>end_request: I/O error, dev sr1, sector 16608288
<3>Buffer I/O error on device sr1, logical block 2076036
<4>lost page write due to I/O error on sr1
...
<4>lost page write due to I/O error on sr1
<4>SCSI error : <1 0 0 0> return code = 0x10000
<4>end_request: I/O error, dev sr1, sector 54044664
<3>Buffer I/O error on device sr1, logical block 6755583
<4>lost page write due to I/O error on sr1
<1>Unable to handle kernel NULL pointer dereference at virtual address 00000000
<4> printing eip:
<4>c01eb383
<1>*pde = 00000000
<4>Oops: 0000 [#1]
<4>CPU:    1
<4>EIP:    0060:[<c01eb383>]    Not tainted
<4>EFLAGS: 00210282
<4>EIP is at get_kobj_path_length+0x19/0x30
<4>eax: 00000000   ebx: 00000000   ecx: ffffffff   edx: dde64af4
<4>esi: 00000015   edi: 00000000   ebp: c683de78   esp: c683de6c
<4>ds: 007b   es: 007b   ss: 0068
<4>Process dd (pid: 3870, threadinfo=c683c000 task=d0b33940)
<4>Stack: c683c000 0000008c c178e280 c683dec0 c01eb543 c03ff060 c17c65ac 0000008c 
<4>       c17c6588 c683dea4 df91a1be d6255c00 c178e200 d6255c19 c03c35a0 c038cfce 
<4>       00000000 c17c65b0 c17c65ac c03ff0a8 ddec9eac c683ded8 c01eba47 c0387165 
<4>Call Trace:
<4> [<c01eb543>] kset_hotplug+0x15e/0x2b0
<4> [<df91a1be>] sr_remove+0x6f/0x8d [sr_mod]
<4> [<c01eba47>] kobject_del+0x66/0x6d
<4> [<c024bbd1>] device_del+0x83/0xa5
<4> [<c0280f64>] scsi_device_put+0xe0/0xfe
<4> [<df8e78d6>] cdrom_release+0x8c/0x105 [cdrom]
<4> [<c0166597>] blkdev_put+0x20f/0x23e
<4> [<c015e497>] __fput+0x123/0x135
<4> [<c015caf4>] filp_close+0x57/0x81
<4> [<c015cb9f>] sys_close+0x81/0xc7
<4> [<c010b46d>] sysenter_past_esp+0x52/0x71
<4>
<4>Code: f2 ae f7 d1 49 8b 52 24 8d 74 31 01 85 d2 75 e7 5b 89 f0 5e 
<4> 

Pat LaVarre



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-12 18:19 ` console lost to Ctrl+Alt+F$n in 2.6.0-test5 Pat LaVarre
@ 2003-09-12 23:06   ` Matt Mackall
  2003-09-13  0:49     ` Pat LaVarre
  0 siblings, 1 reply; 14+ messages in thread
From: Matt Mackall @ 2003-09-12 23:06 UTC (permalink / raw)
  To: Pat LaVarre; +Cc: linux-kernel

On Fri, Sep 12, 2003 at 12:19:28PM -0600, Pat LaVarre wrote:
> 
> > ... always ... an oops ... Must be fixed.
> 
> Once upon a time Ctrl+Alt+F1 gave me a plain text console, Ctrl+Alt+F7
> returned me to me X Windows console.
> 
> Much has changed, the last thing I changed was upgrading to 2.6.0-test5
> from 2.6.0-test4, and now I find that toggling back and forth a few
> times leaves my display permanently dark.  Recovered from my ext3
> journal are the following two examples of  `cat /proc/kmsg | tee ...`
> output.
> 
> This report differs slightly, e.g. by severity, repeatability, and
> mention of handle_vm86_fault, from much of:
> http://groups.google.com/groups?q=__might_sleep&scoring=d
> 
> Example #1:
> 
> ...
> <4>sr0: scsi3-mmc drive: 0x/48x writer cd/rw xa/form2 cdda tray
> <4>sr0: scsi3-mmc maybe not writeable
> <6>Uniform CD-ROM driver Revision: 3.12
> <7>Attached scsi CD-ROM sr0 at scsi0, channel 0, id 0, lun 0
> <4>sr1: scsi3-mmc writable profile: 0x0002
> <7>Attached scsi CD-ROM sr1 at scsi1, channel 0, id 0, lun 0
> <3>Debug: sleeping function called from invalid context at include/asm/uaccess.h:473
> <4>Call Trace:
> <4> [<c0121f16>] __might_sleep+0x5f/0x72
> <4> [<c010e76a>] save_v86_state+0x6a/0x20f
> <4> [<c010f32d>] handle_vm86_fault+0xa7/0x8fb
> <4> [<c010cc8f>] do_general_protection+0x0/0x93
> <4> [<c010bf49>] error_code+0x2d/0x38
> <4> [<c010b4bf>] syscall_call+0x7/0xb
> <4>
> 
> Example #2:
> 
> ...
> <3>Debug: sleeping function called from invalid context at include/asm/uaccess.h:473
> <4>Call Trace:
> <4> [<c0121f16>] __might_sleep+0x5f/0x72
> <4> [<c010e76a>] save_v86_state+0x6a/0x20f
> <4> [<c010f32d>] handle_vm86_fault+0xa7/0x8fb
> <4> [<c02323aa>] ipi_handler+0x0/0x7
> <4> [<c010cc8f>] do_general_protection+0x0/0x93
> <4> [<c010bf49>] error_code+0x2d/0x38
> <4> [<c010b4bf>] syscall_call+0x7/0xb
> <4>
> ...
> 
> Pat LaVarre

I'm working on this, it's rather messy. Your lockup might be caused by
printk spew during console switch, see if it still locks up with the
sleep debugging turned off.

-- 
Matt Mackall : http://www.selenic.com : of or relating to the moon

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-12 23:06   ` Matt Mackall
@ 2003-09-13  0:49     ` Pat LaVarre
  2003-09-13  1:57       ` Matt Mackall
  0 siblings, 1 reply; 14+ messages in thread
From: Pat LaVarre @ 2003-09-13  0:49 UTC (permalink / raw)
  To: mpm; +Cc: linux-kernel


> I'm working on this, it's rather messy. Your lockup might be caused by
> printk spew during console switch, see if it still locks up with the
> sleep debugging turned off.

Yes, thank you, Ctrl+Alt+F$n now works if only I
CONFIG_DEBUG_SPINLOCK_SLEEP=n.

Also `sudo cat /proc/kmsg | tee ...` also suddenly starts working.

I wonder if somehow /proc/kmsg now working is a clue?  Back with =y, my
`dmesg` was clean but via /proc/kmsg I was seeing garbage like

mmae t itbl

or:

mmae t itle

for what now again is such reassuring chatter as:

<6>scsi2 : SCSI emulation for USB Mass Storage devices
<5>  Vendor: Iomega    Model: RRD               Rev: 23.D
<5>  Type:   CD-ROM                             ANSI SCSI revision: 02
<7>WARNING: USB Mass Storage data integrity not assured
<7>USB Mass Storage device found at 3
<4>sr1: scsi3-mmc drive: 125x/125x caddy
<4>sr1: scsi3-mmc maybe not writeable
<4>sr1: scsi3-mmc writable profile: 0x0002
<7>Attached scsi CD-ROM sr1 at scsi2, channel 0, id 0, lun 0

Pat LaVarre

P.S. I could easily check to see if =y kills an ssh session or just the
display, if that helps.

P.P.S.

Tentatively I conclude "sleep debugging ... off" meant this .config
change because I see:

$ make defconfig
...
$ grep -i sleep .config
CONFIG_ACPI_SLEEP=y
CONFIG_ACPI_SLEEP_PROC_FS=y
CONFIG_DEBUG_SPINLOCK_SLEEP=y
$ vi +/DEBUG_SPINLOCK_SLEEP .config
...
Kernel hacking ...

Sleep-inside-spinlock checking (DEBUG_SPINLOCK_SLEEP)

If you say Y here, various routines which may sleep will become very
noisy if they are called with a spinlock held.
...

Google isn't quickly confirming/ denying this tentative conclusion of
mine.




^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-13  0:49     ` Pat LaVarre
@ 2003-09-13  1:57       ` Matt Mackall
  2003-09-13 13:38         ` Pat LaVarre
  0 siblings, 1 reply; 14+ messages in thread
From: Matt Mackall @ 2003-09-13  1:57 UTC (permalink / raw)
  To: Pat LaVarre; +Cc: linux-kernel

On Fri, Sep 12, 2003 at 06:49:08PM -0600, Pat LaVarre wrote:
> 
> > I'm working on this, it's rather messy. Your lockup might be caused by
> > printk spew during console switch, see if it still locks up with the
> > sleep debugging turned off.
> 
> Yes, thank you, Ctrl+Alt+F$n now works if only I
> CONFIG_DEBUG_SPINLOCK_SLEEP=n.
> 
> Also `sudo cat /proc/kmsg | tee ...` also suddenly starts working.
> 
> I wonder if somehow /proc/kmsg now working is a clue?  Back with =y, my
> `dmesg` was clean but via /proc/kmsg I was seeing garbage like
> 
> mmae t itbl
> 
> or:
> 
> mmae t itle

What video are you using? I'm guessing you've got a framebuffer console?
VESA by any chance?

> for what now again is such reassuring chatter as:
> 
> <6>scsi2 : SCSI emulation for USB Mass Storage devices
> <5>  Vendor: Iomega    Model: RRD               Rev: 23.D
> <5>  Type:   CD-ROM                             ANSI SCSI revision: 02
> <7>WARNING: USB Mass Storage data integrity not assured
> <7>USB Mass Storage device found at 3
> <4>sr1: scsi3-mmc drive: 125x/125x caddy
> <4>sr1: scsi3-mmc maybe not writeable
> <4>sr1: scsi3-mmc writable profile: 0x0002
> <7>Attached scsi CD-ROM sr1 at scsi2, channel 0, id 0, lun 0
> 
> Pat LaVarre
> 
> P.S. I could easily check to see if =y kills an ssh session or just the
> display, if that helps.

That might help track down a bug in the console, sure. Not sure what's
going on with /proc/kmsg though.
 
> P.P.S.
> 
> Tentatively I conclude "sleep debugging ... off" meant this .config
> change because I see:

Yep, that's the one.

-- 
Matt Mackall : http://www.selenic.com : of or relating to the moon

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-13  1:57       ` Matt Mackall
@ 2003-09-13 13:38         ` Pat LaVarre
  2003-09-13 14:49           ` Michael Frank
  2003-09-14  0:24           ` 2.6.0-test5: intermittent crash on chvt to X; was console lost to Ctrl+Alt+F$n " Michael Frank
  0 siblings, 2 replies; 14+ messages in thread
From: Pat LaVarre @ 2003-09-13 13:38 UTC (permalink / raw)
  To: mpm; +Cc: linux-kernel


> What video are you using?
> I'm guessing you've got a framebuffer console?
> VESA by any chance?

I do not yet know how to answer such questions confidently.

I see (redhat-config-xfree86 --> tab Advanced) reports:

Video Card
Video Card Type = Intel 865
Memory Size = 16 megabytes
Driver = i810
Enable Hardware 3D Acceleration = no

> > Yes, thank you, Ctrl+Alt+F$n now works
> > if only I CONFIG_DEBUG_SPINLOCK_SLEEP=n.

Please allow me to disavow that first impression.

With CONFIG_DEBUG_SPINLOCK_SLEEP=y, I've now been counting keystrokes
til crash.  I count each of Ctrl+Alt+F5 and Ctrl+Alt+F7 as one stroke. 
Sometimes I crash, sometimes I do not.  I began logging life more
carefully when first I saw a few strokes cause a crash, and thereafter,
per boot:

8 strokes crashed.

60 strokes did not crash, so I gave up and rebooted to try again.

4 strokes crashed.  The first 2 seeming had logged me out, killing my
cat /proc/kmsg process.

8 strokes crashed.

26 strokes crashed.

...

The only consistency I see is that always an even number of strokes
cause a crash i.e. always the Ctrl+Alt+F7 switch back to my X console,
not the switch to a text console.

To prepare to crash, I only know of: sync umount ext3.  For me as yet
"Checking ... filesystem..." wastes less than three minutes per crash,
and I haven't yet perceptibly lost a disk.

> > I wonder if somehow /proc/kmsg now working is a clue?

Meanwhile, whether `sudo cat /proc/kmsg | tee ...` displays printk
intact or not also varies, without clearly correlating with whether a
crash will or will not occur.

So far, with CONFIG_DEBUG_SPINLOCK_SLEEP=y, trying `sudo cat /proc/kmsg
| tee ...` has never run well enough to capture the cause of the crash.

> >  I could easily check ... ssh ...

Remote ssh freezes and remote ping starts losing all packets.

Pat LaVarre




^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-13 13:38         ` Pat LaVarre
@ 2003-09-13 14:49           ` Michael Frank
  2003-09-13 15:47             ` Michael Frank
  2003-09-14  0:24           ` 2.6.0-test5: intermittent crash on chvt to X; was console lost to Ctrl+Alt+F$n " Michael Frank
  1 sibling, 1 reply; 14+ messages in thread
From: Michael Frank @ 2003-09-13 14:49 UTC (permalink / raw)
  To: Pat LaVarre, mpm; +Cc: linux-kernel

Don't think this is a keyboard problem and have seen
this several times related to video drivers in particular
when switching back to X.

The script below switches every $wait (5) seconds between
VT $vt (1) and X @vt$x (7). It logs to $log (log=/tmp/_vt)
syncs and waits before doing the switch to allow data to
reach the disk. So if it dies, you should know quite 
accurately when and where and if it also happens wo KB. 

Put this into a file and make it executable. It must
be run as root unless chvt is in sudoers or suid root. 

#!/bin/bash

cycle=1
log=/tmp/_vt.log
vt=1
x=7
wait=5
#rm -f $log
echo Starting VT <> X test
while ((1)); do
  echo Cycle $cycle switching to VT $vt >> $log    
  sync
  sleep $wait
  chvt $vt
  echo Cycle $cycle switching to X >> $log    
  sync
  sleep $wait
  chvt $x
  echo Cycle $cycle
  ((cycle += 1))
done
;;


On Saturday 13 September 2003 21:38, Pat LaVarre wrote:
> 
> > What video are you using?
> > I'm guessing you've got a framebuffer console?
> > VESA by any chance?
> 
> I do not yet know how to answer such questions confidently.
> 
> I see (redhat-config-xfree86 --> tab Advanced) reports:
> 
> Video Card
> Video Card Type = Intel 865
> Memory Size = 16 megabytes
> Driver = i810
> Enable Hardware 3D Acceleration = no
> 
> > > Yes, thank you, Ctrl+Alt+F$n now works
> > > if only I CONFIG_DEBUG_SPINLOCK_SLEEP=n.
> 
> Please allow me to disavow that first impression.
> 
> With CONFIG_DEBUG_SPINLOCK_SLEEP=y, I've now been counting keystrokes
> til crash.  I count each of Ctrl+Alt+F5 and Ctrl+Alt+F7 as one stroke. 
> Sometimes I crash, sometimes I do not.  I began logging life more
> carefully when first I saw a few strokes cause a crash, and thereafter,
> per boot:
> 
> 8 strokes crashed.
> 
> 60 strokes did not crash, so I gave up and rebooted to try again.
> 
> 4 strokes crashed.  The first 2 seeming had logged me out, killing my
> cat /proc/kmsg process.
> 
> 8 strokes crashed.
> 
> 26 strokes crashed.
> 
> ...
> 
> The only consistency I see is that always an even number of strokes
> cause a crash i.e. always the Ctrl+Alt+F7 switch back to my X console,
> not the switch to a text console.
> 
> To prepare to crash, I only know of: sync umount ext3.  For me as yet
> "Checking ... filesystem..." wastes less than three minutes per crash,
> and I haven't yet perceptibly lost a disk.
> 
> > > I wonder if somehow /proc/kmsg now working is a clue?
> 
> Meanwhile, whether `sudo cat /proc/kmsg | tee ...` displays printk
> intact or not also varies, without clearly correlating with whether a
> crash will or will not occur.
> 
> So far, with CONFIG_DEBUG_SPINLOCK_SLEEP=y, trying `sudo cat /proc/kmsg
> | tee ...` has never run well enough to capture the cause of the crash.
> 
> > >  I could easily check ... ssh ...
> 
> Remote ssh freezes and remote ping starts losing all packets.
> 
> Pat LaVarre
> 
> 
> 
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
> 
> 



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-13 14:49           ` Michael Frank
@ 2003-09-13 15:47             ` Michael Frank
  2003-09-13 23:12               ` Pat LaVarre
  2003-09-14  9:32               ` console lost to Ctrl+Alt+F_n " Pavel Machek
  0 siblings, 2 replies; 14+ messages in thread
From: Michael Frank @ 2003-09-13 15:47 UTC (permalink / raw)
  To: Pat LaVarre, mpm; +Cc: linux-kernel

On Saturday 13 September 2003 22:49, Michael Frank wrote:
> Don't think this is a keyboard problem and have seen
> this several times related to video drivers in particular
> when switching back to X.

Used script with 2.6.0-test5 + pm2 patch

Kernel: VGA16FB 

X4.3:	Driver      "vesa"
	VendorName  "Silicon Integrated Systems [SiS]"
	BoardName   "VESA driver (generic)"

200 cycles wo problems.


^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-13 15:47             ` Michael Frank
@ 2003-09-13 23:12               ` Pat LaVarre
  2003-09-14  9:32               ` console lost to Ctrl+Alt+F_n " Pavel Machek
  1 sibling, 0 replies; 14+ messages in thread
From: Pat LaVarre @ 2003-09-13 23:12 UTC (permalink / raw)
  To: mhf; +Cc: mpm, linux-kernel


> script ... switches every $wait (5) seconds between VT
> $vt (1) and X @vt$x (7) ...

Yes, logging in via ssh to run this script does reliably crash my
2.6.0-test5, I think because of `chvt $x`, while I am not touching the
console keyboard or mouse.

Cycles required varies.  Counting cycles completed before crashing, I
saw: 1 18 20 20 ... 4 16 ...

The script as was posted creates a small ./X and produces logs such as:

Cycle 1 switching to VT 1
Cycle 1 switching to X
...
Cycle 20 switching to VT 1
Cycle 20 switching to X
Cycle 21 switching to VT 1

This log could have been produced by crashing in `chvt $vt`, but I think
I saw it was produced by crashing in the `sleep $wait` that follows
`chvt $x`.  That is, I think the $vt was the last non-blank display, not
the $x.

To increase my confidence, I ran with every command echoed, and indeed
via ssh I saw the last command echoed was `sleep 5`.

I ended by running the third variant script quoted below.  Now my logs
comfortingly end with 'switching to X'.  I presume I'm catching the
crash in the last sleep $wait.

Pat LaVarre

#!/bin/bash
cycle=1
log=/tmp/_vt.log
vt=1
x=7
wait=3
rm -f $log
echo 'Starting VT <> X test'
while ((1)); do
	echo Cycle $cycle switching to VT $vt | tee -a $log
	sync
	sleep $wait
	chvt $vt
	sleep $wait
	echo Cycle $cycle switching to X | tee -a $log
	sync
	sleep $wait
	chvt $x
	sleep $wait
	((cycle += 1))
done
;;




^ permalink raw reply	[flat|nested] 14+ messages in thread

* 2.6.0-test5: intermittent crash on chvt to X; was console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-13 13:38         ` Pat LaVarre
  2003-09-13 14:49           ` Michael Frank
@ 2003-09-14  0:24           ` Michael Frank
  2003-09-14 23:04             ` Pat LaVarre
  1 sibling, 1 reply; 14+ messages in thread
From: Michael Frank @ 2003-09-14  0:24 UTC (permalink / raw)
  To: Pat LaVarre, mpm; +Cc: linux-kernel


> I ended by running the third variant script quoted below. Now my logs
> comfortingly end with 'switching to X'. I presume I'm catching the
> crash in the last sleep $wait.

Makes sense because the crash may (in your case does) happen later 
in the switching sequence at which time the "new" log already made 
it to disk - I'll use your variant from now on ;)

The qestion now is whether this is kernel or X related.
Have you had this problem with an earlier 2.6 or 2.4 kernel?

If it is specific to -test5, post (as tar.bz2)

- lspci -v  
- /var/log/dmesg
- X version and driver info from /var/log/XFree86.log 
- .config

Regards
Michael


^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: console lost to Ctrl+Alt+F_n in 2.6.0-test5
  2003-09-13 15:47             ` Michael Frank
  2003-09-13 23:12               ` Pat LaVarre
@ 2003-09-14  9:32               ` Pavel Machek
  1 sibling, 0 replies; 14+ messages in thread
From: Pavel Machek @ 2003-09-14  9:32 UTC (permalink / raw)
  To: Michael Frank; +Cc: Pat LaVarre, mpm, linux-kernel

Hi!

> > Don't think this is a keyboard problem and have seen
> > this several times related to video drivers in particular
> > when switching back to X.
> 
> Used script with 2.6.0-test5 + pm2 patch
> 
> Kernel: VGA16FB 
> 
> X4.3:	Driver      "vesa"
> 	VendorName  "Silicon Integrated Systems [SiS]"
> 	BoardName   "VESA driver (generic)"
> 
> 200 cycles wo problems.

VESA is special: its kernel who drives the hw => X can't crash it
so easily.

-- 
				Pavel
Written on sharp zaurus, because my Velo1 broke. If you have Velo you don't need...


^ permalink raw reply	[flat|nested] 14+ messages in thread

* RE: 2.6.0-test5: intermittent crash on chvt to X; was console lost to Ctrl+Alt+F$n in 2.6.0-test5
@ 2003-09-14 19:00 Pat LaVarre
  0 siblings, 0 replies; 14+ messages in thread
From: Pat LaVarre @ 2003-09-14 19:00 UTC (permalink / raw)
  To: mhf; +Cc: mpm, p.lavarre, linux-kernel


> If it is specific to -test5, post (as tar.bz2)

Definitely not specific to 2.6.0-test5: I have
repeatedly seen 2.6.0-test4 crash this way too.

Whether I have had this vulnerability for days or
weeks or months or years I am not yet sure, I do not
customarily make a point of trying chvt much with each
new kernel.  

Unless I hear otherwise, hopefully within hours, when
next I manage to visit my console in person:

1) I will test to see if the script can crash:

2.6.0-test4 with my near default .config
2.4.22 with my near default .config
2.4.21-xfs Knoppix booted via cd

2) After the reboot following a crash of 2.6.0-test5 I
will sample:

lspci -v
/var/log/dmesg
X version and driver info from /var/log/XFree86.log 
.config

3) I will report back here, attaching (2) as .tar.bz2.

Pat LaVarre



__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free, easy-to-use web site design software
http://sitebuilder.yahoo.com

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: 2.6.0-test5: intermittent crash on chvt to X; was console lost to Ctrl+Alt+F$n in 2.6.0-test5
  2003-09-14  0:24           ` 2.6.0-test5: intermittent crash on chvt to X; was console lost to Ctrl+Alt+F$n " Michael Frank
@ 2003-09-14 23:04             ` Pat LaVarre
  0 siblings, 0 replies; 14+ messages in thread
From: Pat LaVarre @ 2003-09-14 23:04 UTC (permalink / raw)
  To: mhf; +Cc: mpm, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1341 bytes --]


> qestion now is whether this is kernel or X related.

Oh.

> > echo ... | tee -a $log
> > sync
> > sleep $wait
> > chvt ...
> > sleep $wait
>
> the crash may (in your case does) happen later

In repeating those five commands, is there any purpose to the first
sleep?  I left the first sleep in place to match the original, but once
I assume sync leaves no writes unflushed then now I do not see what the
first sleep accomplishes, if anything.

> Have you had this problem with an earlier 2.6 or 2.4 kernel?

Today 2.4.21-xfs Knoppix booted via cd:
in over 75 cycles I saw no crash so I gave up.

Today 2.4.22 with a near default .config:
"in over 75 cycles I saw no crash so I gave up".

Today 2.6.0-test4 with a near default .config:
Counting cycles before crash per boot I saw: 3 2 16 ...

Yester/today 2.6.0-test5 with a near default .config:
"I saw: 1 18 20 20 ... 4 16 ..." 3 ...

> -test5, post (as tar.bz2) ...

After that last -test5 crash I rebooted and then produced the attached
via:

#!/bin/bash
# rm -r chvtx
mkdir chvtx

sudo /sbin/lspci -v >>chvtx/v.lspci
sudo cat /var/log/dmesg >>chvtx/var.log.dmesg
egrep -i 'version|release|driver' /var/log/XFree86.*.log >>chvtx/var.log.XFree86.log
# cp -ip /var/log/XFree86.*.log chvtx/
cp -ip .config chvtx/config

tar -c chvtx | bzip2 -zc >chvtx.2.6.0-test5.tar.bz2

# Pat LaVarre



[-- Attachment #2: chvtx.2.6.0-test5.tar.bz2 --]
[-- Type: application/x-bzip, Size: 12515 bytes --]

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2003-09-16 17:15 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-09-12 14:57 oops if unplug dd of=/dev/scd$n Pat LaVarre
2003-09-12 18:19 ` console lost to Ctrl+Alt+F$n in 2.6.0-test5 Pat LaVarre
2003-09-12 23:06   ` Matt Mackall
2003-09-13  0:49     ` Pat LaVarre
2003-09-13  1:57       ` Matt Mackall
2003-09-13 13:38         ` Pat LaVarre
2003-09-13 14:49           ` Michael Frank
2003-09-13 15:47             ` Michael Frank
2003-09-13 23:12               ` Pat LaVarre
2003-09-14  9:32               ` console lost to Ctrl+Alt+F_n " Pavel Machek
2003-09-14  0:24           ` 2.6.0-test5: intermittent crash on chvt to X; was console lost to Ctrl+Alt+F$n " Michael Frank
2003-09-14 23:04             ` Pat LaVarre
2003-09-12 19:44 ` oops if unplug dd of=/dev/scd$n Pat LaVarre
  -- strict thread matches above, loose matches on Subject: below --
2003-09-14 19:00 2.6.0-test5: intermittent crash on chvt to X; was console lost to Ctrl+Alt+F$n in 2.6.0-test5 Pat LaVarre

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.