* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
[not found] ` <20060110235554.GA3527-MEqNC12sBsHxa7XIdbXXog@public.gmane.org>
@ 2006-01-11 1:00 ` Andrew Morton
[not found] ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
0 siblings, 1 reply; 6+ messages in thread
From: Andrew Morton @ 2006-01-11 1:00 UTC (permalink / raw)
To: Mattia Dongili
Cc: linux-kernel-u79uwXL29TY76Z2rM5mHXA,
reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
linux-acpi-u79uwXL29TY76Z2rM5mHXA, Pavel Machek
Mattia Dongili <malattia-k2GhghHVRtY@public.gmane.org> wrote:
>
> Hello,
>
> I didn't tested -mm1 but -mm2 has definitely too many problems currently,
> let's start:
Thanks for testing and reporting - it really helps.
> 1- reiser3 oopsed[1] twice while suspending to ram. It seems
> reproducible (have some activity on the fs and suspend)
No significant reiser3 changes in there, so I'd be suspecting something
else has gone haywire.
> 2- I had already written this email once, but the box completely
> froze, nothing in the logs, only mouse and X activity. I suspect
> again of reiser3.
Yes, that sounds like a filesystem failed while holding locks.
> 3- This laptop experienced 2 long stalls (20~25 sec) during boot,
> apparently after scanning usb_storage devices and starting portmap.
You mean before starting portmap?
> I logged the call traces (sysrq+t) during this time, I don't know if
> it is useful[2].
Hard to see anything in there. If you set CONFIG_FRAME_POINTER=y you'll
get better traces.
> Is it time for me to learn to git bisect? (Tomorrow morning I'll try
> (CET) if plain 2.6.15 also shows the same stalls).
Please test the next Linus git tree (2.6.15-git7) and see if we've
propagated it into there too.
There's not much point in fiddling with -mm2. If git7 is OK then please
test the next -mm and if it still fails then yes, doing a bisection would
really help.
<types madly>
See http://www.zip.com.au/~akpm/linux/patches/stuff/bisecting-mm-trees.txt
> 4- I'm also affected by the ACPI Misaligned resource pointer error.
ACPI cc'ed
> 5- That's an older problem I never reported (never tracked to be a
> reiser4 problem): reiser4 shows a very bad slowness. Use case: backup
> my ~/ (rsync)
> a- from reiser4 to xfs rsync stalls for some seconds from time to
> time while building the file list (call trace during the stall[3])
> Even using mutt and editing a file with vim causes short freezes)
> b- from xfs to reiser4 after finishing the copy, sync-ing takes ages,
> gkrellm disk monitor shows 1MB/s
Don't know, sorry.
> [1]: http://oioio.altervista.org/linux/dsc03133.jpg
> [2]: http://oioio.altervista.org/linux/boot-2.6.15-mm2.3
> [3]: http://oioio.altervista.org/linux/dmesg_reiser4_stalls
>
> The reiser oops seems reproducible by suspending with some dirty cache
> (I've been able to suspend/resume cycle 3 times without reiser crashing
> but I also didn't have big activities on that partition).
> If really necessary I can try to reproduce it (oh, poor filesystem).
> Other than that are ther suggestions/patches to start with?
Pavel, have you heard of anything like this??
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
[not found] ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
@ 2006-01-11 10:00 ` Pavel Machek
[not found] ` <20060111100016.GC2574-I/5MKhXcvmPrBKCeMvbIDA@public.gmane.org>
2006-01-11 10:02 ` Pavel Machek
` (2 subsequent siblings)
3 siblings, 1 reply; 6+ messages in thread
From: Pavel Machek @ 2006-01-11 10:00 UTC (permalink / raw)
To: Andrew Morton
Cc: Mattia Dongili, linux-kernel-u79uwXL29TY76Z2rM5mHXA,
reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
linux-acpi-u79uwXL29TY76Z2rM5mHXA
Hi!
> Thanks for testing and reporting - it really helps.
>
> > 1- reiser3 oopsed[1] twice while suspending to ram. It seems
> > reproducible (have some activity on the fs and suspend)
>
> No significant reiser3 changes in there, so I'd be suspecting something
> else has gone haywire.
Suspend to *RAM*? That really does not do anything that should kill
the filesystems. Has it ever worked before? When? Any SATA?
> > The reiser oops seems reproducible by suspending with some dirty cache
> > (I've been able to suspend/resume cycle 3 times without reiser crashing
> > but I also didn't have big activities on that partition).
> > If really necessary I can try to reproduce it (oh, poor filesystem).
> > Other than that are ther suggestions/patches to start with?
>
> Pavel, have you heard of anything like this??
No, never seen this before. With suspend to disk and wrong setup,
"kill my filesystem" is easy; but suspend to RAM? Best bet would be
disk driver doing something really stupid.
Pavel
--
Thanks, Sharp!
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
[not found] ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
2006-01-11 10:00 ` Pavel Machek
@ 2006-01-11 10:02 ` Pavel Machek
2006-01-11 14:27 ` Mattia Dongili
2006-01-11 18:40 ` Mattia Dongili
3 siblings, 0 replies; 6+ messages in thread
From: Pavel Machek @ 2006-01-11 10:02 UTC (permalink / raw)
To: Andrew Morton
Cc: Mattia Dongili, linux-kernel-u79uwXL29TY76Z2rM5mHXA,
reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
linux-acpi-u79uwXL29TY76Z2rM5mHXA
Hi!
> > [1]: http://oioio.altervista.org/linux/dsc03133.jpg
Looking at the picture... I'd try again without preempt. Any taints?
Is machine rock solid otherwise?
Pavel
--
Thanks, Sharp!
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
[not found] ` <20060111100016.GC2574-I/5MKhXcvmPrBKCeMvbIDA@public.gmane.org>
@ 2006-01-11 11:24 ` Mattia Dongili
0 siblings, 0 replies; 6+ messages in thread
From: Mattia Dongili @ 2006-01-11 11:24 UTC (permalink / raw)
To: Pavel Machek
Cc: Andrew Morton, linux-kernel-u79uwXL29TY76Z2rM5mHXA,
reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
linux-acpi-u79uwXL29TY76Z2rM5mHXA
On Wed, January 11, 2006 11:00 am, Pavel Machek said:
> Hi!
>
>> Thanks for testing and reporting - it really helps.
>>
>> > 1- reiser3 oopsed[1] twice while suspending to ram. It seems
>> > reproducible (have some activity on the fs and suspend)
>>
>> No significant reiser3 changes in there, so I'd be suspecting something
>> else has gone haywire.
>
> Suspend to *RAM*? That really does not do anything that should kill
> the filesystems. Has it ever worked before? When? Any SATA?
yes, s2ram! it's a somewhat old laptop (ICH3 chipset), no SATA and it's
been working since quite a long time (don't know exactely I stopped
testing s2ram long time ago and retried only on 2.6.14), lspci follows:
0000:00:00.0 Host bridge: Intel Corporation 82830 830 Chipset Host Bridge
(rev 02)
0000:00:01.0 PCI bridge: Intel Corporation 82830 830 Chipset AGP Bridge
(rev 02)
0000:00:1d.0 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #1)
(rev 01)
0000:00:1d.1 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #2)
(rev 01)
0000:00:1d.2 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #3)
(rev 01)
0000:00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev 41)
0000:00:1f.0 ISA bridge: Intel Corporation 82801CAM ISA Bridge (LPC) (rev 01)
0000:00:1f.1 IDE interface: Intel Corporation 82801CAM IDE U100 (rev 01)
0000:00:1f.3 SMBus: Intel Corporation 82801CA/CAM SMBus Controller (rev 01)
0000:00:1f.5 Multimedia audio controller: Intel Corporation 82801CA/CAM
AC'97 Audio Controller (rev 01)
0000:00:1f.6 Modem: Intel Corporation 82801CA/CAM AC'97 Modem Controller
(rev 01)
0000:01:00.0 VGA compatible controller: ATI Technologies Inc Radeon
Mobility M6 LY
0000:02:02.0 FireWire (IEEE 1394): Texas Instruments TSB43AA22 IEEE-1394
Controller (PHY/Link Integrated) (rev 02)
0000:02:05.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev 80)
0000:02:05.1 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev 80)
0000:02:08.0 Ethernet controller: Intel Corporation 82801CAM (ICH3)
PRO/100 VE (LOM) Ethernet Controller (rev 41)
It's one of the sony PCG-GR214EP/GR214MP/GR215MP/GR314MP/GR315MP/7K/9K
(I'm sorry for line wraps or amenities, I only have a webmail client
available at work...)
Pavel, soon you'll receive a successful s2ram report for
Documentation/power :)
--
mattia
:wq!
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
[not found] ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
2006-01-11 10:00 ` Pavel Machek
2006-01-11 10:02 ` Pavel Machek
@ 2006-01-11 14:27 ` Mattia Dongili
2006-01-11 18:40 ` Mattia Dongili
3 siblings, 0 replies; 6+ messages in thread
From: Mattia Dongili @ 2006-01-11 14:27 UTC (permalink / raw)
To: Andrew Morton
Cc: linux-kernel-u79uwXL29TY76Z2rM5mHXA,
reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
linux-acpi-u79uwXL29TY76Z2rM5mHXA, Pavel Machek
On Wed, January 11, 2006 2:00 am, Andrew Morton said:
> Mattia Dongili <malattia-k2GhghHVRtY@public.gmane.org> wrote:
[...]
>> 3- This laptop experienced 2 long stalls (20~25 sec) during boot,
>> apparently after scanning usb_storage devices and starting portmap.
>
> You mean before starting portmap?
well, _while_ starting portmap. As you can see from the traces I have
portmap sleeping in sys_poll, consider my reflexes are not that fast
so the trace might be well more than 10 secs after the /etc/init.d/portmap
was run.
Trying to stop and start it again didn't have any delay.
>> Is it time for me to learn to git bisect? (Tomorrow morning I'll try
>> (CET) if plain 2.6.15 also shows the same stalls).
>
> Please test the next Linus git tree (2.6.15-git7) and see if we've
> propagated it into there too.
>
> There's not much point in fiddling with -mm2. If git7 is OK then please
> test the next -mm and if it still fails then yes, doing a bisection would
> really help.
>
> <types madly>
>
> See http://www.zip.com.au/~akpm/linux/patches/stuff/bisecting-mm-trees.txt
ooooh :) really really thanks!!
I was starting to script something that just some hours later revealed to
be like stGit (well at least had the same base idea).
Anyway I'm currently using -git7 and building -mm3, -git7 is OK:
no stalls, no reiser3 oops (yet) and no ACPI misaligned pointer.
Will report on -mm3 as soon as possible
--
mattia
:wq!
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
[not found] ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
` (2 preceding siblings ...)
2006-01-11 14:27 ` Mattia Dongili
@ 2006-01-11 18:40 ` Mattia Dongili
3 siblings, 0 replies; 6+ messages in thread
From: Mattia Dongili @ 2006-01-11 18:40 UTC (permalink / raw)
To: Andrew Morton, Pavel Machek
Cc: linux-kernel-u79uwXL29TY76Z2rM5mHXA,
reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
linux-acpi-u79uwXL29TY76Z2rM5mHXA
Hello,
I gave -mm3 a run:
On Tue, Jan 10, 2006 at 05:00:37PM -0800, Andrew Morton wrote:
> Mattia Dongili <malattia-k2GhghHVRtY@public.gmane.org> wrote:
[...]
> > 1- reiser3 oopsed[1] twice while suspending to ram. It seems
> > reproducible (have some activity on the fs and suspend)
>
> No significant reiser3 changes in there, so I'd be suspecting something
> else has gone haywire.
It's still there. But I caught it during normal runtime, a couple more
pictures (yes, I'm going to try to setup a netconsole):
the oops (sorry, it sucks, probably useless):
http://oioio.altervista.org/linux/dsc03136.jpg
a screen with the call traces, the keyboard was still active and sysrq
combos worked. It showed many processes stuck within reiser's log writer:
http://oioio.altervista.org/linux/dsc03138.jpg
going to revert the
> > 3- This laptop experienced 2 long stalls (20~25 sec) during boot,
> > apparently after scanning usb_storage devices and starting portmap.
umph... it's still here call traces:
http://oioio.altervista.org/linux/portmap_stall_trace1
[...]
> There's not much point in fiddling with -mm2. If git7 is OK then please
> test the next -mm and if it still fails then yes, doing a bisection would
> really help.
>
> <types madly>
>
> See http://www.zip.com.au/~akpm/linux/patches/stuff/bisecting-mm-trees.txt
give me some hours for this.
Anyway I'd like to try to revert some reiser3 changes first and see if
the oops goes (I can leave with a longer boot but I'm worried about my
root fs :))
> > 4- I'm also affected by the ACPI Misaligned resource pointer error.
still here:
# grep -e 'Jan 11.*Misaligned.*' /var/log/syslog
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cff4b7e2 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cff4b7e2 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cff4b2e2 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cf657902 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cf657602 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cf657402 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cf657402 Type 07 Len 0
Jan 11 18:18:53 inferi kernel: **** SET: Misaligned resource pointer: cf657d02 Type 07 Len 0
Jan 11 18:21:14 inferi kernel: **** SET: Misaligned resource pointer: cf657d02 Type 07 Len 0
DSDT and lspci can be found here if useful:
http://oioio.altervista.org/linux/DSDT.aml
http://oioio.altervista.org/linux/DSDT.dsl
http://oioio.altervista.org/linux/lspci-v
On Wed, Jan 11, 2006 at 11:00:16AM +0100, Pavel Machek wrote:
[...]
> > > 1- reiser3 oopsed[1] twice while suspending to ram. It seems
> > > reproducible (have some activity on the fs and suspend)
[...]
> Suspend to *RAM*? That really does not do anything that should kill
Sorry, it seems it has nothing to do with s2ram.
--
mattia
:wq!
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2006-01-11 18:40 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <20060110235554.GA3527@inferi.kami.home>
[not found] ` <20060110235554.GA3527-MEqNC12sBsHxa7XIdbXXog@public.gmane.org>
2006-01-11 1:00 ` 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!) Andrew Morton
[not found] ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
2006-01-11 10:00 ` Pavel Machek
[not found] ` <20060111100016.GC2574-I/5MKhXcvmPrBKCeMvbIDA@public.gmane.org>
2006-01-11 11:24 ` Mattia Dongili
2006-01-11 10:02 ` Pavel Machek
2006-01-11 14:27 ` Mattia Dongili
2006-01-11 18:40 ` Mattia Dongili
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox