public inbox for linux-acpi@vger.kernel.org
 help / color / mirror / Atom feed
* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
       [not found] ` <20060110235554.GA3527-MEqNC12sBsHxa7XIdbXXog@public.gmane.org>
@ 2006-01-11  1:00   ` Andrew Morton
       [not found]     ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Andrew Morton @ 2006-01-11  1:00 UTC (permalink / raw)
  To: Mattia Dongili
  Cc: linux-kernel-u79uwXL29TY76Z2rM5mHXA,
	reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
	linux-acpi-u79uwXL29TY76Z2rM5mHXA, Pavel Machek

Mattia Dongili <malattia-k2GhghHVRtY@public.gmane.org> wrote:
>
> Hello,
> 
> I didn't tested -mm1 but -mm2 has definitely too many problems currently,
> let's start:

Thanks for testing and reporting - it really helps.

> 1- reiser3 oopsed[1] twice while suspending to ram. It seems
>    reproducible (have some activity on the fs and suspend)

No significant reiser3 changes in there, so I'd be suspecting something
else has gone haywire.

> 2- I had already written this email once, but the box completely
>    froze, nothing in the logs, only mouse and X activity. I suspect
>    again of reiser3.

Yes, that sounds like a filesystem failed while holding locks.

> 3- This laptop experienced 2 long stalls (20~25 sec) during boot,
>    apparently after scanning usb_storage devices and starting portmap.

You mean before starting portmap?

>    I logged the call traces (sysrq+t) during this time, I don't know if
>    it is useful[2].

Hard to see anything in there.  If you set CONFIG_FRAME_POINTER=y you'll
get better traces.

>    Is it time for me to learn to git bisect? (Tomorrow morning I'll try
>    (CET) if plain 2.6.15 also shows the same stalls).

Please test the next Linus git tree (2.6.15-git7) and see if we've
propagated it into there too.

There's not much point in fiddling with -mm2.  If git7 is OK then please
test the next -mm and if it still fails then yes, doing a bisection would
really help.

<types madly>

See http://www.zip.com.au/~akpm/linux/patches/stuff/bisecting-mm-trees.txt

> 4- I'm also affected by the ACPI Misaligned resource pointer error.

ACPI cc'ed

> 5- That's an older problem I never reported (never tracked to be a
>    reiser4 problem): reiser4 shows a very bad slowness. Use case: backup
>    my ~/ (rsync)
>    a- from reiser4 to xfs rsync stalls for some seconds from time to
>       time while building the file list (call trace during the stall[3])
>       Even using mutt and editing a file with vim causes short freezes)
>    b- from xfs to reiser4 after finishing the copy, sync-ing takes ages,
>       gkrellm disk monitor shows 1MB/s

Don't know, sorry.

> [1]: http://oioio.altervista.org/linux/dsc03133.jpg
> [2]: http://oioio.altervista.org/linux/boot-2.6.15-mm2.3
> [3]: http://oioio.altervista.org/linux/dmesg_reiser4_stalls
> 
> The reiser oops seems reproducible by suspending with some dirty cache
> (I've been able to suspend/resume cycle 3 times without reiser crashing
> but I also didn't have big activities on that partition).
> If really necessary I can try to reproduce it (oh, poor filesystem).
> Other than that are ther suggestions/patches to start with?

Pavel, have you heard of anything like this??

-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
       [not found]     ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
@ 2006-01-11 10:00       ` Pavel Machek
       [not found]         ` <20060111100016.GC2574-I/5MKhXcvmPrBKCeMvbIDA@public.gmane.org>
  2006-01-11 10:02       ` Pavel Machek
                         ` (2 subsequent siblings)
  3 siblings, 1 reply; 6+ messages in thread
From: Pavel Machek @ 2006-01-11 10:00 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Mattia Dongili, linux-kernel-u79uwXL29TY76Z2rM5mHXA,
	reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
	linux-acpi-u79uwXL29TY76Z2rM5mHXA

Hi!

> Thanks for testing and reporting - it really helps.
> 
> > 1- reiser3 oopsed[1] twice while suspending to ram. It seems
> >    reproducible (have some activity on the fs and suspend)
> 
> No significant reiser3 changes in there, so I'd be suspecting something
> else has gone haywire.

Suspend to *RAM*? That really does not do anything that should kill
the filesystems. Has it ever worked before? When? Any SATA?

> > The reiser oops seems reproducible by suspending with some dirty cache
> > (I've been able to suspend/resume cycle 3 times without reiser crashing
> > but I also didn't have big activities on that partition).
> > If really necessary I can try to reproduce it (oh, poor filesystem).
> > Other than that are ther suggestions/patches to start with?
> 
> Pavel, have you heard of anything like this??

No, never seen this before. With suspend to disk and wrong setup,
"kill my filesystem" is easy; but suspend to RAM? Best bet would be
disk driver doing something really stupid.
								Pavel
-- 
Thanks, Sharp!
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
       [not found]     ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
  2006-01-11 10:00       ` Pavel Machek
@ 2006-01-11 10:02       ` Pavel Machek
  2006-01-11 14:27       ` Mattia Dongili
  2006-01-11 18:40       ` Mattia Dongili
  3 siblings, 0 replies; 6+ messages in thread
From: Pavel Machek @ 2006-01-11 10:02 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Mattia Dongili, linux-kernel-u79uwXL29TY76Z2rM5mHXA,
	reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
	linux-acpi-u79uwXL29TY76Z2rM5mHXA

Hi!

> > [1]: http://oioio.altervista.org/linux/dsc03133.jpg

Looking at the picture... I'd try again without preempt. Any taints?
Is machine rock solid otherwise?
								Pavel

-- 
Thanks, Sharp!
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
       [not found]         ` <20060111100016.GC2574-I/5MKhXcvmPrBKCeMvbIDA@public.gmane.org>
@ 2006-01-11 11:24           ` Mattia Dongili
  0 siblings, 0 replies; 6+ messages in thread
From: Mattia Dongili @ 2006-01-11 11:24 UTC (permalink / raw)
  To: Pavel Machek
  Cc: Andrew Morton, linux-kernel-u79uwXL29TY76Z2rM5mHXA,
	reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
	linux-acpi-u79uwXL29TY76Z2rM5mHXA

On Wed, January 11, 2006 11:00 am, Pavel Machek said:
> Hi!
>
>> Thanks for testing and reporting - it really helps.
>>
>> > 1- reiser3 oopsed[1] twice while suspending to ram. It seems
>> >    reproducible (have some activity on the fs and suspend)
>>
>> No significant reiser3 changes in there, so I'd be suspecting something
>> else has gone haywire.
>
> Suspend to *RAM*? That really does not do anything that should kill
> the filesystems. Has it ever worked before? When? Any SATA?

yes, s2ram! it's a somewhat old laptop (ICH3 chipset), no SATA and it's
been working since quite a long time (don't know exactely I stopped
testing s2ram long time ago  and retried only on 2.6.14), lspci follows:
0000:00:00.0 Host bridge: Intel Corporation 82830 830 Chipset Host Bridge
(rev 02)
0000:00:01.0 PCI bridge: Intel Corporation 82830 830 Chipset AGP Bridge
(rev 02)
0000:00:1d.0 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #1)
(rev 01)
0000:00:1d.1 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #2)
(rev 01)
0000:00:1d.2 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #3)
(rev 01)
0000:00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev 41)
0000:00:1f.0 ISA bridge: Intel Corporation 82801CAM ISA Bridge (LPC) (rev 01)
0000:00:1f.1 IDE interface: Intel Corporation 82801CAM IDE U100 (rev 01)
0000:00:1f.3 SMBus: Intel Corporation 82801CA/CAM SMBus Controller (rev 01)
0000:00:1f.5 Multimedia audio controller: Intel Corporation 82801CA/CAM
AC'97 Audio Controller (rev 01)
0000:00:1f.6 Modem: Intel Corporation 82801CA/CAM AC'97 Modem Controller
(rev 01)
0000:01:00.0 VGA compatible controller: ATI Technologies Inc Radeon
Mobility M6 LY
0000:02:02.0 FireWire (IEEE 1394): Texas Instruments TSB43AA22 IEEE-1394
Controller (PHY/Link Integrated) (rev 02)
0000:02:05.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev 80)
0000:02:05.1 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev 80)
0000:02:08.0 Ethernet controller: Intel Corporation 82801CAM (ICH3)
PRO/100 VE (LOM) Ethernet Controller (rev 41)

It's one of the sony PCG-GR214EP/GR214MP/GR215MP/GR314MP/GR315MP/7K/9K
(I'm sorry for line wraps or amenities, I only have a webmail client
available at work...)
Pavel, soon you'll receive a successful s2ram report for
Documentation/power :)

-- 
mattia
:wq!


-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
       [not found]     ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
  2006-01-11 10:00       ` Pavel Machek
  2006-01-11 10:02       ` Pavel Machek
@ 2006-01-11 14:27       ` Mattia Dongili
  2006-01-11 18:40       ` Mattia Dongili
  3 siblings, 0 replies; 6+ messages in thread
From: Mattia Dongili @ 2006-01-11 14:27 UTC (permalink / raw)
  To: Andrew Morton
  Cc: linux-kernel-u79uwXL29TY76Z2rM5mHXA,
	reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
	linux-acpi-u79uwXL29TY76Z2rM5mHXA, Pavel Machek

On Wed, January 11, 2006 2:00 am, Andrew Morton said:
> Mattia Dongili <malattia-k2GhghHVRtY@public.gmane.org> wrote:
[...]
>> 3- This laptop experienced 2 long stalls (20~25 sec) during boot,
>>    apparently after scanning usb_storage devices and starting portmap.
>
> You mean before starting portmap?

well, _while_ starting portmap. As you can see from the traces I have
portmap sleeping in sys_poll, consider my reflexes are not that fast
so the trace might be well more than 10 secs after the /etc/init.d/portmap
was run.
Trying to stop and start it again didn't have any delay.

>>    Is it time for me to learn to git bisect? (Tomorrow morning I'll try
>>    (CET) if plain 2.6.15 also shows the same stalls).
>
> Please test the next Linus git tree (2.6.15-git7) and see if we've
> propagated it into there too.
>
> There's not much point in fiddling with -mm2.  If git7 is OK then please
> test the next -mm and if it still fails then yes, doing a bisection would
> really help.
>
> <types madly>
>
> See http://www.zip.com.au/~akpm/linux/patches/stuff/bisecting-mm-trees.txt

ooooh :) really really thanks!!
I was starting to script something that just some hours later revealed to
be like stGit (well at least had the same base idea).

Anyway I'm currently using -git7 and building -mm3, -git7 is OK:
no stalls, no reiser3 oops (yet) and no ACPI misaligned pointer.

Will report on -mm3 as soon as possible

-- 
mattia
:wq!


-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!)
       [not found]     ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
                         ` (2 preceding siblings ...)
  2006-01-11 14:27       ` Mattia Dongili
@ 2006-01-11 18:40       ` Mattia Dongili
  3 siblings, 0 replies; 6+ messages in thread
From: Mattia Dongili @ 2006-01-11 18:40 UTC (permalink / raw)
  To: Andrew Morton, Pavel Machek
  Cc: linux-kernel-u79uwXL29TY76Z2rM5mHXA,
	reiserfs-dev-nJ1KrdHEGnBBDgjK7y7TUQ,
	linux-acpi-u79uwXL29TY76Z2rM5mHXA

Hello,
I gave -mm3 a run:

On Tue, Jan 10, 2006 at 05:00:37PM -0800, Andrew Morton wrote:
> Mattia Dongili <malattia-k2GhghHVRtY@public.gmane.org> wrote:
[...]
> > 1- reiser3 oopsed[1] twice while suspending to ram. It seems
> >    reproducible (have some activity on the fs and suspend)
> 
> No significant reiser3 changes in there, so I'd be suspecting something
> else has gone haywire.

It's still there. But I caught it during normal runtime, a couple more
pictures (yes, I'm going to try to setup a netconsole):

the oops (sorry, it sucks, probably useless):
http://oioio.altervista.org/linux/dsc03136.jpg

a screen with the call traces, the keyboard was still active and sysrq
combos worked. It showed many processes stuck within reiser's log writer:
http://oioio.altervista.org/linux/dsc03138.jpg

going to revert the 

> > 3- This laptop experienced 2 long stalls (20~25 sec) during boot,
> >    apparently after scanning usb_storage devices and starting portmap.

umph... it's still here call traces:
http://oioio.altervista.org/linux/portmap_stall_trace1

[...]
> There's not much point in fiddling with -mm2.  If git7 is OK then please
> test the next -mm and if it still fails then yes, doing a bisection would
> really help.
> 
> <types madly>
> 
> See http://www.zip.com.au/~akpm/linux/patches/stuff/bisecting-mm-trees.txt

give me some hours for this.
Anyway I'd like to try to revert some reiser3 changes first and see if
the oops goes (I can leave with a longer boot but I'm worried about my
root fs :))

> > 4- I'm also affected by the ACPI Misaligned resource pointer error.

still here:
# grep -e 'Jan 11.*Misaligned.*' /var/log/syslog
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cff4b7e2 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cff4b7e2 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cff4b2e2 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cf657902 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cf657602 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cf657402 Type 07 Len 0
Jan 11 18:07:10 inferi kernel: **** SET: Misaligned resource pointer: cf657402 Type 07 Len 0
Jan 11 18:18:53 inferi kernel: **** SET: Misaligned resource pointer: cf657d02 Type 07 Len 0
Jan 11 18:21:14 inferi kernel: **** SET: Misaligned resource pointer: cf657d02 Type 07 Len 0

DSDT and lspci can be found here if useful:
http://oioio.altervista.org/linux/DSDT.aml
http://oioio.altervista.org/linux/DSDT.dsl
http://oioio.altervista.org/linux/lspci-v

On Wed, Jan 11, 2006 at 11:00:16AM +0100, Pavel Machek wrote:
[...]
> > > 1- reiser3 oopsed[1] twice while suspending to ram. It seems
> > >    reproducible (have some activity on the fs and suspend)
[...]
> Suspend to *RAM*? That really does not do anything that should kill

Sorry, it seems it has nothing to do with s2ram.

-- 
mattia
:wq!
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2006-01-11 18:40 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <20060110235554.GA3527@inferi.kami.home>
     [not found] ` <20060110235554.GA3527-MEqNC12sBsHxa7XIdbXXog@public.gmane.org>
2006-01-11  1:00   ` 2.6.15-mm2: reiser3 oops on suspend and more (bonus oops shot!) Andrew Morton
     [not found]     ` <20060110170037.4a614245.akpm-3NddpPZAyC0@public.gmane.org>
2006-01-11 10:00       ` Pavel Machek
     [not found]         ` <20060111100016.GC2574-I/5MKhXcvmPrBKCeMvbIDA@public.gmane.org>
2006-01-11 11:24           ` Mattia Dongili
2006-01-11 10:02       ` Pavel Machek
2006-01-11 14:27       ` Mattia Dongili
2006-01-11 18:40       ` Mattia Dongili

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox