* Re: Linux 3.7-rc1 (nouveau_bios_score oops).
[not found] ` <2155889.FOmdXlva7q@localhost>
@ 2012-10-21 12:09 ` Marcin Slusarz
2012-10-21 13:31 ` Heinz Diehl
2012-10-21 14:38 ` Linus Torvalds
0 siblings, 2 replies; 6+ messages in thread
From: Marcin Slusarz @ 2012-10-21 12:09 UTC (permalink / raw)
To: linux-acpi, Len Brown
Cc: Heinz Diehl, Martin Peres, Heinz Diehl, Linus Torvalds,
David Airlie, Ben Skeggs, dri-devel, linux-kernel,
Paweł Sikora
On Sun, Oct 21, 2012 at 08:58:07AM +0200, Paweł Sikora wrote:
> On Sunday 21 of October 2012 00:19:48 Marcin Slusarz wrote:
> > On Sat, Oct 20, 2012 at 11:20:36PM +0200, Heinz Diehl wrote:
> > > On 20.10.2012, Marcin Slusarz wrote:
> > >
> > > > Try this one.
> > >
> > > It works, now I can boot again. However, nouveau seems to be dead now.
> > > The dmesg output with your patch on top of 3.7-rc1 is:
> > >
> > > [ 3.685909] [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0
> > > [ 3.687784] nouveau [ DEVICE][0000:01:00.0] BOOT0 : 0x0a8800b1
> > > [ 3.689960] nouveau [ DEVICE][0000:01:00.0] Chipset: GT218 (NVA8)
> > > [ 3.692471] nouveau [ DEVICE][0000:01:00.0] Family : NV50
> > > [ 3.695716] nouveau [ VBIOS][0000:01:00.0] checking PRAMIN for image...
> > > [ 3.697087] nouveau [ VBIOS][0000:01:00.0] ... signature not found
> > > [ 3.698471] nouveau [ VBIOS][0000:01:00.0] checking PROM for image...
> > > [ 3.699838] nouveau [ VBIOS][0000:01:00.0] ... signature not found
> > > [ 3.701223] nouveau [ VBIOS][0000:01:00.0] checking ACPI for image...
> > > [ 3.702684] ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
> > > [ 3.704139] ACPI Error: Method parse/execution failed[\_SB_.PCI0.PEG1.GFX0._ROM] (Node ffff880142e85cf8), AE_AML_REGION_LIMIT (20120913/psparse-536)
> > > [ 3.716183] failed to evaluate ROM got AE_AML_REGION_LIMIT
> > > [ 3.718776] nouveau [ VBIOS][0000:01:00.0] ... signature not found
> > > [ 3.721349] nouveau [ VBIOS][0000:01:00.0] checking PCIROM for image...
> > > [ 3.724111] nouveau 0000:01:00.0: Invalid ROM contents
> > > [ 3.726663] nouveau [ VBIOS][0000:01:00.0] ... signature not found
> > > [ 3.729159] nouveau E[ VBIOS][0000:01:00.0] unable to locate usable image
> > > [ 3.731677] nouveau E[ DEVICE][0000:01:00.0] failed to create 0x10000001, -22
> > > [ 3.734231] nouveau E[ DRM] failed to create 0x80000080, -22
> > > [ 3.736097] nouveau: probe of 0000:01:00.0 failed with error -22
> > > [ 3.740523] dracut: Starting plymouth daemon
> >
> > Hmm, maybe we can't fetch 3 bytes only...
> >
> > Let's check this:
> >
> > ---
> > diff --git a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> > index 824eea0..8bd71aa 100644
> > --- a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> > +++ b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> > @@ -192,14 +192,16 @@ nouveau_bios_shadow_acpi(struct nouveau_bios *bios)
> > {
> > struct pci_dev *pdev = nv_device(bios)->pdev;
> > int ret, cnt, i;
> > - u8 data[3];
> > + u8 *data;
> >
> > if (!nouveau_acpi_rom_supported(pdev))
> > return;
> >
> > bios->size = 0;
> > - if (nouveau_acpi_get_bios_chunk(data, 0, 3) == 3)
> > + data = kmalloc(4096, GFP_KERNEL);
> > + if (data && nouveau_acpi_get_bios_chunk(data, 0, 4096) >= 3)
> > bios->size = data[2] * 512;
> > + kfree(data);
> > if (!bios->size)
> > return;
>
> with these both patches applied my laptop boots and gui works fine.
> here's dmesg:
>
> (...)
> [ 8.751795] nouveau [ VBIOS][0000:01:00.0] ... appears to be valid
> [ 8.751798] nouveau [ VBIOS][0000:01:00.0] using image from ACPI
> [ 8.751895] nouveau [ VBIOS][0000:01:00.0] BIT signature found
> [ 8.751898] nouveau [ VBIOS][0000:01:00.0] version 70.08.45.00
> [ 8.752366] nouveau [ DEVINIT][0000:01:00.0] adaptor not initialised
> [ 8.752368] nouveau [ VBIOS][0000:01:00.0] running init tables
> [ 8.867660] nouveau [ MXM][0000:01:00.0] no VBIOS data, nothing to do
> [ 8.867690] nouveau [ PFB][0000:01:00.0] RAM type: DDR3
> [ 8.867692] nouveau [ PFB][0000:01:00.0] RAM size: 512 MiB
> [ 8.901523] vga_switcheroo: enabled
> [ 8.901979] [TTM] Zone kernel: Available graphics memory: 6104282 kiB
> [ 8.901980] [TTM] Zone dma32: Available graphics memory: 2097152 kiB
> [ 8.901982] [TTM] Initializing pool allocator
> [ 8.902014] [TTM] Initializing DMA pool allocator
> [ 8.902180] mtrr: type mismatch for c0000000,10000000 old: write-back new: write-combining
> [ 8.902184] nouveau [ DRM] VRAM: 512 MiB
> [ 8.902185] nouveau [ DRM] GART: 512 MiB
> [ 8.902188] nouveau [ DRM] BIT BIOS found
> [ 8.902190] nouveau [ DRM] Bios version 70.08.45.00
> [ 8.902192] nouveau [ DRM] TMDS table version 2.0
> [ 8.902194] nouveau [ DRM] DCB version 4.0
> [ 8.902196] nouveau [ DRM] DCB outp 00: 02011300 00000000
> [ 8.902198] nouveau [ DRM] DCB conn 01: 00000100
> [ 8.903540] [drm] Supports vblank timestamp caching Rev 1 (10.10.2010).
> [ 8.903541] [drm] No driver support for vblank timestamp query.
> [ 8.903545] nouveau [ DRM] ACPI backlight interface available, not registering our own
> [ 8.903938] nouveau [ DRM] 3 available performance level(s)
> [ 8.903941] nouveau [ DRM] 0: core 50MHz shader 101MHz memory 135MHz voltage 830mV
> [ 8.903943] nouveau [ DRM] 1: core 202MHz shader 405MHz memory 324MHz voltage 830mV
> [ 8.903946] nouveau [ DRM] 3: core 672MHz shader 1344MHz memory 900MHz voltage 980mV
> [ 8.903948] nouveau [ DRM] c: core 202MHz shader 405MHz memory 324MHz voltage 980mV
> [ 8.908895] nouveau [ DRM] MM: using COPY1 for buffer copies
> [ 8.988695] nouveau [ DRM] allocated 1024x768 fb: 0x60000, bo ffff88033b5f1800
> [ 8.988955] fb1: nouveaufb frame buffer device
> [ 8.988959] [drm] Initialized nouveau 1.1.0 20120801 for 0000:01:00.0 on minor 1
> (...)
>
> > Please attach acpidump output.
>
> http://pluto.agmk.net/nv/acpidump.txt
>
This looks like ACPI bug...
Nouveau calls this method:
Method (_ROM, 2, NotSerialized)// _ROM: Read-Only Memory
{
Add (Arg0, RBUF, Local0)
ShiftLeft (Arg1, 0x03, Local1)
Name (VBUF, Buffer (Arg1) {})
OperationRegion (VROM, SystemMemory, Local0, Local1)
Field (VROM, ByteAcc, NoLock, Preserve)
{
ROMI, 65536
}
Store (ROMI, VBUF)
Return (VBUF)
}
with Arg0 == 0, Arg1 == 3 and gets:
ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEGR.GFX0._ROM] (Node ffff88033e47fe88), AE_AML_REGION_LIMIT (20120913/psparse-536)
We can workaround it by aligning Arg1 to 4096 (I'm wondering what is the minimal
value), but do we really have to?
Marcin
--
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: Linux 3.7-rc1 (nouveau_bios_score oops).
2012-10-21 12:09 ` Linux 3.7-rc1 (nouveau_bios_score oops) Marcin Slusarz
@ 2012-10-21 13:31 ` Heinz Diehl
2012-10-21 14:38 ` Linus Torvalds
1 sibling, 0 replies; 6+ messages in thread
From: Heinz Diehl @ 2012-10-21 13:31 UTC (permalink / raw)
To: Marcin Slusarz
Cc: linux-acpi, Len Brown, Martin Peres, Heinz Diehl, Linus Torvalds,
David Airlie, Ben Skeggs, dri-devel, linux-kernel,
Paweł Sikora
On 21.10.2012, Marcin Slusarz wrote:
> > > Please attach acpidump output.
> >
> > http://pluto.agmk.net/nv/acpidump.txt
> >
>
> This looks like ACPI bug...
I guess my acpidump didn't make it to the list. Anyway, here it is:
http://www.fritha.org/acpidump.gz
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: Linux 3.7-rc1 (nouveau_bios_score oops).
2012-10-21 12:09 ` Linux 3.7-rc1 (nouveau_bios_score oops) Marcin Slusarz
2012-10-21 13:31 ` Heinz Diehl
@ 2012-10-21 14:38 ` Linus Torvalds
2012-10-21 14:49 ` Marcin Slusarz
1 sibling, 1 reply; 6+ messages in thread
From: Linus Torvalds @ 2012-10-21 14:38 UTC (permalink / raw)
To: Marcin Slusarz
Cc: linux-acpi, Len Brown, Heinz Diehl, Martin Peres, Heinz Diehl,
David Airlie, Ben Skeggs, dri-devel, linux-kernel,
Paweł Sikora
On Sun, Oct 21, 2012 at 5:09 AM, Marcin Slusarz
<marcin.slusarz@gmail.com> wrote:
>
> This looks like ACPI bug...
I'm _shocked_ to hear that firmware would be fragile.
Anyway, here's the #1 thing to keep in mind about firmware:
- firmware is *always* buggy.
It's that simple. Don't expect anything else. Firmware is written by
people who have lost the will to live (why? Because they do firmware
development for a living), and the only thing keeping them going is
the drugs. And they're not the "fun" kind of drugs. The end result is
predictable. In their drug-induced haze, they make a mess.
So saying "ACPI is buggy" is like saying "water is wet". Deal with it.
> Nouveau calls this method: [...]
>
> with Arg0 == 0, Arg1 == 3 and gets:
>
> ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
> ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEGR.GFX0._ROM] (Node ffff88033e47fe88), AE_AML_REGION_LIMIT (20120913/psparse-536)
>
> We can workaround it by aligning Arg1 to 4096 (I'm wondering what is the minimal
> value), but do we really have to?
Yes, you really have to. You need to understand that ACPI has been
tested with one thing, and one thing only: Windows. Clearly windows
doesn't ask for some three-byte region. So it doesn't work. Big
surprise. Untested code written by monkeys on crack - what did you
expect?
So don't do "clever" things. When it comes to firmware, you need to
expect it to be buggy, and try to access it the way Windows accesses
it.
Now, whether Windows realy always does things with a 4kB-aligned
access, or whether you just need to make sure that you're (say)
4-byte-aligned, I don't know. Maybe doing things four (or eight, or
sixteen) bytes at a time will work. You can try. But if we know from
past experience that a 4kB block-size will work, I'd suggest just
saying
"It's stupid, but stupid is good - we're working with firmware"
Think of firmware the way you think of hardware: it's buggy and needs
workarounds.
Linus
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: Linux 3.7-rc1 (nouveau_bios_score oops).
2012-10-21 14:38 ` Linus Torvalds
@ 2012-10-21 14:49 ` Marcin Slusarz
2012-10-21 17:13 ` Linus Torvalds
2012-10-21 20:07 ` Lekensteyn
0 siblings, 2 replies; 6+ messages in thread
From: Marcin Slusarz @ 2012-10-21 14:49 UTC (permalink / raw)
To: Linus Torvalds
Cc: linux-acpi, Len Brown, Heinz Diehl, Martin Peres, Heinz Diehl,
David Airlie, Ben Skeggs, dri-devel, linux-kernel,
Paweł Sikora
On Sun, Oct 21, 2012 at 07:38:58AM -0700, Linus Torvalds wrote:
> On Sun, Oct 21, 2012 at 5:09 AM, Marcin Slusarz
> <marcin.slusarz@gmail.com> wrote:
> >
> > This looks like ACPI bug...
>
> I'm _shocked_ to hear that firmware would be fragile.
>
> Anyway, here's the #1 thing to keep in mind about firmware:
>
> - firmware is *always* buggy.
I know. But this bug is not about broken firmware. It's about Linux kernel
ACPI implementation, which (I think) wrongly interprets ACPI script.
Marcin
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: Linux 3.7-rc1 (nouveau_bios_score oops).
2012-10-21 14:49 ` Marcin Slusarz
@ 2012-10-21 17:13 ` Linus Torvalds
2012-10-21 20:07 ` Lekensteyn
1 sibling, 0 replies; 6+ messages in thread
From: Linus Torvalds @ 2012-10-21 17:13 UTC (permalink / raw)
To: Marcin Slusarz
Cc: linux-acpi, Len Brown, Heinz Diehl, Martin Peres, Heinz Diehl,
David Airlie, Ben Skeggs, dri-devel, linux-kernel,
Paweł Sikora
On Sun, Oct 21, 2012 at 5:49 PM, Marcin Slusarz
<marcin.slusarz@gmail.com> wrote:
>
> I know. But this bug is not about broken firmware. It's about Linux kernel
> ACPI implementation, which (I think) wrongly interprets ACPI script.
Hmm. Len, care to comment? Marcin quoted the AML and our arguments in
an earlier thing. I don't read AML, so I was assuming it's the AML
itself that was buggy, not our interpretation of it..
Linus
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: Linux 3.7-rc1 (nouveau_bios_score oops).
2012-10-21 14:49 ` Marcin Slusarz
2012-10-21 17:13 ` Linus Torvalds
@ 2012-10-21 20:07 ` Lekensteyn
1 sibling, 0 replies; 6+ messages in thread
From: Lekensteyn @ 2012-10-21 20:07 UTC (permalink / raw)
To: Marcin Slusarz
Cc: Linus Torvalds, linux-acpi, Len Brown, Heinz Diehl, Martin Peres,
Heinz Diehl, David Airlie, Ben Skeggs, dri-devel, linux-kernel,
Paweł Sikora
On Sunday 21 October 2012 16:49:08 Marcin Slusarz wrote:
> On Sun, Oct 21, 2012 at 07:38:58AM -0700, Linus Torvalds wrote:
> > On Sun, Oct 21, 2012 at 5:09 AM, Marcin Slusarz
> >
> > <marcin.slusarz@gmail.com> wrote:
> > > This looks like ACPI bug...
> >
> > I'm shocked to hear that firmware would be fragile.
> >
> > Anyway, here's the #1 thing to keep in mind about firmware:
> >
> > - firmware is always buggy.
>
> I know. But this bug is not about broken firmware. It's about Linux kernel
> ACPI implementation, which (I think) wrongly interprets ACPI script.
The ACPI implementation is fine, it is just the fw engineers that suck. I see I
have not cc'd the linux-vger ml when replying to a previous mail. I'll paste
it below:
Since commit 9a334cd "drm/nouveau/bios: fix shadowing of ACPI ROMs larger than
64KiB" chunks are not always read in multiples of 4KiB anymore (less is
possible). That is the only obvious thing I can think of atm (besides the
kmalloc(0) bug for which Martin submitted a patch in the previous mail).
Do you still still have an Asus laptop with the same BIOS as in
https://bugzilla.kernel.org/show_bug.cgi?id=19702? (if yes, then the acpidump
from that bug still applies).
This is the ACPI _ROM method that is being called:
Method (_ROM, 2, NotSerialized) // _ROM: Read-Only Memory
{
Add (Arg0, RBUF, Local0)
ShiftLeft (Arg1, 0x03, Local1) // times 8, bytes to bits?
Name (VBUF, Buffer (Arg1) {})
OperationRegion (VROM, SystemMemory, Local0, Local1)
Field (VROM, ByteAcc, NoLock, Preserve)
{
ROMI, 65536
}
Store (ROMI, VBUF)
Return (VBUF)
}
Arg0 is the offset (0 when first reading the size), Arg1 is the number of read
bytes (3). Note the use of Local1 in OperationRegion. The meaning there is
bytes, but a multiple of the requested bytes is passed. So if we request 4096
bytes, we end up with a VROM of size 32KiB. ROMI is 65536 bits (or 8192
bytes), hence reading 4096 does not give errors. But reading only 3 bytes will
fail. Martin, I saw your second patch (https://lkml.org/lkml/2012/10/20/150)
which takes care of the first case, but it skips the case where the ROM is of
an odd size (does that even happen, something like 64KiB+1 bytes? No idea.)
Addition: Conclusion: this means that the request must have a length must be
at least 1 KiB or it will fail with the ACPI error that you have seen before.
This AML snippet suck.
Regards,
Peter
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2012-10-21 20:07 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <1724445.dN2yMEzN6d@localhost>
[not found] ` <20121020212036.GA1698@fritha.org>
[not found] ` <20121020221948.GE5826@joi.lan>
[not found] ` <2155889.FOmdXlva7q@localhost>
2012-10-21 12:09 ` Linux 3.7-rc1 (nouveau_bios_score oops) Marcin Slusarz
2012-10-21 13:31 ` Heinz Diehl
2012-10-21 14:38 ` Linus Torvalds
2012-10-21 14:49 ` Marcin Slusarz
2012-10-21 17:13 ` Linus Torvalds
2012-10-21 20:07 ` Lekensteyn
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox