Re: acpi_ex_system_memory_space_handler oops

From: Sergey Senozhatsky
Date: Thu Aug 13 2009 - 04:31:21 EST


On (08/12/09 20:18), Len Brown wrote:
> > Kernel oopses on early boot stages (boots normally with boot_delay=10).
>
> > rc5-git8
>
> Please find which kernel this failure starts in.
> Start by testing 2.6.30.stable so we can tell
> if this is a regression in the current RC.
>
OK... I'll try to bisect. It'll take time.
I consider 7 successfull boots to mark that kernel is ok (it may be wrong).

-git8
maxcpus=1
+ + + + + + +

acpi_serialize=1
+ -

boot_delay=1
+ + + + + + +

boot_delay=0
+ + -

vga=791
+ + + + + + +


> What is the difference between the two dmesg?
> they have the same cmdline, neither has boot_delay,
> and neither dmesg shows the oops.
>
boot params are the same. Both kernels booted successfully. There are no oopses in syslog (I guess 0.21 is
too early for logd to start).

The only difference (which I find suspicious) is around 0.21 second:
[ 0.205668] ACPI Error (dswload-0790): [PCFG] Namespace lookup failure, AE_ALREADY_EXISTS
[ 0.205899] ACPI Exception: AE_ALREADY_EXISTS, During name lookup/catalog 20090521 psloop-227
[ 0.206124] ACPI Error (psparse-0537): Method parse/execution failed [\_SB_.RBPE] (Node f7017678), AE_ALREADY_EXISTS
[ 0.206417] ACPI: Marking method RBPE as Serialized because of AE_ALREADY_EXISTS error
[ 0.206773] ACPI Error (psparse-0537): Method parse/execution failed [\_SB_.PCI0.P0P4.XCRD._STA] (Node f7015648), AE_ALREADY_EXISTS
[ 0.207070] ACPI: Marking method _STA as Serialized because of AE_ALREADY_EXISTS error
[ 0.207326] ACPI Error (uteval-0256): Method execution failed [\_SB_.PCI0.P0P4.XCRD._STA] (Node f7015648), AE_ALREADY_EXISTS
[ 0.213503] ACPI Error: Could not map memory at 0000000000000000, size 0 20090521 exregion-173
[ 0.213734] ACPI Exception: AE_NO_MEMORY, Returned by Handler for [SystemMemory] 20090521 evregion-424
[ 0.213963] ACPI Error (psparse-0537): Method parse/execution failed [\_SB_.RBPE] (Node f7017678), AE_NO_MEMORY
[ 0.214286] ACPI Error (psparse-0537): Method parse/execution failed [\_GPE._L01] (Node f701d198), AE_NO_MEMORY
[ 0.214616] ACPI Exception: AE_NO_MEMORY, while evaluating GPE method [_L01] 20090521 evgpe-568


> The fact that boot_delay=10 makes the problem go away
> suggests some sort of race condition.
I think it is.

> Is 10 the smallest value of printk_time that works, or is it a random value?
>
No. 1 is the smallest.

> Do you still see the oops with CONFIG_PRINTK_TIME=n?
>
Yes.

> Do you still see the oops with maxcpus=1?
>
No.

> Do you still see the oops with "acpi_serialize=1"?
>
Yes.

Sergey

Attachment: signature.asc
Description: Digital signature