Re: [RFC] Strange code in cpu_idle()

From: Michael Buesch
Date: Mon Dec 06 2004 - 05:05:15 EST


Quoting Zwane Mwaikambo <zwane@xxxxxxxxxxxxxxxx>:
> On Mon, 6 Dec 2004, Michael Buesch wrote:
>
> > Hi Paul,
> >
> > Well, your mail is _very_ interresting for me.
> >
> > I get this oops for weeks with several kernel versions now:
>
> There shouldn't be any oopses with the changes in question, the original
> bugfix was for crashes whilst unloading ACPI processor or APM module.
>
> > Unable to handle kernel paging request at virtual address 00099108
> > printing eip:
> > b01010c0
> > *pde = 00000000
> > Oops: 0000 [#1]
> > SMP
> > Modules linked in: nfs lockd sunrpc nvidia ipv6 ohci_hcd tuner tvaudio msp3400 bttv video_buf firmware_class btcx_risc ehci_hcd uhci_hcd usbcore intel_agp agpgart evdev
> > CPU: 0
> > EIP: 0060:[<b01010c0>] Tainted: P VLI
> > EFLAGS: 00010286 (2.6.10-rc2-ck2-nozeroram-findvmastat)
>
> It would be a lot easier to debug with a vanilla kernel and no nvidia
> loaded.
>
> > The above oops shows a crash when we try to access the
> > thread_info->flags field. This is done by need_resched()
> > to check the TIF_NEED_RESCHED flag.
> > I don't know how to interpret the oops correctly to find
> > the source of the crash.
>
> Looks like thread_info was. Wow i think you're in a different world of
> hurt. If you can reproduce with less patches and no nvidia please send
> over the bugreport.

Yes. That's the problem.
It's impossible to reproduce and it's pure luck when it's triggered.
I'll look what I can do. nvidia already received this bugreport.

> Thanks,
> Zwane
>
>
>

--
Regards Michael Buesch [ http://www.tuxsoft.de.vu ]


Attachment: pgp00000.pgp
Description: PGP signature