Re: [PATCH] Prevent nested interrupts when the IRQ stack is near overflowing v2

From: Andi Kleen
Date: Thu Mar 25 2010 - 09:32:15 EST


> > > > Anyways if such a thing was done it would be a long term project
> > > > and that short term fix would be still needed.
> > >
> > > Your patch is not a fix, It's a lousy, horrible and unreliable
> > > workaround. It's not fixing the root cause of the problem at hand.
> >
> > It fixes the bug in a minimally intrusive way.
>
> It papers over the problem. We already know that the NIC driver floods
> the machine with interrupts, so why are you insisting that we need to

Well in this case it's simply because it has 4 ports and they are all
active and have a lot of MSI-X vectors for each stream.

Even if you had the perfect interrupt handler that ran in
one cycle, if you had enough of them in parallel from different ports
there could be still a stack overflow problem on individual CPUs.

> bandaid that problem ?

Because the system crashes otherwise on that test?

> The minimal intrusive way is a one liner in that very driver code and
> if it causes problems for that very driver then we don't fix them with
> adding a callback in the generic interrupt code path.

Ok.

>
> The message which we would send out with applying that band aid would
> be simply: Go ahead driver writers and let your handlers run as long

Well it's simply the current state of affairs today. I'm merely
attempting to make the current state slightly safer without breaking
anything in the process.

> as they want, we'll safe you in 99.9% of the cases and we'll happily
> go and debug the 0.1% of completely undebuggable shit which will
> result out of that.

I'm not sure I fully understand your suggestion.

Is your suggestion to only set IRQF_DISABLED that one driver and ignore the
other ones? (let's call that the "ostrich approach")

Or is your suggestion to set IRQF_DISABLED by default?

Or is it something else?

Thanks,

-Andi

--
ak@xxxxxxxxxxxxxxx -- Speaking for myself only.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/