Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5

From: Andreas Dilger (adilger@turbolabs.com)
Date: Wed Oct 03 2001 - 17:22:10 EST

Next message: James Bottomley: "Re: how to get virtual address from dma address"
Previous message: Matt Bernstein: "[OT] Re: Which journalised filesystem uses Linus Torvalds ?"
In reply to: Robert Olsson: "Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5"
Next in thread: Davide Libenzi: "Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5"
Reply: Davide Libenzi: "Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5"
Reply: Robert Olsson: "Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Oct 03, 2001 23:08 +0200, Robert Olsson wrote:
> Ingo Molnar writes:
> > (i did not criticize the list_add/list_del in any way, it's obviously
> > correct to cycle the polled devices. I highlited that code only to show
> > that the current patch as-is polls too agressively for generic server
> > load.)
>
> Yes I think we need some data here...
>
> > can you really make it 100% successful for rx? Ie. do you only ever call
> > the ->poll() function if there is a new packet waiting? How do you know
> > with a 100% probability that someone on the network just sent a new packet
> > waiting? (without receiving an interrupt to begin with that is.)
>
> Well we need RX-interrupts not to spin away the CPU or exhaust the the PCI-
> bus. The NAPI scheme is simple, turn off RX-interrupts when the first packet
> comes and have the kernel to pull packets from the RX-ring.
>
> I tried have pure polling... it easy do just have your driver return
> "not_done" all the time. Not a good idea. :-) Maybe as sofirq test.

I think it is rather easy to make this self-regulating (I may be wrong).

If you get to the stage where you are turning off IRQs and going to a
polling mode, then don't turn IRQs back on until you have a poll (or
two or whatever) that there is no work to be done. This will at worst
give you 50% polling success, but in practise you wouldn't start polling
until there is lots of work to be done, so the real success rate will
be much higher.

At this point (no work to be done when polling) there are clearly no
interrupts would be generated (because no packets have arrived), so it
should be reasonable to turn interrupts back on and stop polling (assuming
non-broken hardware). You now go back to interrupt-driven work until
the rate increases again. This means you limit IRQ rates when needed,
but only do one or two excess polls before going back to IRQ-driven work.

Granted, I don't know what the overhead of turning the IRQs on and off
is, but since we do it all the time already (for each ISR) it can't be
that bad.

If you are always having work to do when polling, then interrupts will
never be turned on again, but who cares at that point because the work
is getting done? Similarly, if you have IRQs disabled, but are sharing
IRQs there is nothing wrong in polling all devices sharing that IRQ
(at least conceptually).

I don't know much about IRQ handlers, but I assume that this is already
what happens if you are sharing an IRQ - you don't know which of many
sources it comes from, so you poll all of them to see if they have any
work to be done. If you are polling some of the shared-IRQ devices too
frequently (i.e. they never have work to do), you could have some sort
of progressive backoff, so you skip polling those for a growing number
of polls (this could also be set by the driver if it knows that it could
only generate real work every X ms, so we skip about X/poll_rate polls).

Cheers, Andreas

-- Andreas Dilger \ "If a man ate a pound of pasta and a pound of antipasto, \ would they cancel out, leaving him still hungry?" http://www-mddsp.enel.ucalgary.ca/People/adilger/ -- Dogbert

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/

Next message: James Bottomley: "Re: how to get virtual address from dma address"
Previous message: Matt Bernstein: "[OT] Re: Which journalised filesystem uses Linus Torvalds ?"
In reply to: Robert Olsson: "Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5"
Next in thread: Davide Libenzi: "Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5"
Reply: Davide Libenzi: "Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5"
Reply: Robert Olsson: "Re: [announce] [patch] limiting IRQ load, irq-rewrite-2.4.11-B5"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Sun Oct 07 2001 - 21:00:29 EST