Re: Question about usage of RCU in the input layer

From: Paul E. McKenney
Date: Fri Mar 20 2009 - 21:28:08 EST


On Fri, Mar 20, 2009 at 11:13:54AM -0700, Arjan van de Ven wrote:
> On Fri, 20 Mar 2009 07:31:04 -0700
> "Paul E. McKenney" <paulmck@xxxxxxxxxxxxxxxxxx> wrote:
> > >
> > > that'd be throwing out the baby with the bathwater... I'm trying to
> > > use the other cpus to do some of the boot work (so that the total
> > > goes faster); not using the other cpus would be counter productive
> > > to that. (As is just sitting in synchronize_rcu() when the other
> > > cpu is working.. hence this discussion ;-)
> >
> > OK, so you are definitely running multiple CPUs when the offending
> > synchronize_rcu() executes, then?
>
> absolutely.
> (and I'm using bootgraph.pl in scripts to track who's stalling etc)
> >
> > If so, here are some follow-on questions:
> >
> > 1. How many synchronize_rcu() calls are you seeing on the
> > critical boot path
>
> I've seen only this (input) one to take a long time

Ouch!!! A -single- synchronize_rcu() taking a full second??? That
indicates breakage.

> > and what value of HZ are you running?
>
> 1000

K, in absence of readers for RCU_CLASSIC, we should see a handful
of milliseconds for synchronize_rcu().

> > If each synchronize_rcu() is taking (say) tens of jiffies,
> > then, as Peter Zijlstra notes earlier in this thread, we need to focus
> > on what is taking too long to get through its RCU read-side
> > critical sections
>
> I know that "the other guy" is not optimal and takes waaay too long.

That could explain why Peter focused on this case. ;-)

> > Otherwise, if each synchronize_rcu() is
> > in the 3-5 jiffy range, I may finally be forced to create an
> > expedited version of the synchronize_rcu() API.
>
> I think a simplified API for the "add to a list" case might make sense.
> Because the request isn't for a full sync for sure...
>
> (independent of that .. the open question is if this specific case is
> even needed; I think the code confused "send to others" with "wait
> until everyone sees"; afaik synchronize_rcu() has no pushing behavior
> at all, nor should it)

Quite possibly, perhaps Dmitry will come up with something.

> > 2. If expediting is required, then the code calling
> > synchronize_rcu() might or might not have any idea whether or not
> > expediting is appropriate. If it does not, then we would need some
> > sort of way to tell synchronize_rcu() that it should act more
> > aggressively, perhaps /proc flag or kernel global variable indicating
> > that boot is in progress.
> >
> > No, we do not want to make synchronize_rcu() aggressive all
> > the time, as this would harm performance and energy efficiency in
> > the normal runtime situation.
> >
> > So, if it turns out that synchronize_rcu()'s caller does not
> > know whether or not expediting is appropriate, can the boot
> > path manipulate such a flag or variable?
> >
> > 3. Which RCU implementation are you using? CONFIG_CLASSIC_RCU,
> > CONFIG_TREE_RCU, or CONFIG_PREEMPT_RCU?
>
> CLASSIC

OK, it usually has the fastest synchronize_rcu() at the moment, though
I will be giving TREE_RCU some more help.

Sounds like I should hold off in favor of Dmitry's and Peter's efforts.

Thanx, Paul
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/