Re: [PATCH] Percpu tag allocator

From: Jeff Layton
Date: Thu Jun 13 2013 - 15:09:30 EST

On Thu, 13 Jun 2013 11:53:18 -0700
Tejun Heo <tj@xxxxxxxxxx> wrote:

> Hello, Andrew, Kent.
> (cc'ing NFS folks for id[r|a] discussion)
> On Wed, Jun 12, 2013 at 08:03:11PM -0700, Andrew Morton wrote:
> > They all sound like pretty crappy reasons ;) If the idr/ida interface
> > is nasty then it can be wrapped to provide the same interface as the
> > percpu tag allocator.
> >
> > I could understand performance being an issue, but diligence demands
> > that we test that, or at least provide a convincing argument.
> The thing is that id[r|a] guarantee that the lowest available slot is
> allocated and this is important because it's used to name things which
> are visible to userland - things like block device minor number,
> device indicies and so on. That alone pretty much ensures that
> alloc/free paths can't be very scalable which usually is fine for most
> id[r|a] use cases as long as lookup is fast. I'm doubtful that it's a
> good idea to push per-cpu tag allocation into id[r|a]. The use cases
> are quite different.
> In fact, maybe what we can do is adding some features on top of the
> tag allocator and moving id[r|a] users which don't require strict
> in-order allocation to it. For example, NFS allocates an ID for each
> transaction it performs and uses it to index the associate command
> structure (Jeff, Bruce, please correct me if I'm getting it wrong).

> The only requirement on IDs is that they shouldn't be recycled too
> fast. Currently, idr implements cyclic mode for it but it can easily
> be replaced with per-cpu tag allocator like this one and it'd be a lot
> more scalable. There are a couple things to worry about tho - it
> probably should use the highbits as generation number as a tag is
> given out so that the actual ID doesn't get recycled quickly, and some
> form dynamic tag sizing would be nice too.
> Thanks.

Actually, nfsd just uses idr to allocate stateids, which sort of
correspond to an open or locked file state. This is not a terribly
high-performance codepath -- we allocate one of these roughly on the
frequency of one (or sometimes two) per open() or fcntl() call on the
client. have the important piece right. The main constraint is
that we don't recycle these too quickly. We'd be fine with totally
random IDs as long as we can be ensured that we don't reuse any until
we've used all 2^31 possible values once.

Jeff Layton <jlayton@xxxxxxxxxx>
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at
Please read the FAQ at