Re: [PATCH] allocate page caches pages in round robin fasion

From: Martin J. Bligh
Date: Fri Aug 13 2004 - 11:49:19 EST

Next message: John Riggs: "RE: PROBLEM: 2.6.7 Linux Kernel Crash While Detecting PCI Devices"
Previous message: Randy.Dunlap: "Re: epoll, aio etc..."
In reply to: Jesse Barnes: "Re: [PATCH] allocate page caches pages in round robin fasion"
Next in thread: Nick Piggin: "Re: [PATCH] allocate page caches pages in round robin fasion"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

--Jesse Barnes <jbarnes@xxxxxxxxxxxx> wrote (on Friday, August 13, 2004 09:34:20 -0700):

> On Friday, August 13, 2004 9:20 am, Martin J. Bligh wrote:
>> >> I really don't think this is a good idea - you're assuming there's
>> >> really no locality of reference, which I don't think is at all true in
>> >> most cases.
>> >
>> > No, not at all, just that locality of reference matters more for stack
>> > and anonymous pages than it does for page cache pages. I.e. we don't
>> > want a node to be filled up with page cache pages causing all other
>> > memory references from the process to be off node.
>>
>> Does that actually happen though? Looking at the current code makes me
>> think it'll keep some pages free on all nodes at all times, and if kswapd
>> does it's job, we'll never fall back across nodes. Now ... I think that's
>> broken, but I think that's what currently happens - that was what we
>> discussed at KS ... I might be misreading it though, I should test it.
>
> Not nearly enough pages for any sizeable app though. Maybe the behavior could
> be configurable?

Well, either we're:

1. Falling back and putting all our most recent accesses off-node.

or.

2. Not falling back and only able to use one node's memory for any one
(single threaded) app.

Either situation is crap, though I'm not sure which turd we picked right
now ... I'd have to look at the code again ;-) I thought it was 2, but
I might be wrong.

>> Even if that's not true, allocating all your most recent stuff off-node is
>> still crap (so either way, I'd agree the current situation is broken), but
>> I don't think the solution is to push ALL your accesses (with n-1/n
>> probability) off-node ... we need to be more careful than that ...
>
> Only page cache references...

Yeah, depends how important those are to the app though ;-) I absolutely
agree with you that the current situation is broken ... we need to do
*something*.

>> Not sure I'd agree with that - it's the same problem as swappiness on a
>> global basis for non-NUMA machines. We want the pages we're using MOST to
>> be local, the others to be not-local, and that doesn't equate (necessarily)
>> to whether it's pagecache or not. Shared pages could indeed be dealt with
>> differently, and spread more global ... but I don't agree that pagecache
>> pages equate 1-1 with being globally shared - in fact, I think most often
>> the opposite is true.
>
> Yeah, that's a good point. That argues for configurability too. We should
> behave differently depending on whether the page is shared or not.

Right. An app that mmap'ed a big file then thrashed on it would be a good
example, though simple read-write heavily across a small fileset would do
much same thing.

M.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: John Riggs: "RE: PROBLEM: 2.6.7 Linux Kernel Crash While Detecting PCI Devices"
Previous message: Randy.Dunlap: "Re: epoll, aio etc..."
In reply to: Jesse Barnes: "Re: [PATCH] allocate page caches pages in round robin fasion"
Next in thread: Nick Piggin: "Re: [PATCH] allocate page caches pages in round robin fasion"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]