Re: [PATCH 7/9] readahead: sequential mmap readahead

From: Andrew Morton
Date: Fri Apr 10 2009 - 19:36:57 EST

Next message: Al Viro: "Re: [2.6.30-rc1] RCU detected CPU 1 stall"
Previous message: Yu, Fenghua: "RE: 2.6.29: can't resume from suspend with DMAR (intel iommu)enabled"
In reply to: Wu Fengguang: "[PATCH 7/9] readahead: sequential mmap readahead"
Next in thread: Wu Fengguang: "[PATCH] readahead: enforce full sync mmap readahead size"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Fri, 10 Apr 2009 14:10:04 +0800
Wu Fengguang <fengguang.wu@xxxxxxxxx> wrote:

> Auto-detect sequential mmap reads and do readahead for them.
>
> The sequential mmap readahead will be triggered when
> - sync readahead: it's a major fault and (prev_offset == offset-1);
> - async readahead: minor fault on PG_readahead page with valid readahead state.
>
> The benefits of doing readahead instead of read-around:
> - less I/O wait thanks to async readahead
> - double real I/O size and no more cache hits
>
> The single stream case is improved a little.
> For 100,000 sequential mmap reads:
>
> user system cpu total
> (1-1) plain -mm, 128KB readaround: 3.224 2.554 48.40% 11.838
> (1-2) plain -mm, 256KB readaround: 3.170 2.392 46.20% 11.976
> (2) patched -mm, 128KB readahead: 3.117 2.448 47.33% 11.607
>
> The patched (2) has smallest total time, since it has no cache hit overheads
> and less I/O block time(thanks to async readahead). Here the I/O size
> makes no much difference, since there's only one single stream.
>
> Note that (1-1)'s real I/O size is 64KB and (1-2)'s real I/O size is 128KB,
> since the half of the read-around pages will be readahead cache hits.
>
> This is going to make _real_ differences for _concurrent_ IO streams.
>
> Cc: Nick Piggin <npiggin@xxxxxxx>
> Cc: Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx>
> Signed-off-by: Wu Fengguang <fengguang.wu@xxxxxxxxx>
> ---
> mm/filemap.c | 3 ++-
> 1 file changed, 2 insertions(+), 1 deletion(-)
>
> --- mm.orig/mm/filemap.c
> +++ mm/mm/filemap.c
> @@ -1471,7 +1471,8 @@ static void do_sync_mmap_readahead(struc
> if (VM_RandomReadHint(vma))
> return;
>
> - if (VM_SequentialReadHint(vma)) {
> + if (VM_SequentialReadHint(vma) ||
> + offset - 1 == (ra->prev_pos >> PAGE_CACHE_SHIFT)) {
> page_cache_sync_readahead(mapping, ra, file, offset, 1);
> return;
> }
>

We've always believed that readaround was beneficial for more random
access patterns - classically faulting in an executable. Although I
don't recall that this belief was very well substantiated.

(The best results I ever got was by doing readaround and setting the
size to a few MB, so we slurp the entire executable into memory in one
hit. lol.)

So my question is: what is the probability that this change will
inadvertently cause a randomish-access workload to fall into readahead
(rather than readaround) mode, and what is the impact when this
happens?

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Al Viro: "Re: [2.6.30-rc1] RCU detected CPU 1 stall"
Previous message: Yu, Fenghua: "RE: 2.6.29: can't resume from suspend with DMAR (intel iommu)enabled"
In reply to: Wu Fengguang: "[PATCH 7/9] readahead: sequential mmap readahead"
Next in thread: Wu Fengguang: "[PATCH] readahead: enforce full sync mmap readahead size"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]