Re: Queuing of disk writes

From: Charles Samuels
Date: Mon Apr 04 2011 - 13:50:19 EST


Hi,

Thanks for the reply.

On Sunday, April 03, 2011 7:02:35 pm Ted Ts'o wrote:
> On Fri, Apr 01, 2011 at 12:59:53PM -0700, Charles Samuels wrote:
> > I have an application that is writing large amounts of very
> > fragmented data to harddrives. That is, I could write megabytes of
> > data in blocks of a few bytes scattered around a multi-gigabyte
> > file.
>
> Doctor, doctor, it hurts when I do this.... any way you can avoid
> doing this? What is your application doing at the high level.
Not really, I need the on-disk data organized in this pattern, so that the
reads are optimized nicely. It's a database application.

>
> > Obviously, doing this causes the harddrive to seek a lot and takes a
> > while. From what I understand, if I allow linux to cache the
> > writes, it will fill up the kernel's write cache, and then
> > consequently the disk drive's DMA queue. As a result of that, the
> > harddrive can pick the correct order to do these writes,
> > significantly reducing seek times.
>
> This is one way to avoid some of the seeks, yes.

What's another way? Other than not doing it :)

> Who or what is calling fsync()? Is it being called by your
> application because you want to initiate writeout? Or is it being
> called by some completely unrelated process?

It's being called by my own process. When fsync finishes, I update another file
with some offset counters, fsync that, and with some luck, my writes are
transactional.

> If it is being called by the application, one thing you can do is to
> use the Linux-specific system call sync_file_range(). You can use
> this to do asynchronous data flushes of the file, and control which
> range of bytes are written out, which can also help avoid flooding the
> disk with too many write requests.

What would be good use of sync_file_range? It looks pretty useful, but I don't
know how to make good use of it.

For example, SYNC_FILE_RANGE_WRITE, wouldn't linux start this pretty much
immediately? And wouldn't I really not want to give it a suggestion for what
order it does it in?

Would calling sync_file_range with a flag that allows blocking have a
performance benefit compared to fsync? Specifically, can I expect Linux to not
totally block all reads and writes to other files?

Charles
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/