Re: [PATCH 01/15] writeback: introduce .tagged_writepages for theWB_SYNC_NONE sync stage

From: Wu Fengguang
Date: Tue Jun 07 2011 - 19:25:05 EST


On Wed, Jun 08, 2011 at 07:02:34AM +0800, Andrew Morton wrote:
> On Wed, 08 Jun 2011 05:32:37 +0800
> Wu Fengguang <fengguang.wu@xxxxxxxxx> wrote:
>
> > sync(2) is performed in two stages: the WB_SYNC_NONE sync and the
> > WB_SYNC_ALL sync. Identify the first stage with .tagged_writepages and
> > do livelock prevention for it, too.
> >
> > Note that writeback_inodes_sb() is called by not only sync(), they are
> > treated the same because the other callers also need livelock prevention.
> >
> > Impact: It changes the order in which pages/inodes are synced to disk.
> > Now in the WB_SYNC_NONE stage, it won't proceed to write the next inode
> > until finished with the current inode.
>
> What problem is this patch actually fixing? It sounds like there's
> some livelock scenario in the WB_SYNC_NONE phase. otoh the final
> paragraph implies that the WB_SYNC_NONE phase is failing to write some
> pages under some situations.

Problem is: the WB_SYNC_NONE phase has no livelock prevention _at all_.

Jan's commit f446daaea9 ("mm: implement writeback livelock avoidance
using page tagging") is a partial fix in that it only fixed the
WB_SYNC_ALL phase livelock.

Although ext4 is tested to no longer livelock with commit f446daaea9,
it may due to some "redirty_tail() after pages_skipped" effect which
is by no means a guarantee for _all_ the file systems.

> Suggest that the changelog be fleshed out to cover all of this.

OK, I'll add the above two paragraphs to the changelog.

Thanks,
Fengguang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/