Re: [PATCH 01/13] writeback: IO-less balance_dirty_pages()

From: Wu Fengguang
Date: Wed Dec 01 2010 - 20:56:55 EST


On Thu, Dec 02, 2010 at 07:03:33AM +0800, Andrew Morton wrote:
> On Wed, 1 Dec 2010 21:38:18 +0800
> Wu Fengguang <fengguang.wu@xxxxxxxxx> wrote:
>
> > It shows that
> >
> > 1) io_schedule_timeout(200ms) always return immediately for iostat,
> > forming a busy loop. How can this happen? When iostat received
> > some signal? Then we may have to break out of the loop on catching
> > signals. Note that I already have
> > if (fatal_signal_pending(current))
> > break;
> > in the balance_dirty_pages() loop. Obviously that's not enough.
>
> Presumably the calling task has singal_pending().
>
> Using TASK_INTERRUPTIBLE in balance_dirty_pages() seems wrong. If it's
> going to do that then it must break out if signal_pending(), otherwise
> it's pretty much guaranteed to degenerate into a busywait loop.

Right. It seems not rewarding enough to check signal_pending(). We've
already been able to response to signals much faster than before
(which takes more time to block in get_request_wait()).

> Plus we *do* want these processes to appear in D state and to
> contribute to load average.
>
> So it should be TASK_UNINTERRUPTIBLE.

Fair enough. I do missed the D state (without the long wait :).
Here is the patch.

Thanks,
Fengguang
---
Subject: writeback: do uninterruptible sleep in balance_dirty_pages()
Date: Thu Dec 02 09:31:19 CST 2010

Using TASK_INTERRUPTIBLE in balance_dirty_pages() seems wrong. If it's
going to do that then it must break out if signal_pending(), otherwise
it's pretty much guaranteed to degenerate into a busywait loop. Plus
we *do* want these processes to appear in D state and to contribute to
load average.

So it should be TASK_UNINTERRUPTIBLE. -- Andrew Morton

Signed-off-by: Wu Fengguang <fengguang.wu@xxxxxxxxx>
---
mm/page-writeback.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)

--- linux-next.orig/mm/page-writeback.c 2010-12-02 09:30:29.000000000 +0800
+++ linux-next/mm/page-writeback.c 2010-12-02 09:30:34.000000000 +0800
@@ -636,7 +636,7 @@ pause:
pages_dirtied,
pause);
bdi_update_write_bandwidth(bdi, &bw_time, &bw_written);
- __set_current_state(TASK_INTERRUPTIBLE);
+ __set_current_state(TASK_UNINTERRUPTIBLE);
io_schedule_timeout(pause);
bdi_update_write_bandwidth(bdi, &bw_time, &bw_written);

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/