Re: Ext4 and the "30 second window of death"

From: david
Date: Thu Apr 02 2009 - 14:44:48 EST


On Thu, 2 Apr 2009, Matthew Garrett wrote:

On Thu, Apr 02, 2009 at 11:22:48AM -0700, david@xxxxxxx wrote:
On Wed, 1 Apr 2009, Matthew Garrett wrote:
laptop-mode is something that makes sense as a default behaviour under a
lot of circumstances. Adding fsync() suppression means it's utterly
impossible to use it in that way. An additional mode would be perfectly
reasonable, as long as it's made clear that it's really a request for
data to be discarded at some point. The current mode isn't.

this issue seems pretty straightforward to me

the apps do fsync (and similar) to the degree that they think their data
is important (potentially with config options if they acknowlege that
their data isn't _always_ that important)

the system allows the admin to override the application and say "I'm
willing to loose up to X seconds of data for other benifits"

if this can work cleanly (with the ordering issue that was identified,
which may involve having multiple versions of the metadata cached) it
seems like a very clean interface.

It does, but it's a different interface to the current one with a
different aim and a different set of tradeoffs. The current behaviour of
laptop-mode is that fsync() results in things hitting disk. The only
configurability of laptop-mode is how long it then waits to flush out
everything else as well.

The solution to "fsync() causes disk spinups" isn't "ignore fsync()".
It's "ensure that applications only use fsync() when they really need
it", which requires us to also be able to say "fsync() should not be
required to ensure that events occur in order".

ignore the issue of order on the local disk for the moment.

what should an application do to make sure it's data isn't lost?

let's not talk a database here, let's talk something simpler, like a POP3 mail client (even though I strongly favor IMAP ;-)

it wants to have the message saved before it deletes it from the server.

how should it try to do this?

the only portable method is to fsync the file after it's written and before sending the delete to the server.

so your mail client _should_ issue fsync calls.

however, some (many, most??) users would probably be willing to loose a little e-mail to gain a significant increase in battery life on their laptops.

today they have no choice (other than picking a mail client that doesn't try to protect it's local data)

with the proposed addition to laptop mode (delaying fsync until the disk is awake), the user (or more precisely the admin) gains the ability to define this trade-off rather than depending on the application developers all doing this right.

without this, we end up in a situation like the powertop wakeups. it only takes one 'buggy' application to destroy your power management and performance. but in this case, the application that is 'buggy' from a power management point of view may be entirely correct from a data safety point of view.

David Lang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/