Re: [linux-pm] [SUSPECTED SPAM] Re: Proposal for a new algorithmfor reading & writing a hibernation image.

From: Nigel Cunningham
Date: Mon Jun 07 2010 - 01:23:39 EST


Hi.

On 07/06/10 00:06, Rafael J. Wysocki wrote:
On Sunday 06 June 2010, Nigel Cunningham wrote:
On 06/06/10 09:20, Rafael J. Wysocki wrote:
On Sunday 06 June 2010, Nigel Cunningham wrote:
...
I'm not talking about dropping the page cache, but about keeping it in place
and saving as a part of the image - later. The part I think is too complicated
is the re-using of that memory for creating the "atomic" image. That in my
opinion really goes too far and causes things to be excessively fragile -
without a really good reason (it is like "we do that because we can" IMO).

First, it's not fragile.

Well, I obviously don't agree and I'm not convinced by the arguments below.

Okay. I'm going to assume you're not being unreasonable and ask: "What do you find unconvincing in the arguments below? That is, what can I do to help build a better case for you?"

All it depends on is the freezer being
effective, just as the other parts of hibernation depend on the freezer
being effective. Checksumming has been used to confirm that the contents
of memory haven't changed prior to this page fault idea. I can think of
examples where pages have been found to have changed, but they're few
and far between, and easily addressed by resaving the affected pages in
the atomic copy.

Second, it's not done without reason or simply because we can. It's done
because it's been proven to make it more likely for us to be able to
hibernate successfully in the first place AND gives us a more responsive
system post-resume.

We haven't mentioned the first part so far, so let me go into more
detail there. The problem with not doing things the TuxOnIce way is that
you when you have more than (say) 80% of memory in use, you MUST free
memory. Depending upon your workload, that simply might not be possible.
In other cases, the only way to free memory might be to swap it out, but
you're then reducing the amount of storage available for the image,
which means you have to free more memory again, which means... For
maximum reliability, you need an algorithm wherein you can save the
contents of memory as they are at the start of the cycle.

...
I do agree that doing a single atomic copy and saving the result makes
for a simpler algorithm, but I've always been of the opinion that we're
writing code to satisfy real work needs and desires, not our own desires
for simpler or easier to understand algorithms. Doing the bare minimum
isn't an option for me.

I'm not talking about that!

In short, if your observation that the page cache doesn't really change during
hibernation is correct, then it should be possible to avoid making an atomic
copy of it and to save it directly from its original locations. I think that
would allow us to save about 80% of memory in the majority of cases without
the entire complexity that makes things extremely fragile and depends haevily
on the current (undocumented) behavior of our mm subsystem that _happens_
to be favourable to TuxOnIce. HTH

I'm not sure what this current undocumented behaviour is.

Easy. The behavior that allows you to use memory used for the page cache
hibernation without the risk of it being overwritten in the process. This is
not documented anywhere and I don't think it'll ever be.

All I'm relying on is the freezer working and the mm subsystem not deciding to
free process pages or LRU for no good reason.

It's more than just freeing them. In fact you need a guarantee that their
contents won't be modified over the entire hibernation in a way that you don't
control. There's no such guarantee at the moment I know of, so you have to
assume that that won't happen, which is _exactly_ relying on undocumented
behavior that's not guaranteed to change in future.

I think it's rather unfair to talk about undocumented and unguaranteed behaviour when you know I'm relying on the freezer, which is documented and guaranteed to work.

I'm willing to modify things so we use this page-fault idea to make the guarantee even more certain - would that satisfy you?

Remember that kswapd is also frozen.

But some day it may turn out that it would be better not to freeze it for some
reason. If we go the TuxOnIce route, that won't ever be possible I think.

It may also turn out some day that it's better not to freeze any processes at all.

But seriously, what could possibly lead us to that decision? The only reason we'd want to not freeze kswapd would be if we wanted it to be able to free memory while we're hibernating. What demand would there be for such memory apart from our own routines for writing the image? What impetus would it have to do any freeing? After the atomic copy, any other work is pointless - it's going to be thrown away when we power off.

Regards,

Nigel
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/