Re: [PATCH] add some drop_caches documentation and info messsge

From: Andrew Morton
Date: Wed Oct 24 2012 - 16:48:27 EST


On Wed, 24 Oct 2012 13:28:19 -0700
Dave Hansen <dave@xxxxxxxxxxxxxxxxxx> wrote:

> On 10/24/2012 12:54 PM, Andrew Morton wrote:
> > hmpf. This patch worries me. If there are people out there who are
> > regularly using drop_caches because the VM sucks, it seems pretty
> > obnoxious of us to go dumping stuff into their syslog. What are they
> > supposed to do? Stop using drop_caches?
>
> People use drop_caches because they _think_ the VM sucks, or they
> _think_ they're "tuning" their system. _They_ are supposed to stop
> using drop_caches. :)

Well who knows. Could be that people's vm *does* suck. Or they have
some particularly peculiar worklosd or requirement[*]. Or their VM
*used* to suck, and the drop_caches is not really needed any more but
it's there in vendor-provided code and they can't practically prevent
it.

[*] If your workload consists of having to handle large bursts of data
with minimum latency and then waiting around for another burst, it
makes sense to drop all your cached data between bursts.

> What kind of interface _is_ it in the first place? Is it really a
> production-level thing that we expect users to be poking at? Or, is it
> a rarely-used debugging and benchmarking knob which is fair game for us
> to tweak like this?

It was a rarely-used mainly-developer-only thing which, apparently, real
people found useful at some point in the past. Perhaps we should never
have offered it.

> Do we have any valid uses of drop_caches where the printk() would truly
> _be_ disruptive? Are those cases where we _also_ have real kernel bugs
> or issues that we should be working? If it disrupts them and they go to
> their vendor or the community directly, it gives us at least a shot at
> fixing the real problems (or fixing the "invalid" use).

Heaven knows - I'm just going from what Michal has told me and various
rumors which keep surfacing on the internet ;)

> Adding taint, making this a single-shot printk, or adding vmstat
> counters are all good ideas. I guess I think the disruption is a
> feature because I hope it will draw some folks out of the woodwork.

I had a "send mail to akpm@xxxxxxxxxx" printk in 3c59x.c many years
ago. For about two months. It took *years* before I stopped getting
emails ;)



Gee, I dunno. I have issues with it :( We could do
printk_ratelimited(one-hour) but I suspect that would defeat Michal's
purpose.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/