Re: [PATCH v3 1/3] cpufreq: Add boost frequency support in core

From: Rafael J. Wysocki
Date: Tue Jun 18 2013 - 09:17:03 EST


On Tuesday, June 18, 2013 12:12:13 PM Viresh Kumar wrote:
> On 17 June 2013 19:21, Lukasz Majewski <l.majewski@xxxxxxxxxxx> wrote:
> > On Mon, 17 Jun 2013 18:40:50 +0530, Viresh Kumar wrote:
> >> >> > The core acpi-cpufreq.c code hadn't been changed by me, so I
> >> >> > assume that it will work as before.
> >> >>
> >> >> That should adapt your patch in your patchset.
> >
> > Could you explain what do you mean?
>
> Update acpi-cpufreq to use your infrastructure.
>
> >> >> From sysfs?? I thought we are going to have some automatic control
> >> >> of this stuff from inside kernel.
> >> >
> >> > From sysfs I just enable the boost. I do not order from userpace the
> >> > cpufreq to run with a particular (boosted) frequency.
> >> >
> >> > When I enable boost - I ask (politely) the cpufreq core to
> >> > reevaluate policies and when applicable increase policy->max.
> >> >
> >> > Then governor can use this new frequencies for normal operation.
> >>
> >> So, with your current patchset in, ondemand or conservative governors
> >> will start using boost frequencies. Which might burn your chip.
> >
> > Two scenarios:
> > 1. Thermal framework is compiled in and enabled (for exynos/other SoC)
> > -> trip point is reached -> boost is disabled -> frequency is reduced ->
> > SoC is cooled down.
> >
> > I think, that thermal framework is a good "safe valve" here.
>
> Even in this case boost shouldn't have been enabled by default.
> Its not only about burning the chip but more than that.
>
> According to my understanding, boost was important for power
> saving. In case a high load can be managed by a single cpu with
> boost freqs, then its better to use boost freqs rather than bringing
> another cpu up.

Do you mean the 'boost' sysfs attribute or the 'turbo frequencies' concept?

> Normally boost freqs are not so useful if we talk about powersaving,
> as their energy consumption is much higher with not so great impact
> on performance.

Er, er, please be careful here. The impact on performance may be sufficient
for deep C-states to become relevant in some cases.

> That's why when this thread started we talked about boost only when
> one cpu is operational. But with your patch all cores can use boost
> freq and thermal will come into picture just to save the chip.

Well, that's why on x86 turbo is controlled by hardware that takes care of
keeping things within the chip's thermal limits.

> That's wrong. This isn't why we invented boost here. Otherwise you
> just don't need boost feature at all for your SoC. Just make these
> freqs as available freqs and let thermal control policy->max/min
> to save your chip.

The 'boost' attribute added by acpi-cpufreq means "let the hardware use turbo
frequencies".

I'd recommend you both to read Documentation/cpu-freq/boost.txt now. :-)

I think we can extend the meaning to "let turbo frequencies be used", but if
we need software to play the role of the hardware's thermal control, we need
to be very careful.

> > 2. Thermal framework is disabled and user has enabled boost and used
> > ondemand / conservative governor.
> > Then execute gzip < /dev/urandom > /dev/null & (on all cores).
> >
> > Then yes, chip will hang/burn due to crossing operating point (or burn).
> >
> >
> > What other means of control (despite of thermal) would you consider
> > applicable?
> >
> > What comes to my mind is modifying governor logic to count how long the
> > CPU run with boost enabled and then disable it.
>
> Its not about how long.. One cpu type can work longer with boost freq
> compared to other.
>
> What we probably need is:
> - Enabled boost from sysfs if required (now below steps will come into
> picture)

This has to be compatible with the existing stuff.

> - See how many cpus are running, if only one then start using boost freqs
> - Now thermal should be come into picture to save chip in case a single
> cpu running at boost can burn it out.

I'd say there needs to be a separate controller/monitor for that that will
know what the chip's thermal limit is and how that relates to how fast the
CPU core(s) may run and for how much time. I'm not sure it is sufficient
to "wait for thermal to kick in" here, because you may need to slow down
things in advance (i.e. before thermal sensors tell you there's too much heat,
because that may be too late already).

Thanks,
Rafael


--
I speak only for myself.
Rafael J. Wysocki, Intel Open Source Technology Center.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/