Re: [PATCH] [V2] ata: libata: add workaround to flip LPM during suspend/resume

From: Thorsten Leemhuis
Date: Mon Sep 04 2023 - 06:55:48 EST


On 01.09.23 22:14, Niklas Cassel wrote:
> On Fri, Sep 01, 2023 at 10:34:57AM +0800, Koba Ko wrote:
>> Due to TigerLake/Adler Lake AHCI controller's LPM regression,
>> can't apply LPM on TigerLake/AdlerLake AHCI controller.
>>
>> Add a workaround to flip LPM during suspend/resume.
>> When suspneding,

Side note: s/suspneding/suspending/

>> apply LPM on TigerLake/AdlerLake AHCI.
>> Restore it to target_lpm_policy after resuming.
>>
>> Link: https://bugzilla.kernel.org/show_bug.cgi?id=217775
>> Signed-off-by: Koba Ko <koba.ko@xxxxxxxxxxxxx>
>> ---
>
> I understand that it is very frustrating to not be able to go to the
> deepest C-state.
>
> If you want LPM, we should add the PCI device and vendor id as a
> board_ahci_low_power entry.
>
> I am awake that a lot of people reported regressions when that entry was
> added, and that is was thus reverted.
> [...]
> If you do own one of those systems, isn't it better if we instead:
> 1) re-introduce the TigerLake AHCI board_ahci_low_power entry
> 2) debug and fix the root cause of the regressions on TigerLake laptops

Three stupid questions from the cheap seats, as I'm not involved in this
at all (the questions thus might not help at all or even confuse things;
so feel free to ignore or tell me). But it's one of those tricky
situations I more than once encountered as regression tracker, so maybe
sharing a few thoughts from my point might help to lessen the problem
for some users:

- Was something done so to free ourselves from this tricky situation in
the future? E.g. something like "for any future Intel controllers,
enable LPM by default (as then it's not a regression if things don't
work, which might be a better base to get down to the root of the problem)"

- is the LPM working fine on the Laptops affected by
https://bugzilla.kernel.org/show_bug.cgi?id=217114 with the
pre-installed OS (e.g. Windows)

- is switching to LPM maybe safe when there is nothing connected to that
controller (which I guess might be the case on some of those Laptops?)
In that case we might be able to improve the sitaution for some users.

Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
--
Everything you wanna know about Linux kernel regression tracking:
https://linux-regtracking.leemhuis.info/about/#tldr
If I did something stupid, please tell me, as explained on that page.