linux-6.2-rc4+ hangs on poweroff/reboot: Bisected

From: Chris Clayton
Date: Wed Jan 18 2023 - 16:00:40 EST


Hi.

I build and installed the lastest development kernel earlier this week. I've found that when I try the laptop down (or
reboot it), it hangs right at the end of closing the current session. The last line I see on the screen when rebooting is:

sd 4:0:0:0: [sda] Synchronising SCSI cache

when closing down I see one additional line:

sd 4:0:0:0 [sda]Stopping disk

In both cases the machine then hangs and I have to hold down the power button fot a few seconds to switch it off.

Linux 6.1 is OK but 6.2-rc1 hangs, so I bisected between this two and landed on:

# first bad commit: [0e44c21708761977dcbea9b846b51a6fb684907a] drm/nouveau/flcn: new code to load+boot simple HS FWs
(VPR scrubber)

I built and installed a kernel with f15cde64b66161bfa74fb58f4e5697d8265b802e (the parent of the bad commit) checked out
and that shuts down and reboots fine. It the did the same with the bad commit checked out and that does indeed hang, so
I'm confident the bisect outcome is OK.

Kernels 6.1.6 and 5.15.88 are also OK.

My system had dual GPUs - one intel and one NVidia. Related extracts from 'lscpi -v' is:

00:02.0 VGA compatible controller: Intel Corporation CometLake-H GT2 [UHD Graphics] (rev 05) (prog-if 00 [VGA controller])
Subsystem: CLEVO/KAPOK Computer CometLake-H GT2 [UHD Graphics]

Flags: bus master, fast devsel, latency 0, IRQ 142

Memory at c2000000 (64-bit, non-prefetchable) [size=16M]

Memory at a0000000 (64-bit, prefetchable) [size=256M]

I/O ports at 5000 [size=64]

Expansion ROM at 000c0000 [virtual] [disabled] [size=128K]

Capabilities: [40] Vendor Specific Information: Len=0c <?>

Capabilities: [70] Express Root Complex Integrated Endpoint, MSI 00

Capabilities: [ac] MSI: Enable+ Count=1/1 Maskable- 64bit-

Capabilities: [d0] Power Management version 2

Kernel driver in use: i915

Kernel modules: i915


01:00.0 VGA compatible controller: NVIDIA Corporation TU117M [GeForce GTX 1650 Ti Mobile] (rev a1) (prog-if 00 [VGA
controller])
Subsystem: CLEVO/KAPOK Computer TU117M [GeForce GTX 1650 Ti Mobile]
Flags: bus master, fast devsel, latency 0, IRQ 141
Memory at c4000000 (32-bit, non-prefetchable) [size=16M]
Memory at b0000000 (64-bit, prefetchable) [size=256M]
Memory at c0000000 (64-bit, prefetchable) [size=32M]
I/O ports at 4000 [size=128]
Expansion ROM at c3000000 [disabled] [size=512K]
Capabilities: [60] Power Management version 3
Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Legacy Endpoint, MSI 00
Kernel driver in use: nouveau
Kernel modules: nouveau

DRI_PRIME=1 is exported in one of my init scripts (yes, I am still using sysvinit).

I've attached the bisect.log, but please let me know if I can provide any other diagnostics. Please cc me as I'm not
subscribed.


Chris# bad: [1b929c02afd37871d5afb9d498426f83432e71c2] Linux 6.2-rc1
# good: [830b3c68c1fb1e9176028d02ef86f3cf76aa2476] Linux 6.1
git bisect start 'v6.2-rc1' 'v6.1'
# bad: [fc4c9f450493daef1c996c9d4b3c647ec3121509] Merge tag 'efi-next-for-v6.2' of git://git.kernel.org/pub/scm/linux/kernel/git/efi/efi
git bisect bad fc4c9f450493daef1c996c9d4b3c647ec3121509
# good: [8ecd28b7a3a4c43a875a8840851f72468a2ca1d7] Merge tag 'ata-6.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/dlemoal/libata
git bisect good 8ecd28b7a3a4c43a875a8840851f72468a2ca1d7
# bad: [66efff515a6500d4b4976fbab3bee8b92a1137fb] Merge tag 'amd-drm-next-6.2-2022-12-07' of https://gitlab.freedesktop.org/agd5f/linux into drm-next
git bisect bad 66efff515a6500d4b4976fbab3bee8b92a1137fb
# good: [49e8e6343df688d68b12c2af50791ca37520f0b7] Merge tag 'amd-drm-next-6.2-2022-11-04' of https://gitlab.freedesktop.org/agd5f/linux into drm-next
git bisect good 49e8e6343df688d68b12c2af50791ca37520f0b7
# bad: [fc58764bbf602b65a6f63c53e5fd6feae76c510c] Merge tag 'amd-drm-next-6.2-2022-11-18' of https://gitlab.freedesktop.org/agd5f/linux into drm-next
git bisect bad fc58764bbf602b65a6f63c53e5fd6feae76c510c
# bad: [4e291f2f585313efa5200cce655e17c94906e50a] Merge tag 'drm-misc-next-2022-11-10-1' of git://anongit.freedesktop.org/drm/drm-misc into drm-next
git bisect bad 4e291f2f585313efa5200cce655e17c94906e50a
# bad: [78a43c7e3b2ff5aed1809f93b4f87a418355789e] drm/nouveau/gr/gf100-: make global attrib_cb actually global
git bisect bad 78a43c7e3b2ff5aed1809f93b4f87a418355789e
# good: [eb39c613481fd2fe6b2f66ec2ca21f8fdcdd4cac] drm/nouveau/fifo: expose per-runlist CHID information
git bisect good eb39c613481fd2fe6b2f66ec2ca21f8fdcdd4cac
# good: [8ab849d6dd4c2eb8880096e53e91dfb6ca37b589] drm/nouveau/fifo: add new engine context handling
git bisect good 8ab849d6dd4c2eb8880096e53e91dfb6ca37b589
# bad: [0e44c21708761977dcbea9b846b51a6fb684907a] drm/nouveau/flcn: new code to load+boot simple HS FWs (VPR scrubber)
git bisect bad 0e44c21708761977dcbea9b846b51a6fb684907a
# good: [b7f44ef7320a50dd6718df06f8750489c1b1d5ea] drm/nouveau/pmu: move preinit() falcon reset to devinit
git bisect good b7f44ef7320a50dd6718df06f8750489c1b1d5ea
# good: [3b330f0875014b475ca3dc37e4397aee13f00986] drm/nouveau/sec2: unload RTOS before tearing down WPR
git bisect good 3b330f0875014b475ca3dc37e4397aee13f00986
# good: [f15cde64b66161bfa74fb58f4e5697d8265b802e] drm/nouveau/flcn: rework falcon reset
git bisect good f15cde64b66161bfa74fb58f4e5697d8265b802e
# first bad commit: [0e44c21708761977dcbea9b846b51a6fb684907a] drm/nouveau/flcn: new code to load+boot simple HS FWs (VPR scrubber)