Re: [BUG][6.15][perf] Kernel panic not syncing: Fatal exception in interrupt
From: Yeoreum Yun
Date: Mon Jun 02 2025 - 11:37:44 EST
Hi David,
> > > Before I start testing, I feel concerned about following chain:
> > >
> > > ./kernel/fork.c:
> > > bad_fork_cleanup_perf:
> > > perf_event_free_task()
> > > perf_free_event()
> > > list_del_event()
> > >
> > > This patch seems changes the behavior in this callchain.
> > > Would this have other side-effect?
> >
> > What behavior is changed you're worry about?
> > both error patch is handled by __perf_remove_from_context(),
> > There wouldn't be no problem since this patch just move the
> > time of disabling cgroup before changing event state.
> >
> > also, the cgroup event is for only cpuctx not added in taskctx.
> > So, there's no effect for event attached in taskctx.
> >
> > Thanks.
>
> Am I reading it wrong?
> The call chain I mentioned above dose not walk through __perf_remove_from_context,
> It is a fail path in fork, which happens rarely, but still possible. I guess...
Since commit 90661365021a
("perf Unify perf_event_free_task() / perf_evenet_exit_task_context()")
perf_event_free_task() is integrated with perf_event_exit_task_context()
So, it calls __perf_remove_from_context().
In v6.15, I think you can test with below change only:
@@ -2471,6 +2459,16 @@ __perf_remove_from_context(struct perf_event *event,
ctx_time_update(cpuctx, ctx);
+ /*
+ * If event was in error state, then keep it
+ * that way, otherwise bogus counts will be
+ * returned on read(). The only way to get out
+ * of error state is by explicit re-enabling
+ * of the event
+ */
+ if (event->state > PERF_EVENT_STATE_OFF)
+ perf_cgroup_event_disable(event, ctx);
+
/*
* Ensure event_sched_out() switches to OFF, at the very least
* this avoids raising perf_pending_task() at this time.
not with modification with "list_del_event()".
Thanks
--
Sincerely,
Yeoreum Yun