Hi,Thanks, I believe both patches are base on the same reason. I did a quick search after reading tejun's message. Seems numactl with interleave option helps on the case, you can have a test to see if it's what you want.
How about my per-process numa balancning patch;)
https://lore.kernel.org/all/20211206024530.11336-1-ligang.bdlg@xxxxxxxxxxxxx/