[PATCH 1/2] genirq/affinity: improve __irq_build_affinity_masks()

From: Ming Lei
Date: Fri Aug 09 2019 - 06:23:46 EST


One invariant of __irq_build_affinity_masks() is that all CPUs in the
specified masks( cpu_mask AND node_to_cpumask for each node) should be
covered during the spread. Even though all requested vectors have been
reached, we still need to spread vectors among left CPUs. The similar
policy has been taken in case of 'numvecs <= nodes'.

So remove the following check inside the loop:

if (done >= numvecs)
break;

Meantime assign at least 1 vector for left nodes if 'numvecs' vectors
have been spread.

Also, if the specified cpumask for one numa node is empty, simply not
spread vectors on this node.

Cc: Christoph Hellwig <hch@xxxxxx>
Cc: Keith Busch <kbusch@xxxxxxxxxx>
Cc: linux-nvme@xxxxxxxxxxxxxxxxxxx,
Cc: Jon Derrick <jonathan.derrick@xxxxxxxxx>
Signed-off-by: Ming Lei <ming.lei@xxxxxxxxxx>
---
kernel/irq/affinity.c | 33 +++++++++++++++++++++------------
1 file changed, 21 insertions(+), 12 deletions(-)

diff --git a/kernel/irq/affinity.c b/kernel/irq/affinity.c
index 6fef48033f96..bc3652a2c61b 100644
--- a/kernel/irq/affinity.c
+++ b/kernel/irq/affinity.c
@@ -129,21 +129,32 @@ static int __irq_build_affinity_masks(unsigned int startvec,
for_each_node_mask(n, nodemsk) {
unsigned int ncpus, v, vecs_to_assign, vecs_per_node;

- /* Spread the vectors per node */
- vecs_per_node = (numvecs - (curvec - firstvec)) / nodes;
-
/* Get the cpus on this node which are in the mask */
cpumask_and(nmsk, cpu_mask, node_to_cpumask[n]);
-
- /* Calculate the number of cpus per vector */
ncpus = cpumask_weight(nmsk);
+ if (!ncpus)
+ continue;
+
+ /*
+ * Calculate the number of cpus per vector
+ *
+ * Spread the vectors evenly per node. If the requested
+ * vector number has been reached, simply allocate one
+ * vector for each remaining node so that all nodes can
+ * be covered
+ */
+ if (numvecs > done)
+ vecs_per_node = max_t(unsigned,
+ (numvecs - done) / nodes, 1);
+ else
+ vecs_per_node = 1;
+
vecs_to_assign = min(vecs_per_node, ncpus);

/* Account for rounding errors */
extra_vecs = ncpus - vecs_to_assign * (ncpus / vecs_to_assign);

- for (v = 0; curvec < last_affv && v < vecs_to_assign;
- curvec++, v++) {
+ for (v = 0; v < vecs_to_assign; v++) {
cpus_per_vec = ncpus / vecs_to_assign;

/* Account for extra vectors to compensate rounding errors */
@@ -153,16 +164,14 @@ static int __irq_build_affinity_masks(unsigned int startvec,
}
irq_spread_init_one(&masks[curvec].mask, nmsk,
cpus_per_vec);
+ if (++curvec >= last_affv)
+ curvec = firstvec;
}

done += v;
- if (done >= numvecs)
- break;
- if (curvec >= last_affv)
- curvec = firstvec;
--nodes;
}
- return done;
+ return done < numvecs ? done : numvecs;
}

/*
--
2.20.1