Re: sysbench+mysql(oltp, readonly) 30% regression with 2.6.26-rc1

From: Ingo Molnar
Date: Thu May 08 2008 - 04:46:37 EST



* Mike Galbraith <efault@xxxxxx> wrote:

> > With the patch, oltp testing result is about 50% worse than the one
> > of pure 2.6.26-rc1.
>
> Hm. I was doing some sysbench+postgress(oltp, ro) testing on my
> little Q6600 box this morning, and saw a different picture.
>
> In attached pdf, .bkl refers to Linus' BKL patch, .weight is the
> weight fix, both are applied to git.today. The script I used is also
> attached.

nice numbers. I'm curious about the following detail related to Linus's
BKL-spinlock patch: am i correct that it shows a small but systematic
improvement with many clients? sysbench has not shown sensitivity to any
BKL detail before.

Could you perhaps try the hack below that uses /proc/sys/kernel/panic as
a flag whether the BKL should cause us to block or should be spun upon.

Can you confirm that with that patch too sysbench shows sensitivity to
the value of the panic flag? [this is the surest way to measure such
effects as flipping the sysctl only minimally impacts the system.]

if sysbench truly is dependent on the BKL then that's another argument
in favor of Linus's patch.

Ingo

-------------------------->
Subject: BKL: spin on acquire
From: Ingo Molnar <mingo@xxxxxxx>
Date: Wed May 07 19:05:40 CEST 2008

Signed-off-by: Ingo Molnar <mingo@xxxxxxx>
---
lib/kernel_lock.c | 13 ++++++++++---
1 file changed, 10 insertions(+), 3 deletions(-)

Index: linux/lib/kernel_lock.c
===================================================================
--- linux.orig/lib/kernel_lock.c
+++ linux/lib/kernel_lock.c
@@ -46,7 +46,10 @@ int __lockfunc __reacquire_kernel_lock(v
task->lock_depth = -1;
preempt_enable_no_resched();

- down(&kernel_sem);
+ if (panic_timeout) {
+ while (down_trylock(&kernel_sem))
+ cpu_relax();
+ }

preempt_disable();
task->lock_depth = saved_lock_depth;
@@ -67,11 +70,15 @@ void __lockfunc lock_kernel(void)
struct task_struct *task = current;
int depth = task->lock_depth + 1;

- if (likely(!depth))
+ if (likely(!depth)) {
/*
* No recursion worries - we set up lock_depth _after_
*/
- down(&kernel_sem);
+ if (panic_timeout) {
+ while (down_trylock(&kernel_sem))
+ cpu_relax();
+ }
+ }

task->lock_depth = depth;
}
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/