Re: Perfromance drop on SCSI hard disk

From: Shaohua Li
Date: Thu May 19 2011 - 20:41:03 EST

Next message: Michael Neuling: "Re: [PATCH 3/7] [RFC] add support for BlueGene/P FPU"
Previous message: Benjamin Herrenschmidt: "Re: [PATCH 6/7] [RFC] enable early TLBs for BG/P"
In reply to: Alex,Shi: "Re: Perfromance drop on SCSI hard disk"
Next in thread: Alex,Shi: "Re: Perfromance drop on SCSI hard disk"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

2011/5/20 Alex,Shi <alex.shi@xxxxxxxxx>:
> On Fri, 2011-05-20 at 02:27 +0800, Jens Axboe wrote:
>> On 2011-05-19 10:26, Alex,Shi wrote:
>> >
>> >> I will queue up the combined patch, it looks fine from here as well.
>> >>
>> >
>> > When I have some time to study Jens and shaohua's patch today. I find a
>> > simpler way to resolved the re-enter issue on starved_list. Following
>> > Jens' idea, we can just put the starved_list device into kblockd if it
>> > come from __scsi_queue_insert().
>> > It can resolve the re-enter issue and recover performance totally, and
>> > need not a work_struct in every scsi_device. The logic/code also looks a
>> > bit simpler.
>> > What's your opinion of this?
>>
>> Isn't this _identical_ to my original patch, with the added async run of
>> the queue passed in (which is important, an oversight)?
>
> Not exactly same. It bases on your patch, but added a bypass way for
> starved_list device. If a starved_list device come from
> __scsi_queue_insert(), that may caused by our talking recursion, kblockd
> with take over the process. Maybe you oversight this point in original
> patch. :)
>
> The different part from yours is below:
> ---
> static void __scsi_run_queue(struct request_queue *q, bool async)
> {
> struct scsi_device *sdev = q->queuedata;
> struct Scsi_Host *shost;
> @@ -435,30 +437,35 @@ static void scsi_run_queue(struct request_queue
> *q)
> &shost->starved_list);
> continue;
> }
> -
> - spin_unlock(shost->host_lock);
> - spin_lock(sdev->request_queue->queue_lock);
> - __blk_run_queue(sdev->request_queue);
> - spin_unlock(sdev->request_queue->queue_lock);
> - spin_lock(shost->host_lock);
> + if (async)
> + blk_run_queue_async(sdev->request_queue);
> + else {
> + spin_unlock(shost->host_lock);
> + spin_lock(sdev->request_queue->queue_lock);
> + __blk_run_queue(sdev->request_queue);
> + spin_unlock(sdev->request_queue->queue_lock);
> + spin_lock(shost->host_lock);
>>
I don't quite like this approach. blk_run_queue_async() could
introduce fairness issue as I said in previous mail, because we drop
the sdev from starved list but didn't run its queue immediately. The
issue exists before, but it's a bug to me.
Alex, is there any real advantage of your patch?

Thanks,
Shaohua
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Michael Neuling: "Re: [PATCH 3/7] [RFC] add support for BlueGene/P FPU"
Previous message: Benjamin Herrenschmidt: "Re: [PATCH 6/7] [RFC] enable early TLBs for BG/P"
In reply to: Alex,Shi: "Re: Perfromance drop on SCSI hard disk"
Next in thread: Alex,Shi: "Re: Perfromance drop on SCSI hard disk"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]