Re: [BUGFIX][PATCH] fix wrong lru rotate back at lumpty reclaim

From: Minchan Kim
Date: Tue Jun 09 2009 - 10:01:38 EST


Hi, Andy.

On Tue, Jun 9, 2009 at 10:00 PM, Andy Whitcroft<apw@xxxxxxxxxxxxx> wrote:
> On Tue, Jun 09, 2009 at 09:07:16PM +0900, Minchan Kim wrote:
>> 2009/6/9 KAMEZAWA Hiroyuki <kamezawa.hiroyu@xxxxxxxxxxxxxx>:
>> > Minchan Kim wrote:
>> >
>> >> I mean follow as
>> >> Â908 Â Â Â Â /*
>> >> Â909 Â Â Â Â Â* Attempt to take all pages in the order aligned region
>> >> Â910 Â Â Â Â Â* surrounding the tag page. ÂOnly take those pages of
>> >> Â911 Â Â Â Â Â* the same active state as that tag page. ÂWe may safely
>> >> Â912 Â Â Â Â Â* round the target page pfn down to the requested order
>> >> Â913 Â Â Â Â Â* as the mem_map is guarenteed valid out to MAX_ORDER,
>> >> Â914 Â Â Â Â Â* where that page is in a different zone we will detect
>> >> Â915 Â Â Â Â Â* it from its zone id and abort this block scan.
>> >> Â916 Â Â Â Â Â*/
>> >> Â917 Â Â Â Â zone_id = page_zone_id(page);
>> >>
>> > But what this code really do is.
>> > ==
>> > 931 Â Â Â Â Â Â Â Â Â Â Â Â /* Check that we have not crossed a zone
>> > boundary. */
>> > Â932 Â Â Â Â Â Â Â Â Â Â Â Â if (unlikely(page_zone_id(cursor_page) !=
>> > zone_id))
>> > Â933 Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â continue;
>> > ==
>> > continue. I think this should be "break"
>> > I wonder what "This block scan" means is "scanning this aligned block".
>>
>> It is to find first page in same zone with target page when we have
>> crossed a zone.
>> so it shouldn't stop due to that.
>>
>> I think 'abort' means stopping only the page.
>> If it is right, it would be better to change follow as.
>> "and continue scanning next page"
>>
>> Let's Cced Andy Whitcroft.
>>
>> > But I think the whoe code is not written as commented.
>> >
>> >>
>> >>>> If I understand it properly , don't we add goto phrase ?
>> >>>>
>> >>> No.
>> >>
>> >> If it is so, the break also is meaningless.
>> >>
>> > yes. I'll remove it. But need to add "exit from for loop" logic again.
>> >
>> > I'm sorry that the wrong logic of this loop was out of my sight.
>> > I'll review and rewrite this part all, tomorrow.
>>
>> Yes. I will review tomorrow, too. :)
>
> The comment is not the best wording. ÂThe point here is that we need to
> round down in order to safely scan the free blocks as they are only
> marked at the start. ÂIn rounding down however we may move back into the
> previous zone as zones are not necessarily MAX_ORDER aligned. ÂWe want
> to ignore the bit before our zone starts and that check moves us on to
> the next page. ÂIt should be noted that this occurs rarely, ie. only
> when we touch the start of a zone and only then where the zone
> boundaries are not MAX_ORDER aligned.

Thanks for kind explanation.

I think this thread's issue is the 'break' following as.

...
cursor_page = pfn_to_page(pfn);

/* Check that we have not crossed a zone boundary. */
if (unlikely(page_zone_id(cursor_page) != zone_id))
continue;
switch (__isolate_lru_page(cursor_page, mode, file)) {
case 0:
list_move(&cursor_page->lru, dst);
nr_taken++;
scan++;
break;

case -EBUSY:
/* else it is being freed elsewhere */
list_move(&cursor_page->lru, src);
default:
break; /* ! on LRU or wrong list */
<====== HERE
}
}
}
...

I think you meant that if we met not lru pages, it should stop scanning.
That's because we have in trouble with high order page allocation.
So, if we fail to allocate contiguous page frame, scanning isn't a
point any more.

But that break can't stop loop. It is in switch case. so if we want to
break in loop really, we have to use goto phrase.
What do you think about it ?

> -apw
>



--
Kinds regards,
Minchan Kim
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/