Re: Apparent serious progressive ext4 data corruption bug in 3.6.3(and other stable branches?)

From: Ric Wheeler
Date: Thu Oct 25 2012 - 20:11:09 EST


On 10/24/2012 12:15 AM, Nix wrote:
On 24 Oct 2012, Eric Sandeen uttered the following:

On 10/23/12 3:57 PM, Nix wrote:
The only unusual thing about the filesystems on this machine are that
they have hardware RAID-5 (using the Areca driver), so I'm mounting with
'nobarrier':
I should have read more. :( More questions follow:

* Does the Areca have a battery backed write cache?
Yes (though I'm not powering off, just rebooting). Battery at 100% and
happy, though the lack of power-off means it's not actually getting
used, since the cache is obviously mains-backed as well.

Sending this just to you two to avoid embarrassing myself if I misread the thread, but....

Can we reproduce this with any other hardware RAID card? Or with MD?

If we cannot reproduce this in other machines, why assume this is an ext4 issue and not a hardware firmware bug?

As an ex-storage guy, this really smells like the hardware raid card might be misleading us....

ric



--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/