From: Tao Ma <tm@tao.ma>
To: Li Zefan <lizefan@huawei.com>
Cc: Eric Sandeen <sandeen@redhat.com>,
Yafang Shao <laoar.shao@gmail.com>,
linux-fsdevel@vger.kernel.org, linux-ext4@vger.kernel.org,
wuqixuan@huawei.com, wuqixuan@gmail.com
Subject: Re: help about ext3 read-only issue on ext3(2.6.16.30)
Date: Wed, 05 Dec 2012 22:02:15 +0800 [thread overview]
Message-ID: <50BF53E7.7010307@tao.ma> (raw)
In-Reply-To: <50BF25E9.3090807@huawei.com>
On 12/05/2012 06:46 PM, Li Zefan wrote:
>>>>> We highly doubt it's hardware failures with this frequency in mind, so
>>>>> we're wondering regarding to this issue if there's some ext3 bug-fix
>>>>> having merged into mainline but not in our old kernel?
>>>>
>>>> Absolutely there are. There have been 87 changes just to namei.c since 2.6.16.
>>>> You could look through git logs to see if anything looks applicable.
>>>>
>>>> You might try:
>>>>
>>>> ef2b02d3e617cb0400eedf2668f86215e1b0e6af ext34: ensure do_split leaves enough free space in both blocks
>>>
>>> I've been asked to investigate this issue. Thanks for the reply!
>>>
>>> I found this fix while searching for similar bug reports, but I don't think it
>>> worths trying as we don't use dir_index feature.
>>>
>>> I've collected some logs in different machines, and the error was always
>>> triggered in ext3_readdir:
>>>
>>> EXT3-fs error (device sda7): ext3_readdir: bad entry in directory #6685458: rec_len is smaller than minimal - offset=3860, inode=0, rec_len=0, name_len=0
>>> EXT3-fs error (device sda7): ext3_readdir: bad entry in directory #9650541: rec_len is smaller than minimal - offset=3960, inode=0, rec_len=0, name_len=0
>>> EXT3-fs error (device sda7): ext3_readdir: bad entry in directory #11124783: rec_len is smaller than minimal - offset=4072, inode=0, rec_len=0, name_len=0
>>> EXT3-fs error (device sda7): ext3_readdir: bad entry in directory #52740880: rec_len is smaller than minimal - offset=4024, inode=0, rec_len=0, name_len=0
>>> EXT3-fs error (device sda7): ext3_readdir: bad entry in directory #52740880: rec_len is smaller than minimal - offset=4084, inode=0, rec_len=0, name_len=0
>>>
>>> The last two errors happened on the same machine, and the same inode! One
>>> happened in 11/22 (I was told they had run fsck later on), and one in 12/01.
>> So now this directory has been fscked to be right? You can try by just
>
> right.
>
>> ls this directory and check whether there are any errors in dmesg.
>>
>
> no error at all.
OK, so now it is fixed by e2fsck. hmm, is there any stress inode
creation/deletion in this dir? 2.6.16 is too older although I am not
sure whether this is a bug or not.
>
>> Having said that, as this error happens 2 times for the same inode,
>> maybe there is a kernel bug. At least as Ted said in another mail, the
>> end of this buffer head seems to be cleared. So I guess next time when
>> you see this error, please do:
>> 1. use debugfs to find the disk layout for this dir
>> 2. read the blocks from the block device directly
>> 3. check whether the end of a block(from offset to the end) is zeroed.
>> 4. If yes, I guess there should be a kernel bug and we can go on to
>> investigate the code.
>>
>
> This may give us different output with that by dumping dir via debugfs?
> If so I'll try next time.
In step 2, I mean dd out these blocks, decode and read them by
yourselves to check whether there are zeroes.
Thanks
Tao
>
> Seeing from the output dumpped via debugfs in one machine, more than
> harf of the dir block is all zero, but the offset is near 4K. I also
> checked several other machines, no difference.
Thanks
Tao
next prev parent reply other threads:[~2012-12-05 14:02 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-12-01 14:22 help about ext3 read-only issue on ext3(2.6.16.30) Yafang Shao
2012-12-03 17:59 ` Eric Sandeen
2012-12-04 13:54 ` Li Zefan
2012-12-04 15:09 ` Theodore Ts'o
2012-12-05 10:43 ` Li Zefan
2012-12-05 14:26 ` Tao Ma
2012-12-05 15:51 ` qixuan wu
2012-12-06 1:13 ` Li Zefan
2012-12-06 12:37 ` Jan Kara
2012-12-06 16:21 ` qixuan wu
2012-12-06 17:09 ` Jan Kara
2012-12-07 10:03 ` Li Zefan
2012-12-11 8:01 ` Li Zefan
2012-12-12 10:04 ` Jan Kara
2012-12-12 11:31 ` Li Zefan
2012-12-14 3:32 ` Peng, Tao
2012-12-17 10:51 ` Li Zefan
2012-12-20 11:32 ` Jan Kara
2013-02-12 12:19 ` Jan Kara
2012-12-04 15:29 ` Tao Ma
2012-12-04 16:11 ` Bernd Schubert
2012-12-04 20:20 ` Theodore Ts'o
2012-12-04 16:16 ` qixuan wu
2012-12-04 20:45 ` Theodore Ts'o
2012-12-05 13:58 ` Tao Ma
2012-12-05 15:05 ` Theodore Ts'o
2012-12-06 1:54 ` Tao Ma
2012-12-06 15:48 ` qixuan wu
2012-12-05 15:46 ` qixuan wu
2012-12-06 2:58 ` Yongqiang Yang
2012-12-06 16:26 ` qixuan wu
2012-12-07 1:49 ` Yongqiang Yang
2012-12-05 10:46 ` Li Zefan
2012-12-05 14:02 ` Tao Ma [this message]
2012-12-06 1:17 ` Li Zefan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50BF53E7.7010307@tao.ma \
--to=tm@tao.ma \
--cc=laoar.shao@gmail.com \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=lizefan@huawei.com \
--cc=sandeen@redhat.com \
--cc=wuqixuan@gmail.com \
--cc=wuqixuan@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).