* reiser4 and megaraid problems with debian 2.6.5 @ 2004-04-14 6:51 Paul Wagland 2004-04-14 9:05 ` Domenico Andreoli 2004-04-14 15:13 ` reiser4 and megaraid problems with debian 2.6.5 Hans Reiser 0 siblings, 2 replies; 9+ messages in thread From: Paul Wagland @ 2004-04-14 6:51 UTC (permalink / raw) To: Linux mailing list SCSI, Linux mailing list kernel Cc: Hans Reiser, Atul Mukker [-- Attachment #1: Type: text/plain, Size: 1406 bytes --] Hi all, I would like to report on a problem that I am having. I am just testing out the new megaraid unified driver, and have been doing some baseline testing with bonnie++. My problem is that, although reiserfs, ext2, jfs and xfs all work, reiser4 fails with the following error: --- Can't write block. Bonnie: drastic I/O error (write(2)): No such file or directory --- I am using the debian prepared kernel with the debian reiser4 patch. I made a cursory examination of the patch, and it appears to correlate fairly closely with the patch from the namesys site. Given that this works with reiserfs, ext2, jfs and xfs it would appear to be a reiser4 problem, however ext3 also fails, though with a different error, it claims that the disk is full, but it is trying to write a 2 1GB files onto a 2.5GB filesystem, so it should have enough room, and indeed it did even work two or three times out of about 10 runs (lots of timing :-). This implies that it might be a megaraid problem. As you can tell, I really have no idea ;-) I will try playing around tonight with an official kernel and the official reiser4 patch to see if that makes any difference, but would just like to raise this potential problem sooner rather than later. If I can help debug this situation (I am probably the only person trying this combination :-) please let me know how I should go about it. Cheers, Paul [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 186 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: reiser4 and megaraid problems with debian 2.6.5 2004-04-14 6:51 reiser4 and megaraid problems with debian 2.6.5 Paul Wagland @ 2004-04-14 9:05 ` Domenico Andreoli 2004-04-14 12:36 ` Paul Wagland 2004-04-14 15:13 ` reiser4 and megaraid problems with debian 2.6.5 Hans Reiser 1 sibling, 1 reply; 9+ messages in thread From: Domenico Andreoli @ 2004-04-14 9:05 UTC (permalink / raw) To: Paul Wagland Cc: Linux mailing list SCSI, Linux mailing list kernel, Hans Reiser, Atul Mukker, reiserfs-list [ bringing this also on reiserfs ml, a great place for this kind of posts. this is also the reason of the full quoting. sorry ] On Wed, Apr 14, 2004 at 08:51:53AM +0200, Paul Wagland wrote: > Hi all, hi Paul, > I would like to report on a problem that I am having. I am just testing > out the new megaraid unified driver, and have been doing some baseline > testing with bonnie++. > > My problem is that, although reiserfs, ext2, jfs and xfs all work, > reiser4 fails with the following error: > --- > Can't write block. > Bonnie: drastic I/O error (write(2)): No such file or directory > --- > > I am using the debian prepared kernel with the debian reiser4 patch. I > made a cursory examination of the patch, and it appears to correlate > fairly closely with the patch from the namesys site. of course it is correlated to that of namesys! i have no skills at all to invent reiser4 :)) you forgot to specify version of the patch you are talking about, currently debian provides two versions. anyway i suppose you are talking about version 20040326-2, aren't you? > Given that this works with reiserfs, ext2, jfs and xfs it would appear > to be a reiser4 problem, however ext3 also fails, though with a > different error, it claims that the disk is full, but it is trying to > write a 2 1GB files onto a 2.5GB filesystem, so it should have enough > room, and indeed it did even work two or three times out of about 10 > runs (lots of timing :-). This implies that it might be a megaraid > problem. As you can tell, I really have no idea ;-) > > I will try playing around tonight with an official kernel and the > official reiser4 patch to see if that makes any difference, but would > just like to raise this potential problem sooner rather than later. latest reiser4 snapshot provided a patch which applied cleanly on 2.6.5-rc2 but not to 2.6.5. i had to modify it as suggested on the reiserfs ml. if you look at the debian package's changelog you can find the reference to that thread. > If I can help debug this situation (I am probably the only person > trying this combination :-) please let me know how I should go about > it. i'm sorry but i can't help further. cheers domenico -----[ Domenico Andreoli, aka cavok --[ http://filibusta.crema.unimi.it/~cavok/gpgkey.asc ---[ 3A0F 2F80 F79C 678A 8936 4FEE 0677 9033 A20E BC50 ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: reiser4 and megaraid problems with debian 2.6.5 2004-04-14 9:05 ` Domenico Andreoli @ 2004-04-14 12:36 ` Paul Wagland 2004-04-14 13:09 ` Nikita Danilov 0 siblings, 1 reply; 9+ messages in thread From: Paul Wagland @ 2004-04-14 12:36 UTC (permalink / raw) To: Domenico Andreoli Cc: reiserfs-list, Linux mailing list SCSI, Atul Mukker, Hans Reiser, Linux mailing list kernel [-- Attachment #1: Type: text/plain, Size: 1440 bytes --] On Apr 14, 2004, at 11:05, Domenico Andreoli wrote: > [ bringing this also on reiserfs ml, a great place for this kind > of posts. this is also the reason of the full quoting. sorry ] Thanks ;-) >> I am using the debian prepared kernel with the debian reiser4 patch. I >> made a cursory examination of the patch, and it appears to correlate >> fairly closely with the patch from the namesys site. > > you forgot to specify version of the patch you are talking about, > currently debian provides two versions. anyway i suppose you are > talking > about version 20040326-2, aren't you? Yes, that is correct. >> If I can help debug this situation (I am probably the only person >> trying this combination :-) please let me know how I should go about >> it. > > i'm sorry but i can't help further. Thanks for the tip... the link that you referred to was most useful. I might now have an idea what the problem might be... Further on in the thread <http://marc.theaimsgroup.com/?l=reiserfs&m=108117079808733&w=2> it says that there is something in the patch that "can lead to a dirtied_when in the future, and missed writeback". Well, what happens if the directory that I am missing was in that writeback that got missed? I will try updating the debian patch myself and give it another test tonight and will report back on my findings. But, before I do so, does it seem likely that this could cause the problem? Cheers, Paul [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 186 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: reiser4 and megaraid problems with debian 2.6.5 2004-04-14 12:36 ` Paul Wagland @ 2004-04-14 13:09 ` Nikita Danilov 2004-04-14 13:25 ` Paul Wagland 0 siblings, 1 reply; 9+ messages in thread From: Nikita Danilov @ 2004-04-14 13:09 UTC (permalink / raw) To: Paul Wagland Cc: Domenico Andreoli, reiserfs-list, Linux mailing list SCSI, Atul Mukker, Hans Reiser, Linux mailing list kernel Paul Wagland writes: > > On Apr 14, 2004, at 11:05, Domenico Andreoli wrote: > > > [ bringing this also on reiserfs ml, a great place for this kind > > of posts. this is also the reason of the full quoting. sorry ] > > Thanks ;-) > > >> I am using the debian prepared kernel with the debian reiser4 patch. I > >> made a cursory examination of the patch, and it appears to correlate > >> fairly closely with the patch from the namesys site. > > > > you forgot to specify version of the patch you are talking about, > > currently debian provides two versions. anyway i suppose you are > > talking > > about version 20040326-2, aren't you? > > Yes, that is correct. > > >> If I can help debug this situation (I am probably the only person > >> trying this combination :-) please let me know how I should go about > >> it. Is there anything in the logs? [...] > > Cheers, > Paul Nikita. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: reiser4 and megaraid problems with debian 2.6.5 2004-04-14 13:09 ` Nikita Danilov @ 2004-04-14 13:25 ` Paul Wagland 2004-04-14 23:59 ` Paul Wagland 0 siblings, 1 reply; 9+ messages in thread From: Paul Wagland @ 2004-04-14 13:25 UTC (permalink / raw) To: Nikita Danilov Cc: reiserfs-list, Linux mailing list SCSI, Atul Mukker, Domenico Andreoli, Hans Reiser, Linux mailing list kernel [-- Attachment #1: Type: text/plain, Size: 416 bytes --] On Apr 14, 2004, at 15:09, Nikita Danilov wrote: >>> Paul Wagland writes: >>>> If I can help debug this situation (I am probably the only person >>>> trying this combination :-) please let me know how I should go about >>>> it. > > Is there anything in the logs? Sadly I forgot to check... though I will check again tonight since the problem is quite reproducible for me. Will report back later... Cheers, Paul [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 186 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: reiser4 and megaraid problems with debian 2.6.5 2004-04-14 13:25 ` Paul Wagland @ 2004-04-14 23:59 ` Paul Wagland 2004-04-18 22:36 ` reiser4 and megaraid problems with debian 2.6.5 (*solved*) Paul Wagland 0 siblings, 1 reply; 9+ messages in thread From: Paul Wagland @ 2004-04-14 23:59 UTC (permalink / raw) To: Nikita Danilov Cc: reiserfs-list, Linux SCSI mailing list, Atul Mukker, Domenico Andreoli, Hans Reiser, Linux kernel mailing list On Wed, 2004-04-14 at 15:25, Paul Wagland wrote: > On Apr 14, 2004, at 15:09, Nikita Danilov wrote: > > >>> Paul Wagland writes: > >>>> If I can help debug this situation (I am probably the only person > >>>> trying this combination :-) please let me know how I should go about > >>>> it. > > > > Is there anything in the logs? > > Sadly I forgot to check... though I will check again tonight since the > problem is quite reproducible for me. Will report back later... OK. There is nothing in the logs. I have recompiled the kernel with extra REISER4 debugging and checking and still nothing. This error is 100% reproducible for me. I have had a thought, what if it is "only" the wrong error code that is being returned? What if the real problem is that we are running out of free blocks. To test this theory (a little at least) I ran: # bonnie++ -q -x4 -d /mnt/sdq -u 0:0 -f -r500 name,file_size,putc,putc_cpu,put_block,put_block_cpu,rewrite,rewrite_cpu,getc,getc_cpu,get_block,get_block_cpu,seeks,seeks_cpu,num_files,seq_create,seq_create_cpu,seq_stat,seq_stat_cpu,seq_del,seq_del_cpu,ran_create,ran_create_cpu,ran_stat,ran_stat_cpu,ran_del,ran_del_cpu tidbit.kungfoocoder.org,1G,,,55236,11,36165,10,,,73514,8,2138.3,2,16,+++++,+++,+++++,+++,25015,99,28712,100,+++++,+++,26846,100 tidbit.kungfoocoder.org,1G,,,55236,11,30073,8,,,84287,10,2046.9,2,16,+++++,+++,+++++,+++,24862,99,28340,99,+++++,+++,26490,99 tidbit.kungfoocoder.org,1G,,,55391,11,30140,9,,,84506,10,2050.2,2,16,+++++,+++,+++++,+++,24642,100,28725,100,+++++,+++,26653,100 tidbit.kungfoocoder.org,1G,,,55364,11,30165,8,,,83055,11,2051.9,2,16,+++++,+++,+++++,+++,24682,100,28264,100,+++++,+++,26804,99 Note that even with debugging turned on we are about 5% faster at reading and 20% slower than writing compared to reiserfs. Pretty good I dare say. However, when I run: ~# bonnie++ -x4 -d /mnt/sdq -u 0:0 -f -q -r800 name,file_size,putc,putc_cpu,put_block,put_block_cpu,rewrite,rewrite_cpu,getc,getc_cpu,get_block,get_block_cpu,seeks,seeks_cpu,num_files,seq_create,seq_create_cpu,seq_stat,seq_stat_cpu,seq_del,seq_del_cpu,ran_create,ran_create_cpu,ran_stat,ran_stat_cpu,ran_del,ran_del_cpu Can't write block. Bonnie: drastic I/O error (re write(2)): No such file or directory Using reiserfs I can happily run: # bonnie++ -x4 -d /mnt/sdq -u 0:0 -f -q -r1008 and the partition is 2.5GB in size. Some more background information: my hardware is not overclocked, and has been 100% reliable, about two weeks ago I sat it through about 24 hours of memtest86+ without any problems. The machine has 1GB of RAM. The logical partition that I am testing is 2.5Gb Here are the REISER4 settings from my configuration: tidbit:~# grep REISER4 /boot/config-2.6.5pw-newmega-k7-1 CONFIG_REISER4_FS=m # CONFIG_REISER4_FS_SYSCALL is not set CONFIG_REISER4_LARGE_KEY=y CONFIG_REISER4_CHECK=y CONFIG_REISER4_FS_SYSCALL_DEBUG=y # CONFIG_REISER4_DEBUG_MODIFY is not set # CONFIG_REISER4_DEBUG_MEMCPY is not set # CONFIG_REISER4_DEBUG_NODE is not set # CONFIG_REISER4_ZERO_NEW_NODE is not set # CONFIG_REISER4_TRACE is not set # CONFIG_REISER4_EVENT_LOG is not set # CONFIG_REISER4_STATS is not set # CONFIG_REISER4_PROF is not set # CONFIG_REISER4_LOCKPROF is not set # CONFIG_REISER4_DEBUG_OUTPUT is not set # CONFIG_REISER4_NOOPT is not set CONFIG_REISER4_USE_EFLUSH=y # CONFIG_REISER4_COPY_ON_CAPTURE is not set # CONFIG_REISER4_BADBLOCKS is not set I have removed the |1 from the jiffies|1 assignment. It still works, which means that the kernel must have been fixed :-) But it didn't help :-\ Hope this helps provide some illumination to the gurus out there... Cheers, Paul ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: reiser4 and megaraid problems with debian 2.6.5 (*solved*) 2004-04-14 23:59 ` Paul Wagland @ 2004-04-18 22:36 ` Paul Wagland 0 siblings, 0 replies; 9+ messages in thread From: Paul Wagland @ 2004-04-18 22:36 UTC (permalink / raw) To: Nikita Danilov Cc: reiserfs-list, Linux SCSI mailing list, Atul Mukker, Domenico Andreoli, Hans Reiser, Linux kernel mailing list [-- Attachment #1: Type: text/plain, Size: 892 bytes --] Hi all, well partly solved anyway... I am just posting this so that if anyone finds this thread later they can also find this conclusion... There is still more work to be done before this problem can be properly closed, but at least now I am certain that it has nothing to do with the hardware :-) It appears (my own unsupported theory) that the problem is that reiser4 is taking some time to free up the free blocks that are currently in use by the wandering log. Since I was running a test that causes a lot of wandering log to be created, and I was doing it on a filesystem with very little free space, then I was running into the problem. Rerunning the test with either a) more space, or b) a smaller data set solved the problem. On the reiserfs-list we are now trying to find out exactly why this is happening, and how to solve the problem properly. Cheers, Paul [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: reiser4 and megaraid problems with debian 2.6.5 2004-04-14 6:51 reiser4 and megaraid problems with debian 2.6.5 Paul Wagland 2004-04-14 9:05 ` Domenico Andreoli @ 2004-04-14 15:13 ` Hans Reiser 2004-04-14 15:37 ` Paul Wagland 1 sibling, 1 reply; 9+ messages in thread From: Hans Reiser @ 2004-04-14 15:13 UTC (permalink / raw) To: Paul Wagland Cc: Linux mailing list SCSI, Linux mailing list kernel, Atul Mukker Paul Wagland wrote: > Hi all, > > I would like to report on a problem that I am having. I am just > testing out the new megaraid unified driver, and have been doing some > baseline testing with bonnie++. > > My problem is that, although reiserfs, ext2, jfs and xfs all work, > reiser4 fails with the following error: > --- > Can't write block. > Bonnie: drastic I/O error (write(2)): No such file or directory > --- > > I am using the debian prepared kernel with the debian reiser4 patch. I > made a cursory examination of the patch, and it appears to correlate > fairly closely with the patch from the namesys site. In what way does it not correlate? > > Given that this works with reiserfs, ext2, jfs and xfs it would appear > to be a reiser4 problem, however ext3 also fails, though with a > different error, it claims that the disk is full, but it is trying to > write a 2 1GB files onto a 2.5GB filesystem, so it should have enough > room, and indeed it did even work two or three times out of about 10 > runs (lots of timing :-). This implies that it might be a megaraid > problem. As you can tell, I really have no idea ;-) > > I will try playing around tonight with an official kernel and the > official reiser4 patch to see if that makes any difference, but would > just like to raise this potential problem sooner rather than later. > > If I can help debug this situation (I am probably the only person > trying this combination :-) please let me know how I should go about it. > > Cheers, > Paul I don't have the hardware to test it, can you get the error without your hardware? -- Hans ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: reiser4 and megaraid problems with debian 2.6.5 2004-04-14 15:13 ` reiser4 and megaraid problems with debian 2.6.5 Hans Reiser @ 2004-04-14 15:37 ` Paul Wagland 0 siblings, 0 replies; 9+ messages in thread From: Paul Wagland @ 2004-04-14 15:37 UTC (permalink / raw) To: Hans Reiser; +Cc: Linux mailing list SCSI, Linux mailing list kernel [-- Attachment #1: Type: text/plain, Size: 1045 bytes --] Hi, On Apr 14, 2004, at 17:13, Hans Reiser wrote: > Paul Wagland wrote: > >> I am using the debian prepared kernel with the debian reiser4 patch. >> I made a cursory examination of the patch, and it appears to >> correlate fairly closely with the patch from the namesys site. > > In what way does it not correlate? As was mentioned by Domenico Andreoli the changes are just those required to get reiser4 to work under 2.6.5. Other differences are line offsets due to the fact that the debian kernel also has patches applied. >> If I can help debug this situation (I am probably the only person >> trying this combination :-) please let me know how I should go about >> it. > > I don't have the hardware to test it, can you get the error without > your hardware? Unfortunately, not easily, since this is the only box that I can currently test this out on. However, there a couple of tests that I can still perform (as mentioned elsewhere in this thread) and I will report back on the results of those later tonight. Cheers, Paul [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 186 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2004-04-18 22:36 UTC | newest] Thread overview: 9+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2004-04-14 6:51 reiser4 and megaraid problems with debian 2.6.5 Paul Wagland 2004-04-14 9:05 ` Domenico Andreoli 2004-04-14 12:36 ` Paul Wagland 2004-04-14 13:09 ` Nikita Danilov 2004-04-14 13:25 ` Paul Wagland 2004-04-14 23:59 ` Paul Wagland 2004-04-18 22:36 ` reiser4 and megaraid problems with debian 2.6.5 (*solved*) Paul Wagland 2004-04-14 15:13 ` reiser4 and megaraid problems with debian 2.6.5 Hans Reiser 2004-04-14 15:37 ` Paul Wagland
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox