From: Anton Eliasson <devel-17Olwe7vw2dLC78zk6coLg@public.gmane.org>
To: Vyacheslav Dubeyko <slava-yeENwD64cLxBDgjK7y7TUQ@public.gmane.org>
Cc: linux-nilfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Subject: Nilfs2 crash debugging (was: Broken nilfs2 filesystem)
Date: Thu, 15 Aug 2013 12:40:50 +0200 [thread overview]
Message-ID: <520CB032.2000602@antoneliasson.se> (raw)
In-Reply-To: <9016EBD5-1E01-476F-B1B9-66AE593F4728-yeENwD64cLxBDgjK7y7TUQ@public.gmane.org>
Vyacheslav Dubeyko skrev 2013-07-27 18:23:
> Hi Anton,
>
> On Jul 26, 2013, at 8:52 PM, Anton Eliasson wrote:
>
> Thank you for your efforts. But, as I understand, currently, you
> don't reproduce the issue and shared system log doesn't contain
> any new details about the issue. Please, see my description below.
>
[snip]
Hi again. I was able to reproduce the crash on a fully updated system by
starting the two virtual machines simultaneously as described in my
e-mail from May 25. I made a new attempt to rebuild the kernel with your
patches. I selected these options in make menuconfig [1], which resulted
in this generated config.x86_64 [2] which has the following diff
compared to the stock config.x86_64:
--- config.x86_64 2013-08-11 00:06:09.000000000 +0200
+++ config.x86_64.last 2013-08-11 12:48:44.094979947 +0200
@@ -1,6 +1,6 @@
#
# Automatically generated file; DO NOT EDIT.
-# Linux/x86 3.10.0-1 Kernel Configuration
+# Linux/x86 3.10.5-1 Kernel Configuration
#
CONFIG_64BIT=y
CONFIG_X86_64=y
@@ -5450,6 +5450,11 @@
# CONFIG_BTRFS_FS_RUN_SANITY_TESTS is not set
# CONFIG_BTRFS_DEBUG is not set
CONFIG_NILFS2_FS=m
+CONFIG_NILFS2_DEBUG=y
+# CONFIG_NILFS2_USE_PR_DEBUG is not set
+CONFIG_NILFS2_DEBUG_SHOW_ERRORS=y
+CONFIG_NILFS2_DEBUG_DUMP_STACK=y
+# CONFIG_NILFS2_DEBUG_SUBSYSTEMS is not set
CONFIG_FS_POSIX_ACL=y
CONFIG_EXPORTFS=y
CONFIG_FILE_LOCKING=y
I hope those build options are the ones you want. Using the custom
kernel and mount options, I could reproduce the crash right away. Here's
the log [3] (crash at timestamp "Aug 15 10:26:26 riven kernel: [
376.625992]"). The cleaner wasn't running at the time. I don't remember
if I used the mount option nogc or if I killed it manually after booting up.
Because of these uncertainties and the fact that the log is a bit messy,
I attempted to rotate the logs, reboot and try again. Of course, that
caused this heisenbug to disappear again. I produced some pretty logs
showing lots of errors without the cleaner[4], with the cleaner started
manually [5] and with the cleaner started at boot [6]. None of them show
the crash however so they may be of limited use for you.
Okay, one final attempt. I reinstalled the stock kernel and managed to
crash the system using the virtual machines like before. I then
reinstalled the custom kernel, rotated the logs, rebooted with the mount
options "rw,noatime,discard", left the cleanerd running and fired up
VMware. I was happy to see the system die as expected. [7] and [8]
should contain beautiful logs of everything from boot to crash.
[1]: http://antoneliasson.se/publicdump/menuconfig.png
[2]: http://antoneliasson.se/publicdump/config.x86_64.last
[3]: http://antoneliasson.se/publicdump/kernel.log.2.gz
[4]:
http://antoneliasson.se/publicdump/kernel.log.nogc-nocleanerd-nocrash.2013-08-15.1048.log.gz
[5]:
http://antoneliasson.se/publicdump/kernel.log.nogc-cleanerd-nocrash.2013-08-15.1054.log.gz
[6]:
http://antoneliasson.se/publicdump/kernel.log.gc-cleanerd-nocrash.2013-08-15.1104.log.gz
[7]:
http://antoneliasson.se/publicdump/kernel.log.gc-cleanerd-crash.2013-08-15.1205.log.gz
[8]:
http://antoneliasson.se/publicdump/everything.log.gc-cleanerd-crash.2013-08-15.1211.log.gz
--
Best Regards,
Anton Eliasson
--
To unsubscribe from this list: send the line "unsubscribe linux-nilfs" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2013-08-15 10:40 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <51F2A8A4.4020400@antoneliasson.se>
[not found] ` <51F2A8A4.4020400-17Olwe7vw2dLC78zk6coLg@public.gmane.org>
2013-07-26 16:52 ` Fwd: Re: Broken nilfs2 filesystem Anton Eliasson
[not found] ` <51F2A945.6050909-17Olwe7vw2dLC78zk6coLg@public.gmane.org>
2013-07-27 16:23 ` Vyacheslav Dubeyko
[not found] ` <9016EBD5-1E01-476F-B1B9-66AE593F4728-yeENwD64cLxBDgjK7y7TUQ@public.gmane.org>
2013-07-27 22:32 ` Anton Eliasson
2013-08-15 10:40 ` Anton Eliasson [this message]
[not found] ` <520CB032.2000602-17Olwe7vw2dLC78zk6coLg@public.gmane.org>
2013-08-16 7:11 ` Nilfs2 crash debugging (was: Broken nilfs2 filesystem) Vyacheslav Dubeyko
2013-08-19 19:55 ` Vyacheslav Dubeyko
[not found] ` <FEA41B6A-7D82-4563-AAF5-D5AFA3734D79-yeENwD64cLxBDgjK7y7TUQ@public.gmane.org>
2013-08-25 15:02 ` Nilfs2 crash debugging Anton Eliasson
[not found] ` <521A1C88.9080100-17Olwe7vw2dLC78zk6coLg@public.gmane.org>
2013-08-26 9:56 ` Vyacheslav Dubeyko
2013-08-26 18:37 ` Anton Eliasson
[not found] ` <521BA084.80901-17Olwe7vw2dLC78zk6coLg@public.gmane.org>
2013-08-30 5:58 ` Vyacheslav Dubeyko
2013-09-04 19:39 ` Anton Eliasson
[not found] ` <52278C63.6090303-17Olwe7vw2dLC78zk6coLg@public.gmane.org>
2013-09-04 20:00 ` Vyacheslav Dubeyko
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=520CB032.2000602@antoneliasson.se \
--to=devel-17olwe7vw2dlc78zk6colg@public.gmane.org \
--cc=linux-nilfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=slava-yeENwD64cLxBDgjK7y7TUQ@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.