* truncated files
@ 2008-11-25 21:44 Martin Steigerwald
2008-11-25 22:27 ` Dave Chinner
0 siblings, 1 reply; 7+ messages in thread
From: Martin Steigerwald @ 2008-11-25 21:44 UTC (permalink / raw)
To: linux-xfs
Hi!
Today on one try to hibernate via tuxonice it machine appeared dead. I am
not completely sure if it was. But I was in a hurry and had no time to
try to ping or SSH it from a different machine and thus I just switched
it off hard.
After booting again which worked fine, parts of my KDE configuration
appeared broken. Color scheme and window position was lost. And KMail did
not know how to sent out a mail anymore, all transports were missing.
Thus I checked my /home filesystem on /dev/sda5 this evening. It appeared
absolutely fine with xfs_check and xfs_repair -n from xfsprogs 2.9.8
(grml 2008.11rc).
But comparing some of the config files of KDE with the versions in my
backup showed truncated files:
martin@shambhala:~> ls -l .kde-backup-2008-11-22/share/config/kmailrc
-rw------- 1 martin martin 247680 22. Nov
18:29 .kde-backup-2008-11-22/share/config/kmailrc
martin@shambhala:~> ls -l .kde-broken-2008-11-25/share/config/kmailrc
-rw------- 1 martin martin 116902 25. Nov
21:16 .kde-broken-2008-11-25/share/config/kmailrc
But no hole in it:
martin@shambhala:~>
su -c "xfs_bmap -v .kde-broken-2008-11-25/share/config/kmailrc"
Passwort:
.kde-broken-2008-11-25/share/config/kmailrc:
EXT: FILE-OFFSET BLOCK-RANGE AG AG-OFFSET TOTAL
0: [0..231]: 22444768..22444999 0 (22444768..22444999) 232
Similar stuff happened on the encfs encrypted KDE user that I use for
professional stuff - encfs is layered on top of XFS via FUSE:
shambhala:~ms> ls -l .kde-broken-2008-11-25/share/config/kdeglobals
-rw------- 1 ms teamix 4772 25. Nov
20:35 .kde-broken-2008-11-25/share/config/kdeglobals
shambhala:~ms> ls -l .kde-backup-2008-11-22/share/config/kdeglobals
-rw------- 1 ms teamix 6621 18. Nov
16:13 .kde-backup-2008-11-22/share/config/kdeglobals
Unfortunately I cannot check this one for holes directly as its on encfs:
shambhala:~ms> xfs_bmap -v .kde-broken-2008-11-25/share/config/kdeglobals
xfs_bmap: specified file
[".kde-broken-2008-11-25/share/config/kdeglobals"] is not on an XFS
filesystem
(Since encfs encrypted pathes as well and the file length may differ a
bit, the encrypted file could be difficult to find... well maybe by
date.)
And well my basket index was lost (http://basket.kde.org is a note taking
application):
shambhala:~ms>
ls -l .kde-backup-2008-11-22/share/apps/basket/baskets/baskets.xml
-rw-r--r-- 1 ms teamix 10271 18. Nov
15:03 .kde-backup-2008-11-22/share/apps/basket/baskets/baskets.xml
shambhala:~ms>
ls -l .kde-broken-2008-11-25/share/apps/basket/baskets/baskets.xml
-rw-r--r-- 1 ms teamix 1905 25. Nov
13:24 .kde-broken-2008-11-25/share/apps/basket/baskets/baskets.xml
But since I found no holes in the other file and AFAIK the truncated file
problem has long been solved - anyone has a idea, how this could have
happened?
I fixed it by copying ~/.kde/share/config for my private user, and
complete ~/.kde for my company user from the backup last weekend. In
~/.kde/share/apps for the private too much has changed since the backup,
thus for the moment I left it. I do not miss anything in there right now,
but feel a tad bit uncomfortable that files might be truncated in there
also and I might only find out later when I already worked with them for
some time.
I thought that KDE maybe was writing those files at the moment. But its a
bit wide-spread for that. And it must have happened that it wrote to
config / app files on two user accounts. Might be the explaination, but I
am not completely convinced.
I did not have seen a thing like this since at least a year I think.
This is with:
martin@shambhala:~> cat /proc/version
Linux version 2.6.27.7-tp42-toi-3.0-rc7a (martin@shambhala) (gcc version
4.3.2 (Debian 4.3.2-1) ) #1 PREEMPT Mon Nov 24 11:30:39 CET 2008
I did not find anything cumbersone in /var/log/syslog... no XFS crash or
something like this.
If no one has any idea I take this as really bad luck. Might really be
that I turned off the machine while quite some KDE files were being
written out. I know that some KDE applications make backups prior to
overwriting. Maybe something could be improved in KDE applications. If
so, I would be interested to hear and I would file bug reports with KDE.
Well actually thats my bet already. XFS appears to be fine.
Ciao,
--
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA B82F 991B EAAC A599 84C7
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
^ permalink raw reply [flat|nested] 7+ messages in thread
* truncated files
@ 2008-11-25 22:03 Martin Steigerwald
0 siblings, 0 replies; 7+ messages in thread
From: Martin Steigerwald @ 2008-11-25 22:03 UTC (permalink / raw)
To: xfs
Hi!
Today on one try to hibernate via tuxonice it machine appeared dead. I am
not completely sure if it was. But I was in a hurry and had no time to
try to ping or SSH it from a different machine and thus I just switched
it off hard.
After booting again which worked fine, parts of my KDE configuration
appeared broken. Color scheme and window position was lost. And KMail did
not know how to sent out a mail anymore, all transports were missing.
Thus I checked my /home filesystem on /dev/sda5 this evening. It appeared
absolutely fine with xfs_check and xfs_repair -n from xfsprogs 2.9.8
(grml 2008.11rc).
But comparing some of the config files of KDE with the versions in my
backup showed truncated files:
martin@shambhala:~> ls -l .kde-backup-2008-11-22/share/config/kmailrc
-rw------- 1 martin martin 247680 22. Nov
18:29 .kde-backup-2008-11-22/share/config/kmailrc
martin@shambhala:~> ls -l .kde-broken-2008-11-25/share/config/kmailrc
-rw------- 1 martin martin 116902 25. Nov
21:16 .kde-broken-2008-11-25/share/config/kmailrc
But no hole in it:
martin@shambhala:~>
su -c "xfs_bmap -v .kde-broken-2008-11-25/share/config/kmailrc"
Passwort:
.kde-broken-2008-11-25/share/config/kmailrc:
EXT: FILE-OFFSET BLOCK-RANGE AG AG-OFFSET TOTAL
0: [0..231]: 22444768..22444999 0 (22444768..22444999) 232
Similar stuff happened on the encfs encrypted KDE user that I use for
professional stuff - encfs is layered on top of XFS via FUSE:
shambhala:~ms> ls -l .kde-broken-2008-11-25/share/config/kdeglobals
-rw------- 1 ms teamix 4772 25. Nov
20:35 .kde-broken-2008-11-25/share/config/kdeglobals
shambhala:~ms> ls -l .kde-backup-2008-11-22/share/config/kdeglobals
-rw------- 1 ms teamix 6621 18. Nov
16:13 .kde-backup-2008-11-22/share/config/kdeglobals
Unfortunately I cannot check this one for holes directly as its on encfs:
shambhala:~ms> xfs_bmap -v .kde-broken-2008-11-25/share/config/kdeglobals
xfs_bmap: specified file
[".kde-broken-2008-11-25/share/config/kdeglobals"] is not on an XFS
filesystem
(Since encfs encrypted pathes as well and the file length may differ a
bit, the encrypted file could be difficult to find... well maybe by
date.)
And well my basket index was lost (http://basket.kde.org is a note taking
application):
shambhala:~ms>
ls -l .kde-backup-2008-11-22/share/apps/basket/baskets/baskets.xml
-rw-r--r-- 1 ms teamix 10271 18. Nov
15:03 .kde-backup-2008-11-22/share/apps/basket/baskets/baskets.xml
shambhala:~ms>
ls -l .kde-broken-2008-11-25/share/apps/basket/baskets/baskets.xml
-rw-r--r-- 1 ms teamix 1905 25. Nov
13:24 .kde-broken-2008-11-25/share/apps/basket/baskets/baskets.xml
But since I found no holes in the other file and AFAIK the truncated file
problem has long been solved - anyone has a idea, how this could have
happened?
I fixed it by copying ~/.kde/share/config for my private user, and
complete ~/.kde for my company user from the backup last weekend. In
~/.kde/share/apps for the private too much has changed since the backup,
thus for the moment I left it. I do not miss anything in there right now,
but feel a tad bit uncomfortable that files might be truncated in there
also and I might only find out later when I already worked with them for
some time.
I thought that KDE maybe was writing those files at the moment. But its a
bit wide-spread for that. And it must have happened that it wrote to
config / app files on two user accounts. Might be the explaination, but I
am not completely convinced.
I did not have seen a thing like this since at least a year I think.
This is with:
martin@shambhala:~> cat /proc/version
Linux version 2.6.27.7-tp42-toi-3.0-rc7a (martin@shambhala) (gcc version
4.3.2 (Debian 4.3.2-1) ) #1 PREEMPT Mon Nov 24 11:30:39 CET 2008
I did not find anything cumbersone in /var/log/syslog... no XFS crash or
something like this.
If no one has any idea I take this as really bad luck. Might really be
that I turned off the machine while quite some KDE files were being
written out. I know that some KDE applications make backups prior to
overwriting. Maybe something could be improved in KDE applications. If
so, I would be interested to hear and I would file bug reports with KDE.
Well actually thats my bet already. XFS appears to be fine.
Ciao,
--
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA B82F 991B EAAC A599 84C7
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: truncated files
2008-11-25 21:44 Martin Steigerwald
@ 2008-11-25 22:27 ` Dave Chinner
2008-11-26 8:49 ` Martin Steigerwald
0 siblings, 1 reply; 7+ messages in thread
From: Dave Chinner @ 2008-11-25 22:27 UTC (permalink / raw)
To: Martin Steigerwald; +Cc: linux-xfs
On Tue, Nov 25, 2008 at 10:44:14PM +0100, Martin Steigerwald wrote:
>
> Hi!
>
> Today on one try to hibernate via tuxonice it machine appeared dead. I am
^^^^^^^^^
When (not if) suspend to disk/resume fails, you get to keep all the broken
pieces of your filesystem. It works most of the time, but it has
some fundamentally broken corner cases that you probably just
hit....
> Similar stuff happened on the encfs encrypted KDE user that I use for
> professional stuff - encfs is layered on top of XFS via FUSE:
IIRC, FUSE cannot be suspended safely at all, so expect corruption
of FUSE filesystems on a failed suspend.
I've never had a system that suspends reliably (let alone resumes
from the suspend) so it's no real surprise that I don't trust
suspend to disk....
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: truncated files
2008-11-25 22:27 ` Dave Chinner
@ 2008-11-26 8:49 ` Martin Steigerwald
2008-11-26 22:59 ` Dave Chinner
0 siblings, 1 reply; 7+ messages in thread
From: Martin Steigerwald @ 2008-11-26 8:49 UTC (permalink / raw)
To: xfs
Am Dienstag 25 November 2008 schrieb Dave Chinner:
> On Tue, Nov 25, 2008 at 10:44:14PM +0100, Martin Steigerwald wrote:
> > Hi!
> >
> > Today on one try to hibernate via tuxonice it machine appeared dead.
> > I am
>
> ^^^^^^^^^
> When (not if) suspend to disk/resume fails, you get to keep all the
> broken pieces of your filesystem. It works most of the time, but it has
> some fundamentally broken corner cases that you probably just
> hit....
Well I use TuxOnIce for a reason! I had uptimes of up to 70 days with it
already. And they are usually only interrupted by kernel updates or
manual shutdowns. I was never convinced by in-kernel solutions for
hibernate.
> > Similar stuff happened on the encfs encrypted KDE user that I use for
> > professional stuff - encfs is layered on top of XFS via FUSE:
>
> IIRC, FUSE cannot be suspended safely at all, so expect corruption
> of FUSE filesystems on a failed suspend.
Ok, fair enough. My private data isn't on encfs tough.
> I've never had a system that suspends reliably (let alone resumes
> from the suspend) so it's no real surprise that I don't trust
> suspend to disk....
Well I take it as bad luck then, especially since there are no hints that
XFS had a problem. I am not sure whether the machine really was dead, but
I can't reproduce what exactly happened. So thats it.
Ciao,
--
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA B82F 991B EAAC A599 84C7
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: truncated files
2008-11-26 8:49 ` Martin Steigerwald
@ 2008-11-26 22:59 ` Dave Chinner
2008-11-28 22:02 ` Martin Steigerwald
0 siblings, 1 reply; 7+ messages in thread
From: Dave Chinner @ 2008-11-26 22:59 UTC (permalink / raw)
To: Martin Steigerwald; +Cc: xfs
On Wed, Nov 26, 2008 at 09:49:18AM +0100, Martin Steigerwald wrote:
> Am Dienstag 25 November 2008 schrieb Dave Chinner:
> > On Tue, Nov 25, 2008 at 10:44:14PM +0100, Martin Steigerwald wrote:
> > > Hi!
> > >
> > > Today on one try to hibernate via tuxonice it machine appeared dead.
> > > I am
> >
> > ^^^^^^^^^
> > When (not if) suspend to disk/resume fails, you get to keep all the
> > broken pieces of your filesystem. It works most of the time, but it has
> > some fundamentally broken corner cases that you probably just
> > hit....
>
> Well I use TuxOnIce for a reason! I had uptimes of up to 70 days with it
> already. And they are usually only interrupted by kernel updates or
> manual shutdowns. I was never convinced by in-kernel solutions for
> hibernate.
Sure, though I'm not convinced that TuxOnIce is any better because
it still uses the same fundamental design as the in-kernel ones.
> > I've never had a system that suspends reliably (let alone resumes
> > from the suspend) so it's no real surprise that I don't trust
> > suspend to disk....
>
> Well I take it as bad luck then, especially since there are no hints that
> XFS had a problem. I am not sure whether the machine really was dead, but
> I can't reproduce what exactly happened. So thats it.
And therein lies the problem. I can't get suspend/resume to work
reliably on anything I own, so I can't do anything about problems
reported as a result of suspend/resume. Hell, I even considered
running linux on my new laptop inside a virtual machine on windows
just so I could have functioning suspend/resume....
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: truncated files
2008-11-26 22:59 ` Dave Chinner
@ 2008-11-28 22:02 ` Martin Steigerwald
2008-11-28 22:39 ` Martin Steigerwald
0 siblings, 1 reply; 7+ messages in thread
From: Martin Steigerwald @ 2008-11-28 22:02 UTC (permalink / raw)
To: xfs
Am Mittwoch 26 November 2008 schrieb Dave Chinner:
> On Wed, Nov 26, 2008 at 09:49:18AM +0100, Martin Steigerwald wrote:
> > Am Dienstag 25 November 2008 schrieb Dave Chinner:
> > > On Tue, Nov 25, 2008 at 10:44:14PM +0100, Martin Steigerwald wrote:
> > > > Hi!
> > > >
> > > > Today on one try to hibernate via tuxonice it machine appeared
> > > > dead. I am
> > >
> > > ^^^^^^^^^
> > > When (not if) suspend to disk/resume fails, you get to keep all the
> > > broken pieces of your filesystem. It works most of the time, but it
> > > has some fundamentally broken corner cases that you probably just
> > > hit....
> >
> > Well I use TuxOnIce for a reason! I had uptimes of up to 70 days with
> > it already. And they are usually only interrupted by kernel updates
> > or manual shutdowns. I was never convinced by in-kernel solutions for
> > hibernate.
>
> Sure, though I'm not convinced that TuxOnIce is any better because
> it still uses the same fundamental design as the in-kernel ones.
Might be.
But something is fishy here. I had it a second time today. This time I
know for sure that the machine freezed hard. Mouse pointer froze and the
machine didn't even respond to a ping anymore. Nothing in logs - doesn't
surprise me.
I didn't have this issue with 2.6.26, and I also don't think I had it with
2.6.27.5. I will downgrade to 2.6.27.5 now.
> > > I've never had a system that suspends reliably (let alone resumes
> > > from the suspend) so it's no real surprise that I don't trust
> > > suspend to disk....
> >
> > Well I take it as bad luck then, especially since there are no hints
> > that XFS had a problem. I am not sure whether the machine really was
> > dead, but I can't reproduce what exactly happened. So thats it.
>
> And therein lies the problem. I can't get suspend/resume to work
> reliably on anything I own, so I can't do anything about problems
> reported as a result of suspend/resume. Hell, I even considered
> running linux on my new laptop inside a virtual machine on windows
> just so I could have functioning suspend/resume....
Then you really had back luck. TuxOnIce works stable for me - at least it
did so for a long long while, until my recent kernel.
Ciao,
--
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA B82F 991B EAAC A599 84C7
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: truncated files
2008-11-28 22:02 ` Martin Steigerwald
@ 2008-11-28 22:39 ` Martin Steigerwald
0 siblings, 0 replies; 7+ messages in thread
From: Martin Steigerwald @ 2008-11-28 22:39 UTC (permalink / raw)
To: xfs
Am Freitag 28 November 2008 schrieb Martin Steigerwald:
> Am Mittwoch 26 November 2008 schrieb Dave Chinner:
> > On Wed, Nov 26, 2008 at 09:49:18AM +0100, Martin Steigerwald wrote:
> > > Am Dienstag 25 November 2008 schrieb Dave Chinner:
> > > > On Tue, Nov 25, 2008 at 10:44:14PM +0100, Martin Steigerwald
wrote:
> > > > > Hi!
> > > > >
> > > > > Today on one try to hibernate via tuxonice it machine appeared
> > > > > dead. I am
> > > >
> > > > ^^^^^^^^^
> > > > When (not if) suspend to disk/resume fails, you get to keep all
> > > > the broken pieces of your filesystem. It works most of the time,
> > > > but it has some fundamentally broken corner cases that you
> > > > probably just hit....
> > >
> > > Well I use TuxOnIce for a reason! I had uptimes of up to 70 days
> > > with it already. And they are usually only interrupted by kernel
> > > updates or manual shutdowns. I was never convinced by in-kernel
> > > solutions for hibernate.
> >
> > Sure, though I'm not convinced that TuxOnIce is any better because
> > it still uses the same fundamental design as the in-kernel ones.
>
> Might be.
>
> But something is fishy here. I had it a second time today. This time I
> know for sure that the machine freezed hard. Mouse pointer froze and
> the machine didn't even respond to a ping anymore. Nothing in logs -
> doesn't surprise me.
>
> I didn't have this issue with 2.6.26, and I also don't think I had it
> with 2.6.27.5. I will downgrade to 2.6.27.5 now.
I wonder about those truncated files nonetheless. As I don't think that
KDE is writing config files all the time. Well I might be wrong, but I
didn't even change KDE configuration during time of the crash... OTOH XFS
uses a in memory inode size and should be safe with the point in time
when it writes the size to disk as far as I read here. Well this time at
least again the file "kdeglobals" was affected and this file might be
written rather often.
Anyway, since I am not even using an official TuxOnIce patch... lets see
what the downgrade to 2.6.27.5 brings - no time to really dig deeper into
that right now.
Maybe I should update notebook kernels a little less. 2.6.25.3,6,8,10
worked nicer than 2.6.26.5 which has had hangs on USB sound on my T23
Amarok machine, especially after several hibernate cycles.
Or use the Debian distro kernel, but I never had much luck with hibernate
with that. Might be worth trying again.
--
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA B82F 991B EAAC A599 84C7
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2008-11-28 22:45 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-11-25 22:03 truncated files Martin Steigerwald
-- strict thread matches above, loose matches on Subject: below --
2008-11-25 21:44 Martin Steigerwald
2008-11-25 22:27 ` Dave Chinner
2008-11-26 8:49 ` Martin Steigerwald
2008-11-26 22:59 ` Dave Chinner
2008-11-28 22:02 ` Martin Steigerwald
2008-11-28 22:39 ` Martin Steigerwald
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox