* UBI: recover_peb and power cut safety
@ 2016-06-16 8:37 Jörg Pfähler
2016-06-16 9:46 ` Richard Weinberger
0 siblings, 1 reply; 5+ messages in thread
From: Jörg Pfähler @ 2016-06-16 8:37 UTC (permalink / raw)
To: linux-mtd
Hi,
I would greatly appreciate some clarification with respect to power cut safety
during writing of an erase block in UBI, specifically power cut safety of
recover_peb.
During a normal write operation (ubi_eba_write_leb in mtd/ubi/eba.c) UBI tries
to move the contents of the block (and the new contents) to a new location via
recover_peb, if the write fails. However, recover_peb does not seem to use the
capability to exchange the (logical) block atomically (as
ubi_eba_atomic_leb_change in mtd/ubi/eba.c does). Specifically, it does not
seem to write the amount of data and its checksum to the VID header. Thus, if
the system crashes in the middle of recover_peb before the old/broken block
could be erased, we are left with a newer version of the block (the sequence
number in the header is increased by recover_peb), but without having moved
all the contents of the old block. This would obviously lead to data loss.
Thus, It seems to me that recover_peb (and therefore ubi_eba_write_leb) is not
power cut safe or is there some other mechanism distinct from the one used by
ubi_eba_atomic_leb_change to achieve this? If not I would suggest using
ubi_eba_atomic_leb_change in ubi_eba_write_leb instead of recover_peb.
Thanks in advance for any clarifications!
Regards,
Jörg
--------------------------------------------------------------------------------------
Jörg Pfähler
Lehrstuhl für Softwaretechnik
Institut für Software and Systems Engineering
Universität Augsburg
Universitätsstr. 6a, Raum 3014
tel: (+49) 821/598-2229
e-mail: joerg.pfaehler@informatik.uni-
augsburg.de<mailto:joerg.pfaehler@informatik.uni-augsburg.de>
--------------------------------------------------------------------------------------
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: UBI: recover_peb and power cut safety
2016-06-16 8:37 UBI: recover_peb and power cut safety Jörg Pfähler
@ 2016-06-16 9:46 ` Richard Weinberger
2016-06-16 10:03 ` Richard Weinberger
2016-06-20 13:48 ` Jörg Pfähler
0 siblings, 2 replies; 5+ messages in thread
From: Richard Weinberger @ 2016-06-16 9:46 UTC (permalink / raw)
To: Jörg Pfähler; +Cc: linux-mtd@lists.infradead.org
Jörg,
On Thu, Jun 16, 2016 at 10:37 AM, Jörg Pfähler <pfaehler@isse.de> wrote:
> Hi,
>
> I would greatly appreciate some clarification with respect to power cut safety
> during writing of an erase block in UBI, specifically power cut safety of
> recover_peb.
>
> During a normal write operation (ubi_eba_write_leb in mtd/ubi/eba.c) UBI tries
> to move the contents of the block (and the new contents) to a new location via
> recover_peb, if the write fails. However, recover_peb does not seem to use the
> capability to exchange the (logical) block atomically (as
> ubi_eba_atomic_leb_change in mtd/ubi/eba.c does). Specifically, it does not
> seem to write the amount of data and its checksum to the VID header. Thus, if
> the system crashes in the middle of recover_peb before the old/broken block
> could be erased, we are left with a newer version of the block (the sequence
> number in the header is increased by recover_peb), but without having moved
> all the contents of the old block. This would obviously lead to data loss.
> Thus, It seems to me that recover_peb (and therefore ubi_eba_write_leb) is not
> power cut safe or is there some other mechanism distinct from the one used by
> ubi_eba_atomic_leb_change to achieve this? If not I would suggest using
> ubi_eba_atomic_leb_change in ubi_eba_write_leb instead of recover_peb.
Hmm, you are right, if ubi_eba_write() is facing -EIO from the MTD driver we can
lose the whole erase block upon power cut.
So you found a bug. :-)
Artem, can you tell more on this?
I'd guess that recover_peb() is older than ubi_eba_atomic_leb_change() and
therefore it was not used.
And nobody noticed so far since the condition is hard to hit.
That said, switching to ubi_eba_atomic_leb_change() seems like a good
plan to me.
Jörg, please send a patch and explain how you tested it.
--
Thanks,
//richard
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: UBI: recover_peb and power cut safety
2016-06-16 9:46 ` Richard Weinberger
@ 2016-06-16 10:03 ` Richard Weinberger
2016-06-17 6:37 ` Artem Bityutskiy
2016-06-20 13:48 ` Jörg Pfähler
1 sibling, 1 reply; 5+ messages in thread
From: Richard Weinberger @ 2016-06-16 10:03 UTC (permalink / raw)
To: Jörg Pfähler, Artem Bityutskiy; +Cc: linux-mtd@lists.infradead.org
Forgot to CC Artem.
On Thu, Jun 16, 2016 at 11:46 AM, Richard Weinberger
<richard.weinberger@gmail.com> wrote:
> Jörg,
>
> On Thu, Jun 16, 2016 at 10:37 AM, Jörg Pfähler <pfaehler@isse.de> wrote:
>> Hi,
>>
>> I would greatly appreciate some clarification with respect to power cut safety
>> during writing of an erase block in UBI, specifically power cut safety of
>> recover_peb.
>>
>> During a normal write operation (ubi_eba_write_leb in mtd/ubi/eba.c) UBI tries
>> to move the contents of the block (and the new contents) to a new location via
>> recover_peb, if the write fails. However, recover_peb does not seem to use the
>> capability to exchange the (logical) block atomically (as
>> ubi_eba_atomic_leb_change in mtd/ubi/eba.c does). Specifically, it does not
>> seem to write the amount of data and its checksum to the VID header. Thus, if
>> the system crashes in the middle of recover_peb before the old/broken block
>> could be erased, we are left with a newer version of the block (the sequence
>> number in the header is increased by recover_peb), but without having moved
>> all the contents of the old block. This would obviously lead to data loss.
>> Thus, It seems to me that recover_peb (and therefore ubi_eba_write_leb) is not
>> power cut safe or is there some other mechanism distinct from the one used by
>> ubi_eba_atomic_leb_change to achieve this? If not I would suggest using
>> ubi_eba_atomic_leb_change in ubi_eba_write_leb instead of recover_peb.
>
> Hmm, you are right, if ubi_eba_write() is facing -EIO from the MTD driver we can
> lose the whole erase block upon power cut.
> So you found a bug. :-)
>
> Artem, can you tell more on this?
> I'd guess that recover_peb() is older than ubi_eba_atomic_leb_change() and
> therefore it was not used.
> And nobody noticed so far since the condition is hard to hit.
>
> That said, switching to ubi_eba_atomic_leb_change() seems like a good
> plan to me.
> Jörg, please send a patch and explain how you tested it.
>
> --
> Thanks,
> //richard
--
Thanks,
//richard
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: UBI: recover_peb and power cut safety
2016-06-16 10:03 ` Richard Weinberger
@ 2016-06-17 6:37 ` Artem Bityutskiy
0 siblings, 0 replies; 5+ messages in thread
From: Artem Bityutskiy @ 2016-06-17 6:37 UTC (permalink / raw)
To: Richard Weinberger, Jörg Pfähler; +Cc: linux-mtd@lists.infradead.org
On Thu, 2016-06-16 at 12:03 +0200, Richard Weinberger wrote:
> Forgot to CC Artem.
>
> On Thu, Jun 16, 2016 at 11:46 AM, Richard Weinberger
> <richard.weinberger@gmail.com> wrote:
> >
> > Jörg,
> >
> > On Thu, Jun 16, 2016 at 10:37 AM, Jörg Pfähler <pfaehler@isse.de>
> > wrote:
> > >
> > > Hi,
> > >
> > > I would greatly appreciate some clarification with respect to
> > > power cut safety
> > > during writing of an erase block in UBI, specifically power cut
> > > safety of
> > > recover_peb.
> > >
> > > During a normal write operation (ubi_eba_write_leb in
> > > mtd/ubi/eba.c) UBI tries
> > > to move the contents of the block (and the new contents) to a new
> > > location via
> > > recover_peb, if the write fails. However, recover_peb does not
> > > seem to use the
> > > capability to exchange the (logical) block atomically (as
> > > ubi_eba_atomic_leb_change in mtd/ubi/eba.c does). Specifically,
> > > it does not
> > > seem to write the amount of data and its checksum to the VID
> > > header. Thus, if
> > > the system crashes in the middle of recover_peb before the
> > > old/broken block
> > > could be erased, we are left with a newer version of the block
> > > (the sequence
> > > number in the header is increased by recover_peb), but without
> > > having moved
> > > all the contents of the old block. This would obviously lead to
> > > data loss.
> > > Thus, It seems to me that recover_peb (and therefore
> > > ubi_eba_write_leb) is not
> > > power cut safe or is there some other mechanism distinct from the
> > > one used by
> > > ubi_eba_atomic_leb_change to achieve this? If not I would suggest
> > > using
> > > ubi_eba_atomic_leb_change in ubi_eba_write_leb instead of
> > > recover_peb.
> > Hmm, you are right, if ubi_eba_write() is facing -EIO from the MTD
> > driver we can
> > lose the whole erase block upon power cut.
> > So you found a bug. :-)
> >
> > Artem, can you tell more on this?
> > I'd guess that recover_peb() is older than
> > ubi_eba_atomic_leb_change() and
> > therefore it was not used.
> > And nobody noticed so far since the condition is hard to hit.
> >
> > That said, switching to ubi_eba_atomic_leb_change() seems like a
> > good
> > plan to me.
> > Jörg, please send a patch and explain how you tested it.
Yes indeed, very bad bug, good catch.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: UBI: recover_peb and power cut safety
2016-06-16 9:46 ` Richard Weinberger
2016-06-16 10:03 ` Richard Weinberger
@ 2016-06-20 13:48 ` Jörg Pfähler
1 sibling, 0 replies; 5+ messages in thread
From: Jörg Pfähler @ 2016-06-20 13:48 UTC (permalink / raw)
To: Richard Weinberger; +Cc: linux-mtd@lists.infradead.org
Hi Richard,
> Hmm, you are right, if ubi_eba_write() is facing -EIO from the MTD driver we
> can lose the whole erase block upon power cut.
> So you found a bug. :-)
>
> Artem, can you tell more on this?
> I'd guess that recover_peb() is older than ubi_eba_atomic_leb_change() and
> therefore it was not used.
> And nobody noticed so far since the condition is hard to hit.
>
> That said, switching to ubi_eba_atomic_leb_change() seems like a good
> plan to me.
> Jörg, please send a patch and explain how you tested it.
First of all, thanks for confirming this bug so quickly.
However, we would like to refrain from providing a patch. A little inspection
of the code revealed that the locking is different for both methods, i.e., one
cannot just call ubi_eba_atomic_leb_change. So it would be a larger change to
the code than we thought. Furthermore, we have no experience in writing or
debugging code for the linux kernel and we own only one flash chip for
testing, so we would not feel very confident in any patch we could provide.
We found the bug rather in the formal verification of a model of UBI/UBIFS
(see http://www.isse.de/flashix for more details on our project), where it
turned out that we modeled the feature differently.
MfG,
Jörg
--------------------------------------------------------------------------------------
Jörg Pfähler
Lehrstuhl für Softwaretechnik
Institut für Software and Systems Engineering
Universität Augsburg
Universitätsstr. 6a, Raum 3014
tel: (+49) 821/598-2229
e-mail: pfaehler@isse.de<mailto:pfaehler@isse.de>
--------------------------------------------------------------------------------------
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2016-06-20 13:48 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-06-16 8:37 UBI: recover_peb and power cut safety Jörg Pfähler
2016-06-16 9:46 ` Richard Weinberger
2016-06-16 10:03 ` Richard Weinberger
2016-06-17 6:37 ` Artem Bityutskiy
2016-06-20 13:48 ` Jörg Pfähler
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).