From: "John Stoffel" <john@stoffel.org>
To: Phil Turmel <philip@turmel.org>
Cc: John Stoffel <john@stoffel.org>,
Peter Sanders <plsander@gmail.com>,
Wols Lists <antlists@youngman.org.uk>, NeilBrown <neilb@suse.com>,
linux-raid@vger.kernel.org
Subject: Re: RAID 6, 6 device array - all devices lost superblock
Date: Sun, 28 Aug 2022 15:49:11 -0400 [thread overview]
Message-ID: <25355.50871.743993.605394@quad.stoffel.home> (raw)
In-Reply-To: <ee66bcbe-0a9b-57a6-439f-72cc46debe48@turmel.org>
>>>>> "Phil" == Phil Turmel <philip@turmel.org> writes:
Phil> Pssst! John,
Phil> All of my comments were attributed to Peter by your mail client. ):
Yeah... sometimes my mail reader gets confused when it cites previous
emails. I shoudl probably just drop to > only from now on.
Phil> On 8/28/22 14:45, John Stoffel wrote:
>>>>>>> "Peter" == Peter Sanders <plsander@gmail.com> writes:
>>
Peter> It was set up on the device level, not partitions. (I remember
Peter> getting some advice on the web that device was better than
Peter> partition... Yay for internet advice)
>>
>> Yeah, this is NOT good advice. Generally systems will not destroy
>> existing partition tables, but if they see an empty (to them)
>> disk... all bets are off.
>>
Peter> I'm surveying my other disks to see what I have available to do
Peter> the overlay attempt.
>>
>> They're small. They are sparse files, so just follow the
>> instructions.
>>
Peter> What are the size of the overlay files going to end up being?
>>
>> Not too large, but it depends on how much data is written to the
>> overlayfs to get your data back. If you follow the instructions on
>> this page:
>>
>> https://raid.wiki.kernel.org/index.php/Recovering_a_damaged_RAID
>>
>> It says to create a sparse file for each disk that is 1% of the size
>> of the disk. This can add up... you might need to add a blank disk to
>> your system to hold these.
>>
>> In this case, if you think you know which order the disks were in, you
>> could try to create the RAID6 array (but only using the overlayfs
>> devices!!!!!) I can't stress this enough.
>>
>>
Peter> I did run into UEFI vs AHCI issues early in the process.. they
Peter> are all set to non-UEFI.
>>
>> That's good.
>>
Peter> OS update was onto a new SSD...
>>
>> Ok. Do you have the old OS disk around by any chance? That might
>> give some pointers to how the disks are setup.. You could look in
>> /var/tmp/initrd/... for old mdadm.conf files, which might give more
>> details.
>>
Peter> On Sun, Aug 28, 2022, 12:47 Phil Turmel <philip@turmel.org> wrote:
>>
Peter> Hi Peter, et al,
>>
Peter> On 8/28/22 05:54, Wols Lists wrote:
>>>> On 28/08/2022 10:14, Wols Lists wrote:
>>>>> Currently I have no /dev/md* devices.
>>>>> I have access to the old mdadm.conf file - have tried assembling with
>>>>> it, with the default mdadm.conf, and with no mdadm.conf file in /etc
>>>>> and /etc/mdadm.
>>>>>
>>>>> It looks like the drives weren't partitioned :-( I think you're into
>>>>> forensics.
>>
Peter> It is too soon to say this. The supplied mdadm.conf file does not
Peter> contain specific partition information. It is possible the partition
Peter> tables have just been wiped.
>>
>>>> Whoops - my system froze while I was originally writing my reply, and I
>>>> forgot to put this into my rewrite ...
>>>>
>>>> Look up overlays in the wiki. I've never done it myself, but a fair few
>>>> people have said the instructions worked a treat.
>>>>
>>>> You're basically making the drives read-only (all writes get dumped into
>>>> the overlay file), and then re-creating the array over the top, so you
>>>> can test whether you got it right. If you don't, you just ditch the
>>>> overlays and start again, if you did get it right you can recreate the
>>>> array for real.
>>>>
>>>> Cheers,
>>>> Wol
>>
Peter> On 8/28/22 11:10, John Stoffel wrote:
>>>>>>>>> "Peter" == Peter Sanders <plsander@gmail.com> writes:
>>>>
Peter> have a RAID 6 array, 6 devices. Been running it for years without much issue.
Peter> Had hardware issues with my system - ended up replacing the
Peter> motherboard, video card, and power supply and re-installing the OS
Peter> (Debian 11).
>>>>
>>>> Can you give us details on the old vs new motherboard/cpu? It might
>>>> be that you need to tweak the BIOS of the motherboard to expose the
>>>> old SATA formats as well.
>>>>
>>>> Did you install debian onto a fresh boot disk? Is your BIOS setup to
>>>> only do the new form of booting from UEFI devices, so maybe check your
>>>> BIOS settings that the data drives are all in AHCI mode, or possibly
>>>> even in IDE mode. It all depends on how old the original hardware
>>>> was.
>>>>
>>>> I just recenly upgraded from a 2010 MB/CPU combo and I had to tweak
>>>> the BIOS defaults to see my disks. I guess I should do a clean
>>>> install from a blank disk, but I wanted to minimize downtime.
>>
Peter> It is important to end up in AHCI mode on all MOBO ports. If not set
Peter> that way now, please change them.
>>
>>>> Wols has some great advice here, and I heartily recommend that you use
>>>> overlayfs when doing your testing. Check the RAID WIKI for
>>>> suggestions.
>>
Peter> Concur.
>>
>>>> And don't panic! Your data is probably there, but just missing the
>>>> super blocks or partition tables.
>>
Peter> Both, I suspect.
>>
Peter> On 8/27/22 22:00, Peter Sanders wrote:
>>>> lsdrv ------------------------
>>>> PCI [nvme] 01:00.0 Non-Volatile memory controller: Phison Electronics
>>>> Corporation E12 NVMe Controller (rev 01)
>>>> └nvme nvme0 PCIe SSD {21112925606047}
>>>> └nvme0n1 238.47g [259:0] Partitioned (dos)
>>>> ├nvme0n1p1 485.00m [259:1] ext4 {f38776ac-1ce9-4fc8-ba50-94844b9f504e}
>>>> │└Mounted as /dev/nvme0n1p1 @ /boot
>>>> ├nvme0n1p2 1.00k [259:2] Partitioned (dos)
>>>> ├nvme0n1p5 60.54g [259:3] ext4 {5ee1c3c0-3a05-466c-9f98-f5807c8d813b}
>>>> │└Mounted as /dev/nvme0n1p5 @ /
>>>> ├nvme0n1p6 93.13g [259:4] ext4 {9064169f-4fe3-4836-a906-28c1b445cdff}
>>>> │└Mounted as /dev/nvme0n1p6 @ /var
>>>> ├nvme0n1p7 37.00m [259:5] ext4 {25e161ad-94a0-4298-afaf-18e2433766ee}
>>>> ├nvme0n1p8 82.89g [259:6] ext4 {ac874071-d759-4d33-b32f-83272f3eacd9}
>>>> │└Mounted as /dev/nvme0n1p8 @ /home
>>>> └nvme0n1p9 1.41g [259:7] swap {02cef84b-9a9d-4a0a-973c-fda1a78c533c}
>>>> PCI [pata_jmicron] 26:00.1 IDE interface: JMicron Technology Corp.
>>>> JMB368 IDE controller (rev 10)
>>>> └scsi 0:0:0:0 MAD DOG LS-DVDRW TSH652M {MAD_DOG_LS-DVDRW_TSH652M}
>>>> └sr0 1.00g [11:0] Empty/Unknown
>>>> PCI [ahci] 26:00.0 SATA controller: JMicron Technology Corp. JMB363
>>>> SATA/IDE Controller (rev 10)
>>>> └scsi 2:x:x:x [Empty]
>>>> PCI [ahci] 2b:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD]
>>>> FCH SATA Controller [AHCI mode] (rev 51)
>>>> ├scsi 6:0:0:0 ATA TOSHIBA HDWD130 {477ALBNAS}
>>>> │└sda 2.73t [8:0] Partitioned (PMBR)
>>>> └scsi 7:0:0:0 ATA TOSHIBA HDWD130 {Y7211KPAS}
>>>> └sdc 2.73t [8:32] Partitioned (gpt)
>>>> PCI [ahci] 2c:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD]
>>>> FCH SATA Controller [AHCI mode] (rev 51)
>>>> ├scsi 8:0:0:0 ATA WDC WD30EZRX-00D {WD-WCC1T0668790}
>>>> │└sdb 2.73t [8:16] Partitioned (gpt)
>>>> ├scsi 9:0:0:0 ATA WDC WD30EZRX-00D {WD-WCC4N0091255}
>>>> │└sdd 2.73t [8:48] Partitioned (gpt)
>>>> ├scsi 12:0:0:0 ATA WDC WD30EZRX-00M {WD-WCAWZ2669166}
>>>> │└sde 2.73t [8:64] Partitioned (gpt)
>>>> └scsi 13:0:0:0 ATA TOSHIBA HDWD130 {477ABEJAS}
>>>> └sdf 2.73t [8:80] Partitioned (gpt)
>>
Peter> Unfortunately, my lsdrv tool is not able to reconstruct missing parts.
Peter> It is most useful when used on a *good* system and *saved* for help
Peter> diagnosing *future* problems.
>>
Peter> Please share your /etc/fstab, and if you were using LVM on top of the
Peter> raid, share your lvm.conf and anything in /etc/lvm/backup.
>>
Peter> Please describe the layer(s) that were on top of the raid.
>>
Peter> We need to help you look for signatures, and it helps to be selective in
Peter> what signatures to look for.
>>
Peter> After that, we will want to figure out your raid's chunk size and data
Peter> offsets. If you know of a particular large file (8MB or larger) that is
Peter> sure to be in the raid and you happen to have a copy tucked away, then
Peter> my findHash[1] tool might be able to definitively determine those
Peter> values. (Time consuming, though.)
>>
Peter> Meanwhile, don't do *anything* that would write to those drives.
>>
Peter> Phil
>>
Peter> [1] https://github.com/pturmel/findHash
>>
next prev parent reply other threads:[~2022-08-28 19:50 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-08-28 2:00 RAID 6, 6 device array - all devices lost superblock Peter Sanders
2022-08-28 9:14 ` Wols Lists
2022-08-28 9:54 ` Wols Lists
2022-08-28 16:47 ` Phil Turmel
[not found] ` <CAKAPSkJAQYsec-4zzcePbkJ7Ee0=sd_QvHj4Stnyineq+T8BXw@mail.gmail.com>
2022-08-28 17:16 ` Wols Lists
2022-08-28 18:45 ` John Stoffel
2022-08-28 19:36 ` Phil Turmel
2022-08-28 19:49 ` John Stoffel [this message]
2022-08-28 23:24 ` Peter Sanders
2022-08-29 13:12 ` Peter Sanders
2022-08-29 21:45 ` John Stoffel
2022-08-29 22:29 ` Eyal Lebedinsky
2022-08-29 23:53 ` Peter Sanders
2022-08-30 13:27 ` Peter Sanders
2022-08-30 18:03 ` Wols Lists
2022-08-31 17:48 ` Peter Sanders
2022-08-31 20:37 ` John Stoffel
2022-09-02 14:56 ` Peter Sanders
2022-09-02 18:52 ` Peter Sanders
2022-09-02 19:12 ` John Stoffel
2022-09-03 0:39 ` Peter Sanders
2022-09-03 5:51 ` Peter Sanders
2022-09-05 19:36 ` John Stoffel
2022-09-05 20:16 ` Peter Sanders
2022-09-05 19:25 ` John Stoffel
2022-08-28 15:10 ` John Stoffel
2022-08-28 17:11 ` Andy Smith
2022-08-28 17:22 ` Andy Smith
2022-08-28 17:34 ` Peter Sanders
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=25355.50871.743993.605394@quad.stoffel.home \
--to=john@stoffel.org \
--cc=antlists@youngman.org.uk \
--cc=linux-raid@vger.kernel.org \
--cc=neilb@suse.com \
--cc=philip@turmel.org \
--cc=plsander@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox