* [linux-lvm] Strange LVM Error With AoE Disks
@ 2007-02-07 14:16 Jayson Vantuyl
2007-02-07 14:44 ` Patrick Caulfield
2007-02-07 19:21 ` [linux-lvm] " Ed L. Cashin
0 siblings, 2 replies; 5+ messages in thread
From: Jayson Vantuyl @ 2007-02-07 14:16 UTC (permalink / raw)
To: Linux LVM List; +Cc: Coraid Support
[-- Attachment #1: Type: text/plain, Size: 2656 bytes --]
Hello,
We have been using Coraid's ATA-Over-Ethernet shelves for a while
with much success.
Recently, we added a second shelf (numbered 1) to our first shelf
(numbered 0). CLVM has been running on the old shelf perfectly fine.
As soon as I added the second shelf, attempting to lvcreate a new lv
utilizing the new disks generated roughly the following errors:
Error locking on node ey00-02: Internal lvm error, check syslog
Error locking on node ey00-05: Internal lvm error, check syslog
Error locking on node ey00-01: Internal lvm error, check syslog
Error locking on node ey00-00: Internal lvm error, check syslog
Error locking on node ey00-04: Internal lvm error, check syslog
Error locking on node ey00-03: Internal lvm error, check syslog
Failed to activate new LV.
All of the nodes show the following errors in syslog:
Feb 7 06:09:36 ey00-00 lvm[4869]: Couldn't find all physical volumes
for volume group ey00-data.
Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
'0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find all physical volumes
for volume group ey00-data.
Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
'0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find all physical volumes
for volume group ey00-data.
Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
'0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find all physical volumes
for volume group ey00-data.
Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
'0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find all physical volumes
for volume group ey00-data.
Feb 7 06:09:37 ey00-00 lvm[4869]: Volume group for uuid not found:
WWbD8SXOsAJzDYRCFQiciQho84Rl99nVF7QbO0ArRxnH4cZeKgzG0Nx4gbEhgALU
Inspecting the lvm.conf shows that these devices are the new ones
that were added.
Even more bizarre, pvscan finds them just fine on all nodes.
The only thing I can note about these devices that is particularly
different is that they appear to be using minor numbers above 256.
Note this ls output:
brw-rw---- 1 root disk 152, 288 Feb 7 04:43 /dev/etherd/e1.2
brw-rw---- 1 root disk 152, 289 Feb 7 06:10 /dev/etherd/e1.2p1
brw-rw---- 1 root disk 152, 304 Feb 7 04:44 /dev/etherd/e1.3
brw-rw---- 1 root disk 152, 305 Feb 7 06:10 /dev/etherd/e1.3p1
Is there a known problem with LVM or CLVM related to large device
minor numbers?
--
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl@engineyard.com
[-- Attachment #2: Type: text/html, Size: 5927 bytes --]
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [linux-lvm] Strange LVM Error With AoE Disks
2007-02-07 14:16 [linux-lvm] Strange LVM Error With AoE Disks Jayson Vantuyl
@ 2007-02-07 14:44 ` Patrick Caulfield
2007-02-07 22:23 ` Jayson Vantuyl
2007-02-07 19:21 ` [linux-lvm] " Ed L. Cashin
1 sibling, 1 reply; 5+ messages in thread
From: Patrick Caulfield @ 2007-02-07 14:44 UTC (permalink / raw)
To: LVM general discussion and development; +Cc: Coraid Support
Jayson Vantuyl wrote:
> Hello,
>
> We have been using Coraid's ATA-Over-Ethernet shelves for a while with
> much success.
>
> Recently, we added a second shelf (numbered 1) to our first shelf
> (numbered 0). CLVM has been running on the old shelf perfectly fine.
>
> As soon as I added the second shelf, attempting to lvcreate a new lv
> utilizing the new disks generated roughly the following errors:
>
> Error locking on node ey00-02: Internal lvm error, check syslog
> Error locking on node ey00-05: Internal lvm error, check syslog
> Error locking on node ey00-01: Internal lvm error, check syslog
> Error locking on node ey00-00: Internal lvm error, check syslog
> Error locking on node ey00-04: Internal lvm error, check syslog
> Error locking on node ey00-03: Internal lvm error, check syslog
> Failed to activate new LV.
>
> All of the nodes show the following errors in syslog:
>
> Feb 7 06:09:36 ey00-00 lvm[4869]: Couldn't find all physical volumes
> for volume group ey00-data.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
> '0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find all physical volumes
> for volume group ey00-data.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
> '0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find all physical volumes
> for volume group ey00-data.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
> '0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find all physical volumes
> for volume group ey00-data.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
> '0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find all physical volumes
> for volume group ey00-data.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Volume group for uuid not found:
> WWbD8SXOsAJzDYRCFQiciQho84Rl99nVF7QbO0ArRxnH4cZeKgzG0Nx4gbEhgALU
>
> Inspecting the lvm.conf shows that these devices are the new ones that
> were added.
>
> Even more bizarre, pvscan finds them just fine on all nodes.
>
> The only thing I can note about these devices that is particularly
> different is that they appear to be using minor numbers above 256. Note
> this ls output:
>
> brw-rw---- 1 root disk 152, 288 Feb 7 04:43 /dev/etherd/e1.2
> brw-rw---- 1 root disk 152, 289 Feb 7 06:10 /dev/etherd/e1.2p1
> brw-rw---- 1 root disk 152, 304 Feb 7 04:44 /dev/etherd/e1.3
> brw-rw---- 1 root disk 152, 305 Feb 7 06:10 /dev/etherd/e1.3p1
>
> Is there a known problem with LVM or CLVM related to large device minor
> numbers?
Did you restart clvmd after adding the new disks? There is a bug where it
doesn't always recognise new hardware.
--
patrick
^ permalink raw reply [flat|nested] 5+ messages in thread
* [linux-lvm] Re: Strange LVM Error With AoE Disks
2007-02-07 14:16 [linux-lvm] Strange LVM Error With AoE Disks Jayson Vantuyl
2007-02-07 14:44 ` Patrick Caulfield
@ 2007-02-07 19:21 ` Ed L. Cashin
1 sibling, 0 replies; 5+ messages in thread
From: Ed L. Cashin @ 2007-02-07 19:21 UTC (permalink / raw)
To: Jayson Vantuyl; +Cc: Coraid Support, Linux LVM List
On Wed, Feb 07, 2007 at 08:16:23AM -0600, Jayson Vantuyl wrote:
...
> All of the nodes show the following errors in syslog:
>
> Feb 7 06:09:36 ey00-00 lvm[4869]: Couldn't find all physical volumes
> for volume group ey00-data.
> Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
> '0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
...
> Even more bizarre, pvscan finds them just fine on all nodes.
>
> The only thing I can note about these devices that is particularly
> different is that they appear to be using minor numbers above 256.
...
> Is there a known problem with LVM or CLVM related to large device
> minor numbers?
Hello, Jayson Vantuyl. I just wanted to chime in to say that we are
not aware of such a problem, but we wouldn't necessarily have heard
about it, since the population using CLVM on AoE is probably not large
yet. (Many use LVM, but I haven't heard much feedback from folks
using CLVM.)
I haven't looked at the CLVM sources really recently, but we have seen
the large minor device number problem before in older versions of LVM
where glibc's macros are used to work with the major and minor parts
that are encoded within device numbers. So if there's a problem, it
might be possible to find it by looking for those macros.
--
Ed L Cashin <ecashin@coraid.com>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [linux-lvm] Strange LVM Error With AoE Disks
2007-02-07 14:44 ` Patrick Caulfield
@ 2007-02-07 22:23 ` Jayson Vantuyl
0 siblings, 0 replies; 5+ messages in thread
From: Jayson Vantuyl @ 2007-02-07 22:23 UTC (permalink / raw)
To: LVM general discussion and development; +Cc: Coraid Support
[-- Attachment #1: Type: text/plain, Size: 311 bytes --]
No, I didn't, actually. That worked. Thanks a MILLION!
On Feb 7, 2007, at 8:44 AM, Patrick Caulfield wrote:
> Did you restart clvmd after adding the new disks? There is a bug
> where it
> doesn't always recognise new hardware.
--
Jayson Vantuyl
Systems Architect
Engine Yard
jvantuyl@engineyard.com
[-- Attachment #2: Type: text/html, Size: 3077 bytes --]
^ permalink raw reply [flat|nested] 5+ messages in thread
* [linux-lvm] Re: Strange LVM Error With AoE Disks
@ 2007-02-22 13:11 Daniele Palumbo
0 siblings, 0 replies; 5+ messages in thread
From: Daniele Palumbo @ 2007-02-22 13:11 UTC (permalink / raw)
To: linux-lvm; +Cc: Ed L Cashin
First of all, sorry but i went on this ML after feb 7 2007, so i have to make
a 'manual reply', hoping that thread will be not broken.
> > Feb 7 06:09:36 ey00-00 lvm[4869]: Couldn't find all physical volumes
> > for volume group ey00-data.
> > Feb 7 06:09:37 ey00-00 lvm[4869]: Couldn't find device with uuid
> > '0Cot9Z-BHjK-2Nkw-eEdy-fbFF-Wh1q-qhRaut'.
[...]
> > The only thing I can note about these devices that is particularly
> > different is that they appear to be using minor numbers above 256.
i have this error and setup too (e4.0, e5.0).
> I haven't looked at the CLVM sources really recently, but we have seen
> the large minor device number problem before in older versions of LVM
> where glibc's macros are used to work with the major and minor parts
> that are encoded within device numbers. So if there's a problem, it
> might be possible to find it by looking for those macros.
i am using vblade daemon on gentoo (waiting for my coraid...).
i am now using e0.1 and e0.2 (i have changed my setup), but the problem was
that i have a clustered volume but not the volume on the other server.
i am now trying to explain with ascii:
xen2 and backuphost are 2 machine.
Xen2Volume is a vg that belong to xen2, XenBackupVolume belong to backuphost.
now on:
i have vblade on backuphost, and xen2 can read it.
but, xen2 won't export (my choise) its Xen2Volume.
note that Xen2Volume is marked clustered (vgchange -cy Xen2Volume)
so, when i create a lv in Xen2Volume, an error will came on screen, cause
backuphost can't see the volume.
clvm knows by cman that there is Xen2Volume but cannot find it!
setting clustered bit of Xen2Volume off, will resolve problem.
or, you can export your Xen2Volume to your backuphost...
HTH
bye
d.
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2007-02-22 13:10 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-02-07 14:16 [linux-lvm] Strange LVM Error With AoE Disks Jayson Vantuyl
2007-02-07 14:44 ` Patrick Caulfield
2007-02-07 22:23 ` Jayson Vantuyl
2007-02-07 19:21 ` [linux-lvm] " Ed L. Cashin
-- strict thread matches above, loose matches on Subject: below --
2007-02-22 13:11 Daniele Palumbo
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).