Linux LVM users
 help / color / mirror / Atom feed
* [linux-lvm] Every now and then LVM is not recognized - LVM2 on RAID10
@ 2008-07-23 12:07 exxs.net - matevz sernc-urban
  2008-07-23 17:35 ` Lars Ellenberg
  0 siblings, 1 reply; 3+ messages in thread
From: exxs.net - matevz sernc-urban @ 2008-07-23 12:07 UTC (permalink / raw)
  To: linux-lvm

[-- Attachment #1: Type: text/plain, Size: 2435 bytes --]

Dear LVM-Users,

i am running into a very interesting situation today, that our LVM 
metadata is recognized only sometimes. LVM was running fine for some 
weeks and now from today on running pvscan 10 times one after another i 
get about 4 positive and 6 negative results (its somehow a coincidence 
that LVM is recognized).

* WORKING STATE:
server$ pvscan
   PV /dev/md2   VG mainvg   lvm2 [909.59 GB / 299.59 GB free]
   Total: 1 [909.59 GB] / in use: 1 [909.59 GB] / in no VG: 0 [0   ]

* NOT-WORKING STATE:
server$ pvscan
   No matching physical volumes found

notice: Same behaviour with vgscan and lvscan.

Our Setup:
* RAID-10 /dev/md2 on /dev/sd[a-d]5
cat /proc/mdstat
Personalities : [raid1] [raid10]
md2 : active raid10 sdb5[1] sda5[0] sdd5[3] sdc5[2]
       953778688 blocks 64K chunks 2 far-copies [4/4] [UUUU]

* LVM2 on top of /dev/md2 (no separate lvm partition)
lvm metadata backup is present and it is exactly the same as backups we 
did months ago

I will attach the lvm2 metadata file to this message (vg name: mainvg)

WORKING STATE:
server$ pvscan -vv
       Setting global/locking_type to 1
       File-based locking selected.
       Setting global/locking_dir to /var/lock/lvm
       Locking /var/lock/lvm/P_global WB
     Wiping cache of LVM-capable devices
       /dev/sndstat: stat failed: No such file or directory
     Wiping internal VG cache
     Walking through all physical volumes
       /dev/md2: size is 1907557376 sectors
       /dev/md2: lvm2 label detected
   PV /dev/md2   VG mainvg   lvm2 [909.59 GB / 299.59 GB free]
   Total: 1 [909.59 GB] / in use: 1 [909.59 GB] / in no VG: 0 [0   ]
       Unlocking /var/lock/lvm/P_global

NOT-WORKING STATE:
server$ pvscan -vv
       Setting global/locking_type to 1
       File-based locking selected.
       Setting global/locking_dir to /var/lock/lvm
       Locking /var/lock/lvm/P_global WB
     Wiping cache of LVM-capable devices
       /dev/sndstat: stat failed: No such file or directory
     Wiping internal VG cache
     Walking through all physical volumes
       /dev/md2: size is 1907557376 sectors
       /dev/md2: No label detected
   No matching physical volumes found
       Unlocking /var/lock/lvm/P_global


Any hints?

Where should it look regarding "/dev/md2: No label detected" vs. 
"/dev/md2: lvm2 label detected"??

It seems like RAID-10 is not sync'ed well maybe? But, mdstat reports all 
disks are up.

Thanks for your help.
Mat

[-- Attachment #2: mainvg.txt --]
[-- Type: text/plain, Size: 2374 bytes --]

# Generated by LVM2 version 2.02.38 (2008-06-11): Wed Jul 23 13:16:36 2008

contents = "Text Format Volume Group"
version = 1

description = "Created *after* executing 'vgcfgbackup -vvv'"

creation_host = "fileserver.exxs.net"	# Linux fileserver.exxs.net 2.6.18-6-xen-686 #1 SMP Sun Feb 10 22:43:13 UTC 2008 i686
creation_time = 1216811796	# Wed Jul 23 13:16:36 2008

mainvg {
	id = "mNtCkm-qYPB-RzWY-Kfke-asbD-xMDD-7ABoDq"
	seqno = 15
	status = ["RESIZEABLE", "READ", "WRITE"]
	extent_size = 8192		# 4 Megabytes
	max_lv = 0
	max_pv = 0

	physical_volumes {

		pv0 {
			id = "nKql7U-MLbt-5Grp-Hrp7-DOwQ-E2fd-R0n4aD"
			device = "/dev/md2"	# Hint only

			status = ["ALLOCATABLE"]
			dev_size = 1907557376	# 909.594 Gigabytes
			pe_start = 384
			pe_count = 232856	# 909.594 Gigabytes
		}
	}

	logical_volumes {

		yangc-root {
			id = "oSKqz7-2zJK-FXZ3-r79j-mFCS-2oaK-CIIcN7"
			status = ["READ", "WRITE", "VISIBLE"]
			read_ahead = 3072
			segment_count = 1

			segment1 {
				start_extent = 0
				extent_count = 2560	# 10 Gigabytes

				type = "striped"
				stripe_count = 1	# linear

				stripes = [
					"pv0", 51200
				]
			}
		}

		yangc-imap {
			id = "maitoY-2pNZ-eJt2-YgbA-QXlh-i7WC-xOSwp2"
			status = ["READ", "WRITE", "VISIBLE"]
			segment_count = 1

			segment1 {
				start_extent = 0
				extent_count = 25600	# 100 Gigabytes

				type = "striped"
				stripe_count = 1	# linear

				stripes = [
					"pv0", 53760
				]
			}
		}

		yangc-home {
			id = "hI0d5Q-GFRV-D3H1-0IiQ-IU8m-bnXw-LE5jdH"
			status = ["READ", "WRITE", "VISIBLE"]
			segment_count = 1

			segment1 {
				start_extent = 0
				extent_count = 20480	# 80 Gigabytes

				type = "striped"
				stripe_count = 1	# linear

				stripes = [
					"pv0", 79360
				]
			}
		}

		test.exxs.net-disk {
			id = "CjHEEF-viBE-UHvg-qIhW-Muk1-9m53-2tMH00"
			status = ["READ", "WRITE", "VISIBLE"]
			segment_count = 1

			segment1 {
				start_extent = 0
				extent_count = 5120	# 20 Gigabytes

				type = "striped"
				stripe_count = 1	# linear

				stripes = [
					"pv0", 99840
				]
			}
		}

		yangc-backup {
			id = "WqcTef-vvW1-2tfa-HbUQ-NBjN-zai2-l3dd2F"
			status = ["READ", "WRITE", "VISIBLE"]
			segment_count = 1

			segment1 {
				start_extent = 0
				extent_count = 102400	# 400 Gigabytes

				type = "striped"
				stripe_count = 1	# linear

				stripes = [
					"pv0", 104960
				]
			}
		}
	}
}

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [linux-lvm] Every now and then LVM is not recognized - LVM2 on RAID10
  2008-07-23 12:07 [linux-lvm] Every now and then LVM is not recognized - LVM2 on RAID10 exxs.net - matevz sernc-urban
@ 2008-07-23 17:35 ` Lars Ellenberg
  2008-07-24 15:36   ` exxs.net - matevz sernc-urban
  0 siblings, 1 reply; 3+ messages in thread
From: Lars Ellenberg @ 2008-07-23 17:35 UTC (permalink / raw)
  To: linux-lvm

On Wed, Jul 23, 2008 at 02:07:32PM +0200, exxs.net - matevz sernc-urban wrote:
> Dear LVM-Users,
>
> i am running into a very interesting situation today, that our LVM  
> metadata is recognized only sometimes. LVM was running fine for some  
> weeks and now from today on running pvscan 10 times one after another i  
> get about 4 positive and 6 negative results (its somehow a coincidence  
> that LVM is recognized).
>
> * WORKING STATE:
> server$ pvscan
>   PV /dev/md2   VG mainvg   lvm2 [909.59 GB / 299.59 GB free]
>   Total: 1 [909.59 GB] / in use: 1 [909.59 GB] / in no VG: 0 [0   ]
>
> * NOT-WORKING STATE:
> server$ pvscan
>   No matching physical volumes found
>
> notice: Same behaviour with vgscan and lvscan.
>
> Our Setup:
> * RAID-10 /dev/md2 on /dev/sd[a-d]5
> cat /proc/mdstat
> Personalities : [raid1] [raid10]
> md2 : active raid10 sdb5[1] sda5[0] sdd5[3] sdc5[2]
>       953778688 blocks 64K chunks 2 far-copies [4/4] [UUUU]
>
> * LVM2 on top of /dev/md2 (no separate lvm partition)
> lvm metadata backup is present and it is exactly the same as backups we  
> did months ago
>
> I will attach the lvm2 metadata file to this message (vg name: mainvg)
>
> WORKING STATE:
> server$ pvscan -vv
>       Setting global/locking_type to 1
>       File-based locking selected.
>       Setting global/locking_dir to /var/lock/lvm
>       Locking /var/lock/lvm/P_global WB
>     Wiping cache of LVM-capable devices
>       /dev/sndstat: stat failed: No such file or directory
>     Wiping internal VG cache
>     Walking through all physical volumes
>       /dev/md2: size is 1907557376 sectors
>       /dev/md2: lvm2 label detected
>   PV /dev/md2   VG mainvg   lvm2 [909.59 GB / 299.59 GB free]
>   Total: 1 [909.59 GB] / in use: 1 [909.59 GB] / in no VG: 0 [0   ]
>       Unlocking /var/lock/lvm/P_global
>
> NOT-WORKING STATE:
> server$ pvscan -vv
>       Setting global/locking_type to 1
>       File-based locking selected.
>       Setting global/locking_dir to /var/lock/lvm
>       Locking /var/lock/lvm/P_global WB
>     Wiping cache of LVM-capable devices
>       /dev/sndstat: stat failed: No such file or directory
>     Wiping internal VG cache
>     Walking through all physical volumes
>       /dev/md2: size is 1907557376 sectors
>       /dev/md2: No label detected
>   No matching physical volumes found
>       Unlocking /var/lock/lvm/P_global
>
>
> Any hints?
>
> Where should it look regarding "/dev/md2: No label detected" vs.  
> "/dev/md2: lvm2 label detected"??
>
> It seems like RAID-10 is not sync'ed well maybe? But, mdstat reports all  
> disks are up.

just a shot in the dark:

for dev in /dev/sd{a,b,c,d}5; do
	md5=$(dd if=$dev iflag=direct bs=1M count=1 | md5sum)
	echo "$dev: $md5"
done

echo check > /sys/block/md2/md/sync_action

	Lars

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [linux-lvm] Every now and then LVM is not recognized - LVM2 on RAID10
  2008-07-23 17:35 ` Lars Ellenberg
@ 2008-07-24 15:36   ` exxs.net - matevz sernc-urban
  0 siblings, 0 replies; 3+ messages in thread
From: exxs.net - matevz sernc-urban @ 2008-07-24 15:36 UTC (permalink / raw)
  To: LVM general discussion and development

Lars Ellenberg schrieb:
> On Wed, Jul 23, 2008 at 02:07:32PM +0200, exxs.net - matevz sernc-urban wrote:
>>
>> i am running into a very interesting situation today, that our LVM  
>> metadata is recognized only sometimes. LVM was running fine for some  
>> weeks and now from today on running pvscan 10 times one after another i  
>> get about 4 positive and 6 negative results (its somehow a coincidence  
>> that LVM is recognized).
>>
...
>>
>> Any hints?
>>
>> Where should it look regarding "/dev/md2: No label detected" vs.  
>> "/dev/md2: lvm2 label detected"??
>>
>> It seems like RAID-10 is not sync'ed well maybe? But, mdstat reports all  
>> disks are up.
> 
> just a shot in the dark:
> 
> for dev in /dev/sd{a,b,c,d}5; do
> 	md5=$(dd if=$dev iflag=direct bs=1M count=1 | md5sum)
> 	echo "$dev: $md5"
> done

this is a RAID 10,f2 (with 2 far copies)
so the starting order of blocks is different on all 4 hdd's (and 
therefore all four md5's are different).

> 
> echo check > /sys/block/md2/md/sync_action
> 

i did this, and it finished without any failures - all disks are up and 
sync'ed again. RAID is therefore not a possible problem.

i must add that the logical volumes are in use and working. i just 
really do not want to reboot the machine as i suppose they couldn't be 
brought up again.

Any other hints of why sometimes i get "/dev/md2: No label detected" vs.
  "/dev/md2: lvm2 label detected"?

thanks in advance for any possible hints
mat

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2008-07-24 15:37 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-07-23 12:07 [linux-lvm] Every now and then LVM is not recognized - LVM2 on RAID10 exxs.net - matevz sernc-urban
2008-07-23 17:35 ` Lars Ellenberg
2008-07-24 15:36   ` exxs.net - matevz sernc-urban

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox