linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] imsm: fix checking completion of RAID10 resync
@ 2013-07-30 13:59 Pawel Baldysiak
  2013-07-30 23:22 ` NeilBrown
  0 siblings, 1 reply; 5+ messages in thread
From: Pawel Baldysiak @ 2013-07-30 13:59 UTC (permalink / raw)
  To: neilb; +Cc: linux-raid, lukasz.dorau

If one creates RAID10 with IMSM metadata the is_resync_complete
function returns '1' just when initial resync reaches 50%

IMSM version of the is_resync_complete function has been added
that handles the case of IMSM RAID10 correctly.


Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
---
 super-intel.c |   20 ++++++++++++++++++--
 1 file changed, 18 insertions(+), 2 deletions(-)

diff --git a/super-intel.c b/super-intel.c
index 4df33f4..0371713 100644
--- a/super-intel.c
+++ b/super-intel.c
@@ -1021,6 +1021,22 @@ static int is_failed(struct imsm_disk *disk)
 	return (disk->status & FAILED_DISK) == FAILED_DISK;
 }
 
+/* IMSM version of is_resync_complete helper routine
+ * to determine resync completion
+ * since MaxSector is a moving target
+ */
+static int imsm_is_resync_complete(struct mdinfo *array)
+{
+	if (array->array.level != 10) {
+		if (array->resync_start >= array->component_size)
+			return 1;
+	} else {
+		if (array->resync_start >= 2*array->component_size)
+			return 1;
+	}
+	return 0;
+}
+
 /* try to determine how much space is reserved for metadata from
  * the last get_extents() entry on the smallest active disk,
  * otherwise fallback to the default
@@ -7119,12 +7135,12 @@ static int imsm_set_array_state(struct active_array *a, int consistent)
 		handle_missing(super, dev);
 
 	if (consistent == 2 &&
-	    (!is_resync_complete(&a->info) ||
+	    (!imsm_is_resync_complete(&a->info) ||
 	     map_state != IMSM_T_STATE_NORMAL ||
 	     dev->vol.migr_state))
 		consistent = 0;
 
-	if (is_resync_complete(&a->info)) {
+	if (imsm_is_resync_complete(&a->info)) {
 		/* complete intialization / resync,
 		 * recovery and interrupted recovery is completed in
 		 * ->set_disk


^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH] imsm: fix checking completion of RAID10 resync
  2013-07-30 13:59 [PATCH] imsm: fix checking completion of RAID10 resync Pawel Baldysiak
@ 2013-07-30 23:22 ` NeilBrown
  2013-08-01  8:46   ` Dorau, Lukasz
  0 siblings, 1 reply; 5+ messages in thread
From: NeilBrown @ 2013-07-30 23:22 UTC (permalink / raw)
  To: Pawel Baldysiak; +Cc: linux-raid, lukasz.dorau

[-- Attachment #1: Type: text/plain, Size: 3404 bytes --]

On Tue, 30 Jul 2013 15:59:25 +0200 Pawel Baldysiak
<pawel.baldysiak@intel.com> wrote:

> If one creates RAID10 with IMSM metadata the is_resync_complete
> function returns '1' just when initial resync reaches 50%
> 
> IMSM version of the is_resync_complete function has been added
> that handles the case of IMSM RAID10 correctly.
> 
> 
> Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
> ---
>  super-intel.c |   20 ++++++++++++++++++--
>  1 file changed, 18 insertions(+), 2 deletions(-)
> 
> diff --git a/super-intel.c b/super-intel.c
> index 4df33f4..0371713 100644
> --- a/super-intel.c
> +++ b/super-intel.c
> @@ -1021,6 +1021,22 @@ static int is_failed(struct imsm_disk *disk)
>  	return (disk->status & FAILED_DISK) == FAILED_DISK;
>  }
>  
> +/* IMSM version of is_resync_complete helper routine
> + * to determine resync completion
> + * since MaxSector is a moving target
> + */
> +static int imsm_is_resync_complete(struct mdinfo *array)
> +{
> +	if (array->array.level != 10) {
> +		if (array->resync_start >= array->component_size)
> +			return 1;
> +	} else {
> +		if (array->resync_start >= 2*array->component_size)
> +			return 1;
> +	}
> +	return 0;
> +}
> +
>  /* try to determine how much space is reserved for metadata from
>   * the last get_extents() entry on the smallest active disk,
>   * otherwise fallback to the default
> @@ -7119,12 +7135,12 @@ static int imsm_set_array_state(struct active_array *a, int consistent)
>  		handle_missing(super, dev);
>  
>  	if (consistent == 2 &&
> -	    (!is_resync_complete(&a->info) ||
> +	    (!imsm_is_resync_complete(&a->info) ||
>  	     map_state != IMSM_T_STATE_NORMAL ||
>  	     dev->vol.migr_state))
>  		consistent = 0;
>  
> -	if (is_resync_complete(&a->info)) {
> +	if (imsm_is_resync_complete(&a->info)) {
>  		/* complete intialization / resync,
>  		 * recovery and interrupted recovery is completed in
>  		 * ->set_disk


Thanks.
However the bug is not specific to intel, so should be fixed in common code.
And "2*" is not really very general.

The following patch should fix it properly.  If you can confirm that it fixes
the problem for you I would appreciate it.

Thanks,
NeilBrown

From 71d68ff62f945254240575cd836f5f2a09ced5d2 Mon Sep 17 00:00:00 2001
From: NeilBrown <neilb@suse.de>
Date: Wed, 31 Jul 2013 09:18:57 +1000
Subject: [PATCH] Fix is_resync_complete for RAID10

For RAID10, 'sync' numbers go up to the array size rather than the
component size.  is_resync_complete() needs to allow for this.

Reported-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

diff --git a/mdmon.h b/mdmon.h
index 60fda38..5a8e120 100644
--- a/mdmon.h
+++ b/mdmon.h
@@ -91,7 +91,21 @@ extern int monitor_loop_cnt;
  */
 static inline int is_resync_complete(struct mdinfo *array)
 {
-	if (array->resync_start >= array->component_size)
-		return 1;
-	return 0;
+	unsigned long long sync_size = 0;
+	int ncopies, l;
+	switch(array->array.level) {
+	case 1:
+	case 4:
+	case 5:
+	case 6:
+		sync_size = array->component_size;
+		break;
+	case 10:
+		l = array->array.layout;
+		ncopies = (l & 0xff) * ((l >> 8) && 0xff);
+		sync_size = array->component_size * array->array.raid_disks;
+		sync_size /= ncopies;
+		break;
+	}
+	return array->resync_start >= sync_size;
 }


[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

^ permalink raw reply related	[flat|nested] 5+ messages in thread

* RE: [PATCH] imsm: fix checking completion of RAID10 resync
  2013-07-30 23:22 ` NeilBrown
@ 2013-08-01  8:46   ` Dorau, Lukasz
  2013-08-01 12:32     ` Dorau, Lukasz
  0 siblings, 1 reply; 5+ messages in thread
From: Dorau, Lukasz @ 2013-08-01  8:46 UTC (permalink / raw)
  To: NeilBrown, Baldysiak, Pawel; +Cc: linux-raid@vger.kernel.org

On Wednesday, July 31, 2013 1:23 AM NeilBrown <neilb@suse.de> wrote:
> On Tue, 30 Jul 2013 15:59:25 +0200 Pawel Baldysiak
> <pawel.baldysiak@intel.com> wrote:
> 
> > If one creates RAID10 with IMSM metadata the is_resync_complete
> > function returns '1' just when initial resync reaches 50%
> >
> > IMSM version of the is_resync_complete function has been added
> > that handles the case of IMSM RAID10 correctly.
> >
> >
> > Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
> > ---
> >  super-intel.c |   20 ++++++++++++++++++--
> >  1 file changed, 18 insertions(+), 2 deletions(-)
> >
> > diff --git a/super-intel.c b/super-intel.c
> > index 4df33f4..0371713 100644
> > --- a/super-intel.c
> > +++ b/super-intel.c
> > @@ -1021,6 +1021,22 @@ static int is_failed(struct imsm_disk *disk)
> >  	return (disk->status & FAILED_DISK) == FAILED_DISK;
> >  }
> >
> > +/* IMSM version of is_resync_complete helper routine
> > + * to determine resync completion
> > + * since MaxSector is a moving target
> > + */
> > +static int imsm_is_resync_complete(struct mdinfo *array)
> > +{
> > +	if (array->array.level != 10) {
> > +		if (array->resync_start >= array->component_size)
> > +			return 1;
> > +	} else {
> > +		if (array->resync_start >= 2*array->component_size)
> > +			return 1;
> > +	}
> > +	return 0;
> > +}
> > +
> >  /* try to determine how much space is reserved for metadata from
> >   * the last get_extents() entry on the smallest active disk,
> >   * otherwise fallback to the default
> > @@ -7119,12 +7135,12 @@ static int imsm_set_array_state(struct
> active_array *a, int consistent)
> >  		handle_missing(super, dev);
> >
> >  	if (consistent == 2 &&
> > -	    (!is_resync_complete(&a->info) ||
> > +	    (!imsm_is_resync_complete(&a->info) ||
> >  	     map_state != IMSM_T_STATE_NORMAL ||
> >  	     dev->vol.migr_state))
> >  		consistent = 0;
> >
> > -	if (is_resync_complete(&a->info)) {
> > +	if (imsm_is_resync_complete(&a->info)) {
> >  		/* complete intialization / resync,
> >  		 * recovery and interrupted recovery is completed in
> >  		 * ->set_disk
> 
> 
> Thanks.
> However the bug is not specific to intel, so should be fixed in common code.
> And "2*" is not really very general.
> 
> The following patch should fix it properly.  If you can confirm that it fixes
> the problem for you I would appreciate it.
> 

Hi

Since Pawel is out of office I am responding instead of him.
Your patch does not work for IMSM now, because ncopies is equal 0 and mdmon crashes after dividing by zero - see details below:

> Thanks,
> NeilBrown
> 
> From 71d68ff62f945254240575cd836f5f2a09ced5d2 Mon Sep 17 00:00:00 2001
> From: NeilBrown <neilb@suse.de>
> Date: Wed, 31 Jul 2013 09:18:57 +1000
> Subject: [PATCH] Fix is_resync_complete for RAID10
> 
> For RAID10, 'sync' numbers go up to the array size rather than the
> component size.  is_resync_complete() needs to allow for this.
> 
> Reported-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
> Signed-off-by: NeilBrown <neilb@suse.de>
> 
> diff --git a/mdmon.h b/mdmon.h
> index 60fda38..5a8e120 100644
> --- a/mdmon.h
> +++ b/mdmon.h
> @@ -91,7 +91,21 @@ extern int monitor_loop_cnt;
>   */
>  static inline int is_resync_complete(struct mdinfo *array)
>  {
> -	if (array->resync_start >= array->component_size)
> -		return 1;
> -	return 0;
> +	unsigned long long sync_size = 0;
> +	int ncopies, l;
> +	switch(array->array.level) {
> +	case 1:
> +	case 4:
> +	case 5:
> +	case 6:
> +		sync_size = array->component_size;
> +		break;
> +	case 10:
> +		l = array->array.layout;
> +		ncopies = (l & 0xff) * ((l >> 8) && 0xff);
> +		sync_size = array->component_size * array->array.raid_disks;

At this point of code the following variables equal
(I have created RAID10 volume of size z=10G):

array->array.layout = 0 
ncopies = 0 
array->component_size = 20971520 
array->array.raid_disks = 4 
sync_size = 83886080
array->resync_start = 0

I will check why array->array.layout is equal 0.

Regards,
Lukasz

> +		sync_size /= ncopies;
> +		break;
> +	}
> +	return array->resync_start >= sync_size;
>  }


^ permalink raw reply	[flat|nested] 5+ messages in thread

* RE: [PATCH] imsm: fix checking completion of RAID10 resync
  2013-08-01  8:46   ` Dorau, Lukasz
@ 2013-08-01 12:32     ` Dorau, Lukasz
  2013-08-05  5:43       ` NeilBrown
  0 siblings, 1 reply; 5+ messages in thread
From: Dorau, Lukasz @ 2013-08-01 12:32 UTC (permalink / raw)
  To: NeilBrown, Baldysiak, Pawel; +Cc: linux-raid@vger.kernel.org

On Thursday, August 01, 2013 10:47 AM Lukasz Dorau <lukasz.dorau@intel.com> wrote:
> On Wednesday, July 31, 2013 1:23 AM NeilBrown <neilb@suse.de> wrote:
> > On Tue, 30 Jul 2013 15:59:25 +0200 Pawel Baldysiak
> > <pawel.baldysiak@intel.com> wrote:
> >
> > > If one creates RAID10 with IMSM metadata the is_resync_complete
> > > function returns '1' just when initial resync reaches 50%
> > >
> > > IMSM version of the is_resync_complete function has been added
> > > that handles the case of IMSM RAID10 correctly.
> > >
> > >
> > > Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
> > > ---
> > >  super-intel.c |   20 ++++++++++++++++++--
> > >  1 file changed, 18 insertions(+), 2 deletions(-)
> > >
> > > diff --git a/super-intel.c b/super-intel.c
> > > index 4df33f4..0371713 100644
> > > --- a/super-intel.c
> > > +++ b/super-intel.c
> > > @@ -1021,6 +1021,22 @@ static int is_failed(struct imsm_disk *disk)
> > >  	return (disk->status & FAILED_DISK) == FAILED_DISK;
> > >  }
> > >
> > > +/* IMSM version of is_resync_complete helper routine
> > > + * to determine resync completion
> > > + * since MaxSector is a moving target
> > > + */
> > > +static int imsm_is_resync_complete(struct mdinfo *array)
> > > +{
> > > +	if (array->array.level != 10) {
> > > +		if (array->resync_start >= array->component_size)
> > > +			return 1;
> > > +	} else {
> > > +		if (array->resync_start >= 2*array->component_size)
> > > +			return 1;
> > > +	}
> > > +	return 0;
> > > +}
> > > +
> > >  /* try to determine how much space is reserved for metadata from
> > >   * the last get_extents() entry on the smallest active disk,
> > >   * otherwise fallback to the default
> > > @@ -7119,12 +7135,12 @@ static int imsm_set_array_state(struct
> > active_array *a, int consistent)
> > >  		handle_missing(super, dev);
> > >
> > >  	if (consistent == 2 &&
> > > -	    (!is_resync_complete(&a->info) ||
> > > +	    (!imsm_is_resync_complete(&a->info) ||
> > >  	     map_state != IMSM_T_STATE_NORMAL ||
> > >  	     dev->vol.migr_state))
> > >  		consistent = 0;
> > >
> > > -	if (is_resync_complete(&a->info)) {
> > > +	if (imsm_is_resync_complete(&a->info)) {
> > >  		/* complete intialization / resync,
> > >  		 * recovery and interrupted recovery is completed in
> > >  		 * ->set_disk
> >
> >
> > Thanks.
> > However the bug is not specific to intel, so should be fixed in common code.
> > And "2*" is not really very general.
> >
> > The following patch should fix it properly.  If you can confirm that it fixes
> > the problem for you I would appreciate it.
> >
> 
> Hi
> 
> Since Pawel is out of office I am responding instead of him.
> Your patch does not work for IMSM now, because ncopies is equal 0 and
> mdmon crashes after dividing by zero - see details below:
> 
> > Thanks,
> > NeilBrown
> >
> > From 71d68ff62f945254240575cd836f5f2a09ced5d2 Mon Sep 17 00:00:00
> 2001
> > From: NeilBrown <neilb@suse.de>
> > Date: Wed, 31 Jul 2013 09:18:57 +1000
> > Subject: [PATCH] Fix is_resync_complete for RAID10
> >
> > For RAID10, 'sync' numbers go up to the array size rather than the
> > component size.  is_resync_complete() needs to allow for this.
> >
> > Reported-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
> > Signed-off-by: NeilBrown <neilb@suse.de>
> >
> > diff --git a/mdmon.h b/mdmon.h
> > index 60fda38..5a8e120 100644
> > --- a/mdmon.h
> > +++ b/mdmon.h
> > @@ -91,7 +91,21 @@ extern int monitor_loop_cnt;
> >   */
> >  static inline int is_resync_complete(struct mdinfo *array)
> >  {
> > -	if (array->resync_start >= array->component_size)
> > -		return 1;
> > -	return 0;
> > +	unsigned long long sync_size = 0;
> > +	int ncopies, l;
> > +	switch(array->array.level) {
> > +	case 1:
> > +	case 4:
> > +	case 5:
> > +	case 6:
> > +		sync_size = array->component_size;
> > +		break;
> > +	case 10:
> > +		l = array->array.layout;
> > +		ncopies = (l & 0xff) * ((l >> 8) && 0xff);
> > +		sync_size = array->component_size * array->array.raid_disks;
> 
> At this point of code the following variables equal
> (I have created RAID10 volume of size z=10G):
> 
> array->array.layout = 0
> ncopies = 0
> array->component_size = 20971520
> array->array.raid_disks = 4
> sync_size = 83886080
> array->resync_start = 0
> 
> I will check why array->array.layout is equal 0.
> 

There is another, more serious, problem.
When we stop the array during initial resync (mdadm -Ss) 
and the function is_resync_complete() is entered for the last time, 
array->array.raid_disks already equals 0, because it is zero'ed by manager:
        a->info.array.raid_disks = mdstat->raid_disks;
at managemon.c:454.
As a result sync_size equals 0 and is_resync_complete() incorrectly returns 1 and resync finishes...

It seems to be a race condition between monitor and manager - manager changes value of array.raid_disks too fast.

Regards,
Lukasz

> 
> > +		sync_size /= ncopies;
> > +		break;
> > +	}
> > +	return array->resync_start >= sync_size;
> >  }
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] imsm: fix checking completion of RAID10 resync
  2013-08-01 12:32     ` Dorau, Lukasz
@ 2013-08-05  5:43       ` NeilBrown
  0 siblings, 0 replies; 5+ messages in thread
From: NeilBrown @ 2013-08-05  5:43 UTC (permalink / raw)
  To: Dorau, Lukasz; +Cc: Baldysiak, Pawel, linux-raid@vger.kernel.org

[-- Attachment #1: Type: text/plain, Size: 1878 bytes --]

On Thu, 1 Aug 2013 12:32:50 +0000 "Dorau, Lukasz" <lukasz.dorau@intel.com>
wrote:

> 
> There is another, more serious, problem.
> When we stop the array during initial resync (mdadm -Ss) 
> and the function is_resync_complete() is entered for the last time, 
> array->array.raid_disks already equals 0, because it is zero'ed by manager:
>         a->info.array.raid_disks = mdstat->raid_disks;
> at managemon.c:454.
> As a result sync_size equals 0 and is_resync_complete() incorrectly returns 1 and resync finishes...
> 
> It seems to be a race condition between monitor and manager - manager changes value of array.raid_disks too fast.

Yes - that is a serious problem.  Thanks for reporting it.
I think this is the correct fix.

Thanks,
NeilBrown


From e49a8a80265ab2150c96b636450f5825bcd69d4a Mon Sep 17 00:00:00 2001
From: NeilBrown <neilb@suse.de>
Date: Mon, 5 Aug 2013 15:40:16 +1000
Subject: [PATCH] mdmon: don't use 'ghost' values from an inactive array.

It is possible for mdmon to see (in /proc/mdstat) and array
in 'inactive' state, "mdadm -S" has written "inactive" to
"array_state".

In this state values such as "raid_disk" are not meaningful
and so should be ignored by manage_member().

Reported-by: "Dorau, Lukasz" <lukasz.dorau@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

diff --git a/managemon.c b/managemon.c
index c245655..f40bbdb 100644
--- a/managemon.c
+++ b/managemon.c
@@ -450,9 +450,11 @@ static void manage_member(struct mdstat_ent *mdstat,
 		/* Raced with something */
 		return;
 
-	// FIXME
-	a->info.array.raid_disks = mdstat->raid_disks;
-	// MORE
+	if (mdstat->active) {
+		// FIXME
+		a->info.array.raid_disks = mdstat->raid_disks;
+		// MORE
+	}
 
 	if (sysfs_get_ll(&a->info, NULL, "component_size", &component_size) >= 0)
 		a->info.component_size = component_size << 1;

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

^ permalink raw reply related	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2013-08-05  5:43 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-07-30 13:59 [PATCH] imsm: fix checking completion of RAID10 resync Pawel Baldysiak
2013-07-30 23:22 ` NeilBrown
2013-08-01  8:46   ` Dorau, Lukasz
2013-08-01 12:32     ` Dorau, Lukasz
2013-08-05  5:43       ` NeilBrown

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).