linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: NeilBrown <neilb@suse.de>
To: Adam Kwolek <adam.kwolek@intel.com>
Cc: linux-raid@vger.kernel.org, ed.ciechanowski@intel.com,
	marcin.labun@intel.com, dan.j.williams@intel.com
Subject: Re: [PATCH 1/4] FIX: Takeover mdmon when grow is continued from command line
Date: Mon, 30 Jan 2012 11:24:17 +1100	[thread overview]
Message-ID: <20120130112417.3a30e18e@notabene.brown> (raw)
In-Reply-To: <20120112071231.5168.62971.stgit@gklab-128-013.igk.intel.com>

[-- Attachment #1: Type: text/plain, Size: 4289 bytes --]

On Thu, 12 Jan 2012 08:12:31 +0100 Adam Kwolek <adam.kwolek@intel.com> wrote:

> When reshape with '--continue' option is executed mdadm doesn't know,
> if mdmon is switched to using '--takeover' option to current file system
> context. To resolve this, on reshape continuation when mdmon is run,
> execute mdmon takeover.
> 
> Lack of takeovered mdmon can cause e.g.:
> - multivolume reshape is broken when reshaped volume is changed
> - size of reshaped volume dowsn't increase

I don't think this is right.

In particular, the way we seem to be going with systemd means that we might
not end up using '--takeover' at all.  So performing a --takeover
transparently without being explicitly asked to is not a good idea.

--continue is only used once the final root is mounted and all the early boot
complications are out of the way.
So can't we simple require that --takeover - if it is being used - be
performed before any --continue??

i.e. is there a reason we cannot ensure this works properly simply by making
sure boot scripts do the right things in the right order.

NeilBrown


> 
> Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
> ---
> 
>  Grow.c  |    2 ++
>  mdadm.h |    1 +
>  util.c  |   53 ++++++++++++++++++++++++++++++++++++++++++++---------
>  3 files changed, 47 insertions(+), 9 deletions(-)
> 
> diff --git a/Grow.c b/Grow.c
> index b2c1360..f0422d2 100644
> --- a/Grow.c
> +++ b/Grow.c
> @@ -3876,6 +3876,8 @@ int Grow_continue_command(char *devname, int fd,
>  		 */
>  		if (!mdmon_running(container_dev))
>  			start_mdmon(container_dev);
> +		else
> +			takeover_mdmon(container_dev);
>  		ping_monitor(buf);
>  
>  		if (mdmon_running(container_dev))
> diff --git a/mdadm.h b/mdadm.h
> index 381ef86..f274ae7 100644
> --- a/mdadm.h
> +++ b/mdadm.h
> @@ -1223,6 +1223,7 @@ extern int mdmon_pid(int devnum);
>  extern int check_env(char *name);
>  extern __u32 random32(void);
>  extern int start_mdmon(int devnum);
> +extern int takeover_mdmon(int devnum);
>  
>  extern int child_monitor(int afd, struct mdinfo *sra, struct reshape *reshape,
>  			 struct supertype *st, unsigned long stripes,
> diff --git a/util.c b/util.c
> index 6985a70..e95a366 100644
> --- a/util.c
> +++ b/util.c
> @@ -1581,11 +1581,13 @@ int mdmon_running(int devnum)
>  	return 0;
>  }
>  
> -int start_mdmon(int devnum)
> +#define MDMON_TAKEOVER_WAIT_COUNTER 15
> +int execute_mdmon(int devnum, int takeover)
>  {
>  	int i, skipped;
>  	int len;
> -	pid_t pid;	
> +	pid_t pid;
> +	pid_t pid_org;
>  	int status;
>  	char pathbuf[1024];
>  	char *paths[4] = {
> @@ -1611,6 +1613,12 @@ int start_mdmon(int devnum)
>  	} else
>  		pathbuf[0] = '\0';
>  
> +	if (takeover) {
> +		pid_org = mdmon_pid(devnum);
> +		if (pid_org < 0)
> +			takeover = 0;
> +	}
> +
>  	switch(fork()) {
>  	case 0:
>  		/* FIXME yuk. CLOSE_EXEC?? */
> @@ -1620,24 +1628,51 @@ int start_mdmon(int devnum)
>  				skipped++;
>  			else
>  				skipped = 0;
> -
> -		for (i=0; paths[i]; i++)
> -			if (paths[i][0])
> -				execl(paths[i], "mdmon",
> -				      devnum2devname(devnum),
> -				      NULL);
> +		for (i = 0; paths[i]; i++) {
> +			if (paths[i][0]) {
> +				if (!takeover)
> +					execl(paths[i], "mdmon",
> +					      devnum2devname(devnum),
> +					      NULL);
> +				else
> +					execl(paths[i], "mdmon",
> +					      "--takeover",
> +					      devnum2devname(devnum),
> +					      NULL);
> +			}
> +		}
>  		exit(1);
>  	case -1: fprintf(stderr, Name ": cannot run mdmon. "
>  			 "Array remains readonly\n");
>  		return -1;
>  	default: /* parent - good */
> -		pid = wait(&status);
> +		i = 0;
> +		do {
> +			if (i)
> +				sleep(1);
> +			pid = wait(&status);
> +			if (takeover && (pid == pid_org))
> +				pid = -1;
> +			i++;
> +		} while (takeover &&
> +			 (i < MDMON_TAKEOVER_WAIT_COUNTER) &&
> +			 (pid < 0));
>  		if (pid < 0 || status != 0)
>  			return -1;
>  	}
>  	return 0;
>  }
>  
> +int start_mdmon(int devnum)
> +{
> +	return execute_mdmon(devnum, 0);
> +}
> +
> +int takeover_mdmon(int devnum)
> +{
> +	return execute_mdmon(devnum, 1);
> +}
> +
>  int check_env(char *name)
>  {
>  	char *val = getenv(name);


[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

  reply	other threads:[~2012-01-30  0:24 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-01-12  7:12 [PATCH 0/4] Reshape problems during double raid0 processing Adam Kwolek
2012-01-12  7:12 ` [PATCH 1/4] FIX: Takeover mdmon when grow is continued from command line Adam Kwolek
2012-01-30  0:24   ` NeilBrown [this message]
2012-01-30  8:20     ` Kwolek, Adam
2012-01-12  7:12 ` [PATCH 2/4] FIX: External metadata sometimes is not updated Adam Kwolek
2012-01-12  7:12 ` [PATCH 3/4] FIX: mdmon check in reshape_container() can cause a problem Adam Kwolek
2012-01-12  7:12 ` [PATCH 4/4] FIX: Typo error in fprint command Adam Kwolek
2012-01-27 13:58 ` [PATCH 0/4] Reshape problems during double raid0 processing Kwolek, Adam
2012-01-30  0:37 ` NeilBrown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20120130112417.3a30e18e@notabene.brown \
    --to=neilb@suse.de \
    --cc=adam.kwolek@intel.com \
    --cc=dan.j.williams@intel.com \
    --cc=ed.ciechanowski@intel.com \
    --cc=linux-raid@vger.kernel.org \
    --cc=marcin.labun@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).