* [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made modifications
@ 2014-05-30 13:18 Artur Paszkiewicz
2014-06-02 2:36 ` NeilBrown
0 siblings, 1 reply; 4+ messages in thread
From: Artur Paszkiewicz @ 2014-05-30 13:18 UTC (permalink / raw)
To: neilb; +Cc: linux-raid, pawel.baldysiak, Artur Paszkiewicz
If the checksum verification fails in mdadm and mdmon is running, retry
the load to get a consistent snapshot of the mpb.
Based on db575f3b
Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com>
Reviewed-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
---
super-intel.c | 17 +++++++++++++++++
1 file changed, 17 insertions(+)
diff --git a/super-intel.c b/super-intel.c
index f0a7ab5..037c018 100644
--- a/super-intel.c
+++ b/super-intel.c
@@ -4422,6 +4422,7 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
{
struct intel_super *super;
int rv;
+ int retry;
if (test_partition(fd))
/* IMSM not allowed on partitions */
@@ -4444,6 +4445,22 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
}
rv = load_and_parse_mpb(fd, super, devname, 0);
+ /* retry the load if we might have raced against mdmon */
+ if (rv == 3) {
+ struct mdstat_ent *mdstat = mdstat_by_component(fd2devnm(fd));
+
+ if (mdmon_running(mdstat->devnm) && getpid() != mdmon_pid(mdstat->devnm)) {
+ for (retry = 0; retry < 3; retry++) {
+ usleep(3000);
+ rv = load_and_parse_mpb(fd, super, devname, 0);
+ if (rv != 3)
+ break;
+ }
+ }
+
+ free_mdstat(mdstat);
+ }
+
if (rv) {
if (devname)
pr_err("Failed to load all information "
--
1.8.4.5
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made modifications
2014-05-30 13:18 [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made modifications Artur Paszkiewicz
@ 2014-06-02 2:36 ` NeilBrown
2014-06-02 13:02 ` Artur Paszkiewicz
0 siblings, 1 reply; 4+ messages in thread
From: NeilBrown @ 2014-06-02 2:36 UTC (permalink / raw)
To: Artur Paszkiewicz; +Cc: linux-raid, pawel.baldysiak
[-- Attachment #1: Type: text/plain, Size: 1776 bytes --]
On Fri, 30 May 2014 15:18:33 +0200 Artur Paszkiewicz
<artur.paszkiewicz@intel.com> wrote:
> If the checksum verification fails in mdadm and mdmon is running, retry
> the load to get a consistent snapshot of the mpb.
>
> Based on db575f3b
>
> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com>
> Reviewed-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
> ---
> super-intel.c | 17 +++++++++++++++++
> 1 file changed, 17 insertions(+)
>
> diff --git a/super-intel.c b/super-intel.c
> index f0a7ab5..037c018 100644
> --- a/super-intel.c
> +++ b/super-intel.c
> @@ -4422,6 +4422,7 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
> {
> struct intel_super *super;
> int rv;
> + int retry;
>
> if (test_partition(fd))
> /* IMSM not allowed on partitions */
> @@ -4444,6 +4445,22 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
> }
> rv = load_and_parse_mpb(fd, super, devname, 0);
>
> + /* retry the load if we might have raced against mdmon */
> + if (rv == 3) {
> + struct mdstat_ent *mdstat = mdstat_by_component(fd2devnm(fd));
> +
> + if (mdmon_running(mdstat->devnm) && getpid() != mdmon_pid(mdstat->devnm)) {
> + for (retry = 0; retry < 3; retry++) {
> + usleep(3000);
> + rv = load_and_parse_mpb(fd, super, devname, 0);
> + if (rv != 3)
> + break;
> + }
> + }
The only thing you use from mdstat is devnm, and that is the thing you passed
to mdstat_by_component to get mdstat....
Can you just do
char *devnm = fd2devnm(fd);
if (mdmon_running(devnm) && ......)
??
NeilBrown
> +
> + free_mdstat(mdstat);
> + }
> +
> if (rv) {
> if (devname)
> pr_err("Failed to load all information "
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made modifications
2014-06-02 2:36 ` NeilBrown
@ 2014-06-02 13:02 ` Artur Paszkiewicz
2014-06-02 23:10 ` NeilBrown
0 siblings, 1 reply; 4+ messages in thread
From: Artur Paszkiewicz @ 2014-06-02 13:02 UTC (permalink / raw)
To: NeilBrown; +Cc: linux-raid, Baldysiak, Pawel
On 06/02/2014 04:36 AM, NeilBrown wrote:
> On Fri, 30 May 2014 15:18:33 +0200 Artur Paszkiewicz
> <artur.paszkiewicz@intel.com> wrote:
>
>> If the checksum verification fails in mdadm and mdmon is running, retry
>> the load to get a consistent snapshot of the mpb.
>>
>> Based on db575f3b
>>
>> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com>
>> Reviewed-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
>> ---
>> super-intel.c | 17 +++++++++++++++++
>> 1 file changed, 17 insertions(+)
>>
>> diff --git a/super-intel.c b/super-intel.c
>> index f0a7ab5..037c018 100644
>> --- a/super-intel.c
>> +++ b/super-intel.c
>> @@ -4422,6 +4422,7 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
>> {
>> struct intel_super *super;
>> int rv;
>> + int retry;
>>
>> if (test_partition(fd))
>> /* IMSM not allowed on partitions */
>> @@ -4444,6 +4445,22 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
>> }
>> rv = load_and_parse_mpb(fd, super, devname, 0);
>>
>> + /* retry the load if we might have raced against mdmon */
>> + if (rv == 3) {
>> + struct mdstat_ent *mdstat = mdstat_by_component(fd2devnm(fd));
>> +
>> + if (mdmon_running(mdstat->devnm) && getpid() != mdmon_pid(mdstat->devnm)) {
>> + for (retry = 0; retry < 3; retry++) {
>> + usleep(3000);
>> + rv = load_and_parse_mpb(fd, super, devname, 0);
>> + if (rv != 3)
>> + break;
>> + }
>> + }
>
> The only thing you use from mdstat is devnm, and that is the thing you passed
> to mdstat_by_component to get mdstat....
>
> Can you just do
> char *devnm = fd2devnm(fd);
> if (mdmon_running(devnm) && ......)
>
> ??
>
I can't do that because mdmon_running and mdmon_pid need a devnm of a
container device, and the only thing we have here is the file descriptor
of a component device. So I used mdstat_by_component to get the
container devnm. Do you have an idea how to get that reliably without
reading mdstat?
I have overlooked that mdstat_by_component can return NULL here. I've
added a check for this in the patch below.
Thanks,
Artur
From dfb12870a482654b405ec1d4d9d3a8ba69a6290c Mon Sep 17 00:00:00 2001
From: Artur Paszkiewicz <artur.paszkiewicz@intel.com>
Date: Tue, 27 May 2014 15:30:54 +0200
Subject: [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made
modifications
If the checksum verification fails in mdadm and mdmon is running, retry
the load to get a consistent snapshot of the mpb.
Based on db575f3b
Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com>
---
super-intel.c | 17 +++++++++++++++++
1 file changed, 17 insertions(+)
diff --git a/super-intel.c b/super-intel.c
index f0a7ab5..9dd807a 100644
--- a/super-intel.c
+++ b/super-intel.c
@@ -4422,6 +4422,7 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
{
struct intel_super *super;
int rv;
+ int retry;
if (test_partition(fd))
/* IMSM not allowed on partitions */
@@ -4444,6 +4445,22 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
}
rv = load_and_parse_mpb(fd, super, devname, 0);
+ /* retry the load if we might have raced against mdmon */
+ if (rv == 3) {
+ struct mdstat_ent *mdstat = mdstat_by_component(fd2devnm(fd));
+
+ if (mdstat && mdmon_running(mdstat->devnm) && getpid() != mdmon_pid(mdstat->devnm)) {
+ for (retry = 0; retry < 3; retry++) {
+ usleep(3000);
+ rv = load_and_parse_mpb(fd, super, devname, 0);
+ if (rv != 3)
+ break;
+ }
+ }
+
+ free_mdstat(mdstat);
+ }
+
if (rv) {
if (devname)
pr_err("Failed to load all information "
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made modifications
2014-06-02 13:02 ` Artur Paszkiewicz
@ 2014-06-02 23:10 ` NeilBrown
0 siblings, 0 replies; 4+ messages in thread
From: NeilBrown @ 2014-06-02 23:10 UTC (permalink / raw)
To: Artur Paszkiewicz; +Cc: linux-raid, Baldysiak, Pawel
[-- Attachment #1: Type: text/plain, Size: 2472 bytes --]
On Mon, 02 Jun 2014 15:02:59 +0200 Artur Paszkiewicz
<artur.paszkiewicz@intel.com> wrote:
> On 06/02/2014 04:36 AM, NeilBrown wrote:
> > On Fri, 30 May 2014 15:18:33 +0200 Artur Paszkiewicz
> > <artur.paszkiewicz@intel.com> wrote:
> >
> >> If the checksum verification fails in mdadm and mdmon is running, retry
> >> the load to get a consistent snapshot of the mpb.
> >>
> >> Based on db575f3b
> >>
> >> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com>
> >> Reviewed-by: Pawel Baldysiak <pawel.baldysiak@intel.com>
> >> ---
> >> super-intel.c | 17 +++++++++++++++++
> >> 1 file changed, 17 insertions(+)
> >>
> >> diff --git a/super-intel.c b/super-intel.c
> >> index f0a7ab5..037c018 100644
> >> --- a/super-intel.c
> >> +++ b/super-intel.c
> >> @@ -4422,6 +4422,7 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
> >> {
> >> struct intel_super *super;
> >> int rv;
> >> + int retry;
> >>
> >> if (test_partition(fd))
> >> /* IMSM not allowed on partitions */
> >> @@ -4444,6 +4445,22 @@ static int load_super_imsm(struct supertype *st, int fd, char *devname)
> >> }
> >> rv = load_and_parse_mpb(fd, super, devname, 0);
> >>
> >> + /* retry the load if we might have raced against mdmon */
> >> + if (rv == 3) {
> >> + struct mdstat_ent *mdstat = mdstat_by_component(fd2devnm(fd));
> >> +
> >> + if (mdmon_running(mdstat->devnm) && getpid() != mdmon_pid(mdstat->devnm)) {
> >> + for (retry = 0; retry < 3; retry++) {
> >> + usleep(3000);
> >> + rv = load_and_parse_mpb(fd, super, devname, 0);
> >> + if (rv != 3)
> >> + break;
> >> + }
> >> + }
> >
> > The only thing you use from mdstat is devnm, and that is the thing you passed
> > to mdstat_by_component to get mdstat....
> >
> > Can you just do
> > char *devnm = fd2devnm(fd);
> > if (mdmon_running(devnm) && ......)
> >
> > ??
> >
> I can't do that because mdmon_running and mdmon_pid need a devnm of a
> container device, and the only thing we have here is the file descriptor
> of a component device. So I used mdstat_by_component to get the
> container devnm. Do you have an idea how to get that reliably without
> reading mdstat?
>
> I have overlooked that mdstat_by_component can return NULL here. I've
> added a check for this in the patch below.
Right, of course, yes.
I've applied your patch - and thanks for the updated version.
NeilBrown
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2014-06-02 23:10 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-05-30 13:18 [PATCH] imsm: retry load_and_parse_mpb if we suspect mdmon has made modifications Artur Paszkiewicz
2014-06-02 2:36 ` NeilBrown
2014-06-02 13:02 ` Artur Paszkiewicz
2014-06-02 23:10 ` NeilBrown
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).