linuxppc-dev.lists.ozlabs.org archive mirror
 help / color / mirror / Atom feed
From: Michael Ellerman <michael@ellerman.id.au>
To: Neil Horman <nhorman@tuxdriver.com>
Cc: Michael Neuling <mikey@neuling.org>,
	linuxppc-dev@ozlabs.org, linux-kernel@vger.kernel.org,
	Neil Horman <nhorman@gmail.com>,
	Arjen Van De Ven <arjanvandeven@gmail.com>,
	arjan@linux.intel.com
Subject: Re: [PATCH] irqbalance, powerpc: add IRQs without settable SMP affinity to banned list
Date: Fri, 24 Sep 2010 15:03:42 +1000	[thread overview]
Message-ID: <1285304622.7637.55.camel@concordia> (raw)
In-Reply-To: <20100923131302.GA6560@shamino.rdu.redhat.com>

[-- Attachment #1: Type: text/plain, Size: 4041 bytes --]

On Thu, 2010-09-23 at 09:13 -0400, Neil Horman wrote:
> On Thu, Sep 23, 2010 at 08:57:20PM +1000, Michael Neuling wrote:
> > 
> > > > +		if (fwrite(line, strlen(line) - 1, 1, file) == 0)
> > > 
> > > if (fputs(line, file) == EOF)
> > 
> > Good point thanks... new patch below
> > 
> > Mikey
> > 
> > irqbalance, powerpc: add IRQs without settable SMP affinity to banned list
> > 
> > On pseries powerpc, IPIs are registered with an IRQ number so
> > /proc/interrupts looks like this on a 2 core/2 thread machine:
> > 
> >            CPU0       CPU1       CPU2       CPU3
> >  16:    3164282    3290514    1138794     983121   XICS             Level        IPI
> >  18:    2605674          0     304994          0   XICS             Level        lan0
> >  30:     400057          0     169209          0   XICS             Level        ibmvscsi
> > LOC:     133734      77250     106425      91951   Local timer interrupts
> > SPU:          0          0          0          0   Spurious interrupts
> > CNT:          0          0          0          0   Performance monitoring interrupts
> > MCE:          0          0          0          0   Machine check exceptions
> > 
> > Unfortunately this means irqbalance attempts to set the affinity of IPIs
> > which is not possible.  So in the above case, when irqbalance is in
> > performance mode due to heavy IPI, lan0 and ibmvscsi activity, it
> > sometimes attempts to put the IPIs on one core (CPU0&1) and lan0 and
> > ibmvscsi on the other core (CPU2&3).  This is suboptimal as we want lan0
> > and ibmvscsi to be on separate cores and IPIs to be ignored.
> > 
> > When irqblance attempts writes to the IPI smp_affinity (ie.
> > /proc/irq/16/smp_affinity in the above example) it fails but irqbalance
> > ignores currently ignores this.
> > 
> > This patch catches these write fails and in this case adds that IRQ
> > number to the banned IRQ list.  This will catch the above IPI case and
> > any other IRQ where the SMP affinity can't be set.
> > 
> > Tested on POWER6, POWER7 and x86.
> > 
> > Signed-off-by: Michael Neuling <mikey@neuling.org>
> > 
> > Index: irqbalance/irqlist.c
> > ===================================================================
> > --- irqbalance.orig/irqlist.c
> > +++ irqbalance/irqlist.c
> > @@ -67,7 +67,7 @@
> >  	DIR *dir;
> >  	struct dirent *entry;
> >  	char *c, *c2;
> > -	int nr , count = 0;
> > +	int nr , count = 0, can_set = 1;
> >  	char buf[PATH_MAX];
> >  	sprintf(buf, "/proc/irq/%i", number);
> >  	dir = opendir(buf);
> > @@ -80,7 +80,7 @@
> >  			size_t size = 0;
> >  			FILE *file;
> >  			sprintf(buf, "/proc/irq/%i/smp_affinity", number);
> > -			file = fopen(buf, "r");
> > +			file = fopen(buf, "r+");
> >  			if (!file)
> >  				continue;
> >  			if (getline(&line, &size, file)==0) {
> > @@ -89,7 +89,14 @@
> >  				continue;
> >  			}
> >  			cpumask_parse_user(line, strlen(line), irq->mask);
> > -			fclose(file);
> > +			/*
> > +			 * Check that we can write the affinity, if
> > +			 * not take it out of the list.
> > +			 */
> > +			if (fputs(line, file) == EOF)
> > +				can_set = 0;

> This is maybe a nit, but writing to the affinity file can fail for a few
> different reasons, some of them permanent, some transient.  For instance, if
> we're in a memory constrained condition temporarily irq_affinity_proc_write
> might return -ENOMEM.  

Yeah true, usually followed shortly by your kernel going so far into
swap you never get it back, or OOMing, but I guess it's possible.

> Might it be better to modify this code so that, instead
> of using fputs to merge the various errors into an EOF, we use some other write
> method that lets us better determine the error and selectively ban the interrupt
> only for those errors which we consider permanent?

Yep. It seems fputs() gives you know way to get the actual error from
write(), so it looks we'll need to switch to open/write, but that's
probably not so terrible.

cheers


[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

  reply	other threads:[~2010-09-24  5:03 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-09-22  6:04 [PATCH] irqbalance, powerpc: add IRQs without settable SMP affinity to banned list Michael Neuling
2010-09-23  8:22 ` Michael Ellerman
2010-09-23 10:57   ` Michael Neuling
2010-09-23 13:13     ` Neil Horman
2010-09-24  5:03       ` Michael Ellerman [this message]
2010-09-24  6:56         ` Michael Neuling
2010-09-24 10:37           ` Neil Horman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1285304622.7637.55.camel@concordia \
    --to=michael@ellerman.id.au \
    --cc=arjan@linux.intel.com \
    --cc=arjanvandeven@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linuxppc-dev@ozlabs.org \
    --cc=mikey@neuling.org \
    --cc=nhorman@gmail.com \
    --cc=nhorman@tuxdriver.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).