netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Shailabh Nagar <nagar@watson.ibm.com>
To: Andrew Morton <akpm@osdl.org>
Cc: Paul Jackson <pj@sgi.com>,
	Valdis.Kletnieks@vt.edu, jlan@engr.sgi.com, balbir@in.ibm.com,
	csturtiv@sgi.com, linux-kernel@vger.kernel.org,
	Jamal <hadi@cyberus.ca>, netdev <netdev@vger.kernel.org>
Subject: Re: [Patch][RFC] Disabling per-tgid stats on task exit in taskstats
Date: Thu, 29 Jun 2006 15:10:31 -0400	[thread overview]
Message-ID: <44A425A7.2060900@watson.ibm.com> (raw)
In-Reply-To: <20060629110107.2e56310b.akpm@osdl.org>

Andrew Morton wrote:

>On Thu, 29 Jun 2006 09:44:08 -0700
>Paul Jackson <pj@sgi.com> wrote:
>
>  
>
>>>You're probably correct on that model. However, it all depends on the actual
>>>workload. Are people who actually have large-CPU (>256) systems actually
>>>running fork()-heavy things like webservers on them, or are they running things
>>>like database servers and computations, which tend to have persistent
>>>processes?
>>>      
>>>
>>It may well be mostly as you say - the large-CPU systems not running
>>the fork() heavy jobs.
>>
>>Sooner or later, someone will want to run a fork()-heavy job on a
>>large-CPU system.  On a 1024 CPU system, it would apparently take
>>just 14 exits/sec/CPU to hit this bottleneck, if Jay's number of
>>14000 applied.
>>
>>Chris Sturdivant's reply is reasonable -- we'll hit it sooner or later,
>>and deal with it then.
>>
>>    
>>
>
>I agree, and I'm viewing this as blocking the taskstats merge.  Because if
>this _is_ a problem then it's a big one because fixing it will be
>intrusive, and might well involve userspace-visible changes.
>  
>
First off, just a reminder that this is inherently a netlink flow 
control issue...which was being exacerbated
earlier by taskstats decision to send per-tgid data (no longer the case).

But I'd like to know whats our target here ? How many messages per 
second do we want to be able to be sent
and received without risking any loss of data ? Netlink will lose 
messages at a high enough rate so the design point
will need to be known a bit.

For statistics type usage of the genetlink/netlink, I would have thought 
that userspace, provided it is reliably informed
about the loss of data through ENOBUFS, could take measures to just 
account for the missing data and carry on ?




>The only ways I can see of fixing the problem generally are to either
>
>a) throw more CPU(s) at stats collection: allow userspace to register for
>   "stats generated by CPU N", then run a stats collection daemon on each
>   CPU or
>  
>
>b) make the kernel recognise when it's getting overloaded and switch to
>   some degraded mode where it stops trying to send all the data to
>   userspace - just send a summary, or a "we goofed" message or something.
>  
>
One of the unused features of genetlink that's meant for high volume 
data output from the kernel is
the "dump" callback of a genetlink connection. Essentially kernel space 
keeps getting provided sk_buffs
to fill which the netlink layer then supplies to user space (over time I 
guess ?)

But whatever we do, there's going to be some limit so its useful to 
decide what the design point should be ?

Adding Jamal for his thoughts on netlink's flow control in the context 
of genetlink.

--Shailabh

       reply	other threads:[~2006-06-29 19:10 UTC|newest]

Thread overview: 52+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <44892610.6040001@watson.ibm.com>
     [not found] ` <20060609042129.ae97018c.akpm@osdl.org>
     [not found]   ` <4489EE7C.3080007@watson.ibm.com>
     [not found]     ` <449999D1.7000403@engr.sgi.com>
     [not found]       ` <44999A98.8030406@engr.sgi.com>
     [not found]         ` <44999F5A.2080809@watson.ibm.com>
     [not found]           ` <4499D7CD.1020303@engr.sgi.com>
     [not found]             ` <449C2181.6000007@watson.ibm.com>
     [not found]               ` <20060623141926.b28a5fc0.akpm@osdl.org>
     [not found]                 ` <449C6620.1020203@engr.sgi.com>
     [not found]                   ` <20060623164743.c894c314.akpm@osdl.org>
     [not found]                     ` <449CAA78.4080902@watson.ibm.com>
     [not found]                       ` <20060623213912.96056b02.akpm@osdl.org>
     [not found]                         ` <449CD4B3.8020300@watson.ibm.com>
     [not found]                           ` <44A01A50.1050403@sgi.com>
     [not found]                             ` <20060626105548.edef4c64.akpm@osdl.org>
     [not found]                               ` <44A020CD.30903@watson.ibm.com>
     [not found]                                 ` <20060626111249.7aece36e.akpm@osdl.org>
     [not found]                                   ` <44A026ED.8080903@sgi.com>
     [not found]                                     ` <20060626113959.839d72bc.akpm@osdl.org>
     [not found]                                       ` <44A2F50D.8030306@engr.sgi.com>
     [not found]                                         ` <20060628145341.529a61ab.akpm@osdl.org>
     [not found]                                           ` <44A2FC72.9090407@engr.sgi.com>
     [not found]                                             ` <20060629014050.d3bf0be4.pj@sgi.com>
     [not found]                                               ` <200606291230.k5TCUg45030710@turing-police.cc.vt.edu>
     [not found]                                               ` <200606291230.k5TCUg45030710@turing-police.cc.vt. edu>
     [not found]                                                 ` <20060629094408.360ac157.pj@sgi.com>
     [not found]                                                   ` <20060629110107.2e56310b.akpm@osdl.org>
2006-06-29 19:10                                                     ` Shailabh Nagar [this message]
2006-06-29 19:23                                                       ` [Patch][RFC] Disabling per-tgid stats on task exit in taskstats Paul Jackson
2006-06-29 19:33                                                       ` Andrew Morton
2006-06-29 19:43                                                         ` Shailabh Nagar
2006-06-29 20:00                                                           ` Andrew Morton
2006-06-29 22:13                                                             ` Shailabh Nagar
2006-06-29 23:00                                                               ` jamal
2006-06-29 20:01                                                         ` Shailabh Nagar
2006-06-29 21:22                                                           ` Paul Jackson
2006-06-29 22:54                                                           ` jamal
2006-06-30  0:38                                                             ` Shailabh Nagar
2006-06-30  1:05                                                               ` Andrew Morton
2006-06-30  1:11                                                                 ` Shailabh Nagar
2006-06-30  1:30                                                                   ` jamal
2006-06-30  3:01                                                                     ` Shailabh Nagar
2006-06-30 12:45                                                                       ` jamal
2006-06-30  2:25                                                                 ` Paul Jackson
2006-06-30  2:35                                                                   ` Andrew Morton
2006-06-30  2:43                                                                     ` Paul Jackson
2006-06-30 18:53                                                     ` Shailabh Nagar
2006-06-30 19:10                                                       ` Shailabh Nagar
2006-06-30 19:19                                                         ` Shailabh Nagar
2006-06-30 20:19                                                         ` jamal
2006-06-30 22:50                                                         ` Andrew Morton
2006-07-01  2:20                                                           ` Shailabh Nagar
2006-07-01  2:43                                                             ` Andrew Morton
2006-07-01  3:37                                                               ` Shailabh Nagar
2006-07-01  3:51                                                                 ` Andrew Morton
2006-07-03 21:11                                                                   ` Shailabh Nagar
2006-07-03 21:41                                                                     ` Andrew Morton
2006-07-04  0:13                                                                       ` Shailabh Nagar
2006-07-04  0:38                                                                         ` Andrew Morton
2006-07-04 20:19                                                                       ` Paul Jackson
2006-07-04 20:22                                                                         ` Paul Jackson
2006-07-04  0:54                                                                     ` Shailabh Nagar
2006-07-04  1:01                                                                       ` Andrew Morton
2006-07-04 13:05                                                                         ` jamal
2006-07-04 15:18                                                                           ` Shailabh Nagar
2006-07-04 16:37                                                                             ` Shailabh Nagar
2006-07-04 19:24                                                                               ` jamal
2006-07-05 14:09                                                                                 ` Shailabh Nagar
2006-07-05 20:25                                                                                   ` Chris Sturtivant
2006-07-05 20:32                                                                                     ` Shailabh Nagar
2006-07-03  4:53                                                                 ` Paul Jackson
2006-07-03 15:02                                                                   ` Shailabh Nagar
2006-07-03 15:55                                                                     ` Paul Jackson
2006-07-03 16:31                                                                     ` Paul Jackson
2006-07-04  0:09                                                                       ` Shailabh Nagar
2006-07-04 19:59                                                                         ` Paul Jackson
2006-07-05 17:20                                                                     ` Jay Lan
2006-07-05 18:18                                                                       ` Shailabh Nagar
2006-06-30 22:56                                                       ` Andrew Morton

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=44A425A7.2060900@watson.ibm.com \
    --to=nagar@watson.ibm.com \
    --cc=Valdis.Kletnieks@vt.edu \
    --cc=akpm@osdl.org \
    --cc=balbir@in.ibm.com \
    --cc=csturtiv@sgi.com \
    --cc=hadi@cyberus.ca \
    --cc=jlan@engr.sgi.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=pj@sgi.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).