All of lore.kernel.org
 help / color / mirror / Atom feed
From: "roland" <for_spam@gmx.de>
To: Joern Bredereck <jb@bw-networx.net>,
	user-mode-linux-user@lists.sourceforge.net
Cc: user-mode-linux-devel@lists.sourceforge.net, mingo@elte.hu,
	Jeff Dike <jdike@addtoit.com>
Subject: [uml-devel] Re: [uml-user] Network lags
Date: Fri, 9 Apr 2004 14:47:32 +0200	[thread overview]
Message-ID: <189201c41e30$d434cc70$2000000a@schlepptopp> (raw)
In-Reply-To: 185d01c41e23$5996cc00$2000000a@schlepptopp

hi again,

i did some testing and i`m able to reproduce this with 2.6 based uml on 2.6 host.
after starting some threads like

while true;do find /;done >/dev/zero 2>&1 &
while true;do dd if=/dev/urandom of=/somepath/test.dat >/dev/zero 2>&1 &
while true;do dd íf=/dev/hda of=/dev/zero >/dev/zero 2>&1 &

and thus stressing the I/O on host very much, all seems to run fine for some time - BUT:
from time to time (didn`t examine, if this is periodic - but all runs fine at least >95% of the time) i get the same lags, when
pinging the uml.

since i`m stressing the I/O very heavily, the lags i see are much worse than joerns.
i sometimes get lags >60s and even error-messages from ping: sendms: no buffer space available.

ok - my generated disk I/O uses most of the buffers, but shouln`t it be the host kernels job,to leave some buffers reserved and
"schedule" that appropriately ?  i`m not an expert regarding scheduling - but this is what i expect from a kernel.

does anybody run a more recent (>2.6.0) HOST kernel and is also able to reproduce this?
maybe this is a "scheduling question" or a question for LKML ?
maybe i`m just stupid doing such "nasty things" i shouldn`t do - and i`m expecting too much ? :D

at least, i would be interested in getting to know:

- why this happens
- how to "tune" this, that it doesn`t happen
- if this is expected behavour on "heavy loaded systems" or probably a uml or host-kernel bug

regards
roland




----- Original Message ----- 
From: "roland" <for_spam@gmx.de>
To: "Joern Bredereck" <jb@bw-networx.net>; <user-mode-linux-user@lists.sourceforge.net>
Sent: Friday, April 09, 2004 1:10 PM
Subject: Re: [uml-user] Network lags


> Hi Joern!
> Is the uml 100% idle all the time?
> what`s going on on the host at the same time?
> uml is just another "process" on the host - and the scheduler (especially I/O) of the 2.4 kernel series is not the really best.
2.6
> is MUCH better - so you probably could try 2.6 on HOST and compare, if that makes a difference?
>
> could you run "vmstat 1" on the host and inside the uml while pinging?
>
> could you also ping into the other direction and put the results "side by side", to see if there is a relation ?
>
> as of writing this, i searched the mailing-list archive and found a reference:
>
> http://sourceforge.net/mailarchive/message.php?msg_id=6285243
>
> so, you probably can do "some more" I/O on your host or on your UML  (dd if=....) and study the ping "behaviour" ?
>
>
> regards
> roland
>
>
> ----- Original Message ----- 
> From: "Joern Bredereck" <jb@bw-networx.net>
> To: <user-mode-linux-user@lists.sourceforge.net>
> Sent: Friday, April 09, 2004 12:27 PM
> Subject: [uml-user] Network lags
>
>
> > Hi,
> >
> > I have a strange problem with lagging network connections to my vservers.
> >
> > Here is a typical ping to one of the uml vservers, sent from the uml host
> > itself:
> >
> > 64 bytes from 217.146.142.73: icmp_seq=44 ttl=64 time=0.2 ms
> > 64 bytes from 217.146.142.73: icmp_seq=45 ttl=64 time=0.2 ms
> > 64 bytes from 217.146.142.73: icmp_seq=46 ttl=64 time=0.2 ms
> > 64 bytes from 217.146.142.73: icmp_seq=47 ttl=64 time=235.6 ms
> > 64 bytes from 217.146.142.73: icmp_seq=48 ttl=64 time=0.4 ms
> > 64 bytes from 217.146.142.73: icmp_seq=49 ttl=64 time=0.2 ms
> > 64 bytes from 217.146.142.73: icmp_seq=50 ttl=64 time=0.4 ms
> > 64 bytes from 217.146.142.73: icmp_seq=51 ttl=64 time=0.2 ms
> > 64 bytes from 217.146.142.73: icmp_seq=52 ttl=64 time=0.4 ms
> > 64 bytes from 217.146.142.73: icmp_seq=53 ttl=64 time=2876.6 ms
> > 64 bytes from 217.146.142.73: icmp_seq=54 ttl=64 time=1876.7 ms
> > 64 bytes from 217.146.142.73: icmp_seq=55 ttl=64 time=876.7 ms
> > 64 bytes from 217.146.142.73: icmp_seq=56 ttl=64 time=1.8 ms
> > 64 bytes from 217.146.142.73: icmp_seq=57 ttl=64 time=0.3 ms
> > 64 bytes from 217.146.142.73: icmp_seq=58 ttl=64 time=0.4 ms
> >
> > As you can see, most of the times, the pings are just fine. But every 10
> > to 20 seconds there is a lag. Sometime only 500 ms, sometimes 3000 ms and
> > sometimes even 10 seconds long.
> >
> > You can imagine it's rellay no fun to work on a lagging vserver via ssh.
> > :-(
> >
> > The uml host itself has no networking problems with the outside world.
> > Pings to everywhere are just as they are supposed to be.
> >
> > I use the tap-devices for networking.
> >
> > vserver2:/etc/init.d# ifconfig tap1
> > tap1      Link encap:Ethernet  HWaddr 00:FF:7F:4C:C5:2F
> >           inet addr:217.146.142.84  Bcast:217.146.142.255
> > Mask:255.255.255.255
> >           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
> >           RX packets:3478644 errors:0 dropped:0 overruns:0 frame:0
> >           TX packets:2961598 errors:0 dropped:0 overruns:0 carrier:0
> >           collisions:0 txqueuelen:1000
> >           RX bytes:1503132252 (1.3 GiB)  TX bytes:299835570 (285.9 MiB)
> >
> >
> > The uml, as well as the uml host is a Debian Woody 3.0.
> >
> > Kernels in use:
> >
> > On the host: Vanilla 2.4.23 with the host-skas-patch.
> >
> > On the uml:
> >
> > Linux version 2.4.23-1um (root@vserver1) (gcc version 2.95.4 20011002 (Debian prerelease)) #5 Sun Dec 21 04:26:57 CET 2003
> > On node 0 totalpages: 16384
> > zone(0): 16384 pages.
> > zone(1): 0 pages.
> > zone(2): 0 pages.
> > Kernel command line: ubd0=/dev/vserver/ns1 ubd1=/dev/vserver/swap_ns1 eth0=tuntap,,,217.146.142.84 umid=ns1 mem=64M
root=/dev/ubd0
> > Calibrating delay loop... 3135.26 BogoMIPS
> > Memory: 60772k available
> > Dentry cache hash table entries: 8192 (order: 4, 65536 bytes)
> > Inode cache hash table entries: 4096 (order: 3, 32768 bytes)
> > Mount cache hash table entries: 512 (order: 0, 4096 bytes)
> > Buffer cache hash table entries: 1024 (order: 0, 4096 bytes)
> > Page-cache hash table entries: 16384 (order: 4, 65536 bytes)
> > Checking for host processor cmov support...Yes
> > Checking for host processor xmm support...No
> > Checking that ptrace can change system call numbers...OK
> > Checking that host ptys support output SIGIO...Yes
> > Checking that host ptys support SIGIO on close...No, enabling workaround
> > Checking for /dev/anon on the host...Not available (open failed with errno 2)
> > POSIX conformance testing by UNIFIX
> > Linux NET4.0 for Linux 2.4
> > Based upon Swansea University Computer Society NET3.039
> > Sangoma WANPIPE Router v1.1 (c) 1995-2000 Sangoma Technologies Inc.
> > Initializing RT netlink socket
> > Starting kswapd
> > VFS: Disk quotas vdquot_6.5.1
> > Journalled Block Device driver loaded
> > devfs: v1.12c (20020818) Richard Gooch (rgooch@atnf.csiro.au)
> > devfs: boot_options: 0x1
> > pty: 2048 Unix98 ptys configured
> > RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize
> > loop: loaded (max 8 devices)
> > PPP generic driver version 2.4.2
> > Equalizer1996: $Revision: 1.2.1 $ $Date: 1996/09/22 13:52:00 $ Simon Janes (simon@ncm.com)
> > Universal TUN/TAP device driver 1.5 (C)1999-2002 Maxim Krasnyansky
> > ipddp.c:v0.01 8/28/97 Bradford W. Johnson <johns393@maroon.tc.umn.edu>
> > ipddp0: Appletalk-IP Encap. mode by Bradford W. Johnson <johns393@maroon.tc.umn.edu>
> > md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27
> > md: Autodetecting RAID arrays.
> > md: autorun ...
> > md: ... autorun DONE.
> > Initializing software serial port version 1
> > mconsole (version 2) initialized on /root/.uml/ns1/mconsole
> > Partition check:
> >  ubda: unknown partition table
> >  ubdb: unknown partition table
> > ubd : Synchronous mode
> > Initializing stdio console driver
> > Netdevice 0 : TUN/TAP backend - IP = 217.146.142.84
> > NET4: Linux TCP/IP 1.0 for NET4.0
> > IP Protocols: ICMP, UDP, TCP, IGMP
> > IP: routing cache hash table of 512 buckets, 4Kbytes
> > TCP: Hash tables configured (established 4096 bind 4096)
> > IPv4 over IPv4 tunneling driver
> > GRE over IPv4 tunneling driver
> > Linux IP multicast router 0.06 plus PIM-SM
> > ip_conntrack version 2.1 (474 buckets, 3792 max) - 292 bytes per conntrack
> > NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
> > IPv6 v0.8 for NET4.0
> > IPv6 over IPv4 tunneling driver
> > NET4: AppleTalk 0.18a for Linux NET4.0
> > EXT3-fs: INFO: recovery required on readonly filesystem.
> > EXT3-fs: write access will be enabled during recovery.
> > kjournald starting.  Commit interval 5 seconds
> > EXT3-fs: recovery complete.
> > EXT3-fs: mounted filesystem with ordered data mode.
> > VFS: Mounted root (ext3 filesystem) readonly.
> > Mounted devfs on /dev
> > Adding Swap: 262136k swap-space (priority -1)
> > EXT3 FS 2.4-0.9.19, 19 August 2002 on ubd(98,0), internal journal
> > Virtual console 1 assigned device '/dev/ptyp1'
> > * insmod tun
> > insmod: tun: no module by that name found
> > * ifconfig tap1 217.146.142.84 netmask 255.255.255.255 up
> > * bash -c echo 1 > /proc/sys/net/ipv4/ip_forward
> > * route add -host 217.146.142.73 dev tap1
> > * bash -c echo 1 > /proc/sys/net/ipv4/conf/tap1/proxy_arp
> > * arp -Ds 217.146.142.73 eth0 pub
> > * route del -host 217.146.142.73 dev tap1
> > * bash -c echo 0 > /proc/sys/net/ipv4/conf/tap1/proxy_arp
> > * arp -i eth0 -d 217.146.142.73 pub
> > * route add -host 217.146.142.73 dev tap1
> > * bash -c echo 1 > /proc/sys/net/ipv4/conf/tap1/proxy_arp
> > * arp -Ds 217.146.142.73 eth0 pub
> > eth0: no IPv6 routers present
> >
> >
> > Any idea what could cause those lags? Any hint for further troubleshooting
> > approaches?
> >
> > Thanks!
> >
> > Joern
> >
> >
> >
> > -------------------------------------------------------
> > This SF.Net email is sponsored by: IBM Linux Tutorials
> > Free Linux tutorial presented by Daniel Robbins, President and CEO of
> > GenToo technologies. Learn everything from fundamentals to system
> > administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
> > _______________________________________________
> > User-mode-linux-user mailing list
> > User-mode-linux-user@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/user-mode-linux-user
> >
>
>
>
> -------------------------------------------------------
> This SF.Net email is sponsored by: IBM Linux Tutorials
> Free Linux tutorial presented by Daniel Robbins, President and CEO of
> GenToo technologies. Learn everything from fundamentals to system
> administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
> _______________________________________________
> User-mode-linux-user mailing list
> User-mode-linux-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/user-mode-linux-user
>



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

       reply	other threads:[~2004-04-09 12:41 UTC|newest]

Thread overview: 23+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <Pine.LNX.4.59.9.0404091209570.10714@hathi.bw-networx.net>
     [not found] ` <185d01c41e23$5996cc00$2000000a@schlepptopp>
2004-04-09 12:47   ` roland [this message]
2004-04-09 16:26     ` [uml-devel] Re: [uml-user] Network lags Jeff Dike
2004-04-09 18:33       ` [uml-devel] uml responsivenes or "AS vs DEADLINE Scheduler on uml-host" - was: " roland
2004-04-10 12:28         ` BlaisorBlade
2004-04-13 21:26           ` roland
2004-04-14  1:02             ` Henrik Nordstrom
     [not found]       ` <c58kbu$el1$1@sea.gmane.org>
2004-04-10 14:08         ` [uml-devel] " Jeff Dike
2004-04-10 14:00           ` Sven Köhler
2004-04-10 17:07             ` Jeff Dike
2004-04-10 21:22               ` Sven Köhler
2004-04-10 23:29                 ` Henrik Nordstrom
2004-04-11  3:45                 ` attriel
2004-04-15  0:31               ` roland
2004-04-15 12:01                 ` David Cannings
     [not found]                   ` <407EADE0.8010308@cox.net>
2004-04-15 21:11                     ` roland
2004-04-16 15:12                   ` Matthew Bloch
2004-04-18 10:54                     ` roland
2004-04-10 14:50           ` [uml-devel] Re: V=R Michael Koehne
2004-04-11 16:54             ` BlaisorBlade
2004-04-11 17:17             ` Nicholas E. Walker
2004-04-10 15:12           ` [uml-devel] Re: [uml-user] Network lags Nicholas E. Walker
2004-04-10 16:46             ` Steven Pritchard
2004-04-10 23:41               ` Henrik Nordstrom

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='189201c41e30$d434cc70$2000000a@schlepptopp' \
    --to=for_spam@gmx.de \
    --cc=jb@bw-networx.net \
    --cc=jdike@addtoit.com \
    --cc=mingo@elte.hu \
    --cc=user-mode-linux-devel@lists.sourceforge.net \
    --cc=user-mode-linux-user@lists.sourceforge.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.