virtualization.lists.linux-foundation.org archive mirror
 help / color / mirror / Atom feed
From: Paolo Bonzini <pbonzini@redhat.com>
To: Rusty Russell <rusty@rustcorp.com.au>
Cc: virtualization@lists.linux-foundation.org,
	linux-kernel@vger.kernel.org,
	"Michael S. Tsirkin" <mst@redhat.com>
Subject: Re: [PATCH 02/16] virtio_ring: virtqueue_add_sgs, to add multiple sgs.
Date: Thu, 28 Feb 2013 10:24:11 +0100	[thread overview]
Message-ID: <512F223B.4080801@redhat.com> (raw)
In-Reply-To: <87fw0idmu2.fsf@rustcorp.com.au>

Il 27/02/2013 12:21, Rusty Russell ha scritto:
>>> >> Baseline (before add_sgs):
>>> >>         2.840000-3.040000(2.927292)user
>>> >> 
>>> >> After add_sgs:
>>> >>         2.970000-3.150000(3.053750)user
>>> >> 
>>> >> After simplifying add_buf a little:
>>> >>         2.950000-3.210000(3.081458)user
>>> >> 
>>> >> After inlining virtqueue_add/vring_add_indirect:
>>> >>         2.920000-3.150000(3.026875)user
>>> >> 
>>> >> After passing in iteration functions (chained vs unchained):
>>> >>         2.760000-2.970000(2.883542)user
> Oops.  This result (and the next) is bogus.  I was playing with -O3, and
> accidentally left that in :(

Did you check what actually happened that improved speed so much?  Can
we do it ourselves, or use a GCC attribute to turn it on?  Looking at
the GCC manual and source, there's just a bunch of optimizations enabled
by -O3:

    { OPT_LEVELS_3_PLUS, OPT_ftree_loop_distribute_patterns, NULL, 1 },

`-ftree-loop-distribute-patterns'
     This pass distributes the initialization loops and generates a
     call to memset zero.  For example, the loop

Doesn't matter.

    { OPT_LEVELS_3_PLUS, OPT_fpredictive_commoning, NULL, 1 },

Also doesn't matter.

    { OPT_LEVELS_3_PLUS, OPT_funswitch_loops, NULL, 1 },

Can be done by us at the source level.

    { OPT_LEVELS_3_PLUS, OPT_ftree_vectorize, NULL, 1 },

Probably doesn't matter.

    { OPT_LEVELS_3_PLUS, OPT_fipa_cp_clone, NULL, 1 },

`-fipa-cp-clone'
     Perform function cloning to make interprocedural constant
     propagation stronger.  When enabled, interprocedural constant
     propagation will perform function cloning when externally visible
     function can be called with constant arguments.

Can be done by adding new external APIs or marking functions as
always_inline.

    { OPT_LEVELS_3_PLUS, OPT_fgcse_after_reload, NULL, 1 },

`-fgcse-after-reload'
     When `-fgcse-after-reload' is enabled, a redundant load elimination
     pass is performed after reload.  The purpose of this pass is to
     cleanup redundant spilling.

Never saw it have any substantial effect.

    { OPT_LEVELS_3_PLUS_AND_SIZE, OPT_finline_functions, NULL, 1 },

Can be done by us simply by adding more "inline" keywords.

Plus, -O3 will make *full* loop unrolling a bit more aggressive.  But
full loop unrolling requires compile-time-known loop bounds, so I doubt
this is the case.

Paolo

  parent reply	other threads:[~2013-02-28  9:24 UTC|newest]

Thread overview: 60+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-02-19  7:56 [PATCH 00/16] virtio ring rework Rusty Russell
2013-02-19  7:56 ` [PATCH 01/16] scatterlist: introduce sg_unmark_end Rusty Russell
2013-02-19  7:56 ` [PATCH 02/16] virtio_ring: virtqueue_add_sgs, to add multiple sgs Rusty Russell
2013-02-19  9:15   ` Wanlong Gao
2013-02-20  9:18   ` Asias He
2013-02-24 22:12   ` Michael S. Tsirkin
2013-02-26  5:14     ` Rusty Russell
2013-02-26  9:30       ` Michael S. Tsirkin
2013-02-26  7:02     ` Paolo Bonzini
2013-02-27  7:28       ` Rusty Russell
2013-02-27  7:49         ` Michael S. Tsirkin
2013-02-27 11:21           ` Rusty Russell
     [not found]           ` <87fw0idmu2.fsf@rustcorp.com.au>
2013-02-28  9:24             ` Paolo Bonzini [this message]
2013-03-01  1:01               ` Rusty Russell
2013-02-19  7:56 ` [PATCH 03/16] virtio-blk: reorganize virtblk_add_req Rusty Russell
2013-02-19  7:56 ` [PATCH 04/16] virtio-blk: use virtqueue_start_buf on bio path Rusty Russell
2013-02-20  9:19   ` Asias He
2013-02-21  6:23     ` Rusty Russell
2013-02-19  7:56 ` [PATCH 05/16] virtio-blk: use virtqueue_add_sgs on req path Rusty Russell
2013-02-20  9:20   ` Asias He
2013-02-19  7:56 ` [PATCH 06/16] virtio_blk: remove nents member Rusty Russell
2013-02-20  9:20   ` Asias He
2013-02-19  7:56 ` [PATCH 07/16] virtio_ring: don't count elements twice for add_buf path Rusty Russell
2013-02-20 10:09   ` Wanlong Gao
2013-02-19  7:56 ` [PATCH 08/16] virtio_ring: virtqueue_add_outbuf / virtqueue_add_inbuf Rusty Russell
2013-02-20 10:09   ` Wanlong Gao
2013-02-21 17:09   ` Michael S. Tsirkin
2013-02-22  0:02     ` Rusty Russell
2013-02-25 21:35       ` Michael S. Tsirkin
2013-02-28  5:08         ` Rusty Russell
2013-02-28  7:01           ` Michael S. Tsirkin
2013-03-06  6:03             ` Rusty Russell
2013-02-19  7:56 ` [PATCH 09/16] virtio_net: use simplified virtqueue accessors Rusty Russell
2013-02-20 10:09   ` Wanlong Gao
2013-02-19  7:56 ` [PATCH 10/16] virtio_net: use virtqueue_add_sgs[] for command buffers Rusty Russell
2013-02-19  7:56 ` [PATCH 11/16] virtio_rng: use simplified virtqueue accessors Rusty Russell
2013-02-19  7:56 ` [PATCH 12/16] virtio_console: " Rusty Russell
2013-02-19  7:56 ` [PATCH 13/16] caif_virtio: " Rusty Russell
2013-02-19  7:56 ` [PATCH 14/16] virtio_rpmsg_bus: " Rusty Russell
2013-02-19  7:56 ` [PATCH 15/16] virtio_balloon: " Rusty Russell
2013-02-19  7:56 ` [PATCH 16/16] 9p/trans_virtio.c: use virtio_add_sgs[] Rusty Russell
2013-02-19  9:15 ` [PATCH 00/16] virtio ring rework Paolo Bonzini
2013-02-21  6:30   ` Rusty Russell
2013-02-20  8:37 ` [PATCH 17/16] virtio-scsi: use virtqueue_add_sgs for command buffers Wanlong Gao
2013-02-20  9:38   ` Asias He
2013-02-20  9:41     ` Wanlong Gao
2013-02-20  9:47 ` [PATCH 17/16 V2] " Wanlong Gao
2013-02-20 10:54   ` Paolo Bonzini
2013-02-20 12:17   ` Asias He
2013-02-21  6:34     ` Rusty Russell
     [not found] ` <1361260594-601-11-git-send-email-rusty@rustcorp.com.au>
2013-02-20 10:11   ` [PATCH 10/16] virtio_net: use virtqueue_add_sgs[] " Wanlong Gao
2013-02-21  6:27     ` Rusty Russell
2013-02-21  8:30   ` Wanlong Gao
2013-02-21  9:41     ` Jason Wang
2013-02-21  9:43       ` Wanlong Gao
     [not found] ` <1361260594-601-12-git-send-email-rusty@rustcorp.com.au>
2013-02-20 10:12   ` [PATCH 11/16] virtio_rng: use simplified virtqueue accessors Wanlong Gao
     [not found] ` <1361260594-601-13-git-send-email-rusty@rustcorp.com.au>
2013-02-20 10:12   ` [PATCH 12/16] virtio_console: " Wanlong Gao
     [not found] ` <1361260594-601-14-git-send-email-rusty@rustcorp.com.au>
2013-02-20 10:13   ` [PATCH 13/16] caif_virtio: " Wanlong Gao
     [not found] ` <1361260594-601-15-git-send-email-rusty@rustcorp.com.au>
2013-02-20 10:14   ` [PATCH 14/16] virtio_rpmsg_bus: " Wanlong Gao
     [not found] ` <1361260594-601-16-git-send-email-rusty@rustcorp.com.au>
2013-02-20 10:15   ` [PATCH 15/16] virtio_balloon: " Wanlong Gao

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=512F223B.4080801@redhat.com \
    --to=pbonzini@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mst@redhat.com \
    --cc=rusty@rustcorp.com.au \
    --cc=virtualization@lists.linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).