From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier MATZ Subject: Re: [PATCH] mbuf: extend rte_mbuf_prefetch_part* to support more prefetching methods Date: Tue, 31 May 2016 21:28:22 +0200 Message-ID: <574DE5D6.2040001@6wind.com> References: <1464663966-8122-1-git-send-email-jianbo.liu@linaro.org> Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit To: Jianbo Liu , jerin.jacob@caviumnetworks.com, dev@dpdk.org Return-path: Received: from mail-wm0-f41.google.com (mail-wm0-f41.google.com [74.125.82.41]) by dpdk.org (Postfix) with ESMTP id A8C882B94 for ; Tue, 31 May 2016 21:28:23 +0200 (CEST) Received: by mail-wm0-f41.google.com with SMTP id n129so148518695wmn.1 for ; Tue, 31 May 2016 12:28:23 -0700 (PDT) In-Reply-To: <1464663966-8122-1-git-send-email-jianbo.liu@linaro.org> List-Id: patches and discussions about DPDK List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" Hi Jianbo, On 05/31/2016 05:06 AM, Jianbo Liu wrote: > Change the inline function to macro with parameters > > Signed-off-by: Jianbo Liu > > [...] > --- a/lib/librte_mbuf/rte_mbuf.h > +++ b/lib/librte_mbuf/rte_mbuf.h > @@ -849,14 +849,15 @@ struct rte_mbuf { > * in the receive path. If the cache line of the architecture is higher than > * 64B, the second part will also be prefetched. > * > + * @param method > + * The prefetch method: prefetch0, prefetch1, prefetch2 or > + * prefetch_non_temporal. > + * > * @param m > * The pointer to the mbuf. > */ > -static inline void > -rte_mbuf_prefetch_part1(struct rte_mbuf *m) > -{ > - rte_prefetch0(&m->cacheline0); > -} > +#define RTE_MBUF_PREFETCH_PART1(method, m) \ > + rte_##method(&(m)->cacheline0) I'm not very fan of this macro, because it allows to really do everything): RTE_MBUF_PREFETCH_PART1(pktmbuf_free, m) would expand as: rte_pktmbuf_free(m) I'd prefer to have a switch case like this, almost similar to what Keith proposed in the initial discussion for my patch: enum rte_mbuf_prefetch_type { PREFETCH0, PREFETCH1, ... }; static inline void rte_mbuf_prefetch_part1(enum rte_mbuf_prefetch_type type, struct rte_mbuf *m) { switch (type) { case PREFETCH0: rte_prefetch0(&m->cacheline0); break; case PREFETCH1: rte_prefetch1(&m->cacheline0); break; ... } Some questions: could you give some details about the use of non-temporal prefetch in ixgbe_vec_neon? What are the pros and cons, and would it be useful in other drivers? Currently all drivers are doing prefetch0 when they prefetch the mbuf structure. Some drivers use prefetch1 for data. By the way, I did not try to apply the patch, but it looks it's on top of dpdk-next-net/rel_16_07, right? Thanks, Olivier