Netdev List

Netdev List
 help / color / mirror / Atom feed

* Re: [PATCH net-next-2.6] vlan: multiqueue vlan devices
From: Eric Dumazet @ 2009-09-02 18:55 UTC (permalink / raw)
  To: Brian Haley
  Cc: David Miller, Patrick McHardy, Stephen Hemminger, jarkao2, netdev
In-Reply-To: <4A9EBB68.2080903@hp.com>

Brian Haley a écrit :
> Eric Dumazet wrote:
>> diff --git a/include/net/rtnetlink.h b/include/net/rtnetlink.h
>> index 3c1895e..0525a1d 100644
>> --- a/include/net/rtnetlink.h
>> +++ b/include/net/rtnetlink.h
>> @@ -70,6 +70,8 @@ struct rtnl_link_ops {
>>  	size_t			(*get_xstats_size)(const struct net_device *dev);
>>  	int			(*fill_xstats)(struct sk_buff *skb,
>>  					       const struct net_device *dev);
>> +	int			(*get_tx_queues)(struct net*, struct nlattr *tb[],
>> +						 int *tx_queues, int *real_tx_queues);
> 
> 	int			(*get_tx_queues)(struct net *net, struct nlattr *tb[],
> 
> Total nitpick, but tbird highlighted it because of the missing "net".
> 

Thats right, and I believe some gcc versions could complain

Thanks Brian

[PATCH net-next-2.6] vlan: multiqueue vlan device

vlan devices are currently not multi-queue capable.

We can do that with a new rtnl_link_ops method,
get_tx_queues(), called from rtnl_create_link()

This new method gets num_tx_queues/real_num_tx_queues
from real device.

register_vlan_device() is also handled.

Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com>
---
 include/net/rtnetlink.h  |    2 ++
 net/8021q/vlan.c         |    5 +++--
 net/8021q/vlan_netlink.c |   20 ++++++++++++++++++++
 net/core/rtnetlink.c     |   10 +++++++++-
 4 files changed, 34 insertions(+), 3 deletions(-)

diff --git a/include/net/rtnetlink.h b/include/net/rtnetlink.h
index 3c1895e..3f4ab54 100644
--- a/include/net/rtnetlink.h
+++ b/include/net/rtnetlink.h
@@ -70,6 +70,8 @@ struct rtnl_link_ops {
 	size_t			(*get_xstats_size)(const struct net_device *dev);
 	int			(*fill_xstats)(struct sk_buff *skb,
 					       const struct net_device *dev);
+	int			(*get_tx_queues)(struct net *net, struct nlattr *tb[],
+						 int *tx_queues, int *real_tx_queues);
 };
 
 extern int	__rtnl_link_register(struct rtnl_link_ops *ops);
diff --git a/net/8021q/vlan.c b/net/8021q/vlan.c
index e814794..8836575 100644
--- a/net/8021q/vlan.c
+++ b/net/8021q/vlan.c
@@ -330,12 +330,13 @@ static int register_vlan_device(struct net_device *real_dev, u16 vlan_id)
 		snprintf(name, IFNAMSIZ, "vlan%.4i", vlan_id);
 	}
 
-	new_dev = alloc_netdev(sizeof(struct vlan_dev_info), name,
-			       vlan_setup);
+	new_dev = alloc_netdev_mq(sizeof(struct vlan_dev_info), name,
+				  vlan_setup, real_dev->num_tx_queues);
 
 	if (new_dev == NULL)
 		return -ENOBUFS;
 
+	new_dev->real_num_tx_queues = real_dev->real_num_tx_queues;
 	dev_net_set(new_dev, net);
 	/* need 4 bytes for extra VLAN header info,
 	 * hope the underlying device can handle it.
diff --git a/net/8021q/vlan_netlink.c b/net/8021q/vlan_netlink.c
index e9c91dc..8ce4122 100644
--- a/net/8021q/vlan_netlink.c
+++ b/net/8021q/vlan_netlink.c
@@ -100,6 +100,25 @@ static int vlan_changelink(struct net_device *dev,
 	return 0;
 }
 
+static int vlan_get_tx_queues(struct net *net, 
+			      struct nlattr *tb[],
+			      int *num_tx_queues,
+			      int *real_num_tx_queues)
+{
+	struct net_device *real_dev;
+
+	if (!tb[IFLA_LINK])
+		return -EINVAL;
+
+	real_dev = __dev_get_by_index(net, nla_get_u32(tb[IFLA_LINK]));
+	if (!real_dev)
+		return -ENODEV;
+
+	*num_tx_queues      = real_dev->num_tx_queues;
+	*real_num_tx_queues = real_dev->real_num_tx_queues;
+	return 0;
+}
+
 static int vlan_newlink(struct net_device *dev,
 			struct nlattr *tb[], struct nlattr *data[])
 {
@@ -216,6 +235,7 @@ struct rtnl_link_ops vlan_link_ops __read_mostly = {
 	.maxtype	= IFLA_VLAN_MAX,
 	.policy		= vlan_policy,
 	.priv_size	= sizeof(struct vlan_dev_info),
+	.get_tx_queues  = vlan_get_tx_queues,
 	.setup		= vlan_setup,
 	.validate	= vlan_validate,
 	.newlink	= vlan_newlink,
diff --git a/net/core/rtnetlink.c b/net/core/rtnetlink.c
index b44775f..5c1fe53 100644
--- a/net/core/rtnetlink.c
+++ b/net/core/rtnetlink.c
@@ -974,12 +974,20 @@ struct net_device *rtnl_create_link(struct net *net, char *ifname,
 {
 	int err;
 	struct net_device *dev;
+	int num_queues = 1;
+	int real_num_queues = 1;
 
+	if (ops->get_tx_queues) {
+		err = ops->get_tx_queues(net, tb, &num_queues, &real_num_queues);
+		if (err)
+			goto err;
+	}
 	err = -ENOMEM;
-	dev = alloc_netdev(ops->priv_size, ifname, ops->setup);
+	dev = alloc_netdev_mq(ops->priv_size, ifname, ops->setup, num_queues);
 	if (!dev)
 		goto err;
 
+	dev->real_num_tx_queues = real_num_queues;
 	if (strchr(dev->name, '%')) {
 		err = dev_alloc_name(dev, dev->name);
 		if (err < 0)

^ permalink raw reply related

* Re: [PATCH net-next-2.6] vlan: multiqueue vlan devices
From: Brian Haley @ 2009-09-02 18:37 UTC (permalink / raw)
  To: Eric Dumazet
  Cc: David Miller, Patrick McHardy, Stephen Hemminger, jarkao2, netdev
In-Reply-To: <4A9EA8C6.2000501@gmail.com>

Eric Dumazet wrote:
> diff --git a/include/net/rtnetlink.h b/include/net/rtnetlink.h
> index 3c1895e..0525a1d 100644
> --- a/include/net/rtnetlink.h
> +++ b/include/net/rtnetlink.h
> @@ -70,6 +70,8 @@ struct rtnl_link_ops {
>  	size_t			(*get_xstats_size)(const struct net_device *dev);
>  	int			(*fill_xstats)(struct sk_buff *skb,
>  					       const struct net_device *dev);
> +	int			(*get_tx_queues)(struct net*, struct nlattr *tb[],
> +						 int *tx_queues, int *real_tx_queues);

	int			(*get_tx_queues)(struct net *net, struct nlattr *tb[],

Total nitpick, but tbird highlighted it because of the missing "net".

-Brian

^ permalink raw reply

* Re: ipw2200: firmware DMA loading rework
From: Bartlomiej Zolnierkiewicz @ 2009-09-02 18:26 UTC (permalink / raw)
  To: Luis R. Rodriguez
  Cc: Tso Ted, Aneesh Kumar K.V, Zhu Yi, Andrew Morton, Mel Gorman,
	Johannes Weiner, Pekka Enberg, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Mel Gorman,
	netdev-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org, James Ketrenos,
	Chatre, Reinette,
	linux-wireless-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	ipw2100-devel-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org
In-Reply-To: <43e72e890909021102g7f844c79xefccf305f5f5c5b6-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>

On Wednesday 02 September 2009 20:02:14 Luis R. Rodriguez wrote:
> On Wed, Sep 2, 2009 at 10:48 AM, Bartlomiej
> Zolnierkiewicz<bzolnier-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
> > On Sunday 30 August 2009 14:37:42 Bartlomiej Zolnierkiewicz wrote:
> >> On Friday 28 August 2009 05:42:31 Zhu Yi wrote:
> >> > Bartlomiej Zolnierkiewicz reported an atomic order-6 allocation failure
> >> > for ipw2200 firmware loading in kernel 2.6.30. High order allocation is
> >>
> >> s/2.6.30/2.6.31-rc6/
> >>
> >> The issue has always been there but it was some recent change that
> >> explicitly triggered the allocation failures (after 2.6.31-rc1).
> >
> > ipw2200 fix works fine but yesterday I got the following error while mounting
> > ext4 filesystem (mb_history is optional so the mount succeeded):
> 
> OK so the mount succeeded.
> 
> > EXT4-fs (dm-2): barriers enabled
> > kjournald2 starting: pid 3137, dev dm-2:8, commit interval 5 seconds
> > EXT4-fs (dm-2): internal journal on dm-2:8
> > EXT4-fs (dm-2): delayed allocation enabled
> > EXT4-fs: file extents enabled
> > mount: page allocation failure. order:5, mode:0xc0d0
> > Pid: 3136, comm: mount Not tainted 2.6.31-rc8-00015-gadda766-dirty #78
> > Call Trace:
> >  [<c0394de3>] ? printk+0xf/0x14
> >  [<c016a693>] __alloc_pages_nodemask+0x400/0x442
> >  [<c016a71b>] __get_free_pages+0xf/0x32
> >  [<c01865cf>] __kmalloc+0x28/0xfa
> >  [<c023d96f>] ? __spin_lock_init+0x28/0x4d
> >  [<c01f529d>] ext4_mb_init+0x392/0x460
> >  [<c01e99d2>] ext4_fill_super+0x1b96/0x2012
> >  [<c0239bc8>] ? snprintf+0x15/0x17
> >  [<c01c0b26>] ? disk_name+0x24/0x69
> >  [<c018ba63>] get_sb_bdev+0xda/0x117
> >  [<c01e6711>] ext4_get_sb+0x13/0x15
> >  [<c01e7e3c>] ? ext4_fill_super+0x0/0x2012
> >  [<c018ad2d>] vfs_kern_mount+0x3b/0x76
> >  [<c018adad>] do_kern_mount+0x33/0xbd
> >  [<c019d0af>] do_mount+0x660/0x6b8
> >  [<c016a71b>] ? __get_free_pages+0xf/0x32
> >  [<c019d168>] sys_mount+0x61/0x99
> >  [<c0102908>] sysenter_do_call+0x12/0x36
> > Mem-Info:
> > DMA per-cpu:
> > CPU    0: hi:    0, btch:   1 usd:   0
> > Normal per-cpu:
> > CPU    0: hi:  186, btch:  31 usd:   0
> > Active_anon:25471 active_file:22802 inactive_anon:25812
> >  inactive_file:33619 unevictable:2 dirty:2452 writeback:135 unstable:0
> >  free:4346 slab:4308 mapped:26038 pagetables:912 bounce:0
> > DMA free:2060kB min:84kB low:104kB high:124kB active_anon:1660kB inactive_anon:1848kB active_file:144kB inactive_file:868kB unevictable:0kB present:15788kB pages_scanned:0 all_unreclaimable? no
> > lowmem_reserve[]: 0 489 489
> > Normal free:15324kB min:2788kB low:3484kB high:4180kB active_anon:100224kB inactive_anon:101400kB active_file:91064kB inactive_file:133608kB unevictable:8kB present:501392kB pages_scanned:0 all_unreclaimable? no
> > lowmem_reserve[]: 0 0 0
> > DMA: 1*4kB 1*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 1*2048kB 0*4096kB = 2060kB
> > Normal: 1283*4kB 648*8kB 159*16kB 53*32kB 10*64kB 1*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 15324kB
> > 57947 total pagecache pages
> > 878 pages in swap cache
> > Swap cache stats: add 920, delete 42, find 11/11
> > Free swap  = 1016436kB
> > Total swap = 1020116kB
> > 131056 pages RAM
> > 4233 pages reserved
> > 90573 pages shared
> > 77286 pages non-shared
> > EXT4-fs: mballoc enabled
> > EXT4-fs (dm-2): mounted filesystem with ordered data mode
> >
> > Thus it seems like the original bug is still there and any ideas how to
> > debug the problem further are appreciated..
> >
> > The complete dmesg and kernel config are here:
> >
> > http://www.kernel.org/pub/linux/kernel/people/bart/ext4-paf.dmesg
> > http://www.kernel.org/pub/linux/kernel/people/bart/ext4-paf.config
> 
> This looks very similar to the kmemleak ext4 reports upon a mount. If
> it is the same issue, which from the trace it seems it is, then this
> is due to an extra kmalloc() allocation and this apparently will not
> get fixed on 2.6.31 due to the closeness of the merge window and the
> non-criticalness this issue has been deemed.
> 
> A patch fix is part of the ext4-patchqueue
> http://repo.or.cz/w/ext4-patch-queue.git

Thanks for the pointer but the page allocation failures that I hit seem
to be caused by the memory management itself and the ext4 issue fixed by:

http://repo.or.cz/w/ext4-patch-queue.git?a=blob;f=memory-leak-fix-ext4_group_info-allocation;h=c919fff34e70ec85f96d1833f9ce460c451000de;hb=HEAD

is a different problem (unrelated to this one).

^ permalink raw reply

* Re: [PATCH] tc: Fix unitialized kernel memory leak
From: Jiri Pirko @ 2009-09-02 18:09 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: David S. Miller, Linux Netdev List
In-Reply-To: <4A9E67A9.7090205@gmail.com>

Wed, Sep 02, 2009 at 02:40:09PM CEST, eric.dumazet@gmail.com wrote:
>Three bytes of uninitialized kernel memory are currently leaked to user
>
>Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com>

Reviewed-by: Jiri Pirko <jpirko@redhat.com>

>---
>diff --git a/net/sched/sch_api.c b/net/sched/sch_api.c
>index 24d17ce..fdb694e 100644
>--- a/net/sched/sch_api.c
>+++ b/net/sched/sch_api.c
>@@ -1456,6 +1456,8 @@ static int tc_fill_tclass(struct sk_buff *skb, struct Qdisc *q,
> 	nlh = NLMSG_NEW(skb, pid, seq, event, sizeof(*tcm), flags);
> 	tcm = NLMSG_DATA(nlh);
> 	tcm->tcm_family = AF_UNSPEC;
>+	tcm->tcm__pad1 = 0;
>+	tcm->tcm__pad2 = 0;
> 	tcm->tcm_ifindex = qdisc_dev(q)->ifindex;
> 	tcm->tcm_parent = q->handle;
> 	tcm->tcm_handle = q->handle;
>--
>To unsubscribe from this list: send the line "unsubscribe netdev" in
>the body of a message to majordomo@vger.kernel.org
>More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply

* Re: ipw2200: firmware DMA loading rework
From: Luis R. Rodriguez @ 2009-09-02 18:02 UTC (permalink / raw)
  To: Bartlomiej Zolnierkiewicz, Tso Ted, Aneesh Kumar K.V
  Cc: Zhu Yi, Andrew Morton, Mel Gorman, Johannes Weiner, Pekka Enberg,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Mel Gorman, netdev@vger.kernel.org, linux-mm@kvack.org,
	James Ketrenos, Chatre, Reinette, linux-wireless@vger.kernel.org,
	ipw2100-devel@lists.sourceforge.net
In-Reply-To: <200909021948.13262.bzolnier@gmail.com>

On Wed, Sep 2, 2009 at 10:48 AM, Bartlomiej
Zolnierkiewicz<bzolnier@gmail.com> wrote:
> On Sunday 30 August 2009 14:37:42 Bartlomiej Zolnierkiewicz wrote:
>> On Friday 28 August 2009 05:42:31 Zhu Yi wrote:
>> > Bartlomiej Zolnierkiewicz reported an atomic order-6 allocation failure
>> > for ipw2200 firmware loading in kernel 2.6.30. High order allocation is
>>
>> s/2.6.30/2.6.31-rc6/
>>
>> The issue has always been there but it was some recent change that
>> explicitly triggered the allocation failures (after 2.6.31-rc1).
>
> ipw2200 fix works fine but yesterday I got the following error while mounting
> ext4 filesystem (mb_history is optional so the mount succeeded):

OK so the mount succeeded.

> EXT4-fs (dm-2): barriers enabled
> kjournald2 starting: pid 3137, dev dm-2:8, commit interval 5 seconds
> EXT4-fs (dm-2): internal journal on dm-2:8
> EXT4-fs (dm-2): delayed allocation enabled
> EXT4-fs: file extents enabled
> mount: page allocation failure. order:5, mode:0xc0d0
> Pid: 3136, comm: mount Not tainted 2.6.31-rc8-00015-gadda766-dirty #78
> Call Trace:
>  [<c0394de3>] ? printk+0xf/0x14
>  [<c016a693>] __alloc_pages_nodemask+0x400/0x442
>  [<c016a71b>] __get_free_pages+0xf/0x32
>  [<c01865cf>] __kmalloc+0x28/0xfa
>  [<c023d96f>] ? __spin_lock_init+0x28/0x4d
>  [<c01f529d>] ext4_mb_init+0x392/0x460
>  [<c01e99d2>] ext4_fill_super+0x1b96/0x2012
>  [<c0239bc8>] ? snprintf+0x15/0x17
>  [<c01c0b26>] ? disk_name+0x24/0x69
>  [<c018ba63>] get_sb_bdev+0xda/0x117
>  [<c01e6711>] ext4_get_sb+0x13/0x15
>  [<c01e7e3c>] ? ext4_fill_super+0x0/0x2012
>  [<c018ad2d>] vfs_kern_mount+0x3b/0x76
>  [<c018adad>] do_kern_mount+0x33/0xbd
>  [<c019d0af>] do_mount+0x660/0x6b8
>  [<c016a71b>] ? __get_free_pages+0xf/0x32
>  [<c019d168>] sys_mount+0x61/0x99
>  [<c0102908>] sysenter_do_call+0x12/0x36
> Mem-Info:
> DMA per-cpu:
> CPU    0: hi:    0, btch:   1 usd:   0
> Normal per-cpu:
> CPU    0: hi:  186, btch:  31 usd:   0
> Active_anon:25471 active_file:22802 inactive_anon:25812
>  inactive_file:33619 unevictable:2 dirty:2452 writeback:135 unstable:0
>  free:4346 slab:4308 mapped:26038 pagetables:912 bounce:0
> DMA free:2060kB min:84kB low:104kB high:124kB active_anon:1660kB inactive_anon:1848kB active_file:144kB inactive_file:868kB unevictable:0kB present:15788kB pages_scanned:0 all_unreclaimable? no
> lowmem_reserve[]: 0 489 489
> Normal free:15324kB min:2788kB low:3484kB high:4180kB active_anon:100224kB inactive_anon:101400kB active_file:91064kB inactive_file:133608kB unevictable:8kB present:501392kB pages_scanned:0 all_unreclaimable? no
> lowmem_reserve[]: 0 0 0
> DMA: 1*4kB 1*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 1*2048kB 0*4096kB = 2060kB
> Normal: 1283*4kB 648*8kB 159*16kB 53*32kB 10*64kB 1*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 15324kB
> 57947 total pagecache pages
> 878 pages in swap cache
> Swap cache stats: add 920, delete 42, find 11/11
> Free swap  = 1016436kB
> Total swap = 1020116kB
> 131056 pages RAM
> 4233 pages reserved
> 90573 pages shared
> 77286 pages non-shared
> EXT4-fs: mballoc enabled
> EXT4-fs (dm-2): mounted filesystem with ordered data mode
>
> Thus it seems like the original bug is still there and any ideas how to
> debug the problem further are appreciated..
>
> The complete dmesg and kernel config are here:
>
> http://www.kernel.org/pub/linux/kernel/people/bart/ext4-paf.dmesg
> http://www.kernel.org/pub/linux/kernel/people/bart/ext4-paf.config

This looks very similar to the kmemleak ext4 reports upon a mount. If
it is the same issue, which from the trace it seems it is, then this
is due to an extra kmalloc() allocation and this apparently will not
get fixed on 2.6.31 due to the closeness of the merge window and the
non-criticalness this issue has been deemed.

A patch fix is part of the ext4-patchqueue
http://repo.or.cz/w/ext4-patch-queue.git

  Luis

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply

* Re: [PATCH 1/3] netfilter: xt_ipvs (netfilter matcher for IPVS)
From: Patrick McHardy @ 2009-09-02 17:51 UTC (permalink / raw)
  To: Jan Engelhardt
  Cc: Hannes Eder, lvs-devel, linux-kernel, netdev, netfilter-devel,
	Fabien Duchêne, Jean-Luc Fortemaison, Julian Anastasov,
	Julius Volz, Laurent Grawet, Simon Horman, Wensong Zhang
In-Reply-To: <alpine.LSU.2.00.0909021748310.25091@fbirervta.pbzchgretzou.qr>

Jan Engelhardt wrote:
> On Wednesday 2009-09-02 17:36, Patrick McHardy wrote:
>>> Nice, I'll use par->family.
>>>
>>> So in theory I do not even need a check like the following in the beginning?
>>>
>>> 	if (family != NFPROTO_IPV4
>>> #ifdef CONFIG_IP_VS_IPV6
>>> 	    && family != NFPROTO_IPV6
>>> #endif
>>> 		) {
>>> 		match = false;
>>> 		goto out;
>>> 	}
>> With the AF_UNSPEC registration of your match, it might be used
> 
> par->family always contains the NFPROTO of the invoking implementation,
> which can never be UNSPEC (except, in future, xtables2 ;-)

I didn't say it will be UNSPEC, I said it might be something
different than IPV4/IPV6 unless that is checked *somewhere*.

^ permalink raw reply

* Re: [PATCH net-next-2.6] tc: report informations for multiqueue devices
From: Patrick McHardy @ 2009-09-02 17:50 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: Stephen Hemminger, David Miller, jarkao2, cl, netdev
In-Reply-To: <4A9EAC00.4030106@gmail.com>

Eric Dumazet wrote:
> Patrick McHardy a écrit :
>> Just to avoid duplicate work - I'm currently trying to put a patch
>> together that presents multiqueue qdiscs to userspace as regular
>> child qdiscs of a dummy qdisc, which also contains the aggregated
>> statistics. So far it actually looks pretty sane :)
> 
> There is one thing that bothers me with "tc -s -d qdisc", maybe
> its the right time to speak
> 
> qdisc pfifo_fast 0: dev eth0 root bands 3 priomap  1 2 2 2 1 2 0 0 1 1 1 1 1 1 1 1
>  Sent 54823 bytes 490 pkt (dropped 0, overlimits 0 requeues 0)
>  rate 0bit 0pps backlog 0b 0p requeues 0
> qdisc pfifo_fast 0: dev eth1 root bands 3 priomap  1 2 2 2 1 2 0 0 1 1 1 1 1 1 1 1
>  Sent 468 bytes 6 pkt (dropped 0, overlimits 0 requeues 0)
>  rate 0bit 0pps backlog 0b 0p requeues 0
> 
> 
> The rate estimation is given by kernel even if no rate estimation is performed.
> 
> User space doesnt have an indication saying this 0 numbers are real or fake
> (rate 0bit 0pps).
> 
> Could we set a bit at Qdisc level when a gen_new_estimator() is really
> done one a Qdisc, so that we do not call gnet_stats_copy_rate_est()
> from tc_fill_qdisc() if this bit is not set ?

That should be possible, I'll look into it once I've finished my patch.

^ permalink raw reply

* Re: [PATCH net-next-2.6] vlan: multiqueue vlan devices
From: Patrick McHardy @ 2009-09-02 17:49 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: David Miller, Stephen Hemminger, jarkao2, netdev
In-Reply-To: <4A9EA8C6.2000501@gmail.com>

Eric Dumazet wrote:
> vlan devices are currently not multi-queue capable.
> 
> We can do that with a new rtnl_link_ops method,
> get_tx_queues(), called from rtnl_create_link()
> 
> This new method gets num_tx_queues/real_num_tx_queues
> from real device.
> 
> register_vlan_device() is also handled.

Looks great. This will also make it easier to test multiqueue
scheduling without a multiqueue-capable device :)

^ permalink raw reply

* Re: ipw2200: firmware DMA loading rework
From: Bartlomiej Zolnierkiewicz @ 2009-09-02 17:48 UTC (permalink / raw)
  To: Zhu Yi
  Cc: Andrew Morton, Mel Gorman, Johannes Weiner, Pekka Enberg,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Mel Gorman, netdev@vger.kernel.org, linux-mm@kvack.org,
	James Ketrenos, Chatre, Reinette, linux-wireless@vger.kernel.org,
	ipw2100-devel@lists.sourceforge.net
In-Reply-To: <200908301437.42133.bzolnier@gmail.com>

On Sunday 30 August 2009 14:37:42 Bartlomiej Zolnierkiewicz wrote:
> On Friday 28 August 2009 05:42:31 Zhu Yi wrote:
> > Bartlomiej Zolnierkiewicz reported an atomic order-6 allocation failure
> > for ipw2200 firmware loading in kernel 2.6.30. High order allocation is
> 
> s/2.6.30/2.6.31-rc6/
> 
> The issue has always been there but it was some recent change that
> explicitly triggered the allocation failures (after 2.6.31-rc1).

ipw2200 fix works fine but yesterday I got the following error while mounting
ext4 filesystem (mb_history is optional so the mount succeeded):

EXT4-fs (dm-2): barriers enabled
kjournald2 starting: pid 3137, dev dm-2:8, commit interval 5 seconds
EXT4-fs (dm-2): internal journal on dm-2:8
EXT4-fs (dm-2): delayed allocation enabled
EXT4-fs: file extents enabled
mount: page allocation failure. order:5, mode:0xc0d0
Pid: 3136, comm: mount Not tainted 2.6.31-rc8-00015-gadda766-dirty #78
Call Trace:
 [<c0394de3>] ? printk+0xf/0x14
 [<c016a693>] __alloc_pages_nodemask+0x400/0x442
 [<c016a71b>] __get_free_pages+0xf/0x32
 [<c01865cf>] __kmalloc+0x28/0xfa
 [<c023d96f>] ? __spin_lock_init+0x28/0x4d
 [<c01f529d>] ext4_mb_init+0x392/0x460
 [<c01e99d2>] ext4_fill_super+0x1b96/0x2012
 [<c0239bc8>] ? snprintf+0x15/0x17
 [<c01c0b26>] ? disk_name+0x24/0x69
 [<c018ba63>] get_sb_bdev+0xda/0x117
 [<c01e6711>] ext4_get_sb+0x13/0x15
 [<c01e7e3c>] ? ext4_fill_super+0x0/0x2012
 [<c018ad2d>] vfs_kern_mount+0x3b/0x76
 [<c018adad>] do_kern_mount+0x33/0xbd
 [<c019d0af>] do_mount+0x660/0x6b8
 [<c016a71b>] ? __get_free_pages+0xf/0x32
 [<c019d168>] sys_mount+0x61/0x99
 [<c0102908>] sysenter_do_call+0x12/0x36
Mem-Info:
DMA per-cpu:
CPU    0: hi:    0, btch:   1 usd:   0
Normal per-cpu:
CPU    0: hi:  186, btch:  31 usd:   0
Active_anon:25471 active_file:22802 inactive_anon:25812
 inactive_file:33619 unevictable:2 dirty:2452 writeback:135 unstable:0
 free:4346 slab:4308 mapped:26038 pagetables:912 bounce:0
DMA free:2060kB min:84kB low:104kB high:124kB active_anon:1660kB inactive_anon:1848kB active_file:144kB inactive_file:868kB unevictable:0kB present:15788kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 489 489
Normal free:15324kB min:2788kB low:3484kB high:4180kB active_anon:100224kB inactive_anon:101400kB active_file:91064kB inactive_file:133608kB unevictable:8kB present:501392kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0 0
DMA: 1*4kB 1*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 1*2048kB 0*4096kB = 2060kB
Normal: 1283*4kB 648*8kB 159*16kB 53*32kB 10*64kB 1*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 15324kB
57947 total pagecache pages
878 pages in swap cache
Swap cache stats: add 920, delete 42, find 11/11
Free swap  = 1016436kB
Total swap = 1020116kB
131056 pages RAM
4233 pages reserved
90573 pages shared
77286 pages non-shared
EXT4-fs: mballoc enabled
EXT4-fs (dm-2): mounted filesystem with ordered data mode

Thus it seems like the original bug is still there and any ideas how to
debug the problem further are appreciated..

The complete dmesg and kernel config are here:

http://www.kernel.org/pub/linux/kernel/people/bart/ext4-paf.dmesg
http://www.kernel.org/pub/linux/kernel/people/bart/ext4-paf.config

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply

* Re: [PATCH net-next-2.6] tc: report informations for multiqueue devices
From: Eric Dumazet @ 2009-09-02 17:31 UTC (permalink / raw)
  To: Patrick McHardy; +Cc: Stephen Hemminger, David Miller, jarkao2, cl, netdev
In-Reply-To: <4A9E9D8A.50406@trash.net>

Patrick McHardy a écrit :
> Stephen Hemminger wrote:
>> Even if the kernel keeps the value in something else for the API
>> QINDEX needs to be a fixed size unsigned type like u32.
> 
> Just to avoid duplicate work - I'm currently trying to put a patch
> together that presents multiqueue qdiscs to userspace as regular
> child qdiscs of a dummy qdisc, which also contains the aggregated
> statistics. So far it actually looks pretty sane :)

There is one thing that bothers me with "tc -s -d qdisc", maybe
its the right time to speak

qdisc pfifo_fast 0: dev eth0 root bands 3 priomap  1 2 2 2 1 2 0 0 1 1 1 1 1 1 1 1
 Sent 54823 bytes 490 pkt (dropped 0, overlimits 0 requeues 0)
 rate 0bit 0pps backlog 0b 0p requeues 0
qdisc pfifo_fast 0: dev eth1 root bands 3 priomap  1 2 2 2 1 2 0 0 1 1 1 1 1 1 1 1
 Sent 468 bytes 6 pkt (dropped 0, overlimits 0 requeues 0)
 rate 0bit 0pps backlog 0b 0p requeues 0


The rate estimation is given by kernel even if no rate estimation is performed.

User space doesnt have an indication saying this 0 numbers are real or fake
(rate 0bit 0pps).

Could we set a bit at Qdisc level when a gen_new_estimator() is really
done one a Qdisc, so that we do not call gnet_stats_copy_rate_est()
from tc_fill_qdisc() if this bit is not set ?

->

qdisc pfifo_fast 0: dev eth0 root bands 3 priomap  1 2 2 2 1 2 0 0 1 1 1 1 1 1 1 1
 Sent 54823 bytes 490 pkt (dropped 0, overlimits 0 requeues 0)
 backlog 0b 0p requeues 0
qdisc pfifo_fast 0: dev eth1 root bands 3 priomap  1 2 2 2 1 2 0 0 1 1 1 1 1 1 1 1
 Sent 468 bytes 6 pkt (dropped 0, overlimits 0 requeues 0)
 backlog 0b 0p requeues 0


^ permalink raw reply

* Re: Subject: [PATCH 3/9] bna: Brocade 10Gb Ethernet device driver
From: Ajit Khaparde @ 2009-09-02 17:29 UTC (permalink / raw)
  To: Rasesh Mody; +Cc: netdev, amathur
In-Reply-To: <200908290518.n7T5IdmQ031881@blc-10-10.brocade.com>

checkpatch.pl also fails on the patches - I picked just a couple of them though.

-Ajit

On 28/08/09 22:18 -0700, Rasesh Mody wrote:
> From: Rasesh Mody <rmody@brocade.com>
> 
> This is patch 3/9 which contains linux driver source for 
> Brocade's BR1010/BR1020 10Gb CEE capable ethernet adapter.
> 
> We wish this patch to be considered for inclusion in 2.6.30 
> 
> Signed-off-by: Rasesh Mody <rmody@brocade.com>
> ---
> 
> 
> diff -ruP linux-2.6.30.5-orig/drivers/net/bna/bfa_ioc.c linux-2.6.30.5-mod/drivers/net/bna/bfa_ioc.c
> --- linux-2.6.30.5-orig/drivers/net/bna/bfa_ioc.c	1969-12-31 16:00:00.000000000 -0800
> +++ linux-2.6.30.5-mod/drivers/net/bna/bfa_ioc.c	2009-08-28 21:09:22.769890000 -0700
> @@ -0,0 +1,2384 @@
> +/*
> + * Copyright (c) 2005-2009 Brocade Communications Systems, Inc.
> + * All rights reserved
> + * www.brocade.com
> + *
> + * Linux network driver for Brocade Converged Network Adapter.
> + *
> + * This program is free software; you can redistribute it and/or modify it
> + * under the terms of the GNU General Public License (GPL) Version 2 as
> + * published by the Free Software Foundation
> + *
> + * This program is distributed in the hope that it will be useful, but
> + * WITHOUT ANY WARRANTY; without even the implied warranty of
> + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
> + * General Public License for more details.
> + */
> +/*
> + * Copyright (c) 2005-2009 Brocade Communications Systems, Inc.
> + * All rights reserved
> + * www.brocade.com
> + *
> + * See LICENSE.bna for copyright and licensing details.
> + */
> +
> +#include <bfa.h>
> +#include <bfa_ioc.h>
> +#include <bfa_fwimg_priv.h>
> +#include <bfa_trcmod_priv.h>
> +#include <cs/bfa_debug.h>
> +#include <bfi/bfi_ioc.h>
> +#include <bfi/bfi_ctreg.h>
> +#include <aen/bfa_aen_ioc.h>
> +#include <aen/bfa_aen.h>
> +#include <log/bfa_log_hal.h>
> +#include <defs/bfa_defs_pci.h>
> +
> +BFA_TRC_FILE(HAL, IOC);
> +
> +/**
> + * IOC local definitions
> + */
> +#define BFA_IOC_TOV		2000	/* msecs */
> +#define BFA_IOC_HB_TOV		1000	/* msecs */
> +#define BFA_IOC_HB_FAIL_MAX	4
> +#define BFA_IOC_HWINIT_MAX	2
> +#define BFA_IOC_FWIMG_MINSZ     (16 * 1024)
> +#define BFA_IOC_TOV_RECOVER	(BFA_IOC_HB_FAIL_MAX * BFA_IOC_HB_TOV \
> +				+ BFA_IOC_TOV)
> +
> +#define bfa_ioc_timer_start(__ioc)					\
> +	bfa_timer_begin((__ioc)->timer_mod, &(__ioc)->ioc_timer,	\
> +			bfa_ioc_timeout, (__ioc), BFA_IOC_TOV)
> +#define bfa_ioc_timer_stop(__ioc)   bfa_timer_stop(&(__ioc)->ioc_timer)
> +
> +#define BFA_DBG_FWTRC_ENTS	(BFI_IOC_TRC_ENTS)
> +#define BFA_DBG_FWTRC_LEN					\
> +	(BFA_DBG_FWTRC_ENTS * sizeof(struct bfa_trc_s) +	\
> +	 (sizeof(struct bfa_trc_mod_s) -			\
> +	  BFA_TRC_MAX * sizeof(struct bfa_trc_s)))
> +#define BFA_DBG_FWTRC_OFF(_fn)	(BFI_IOC_TRC_OFF + BFA_DBG_FWTRC_LEN * (_fn))
> +#define bfa_ioc_stats(_ioc, _stats)	(_ioc)->stats._stats ++
> +
> +#define BFA_FLASH_CHUNK_NO(off)         (off / BFI_FLASH_CHUNK_SZ_WORDS)
> +#define BFA_FLASH_OFFSET_IN_CHUNK(off)  (off % BFI_FLASH_CHUNK_SZ_WORDS)
> +#define BFA_FLASH_CHUNK_ADDR(chunkno)   (chunkno * BFI_FLASH_CHUNK_SZ_WORDS)
> +bfa_boolean_t bfa_auto_recover = BFA_FALSE;
> +
> +/*
> + * forward declarations
> + */
> +static void bfa_ioc_aen_post(struct bfa_ioc_s *bfa,
> +			     enum bfa_ioc_aen_event event);
> +static void bfa_ioc_hw_sem_get(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_hw_sem_release(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_hw_sem_get_cancel(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_hwinit(struct bfa_ioc_s *ioc, bfa_boolean_t force);
> +static void bfa_ioc_timeout(void *ioc);
> +static void bfa_ioc_send_enable(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_send_disable(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_send_getattr(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_hb_monitor(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_hb_stop(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_reset(struct bfa_ioc_s *ioc, bfa_boolean_t force);
> +static void bfa_ioc_mbox_poll(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_mbox_hbfail(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_recover(struct bfa_ioc_s *ioc);
> +static bfa_boolean_t bfa_ioc_firmware_lock(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_firmware_unlock(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_disable_comp(struct bfa_ioc_s *ioc);
> +static void bfa_ioc_lpu_stop(struct bfa_ioc_s *ioc);
> +
> +/**
> + *  bfa_ioc_sm
> + */
> +
> +/**
> + * IOC state machine events
> + */
> +enum ioc_event {
> +	IOC_E_ENABLE = 1,	/*  IOC enable request */
> +	IOC_E_DISABLE = 2,	/*  IOC disable request */
> +	IOC_E_TIMEOUT = 3,	/*  f/w response timeout */
> +	IOC_E_FWREADY = 4,	/*  f/w initialization done */
> +	IOC_E_FWRSP_GETATTR = 5,	/*  IOC get attribute response */
> +	IOC_E_FWRSP_ENABLE = 6,	/*  enable f/w response */
> +	IOC_E_FWRSP_DISABLE = 7,	/*  disable f/w response */
> +	IOC_E_HBFAIL = 8,	/*  heartbeat failure */
> +	IOC_E_HWERROR = 9,	/*  hardware error interrupt */
> +	IOC_E_SEMLOCKED = 10,	/*  h/w semaphore is locked */
> +	IOC_E_DETACH = 11,	/*  driver detach cleanup */
> +};
> +
> +bfa_fsm_state_decl(bfa_ioc, reset, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, fwcheck, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, mismatch, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, semwait, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, hwinit, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, enabling, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, getattr, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, op, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, initfail, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, hbfail, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, disabling, struct bfa_ioc_s, enum ioc_event);
> +bfa_fsm_state_decl(bfa_ioc, disabled, struct bfa_ioc_s, enum ioc_event);
> +
> +static struct bfa_sm_table_s ioc_sm_table[] = {
> +	{BFA_SM(bfa_ioc_sm_reset), BFA_IOC_RESET},
> +	{BFA_SM(bfa_ioc_sm_fwcheck), BFA_IOC_FWMISMATCH},
> +	{BFA_SM(bfa_ioc_sm_mismatch), BFA_IOC_FWMISMATCH},
> +	{BFA_SM(bfa_ioc_sm_semwait), BFA_IOC_SEMWAIT},
> +	{BFA_SM(bfa_ioc_sm_hwinit), BFA_IOC_HWINIT},
> +	{BFA_SM(bfa_ioc_sm_enabling), BFA_IOC_HWINIT},
> +	{BFA_SM(bfa_ioc_sm_getattr), BFA_IOC_GETATTR},
> +	{BFA_SM(bfa_ioc_sm_op), BFA_IOC_OPERATIONAL},
> +	{BFA_SM(bfa_ioc_sm_initfail), BFA_IOC_INITFAIL},
> +	{BFA_SM(bfa_ioc_sm_hbfail), BFA_IOC_HBFAIL},
> +	{BFA_SM(bfa_ioc_sm_disabling), BFA_IOC_DISABLING},
> +	{BFA_SM(bfa_ioc_sm_disabled), BFA_IOC_DISABLED},
> +};
> +
> +/**
> + * Reset entry actions -- initialize state machine
> + */
> +static void
> +bfa_ioc_sm_reset_entry(struct bfa_ioc_s *ioc)
> +{
> +	ioc->retry_count = 0;
> +	ioc->auto_recover = bfa_auto_recover;
> +}
> +
> +/**
> + * Beginning state. IOC is in reset state.
> + */
> +static void
> +bfa_ioc_sm_reset(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_ENABLE:
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_fwcheck);
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		bfa_ioc_disable_comp(ioc);
> +		break;
> +
> +	case IOC_E_DETACH:
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +/**
> + * Semaphore should be acquired for version check.
> + */
> +static void
> +bfa_ioc_sm_fwcheck_entry(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_hw_sem_get(ioc);
> +}
> +
> +/**
> + * Awaiting h/w semaphore to continue with version check.
> + */
> +static void
> +bfa_ioc_sm_fwcheck(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_SEMLOCKED:
> +		if (bfa_ioc_firmware_lock(ioc)) {
> +			ioc->retry_count = 0;
> +			bfa_fsm_set_state(ioc, bfa_ioc_sm_hwinit);
> +		} else {
> +			bfa_ioc_hw_sem_release(ioc);
> +			bfa_fsm_set_state(ioc, bfa_ioc_sm_mismatch);
> +		}
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		bfa_ioc_disable_comp(ioc);
> +		/*
> +		 * fall through
> +		 */
> +
> +	case IOC_E_DETACH:
> +		bfa_ioc_hw_sem_get_cancel(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_reset);
> +		break;
> +
> +	case IOC_E_FWREADY:
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +/**
> + * Notify enable completion callback and generate mismatch AEN.
> + */
> +static void
> +bfa_ioc_sm_mismatch_entry(struct bfa_ioc_s *ioc)
> +{
> +	/**
> +	 * Provide enable completion callback and AEN notification only once.
> +	 */
> +	if (ioc->retry_count == 0) {
> +		ioc->cbfn->enable_cbfn(ioc->bfa, BFA_STATUS_IOC_FAILURE);
> +		bfa_ioc_aen_post(ioc, BFA_IOC_AEN_FWMISMATCH);
> +	}
> +	ioc->retry_count++;
> +	bfa_ioc_timer_start(ioc);
> +}
> +
> +/**
> + * Awaiting firmware version match.
> + */
> +static void
> +bfa_ioc_sm_mismatch(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_TIMEOUT:
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_fwcheck);
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		bfa_ioc_disable_comp(ioc);
> +		/*
> +		 * fall through
> +		 */
> +
> +	case IOC_E_DETACH:
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_reset);
> +		break;
> +
> +	case IOC_E_FWREADY:
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +/**
> + * Request for semaphore.
> + */
> +static void
> +bfa_ioc_sm_semwait_entry(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_hw_sem_get(ioc);
> +}
> +
> +/**
> + * Awaiting semaphore for h/w initialzation.
> + */
> +static void
> +bfa_ioc_sm_semwait(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_SEMLOCKED:
> +		ioc->retry_count = 0;
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_hwinit);
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		bfa_ioc_hw_sem_get_cancel(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_disabled);
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +
> +static void
> +bfa_ioc_sm_hwinit_entry(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_timer_start(ioc);
> +	bfa_ioc_reset(ioc, BFA_FALSE);
> +}
> +
> +/**
> + * Hardware is being initialized. Interrupts are enabled.
> + * Holding hardware semaphore lock.
> + */
> +static void
> +bfa_ioc_sm_hwinit(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_FWREADY:
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_enabling);
> +		break;
> +
> +	case IOC_E_HWERROR:
> +		bfa_ioc_timer_stop(ioc);
> +		/*
> +		 * fall through
> +		 */
> +
> +	case IOC_E_TIMEOUT:
> +		ioc->retry_count++;
> +		if (ioc->retry_count < BFA_IOC_HWINIT_MAX) {
> +			bfa_ioc_timer_start(ioc);
> +			bfa_ioc_reset(ioc, BFA_TRUE);
> +			break;
> +		}
> +
> +		bfa_ioc_hw_sem_release(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_initfail);
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		bfa_ioc_hw_sem_release(ioc);
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_disabled);
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +
> +static void
> +bfa_ioc_sm_enabling_entry(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_timer_start(ioc);
> +	bfa_ioc_send_enable(ioc);
> +}
> +
> +/**
> + * Host IOC function is being enabled, awaiting response from firmware.
> + * Semaphore is acquired.
> + */
> +static void
> +bfa_ioc_sm_enabling(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_FWRSP_ENABLE:
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_ioc_hw_sem_release(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_getattr);
> +		break;
> +
> +	case IOC_E_HWERROR:
> +		bfa_ioc_timer_stop(ioc);
> +		/*
> +		 * fall through
> +		 */
> +
> +	case IOC_E_TIMEOUT:
> +		ioc->retry_count++;
> +		if (ioc->retry_count < BFA_IOC_HWINIT_MAX) {
> +			bfa_reg_write(ioc->ioc_regs.ioc_fwstate,
> +				      BFI_IOC_UNINIT);
> +			bfa_fsm_set_state(ioc, bfa_ioc_sm_hwinit);
> +			break;
> +		}
> +
> +		bfa_ioc_hw_sem_release(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_initfail);
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_ioc_hw_sem_release(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_disabled);
> +		break;
> +
> +	case IOC_E_FWREADY:
> +		bfa_ioc_send_enable(ioc);
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +
> +static void
> +bfa_ioc_sm_getattr_entry(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_timer_start(ioc);
> +	bfa_ioc_send_getattr(ioc);
> +}
> +
> +/**
> + * IOC configuration in progress. Timer is active.
> + */
> +static void
> +bfa_ioc_sm_getattr(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_FWRSP_GETATTR:
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_op);
> +		break;
> +
> +	case IOC_E_HWERROR:
> +		bfa_ioc_timer_stop(ioc);
> +		/*
> +		 * fall through
> +		 */
> +
> +	case IOC_E_TIMEOUT:
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_initfail);
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_disabled);
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +
> +static void
> +bfa_ioc_sm_op_entry(struct bfa_ioc_s *ioc)
> +{
> +	ioc->cbfn->enable_cbfn(ioc->bfa, BFA_STATUS_OK);
> +	bfa_ioc_hb_monitor(ioc);
> +	bfa_ioc_aen_post(ioc, BFA_IOC_AEN_ENABLE);
> +}
> +
> +static void
> +bfa_ioc_sm_op(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_ENABLE:
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		bfa_ioc_hb_stop(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_disabling);
> +		break;
> +
> +	case IOC_E_HWERROR:
> +	case IOC_E_FWREADY:
> +		/**
> +		 * Hard error or IOC recovery by other function.
> +		 * Treat it same as heartbeat failure.
> +		 */
> +		bfa_ioc_hb_stop(ioc);
> +		/*
> +		 * !!! fall through !!!
> +		 */
> +
> +	case IOC_E_HBFAIL:
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_hbfail);
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +
> +static void
> +bfa_ioc_sm_disabling_entry(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_aen_post(ioc, BFA_IOC_AEN_DISABLE);
> +	bfa_ioc_timer_start(ioc);
> +	bfa_ioc_send_disable(ioc);
> +}
> +
> +/**
> + * IOC is being disabled
> + */
> +static void
> +bfa_ioc_sm_disabling(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_HWERROR:
> +	case IOC_E_FWRSP_DISABLE:
> +		bfa_ioc_timer_stop(ioc);
> +		/*
> +		 * !!! fall through !!!
> +		 */
> +
> +	case IOC_E_TIMEOUT:
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_disabled);
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +/**
> + * IOC disable completion entry.
> + */
> +static void
> +bfa_ioc_sm_disabled_entry(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_disable_comp(ioc);
> +}
> +
> +static void
> +bfa_ioc_sm_disabled(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_ENABLE:
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_semwait);
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		ioc->cbfn->disable_cbfn(ioc->bfa);
> +		break;
> +
> +	case IOC_E_FWREADY:
> +		break;
> +
> +	case IOC_E_DETACH:
> +		bfa_ioc_firmware_unlock(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_reset);
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +
> +static void
> +bfa_ioc_sm_initfail_entry(struct bfa_ioc_s *ioc)
> +{
> +	ioc->cbfn->enable_cbfn(ioc->bfa, BFA_STATUS_IOC_FAILURE);
> +	bfa_ioc_timer_start(ioc);
> +}
> +
> +/**
> + * Hardware initialization failed.
> + */
> +static void
> +bfa_ioc_sm_initfail(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +	case IOC_E_DISABLE:
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_disabled);
> +		break;
> +
> +	case IOC_E_DETACH:
> +		bfa_ioc_timer_stop(ioc);
> +		bfa_ioc_firmware_unlock(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_reset);
> +		break;
> +
> +	case IOC_E_TIMEOUT:
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_semwait);
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +
> +static void
> +bfa_ioc_sm_hbfail_entry(struct bfa_ioc_s *ioc)
> +{
> +	struct list_head *qe;
> +	struct bfa_ioc_hbfail_notify_s *notify;
> +
> +	/**
> +	 * Mark IOC as failed in hardware and stop firmware.
> +	 */
> +	bfa_ioc_lpu_stop(ioc);
> +	bfa_reg_write(ioc->ioc_regs.ioc_fwstate, BFI_IOC_HBFAIL);
> +
> +	if (ioc->pcidev.device_id == BFA_PCI_DEVICE_ID_CT) {
> +		bfa_reg_write(ioc->ioc_regs.ll_halt, __FW_INIT_HALT_P);
> +		/*
> +		 * Wait for halt to take effect
> +		 */
> +		bfa_reg_read(ioc->ioc_regs.ll_halt);
> +	}
> +
> +	/**
> +	 * Notify driver and common modules registered for notification.
> +	 */
> +	ioc->cbfn->hbfail_cbfn(ioc->bfa);
> +	list_for_each(qe, &ioc->hb_notify_q) {
> +		notify = (struct bfa_ioc_hbfail_notify_s *) qe;
> +		notify->cbfn(notify->cbarg);
> +	}
> +
> +	/**
> +	 * Flush any queued up mailbox requests.
> +	 */
> +	bfa_ioc_mbox_hbfail(ioc);
> +	bfa_ioc_aen_post(ioc, BFA_IOC_AEN_HBFAIL);
> +
> +	/**
> +	 * Trigger auto-recovery after a delay.
> +	 */
> +	if (ioc->auto_recover) {
> +		bfa_timer_begin(ioc->timer_mod, &ioc->ioc_timer,
> +				bfa_ioc_timeout, ioc, BFA_IOC_TOV_RECOVER);
> +	}
> +}
> +
> +/**
> + * IOC heartbeat failure.
> + */
> +static void
> +bfa_ioc_sm_hbfail(struct bfa_ioc_s *ioc, enum ioc_event event)
> +{
> +	bfa_trc(ioc, event);
> +
> +	switch (event) {
> +
> +	case IOC_E_ENABLE:
> +		ioc->cbfn->enable_cbfn(ioc->bfa, BFA_STATUS_IOC_FAILURE);
> +		break;
> +
> +	case IOC_E_DISABLE:
> +		if (ioc->auto_recover)
> +			bfa_ioc_timer_stop(ioc);
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_disabled);
> +		break;
> +
> +	case IOC_E_TIMEOUT:
> +		bfa_fsm_set_state(ioc, bfa_ioc_sm_semwait);
> +		break;
> +
> +	case IOC_E_FWREADY:
> +		/**
> +		 * Recovery is already initiated by other function.
> +		 */
> +		break;
> +
> +	default:
> +		bfa_sm_fault(ioc, event);
> +	}
> +}
> +
> +
> +
> +/**
> + *  bfa_ioc_pvt BFA IOC private functions
> + */
> +
> +static void
> +bfa_ioc_disable_comp(struct bfa_ioc_s *ioc)
> +{
> +	struct list_head *qe;
> +	struct bfa_ioc_hbfail_notify_s *notify;
> +
> +	ioc->cbfn->disable_cbfn(ioc->bfa);
> +
> +	/**
> +	 * Notify common modules registered for notification.
> +	 */
> +	list_for_each(qe, &ioc->hb_notify_q) {
> +		notify = (struct bfa_ioc_hbfail_notify_s *) qe;
> +		notify->cbfn(notify->cbarg);
> +	}
> +}
> +
> +static void
> +bfa_ioc_sem_timeout(void *ioc_arg)
> +{
> +	struct bfa_ioc_s *ioc = (struct bfa_ioc_s *) ioc_arg;
> +
> +	bfa_ioc_hw_sem_get(ioc);
> +}
> +
> +static void
> +bfa_ioc_usage_sem_get(struct bfa_ioc_s *ioc)
> +{
> +	u32 r32;
> +	int cnt = 0;
> +#define BFA_SEM_SPINCNT	1000
> +
> +	do {
> +		r32 = bfa_reg_read(ioc->ioc_regs.ioc_usage_sem_reg);
> +		cnt++;
> +		if (cnt > BFA_SEM_SPINCNT)
> +			break;
> +	} while (r32 != 0);
> +	bfa_assert(cnt < BFA_SEM_SPINCNT);
> +}
> +
> +static void
> +bfa_ioc_usage_sem_release(struct bfa_ioc_s *ioc)
> +{
> +	bfa_reg_write(ioc->ioc_regs.ioc_usage_sem_reg, 1);
> +}
> +
> +static void
> +bfa_ioc_hw_sem_get(struct bfa_ioc_s *ioc)
> +{
> +	u32 r32;
> +
> +	/**
> +	 * First read to the semaphore register will return 0, subsequent reads
> +	 * will return 1. Semaphore is released by writing 0 to the register
> +	 */
> +	r32 = bfa_reg_read(ioc->ioc_regs.ioc_sem_reg);
> +	if (r32 == 0) {
> +		bfa_fsm_send_event(ioc, IOC_E_SEMLOCKED);
> +		return;
> +	}
> +
> +	bfa_timer_begin(ioc->timer_mod, &ioc->sem_timer, bfa_ioc_sem_timeout,
> +			ioc, BFA_IOC_TOV);
> +}
> +
> +static void
> +bfa_ioc_hw_sem_release(struct bfa_ioc_s *ioc)
> +{
> +	bfa_reg_write(ioc->ioc_regs.ioc_sem_reg, 1);
> +}
> +
> +static void
> +bfa_ioc_hw_sem_get_cancel(struct bfa_ioc_s *ioc)
> +{
> +	bfa_timer_stop(&ioc->sem_timer);
> +}
> +
> +/**
> + * Initialize LPU local memory (aka secondary memory / SRAM)
> + */
> +static void
> +bfa_ioc_lmem_init(struct bfa_ioc_s *ioc)
> +{
> +	u32 pss_ctl;
> +	int i;
> +#define PSS_LMEM_INIT_TIME  10000
> +
> +	pss_ctl = bfa_reg_read(ioc->ioc_regs.pss_ctl_reg);
> +	pss_ctl &= ~__PSS_LMEM_RESET;
> +	pss_ctl |= __PSS_LMEM_INIT_EN;
> +	pss_ctl |= __PSS_I2C_CLK_DIV(3UL);	/* i2c workaround 12.5khz clock */
> +	bfa_reg_write(ioc->ioc_regs.pss_ctl_reg, pss_ctl);
> +
> +	/**
> +	 * wait for memory initialization to be complete
> +	 */
> +	i = 0;
> +	do {
> +		pss_ctl = bfa_reg_read(ioc->ioc_regs.pss_ctl_reg);
> +		i++;
> +	} while (!(pss_ctl & __PSS_LMEM_INIT_DONE) && (i < PSS_LMEM_INIT_TIME));
> +
> +	/**
> +	 * If memory initialization is not successful, IOC timeout will catch
> +	 * such failures.
> +	 */
> +	bfa_assert(pss_ctl & __PSS_LMEM_INIT_DONE);
> +	bfa_trc(ioc, pss_ctl);
> +
> +	pss_ctl &= ~(__PSS_LMEM_INIT_DONE | __PSS_LMEM_INIT_EN);
> +	bfa_reg_write(ioc->ioc_regs.pss_ctl_reg, pss_ctl);
> +}
> +
> +static void
> +bfa_ioc_lpu_start(struct bfa_ioc_s *ioc)
> +{
> +	u32 pss_ctl;
> +
> +	/**
> +	 * Take processor out of reset.
> +	 */
> +	pss_ctl = bfa_reg_read(ioc->ioc_regs.pss_ctl_reg);
> +	pss_ctl &= ~__PSS_LPU0_RESET;
> +
> +	bfa_reg_write(ioc->ioc_regs.pss_ctl_reg, pss_ctl);
> +}
> +
> +static void
> +bfa_ioc_lpu_stop(struct bfa_ioc_s *ioc)
> +{
> +	u32 pss_ctl;
> +
> +	/**
> +	 * Put processors in reset.
> +	 */
> +	pss_ctl = bfa_reg_read(ioc->ioc_regs.pss_ctl_reg);
> +	pss_ctl |= (__PSS_LPU0_RESET | __PSS_LPU1_RESET);
> +
> +	bfa_reg_write(ioc->ioc_regs.pss_ctl_reg, pss_ctl);
> +}
> +
> +/**
> + * Get driver and firmware versions.
> + */
> +static void
> +bfa_ioc_fwver_get(struct bfa_ioc_s *ioc, struct bfi_ioc_image_hdr_s *fwhdr)
> +{
> +	u32 pgnum, pgoff;
> +	u32 loff = 0;
> +	int i;
> +	u32 *fwsig = (u32 *) fwhdr;
> +
> +	pgnum = bfa_ioc_smem_pgnum(ioc, loff);
> +	pgoff = bfa_ioc_smem_pgoff(ioc, loff);
> +	bfa_reg_write(ioc->ioc_regs.host_page_num_fn, pgnum);
> +
> +	for (i = 0; i < (sizeof(struct bfi_ioc_image_hdr_s) / sizeof(u32)); i++) {
> +		fwsig[i] = bfa_mem_read(ioc->ioc_regs.smem_page_start, loff);
> +		loff += sizeof(u32);
> +	}
> +}
> +
> +static u32 *
> +bfa_ioc_fwimg_get_chunk(struct bfa_ioc_s *ioc, u32 off)
> +{
> +	if (ioc->ctdev)
> +		return bfi_image_ct_get_chunk(off);
> +	return bfi_image_cb_get_chunk(off);
> +}
> +
> +static u32
> +bfa_ioc_fwimg_get_size(struct bfa_ioc_s *ioc)
> +{
> +	return (ioc->ctdev) ? bfi_image_ct_size : bfi_image_cb_size;
> +}
> +
> +/**
> + * Returns TRUE if same.
> + */
> +static bfa_boolean_t
> +bfa_ioc_fwver_cmp(struct bfa_ioc_s *ioc, struct bfi_ioc_image_hdr_s *fwhdr)
> +{
> +	struct bfi_ioc_image_hdr_s *drv_fwhdr;
> +	int i;
> +
> +	drv_fwhdr =
> +		(struct bfi_ioc_image_hdr_s *) bfa_ioc_fwimg_get_chunk(ioc, 0);
> +
> +	for (i = 0; i < BFI_IOC_MD5SUM_SZ; i++) {
> +		if (fwhdr->md5sum[i] != drv_fwhdr->md5sum[i]) {
> +			bfa_trc(ioc, i);
> +			bfa_trc(ioc, fwhdr->md5sum[i]);
> +			bfa_trc(ioc, drv_fwhdr->md5sum[i]);
> +			return BFA_FALSE;
> +		}
> +	}
> +
> +	bfa_trc(ioc, fwhdr->md5sum[0]);
> +	return BFA_TRUE;
> +}
> +
> +/**
> + * Return true if current running version is valid. Firmware signature and
> + * execution context (driver/bios) must match.
> + */
> +static bfa_boolean_t
> +bfa_ioc_fwver_valid(struct bfa_ioc_s *ioc)
> +{
> +	struct bfi_ioc_image_hdr_s fwhdr, *drv_fwhdr;
> +
> +	/**
> +	 * If bios/efi boot (flash based) -- return true
> +	 */
> +	if (bfa_ioc_fwimg_get_size(ioc) < BFA_IOC_FWIMG_MINSZ)
> +		return BFA_TRUE;
> +
> +	bfa_ioc_fwver_get(ioc, &fwhdr);
> +	drv_fwhdr =
> +		(struct bfi_ioc_image_hdr_s *) bfa_ioc_fwimg_get_chunk(ioc, 0);
> +
> +	if (fwhdr.signature != drv_fwhdr->signature) {
> +		bfa_trc(ioc, fwhdr.signature);
> +		bfa_trc(ioc, drv_fwhdr->signature);
> +		return BFA_FALSE;
> +	}
> +
> +	if (fwhdr.exec != drv_fwhdr->exec) {
> +		bfa_trc(ioc, fwhdr.exec);
> +		bfa_trc(ioc, drv_fwhdr->exec);
> +		return BFA_FALSE;
> +	}
> +
> +	return bfa_ioc_fwver_cmp(ioc, &fwhdr);
> +}
> +
> +/**
> + * Return true if firmware of current driver matches the running firmware.
> + */
> +static bfa_boolean_t
> +bfa_ioc_firmware_lock(struct bfa_ioc_s *ioc)
> +{
> +	enum bfi_ioc_state ioc_fwstate;
> +	u32 usecnt;
> +	struct bfi_ioc_image_hdr_s fwhdr;
> +
> +	/**
> +	 * Firmware match check is relevant only for CNA.
> +	 */
> +	if (!ioc->cna)
> +		return BFA_TRUE;
> +
> +	/**
> +	 * If bios boot (flash based) -- do not increment usage count
> +	 */
> +	if (bfa_ioc_fwimg_get_size(ioc) < BFA_IOC_FWIMG_MINSZ)
> +		return BFA_TRUE;
> +
> +	bfa_ioc_usage_sem_get(ioc);
> +	usecnt = bfa_reg_read(ioc->ioc_regs.ioc_usage_reg);
> +
> +	/**
> +	 * If usage count is 0, always return TRUE.
> +	 */
> +	if (usecnt == 0) {
> +		bfa_reg_write(ioc->ioc_regs.ioc_usage_reg, 1);
> +		bfa_ioc_usage_sem_release(ioc);
> +		bfa_trc(ioc, usecnt);
> +		return BFA_TRUE;
> +	}
> +
> +	ioc_fwstate = bfa_reg_read(ioc->ioc_regs.ioc_fwstate);
> +	bfa_trc(ioc, ioc_fwstate);
> +
> +	/**
> +	 * Use count cannot be non-zero and chip in uninitialized state.
> +	 */
> +	bfa_assert(ioc_fwstate != BFI_IOC_UNINIT);
> +
> +	/**
> +	 * Check if another driver with a different firmware is active
> +	 */
> +	bfa_ioc_fwver_get(ioc, &fwhdr);
> +	if (!bfa_ioc_fwver_cmp(ioc, &fwhdr)) {
> +		bfa_ioc_usage_sem_release(ioc);
> +		bfa_trc(ioc, usecnt);
> +		return BFA_FALSE;
> +	}
> +
> +	/**
> +	 * Same firmware version. Increment the reference count.
> +	 */
> +	usecnt++;
> +	bfa_reg_write(ioc->ioc_regs.ioc_usage_reg, usecnt);
> +	bfa_ioc_usage_sem_release(ioc);
> +	bfa_trc(ioc, usecnt);
> +	return BFA_TRUE;
> +}
> +
> +static void
> +bfa_ioc_firmware_unlock(struct bfa_ioc_s *ioc)
> +{
> +	u32 usecnt;
> +
> +	/**
> +	 * Firmware lock is relevant only for CNA.
> +	 * If bios boot (flash based) -- do not decrement usage count
> +	 */
> +	if (!ioc->cna || (bfa_ioc_fwimg_get_size(ioc) < BFA_IOC_FWIMG_MINSZ))
> +		return;
> +
> +	/**
> +	 * decrement usage count
> +	 */
> +	bfa_ioc_usage_sem_get(ioc);
> +	usecnt = bfa_reg_read(ioc->ioc_regs.ioc_usage_reg);
> +	bfa_assert(usecnt > 0);
> +
> +	usecnt--;
> +	bfa_reg_write(ioc->ioc_regs.ioc_usage_reg, usecnt);
> +	bfa_trc(ioc, usecnt);
> +
> +	bfa_ioc_usage_sem_release(ioc);
> +}
> +
> +/**
> + * Conditionally flush any pending message from firmware at start.
> + */
> +static void
> +bfa_ioc_msgflush(struct bfa_ioc_s *ioc)
> +{
> +	u32 r32;
> +
> +	r32 = bfa_reg_read(ioc->ioc_regs.lpu_mbox_cmd);
> +	if (r32)
> +		bfa_reg_write(ioc->ioc_regs.lpu_mbox_cmd, 1);
> +}
> +
> +
> +static void
> +bfa_ioc_hwinit(struct bfa_ioc_s *ioc, bfa_boolean_t force)
> +{
> +	enum bfi_ioc_state ioc_fwstate;
> +	bfa_boolean_t fwvalid;
> +
> +	ioc_fwstate = bfa_reg_read(ioc->ioc_regs.ioc_fwstate);
> +
> +	if (force)
> +		ioc_fwstate = BFI_IOC_UNINIT;
> +
> +	bfa_trc(ioc, ioc_fwstate);
> +
> +	/**
> +	 * check if firmware is valid
> +	 */
> +	fwvalid = (ioc_fwstate == BFI_IOC_UNINIT) ?
> +		BFA_FALSE : bfa_ioc_fwver_valid(ioc);
> +
> +	if (!fwvalid) {
> +		bfa_ioc_boot(ioc, BFI_BOOT_TYPE_NORMAL, ioc->pcidev.device_id);
> +		return;
> +	}
> +
> +	/**
> +	 * If hardware initialization is in progress (initialized by other IOC),
> +	 * just wait for an initialization completion interrupt.
> +	 */
> +	if (ioc_fwstate == BFI_IOC_INITING) {
> +		bfa_trc(ioc, ioc_fwstate);
> +		ioc->cbfn->reset_cbfn(ioc->bfa);
> +		return;
> +	}
> +
> +	/**
> +	 * If IOC function is disabled and firmware version is same,
> +	 * just re-enable IOC.
> +	 */
> +	if (ioc_fwstate == BFI_IOC_DISABLED || ioc_fwstate == BFI_IOC_OP) {
> +		bfa_trc(ioc, ioc_fwstate);
> +
> +		/**
> +		 * When using MSI-X any pending firmware ready event should
> +		 * be flushed. Otherwise MSI-X interrupts are not delivered.
> +		 */
> +		bfa_ioc_msgflush(ioc);
> +		ioc->cbfn->reset_cbfn(ioc->bfa);
> +		bfa_fsm_send_event(ioc, IOC_E_FWREADY);
> +		return;
> +	}
> +
> +	/**
> +	 * Initialize the h/w for any other states.
> +	 */
> +	bfa_ioc_boot(ioc, BFI_BOOT_TYPE_NORMAL, ioc->pcidev.device_id);
> +}
> +
> +static void
> +bfa_ioc_timeout(void *ioc_arg)
> +{
> +	struct bfa_ioc_s *ioc = (struct bfa_ioc_s *) ioc_arg;
> +
> +	bfa_trc(ioc, 0);
> +	bfa_fsm_send_event(ioc, IOC_E_TIMEOUT);
> +}
> +
> +void
> +bfa_ioc_mbox_send(struct bfa_ioc_s *ioc, void *ioc_msg, int len)
> +{
> +	u32 *msgp = (u32 *) ioc_msg;
> +	u32 i;
> +
> +	bfa_trc(ioc, msgp[0]);
> +	bfa_trc(ioc, len);
> +
> +	bfa_assert(len <= BFI_IOC_MSGLEN_MAX);
> +
> +	/*
> +	 * first write msg to mailbox registers
> +	 */
> +	for (i = 0; i < len / sizeof(u32); i++)
> +		bfa_reg_write(ioc->ioc_regs.hfn_mbox + i * sizeof(u32),
> +			      bfa_os_wtole(msgp[i]));
> +
> +	for (; i < BFI_IOC_MSGLEN_MAX / sizeof(u32); i++)
> +		bfa_reg_write(ioc->ioc_regs.hfn_mbox + i * sizeof(u32), 0);
> +
> +	/*
> +	 * write 1 to mailbox CMD to trigger LPU event
> +	 */
> +	bfa_reg_write(ioc->ioc_regs.hfn_mbox_cmd, 1);
> +	(void) bfa_reg_read(ioc->ioc_regs.hfn_mbox_cmd);
> +}
> +
> +static void
> +bfa_ioc_send_enable(struct bfa_ioc_s *ioc)
> +{
> +	struct bfi_ioc_ctrl_req_s enable_req;
> +
> +	bfi_h2i_set(enable_req.mh, BFI_MC_IOC, BFI_IOC_H2I_ENABLE_REQ,
> +		    bfa_ioc_portid(ioc));
> +	enable_req.ioc_class = ioc->ioc_mc;
> +	bfa_ioc_mbox_send(ioc, &enable_req, sizeof(struct bfi_ioc_ctrl_req_s));
> +}
> +
> +static void
> +bfa_ioc_send_disable(struct bfa_ioc_s *ioc)
> +{
> +	struct bfi_ioc_ctrl_req_s disable_req;
> +
> +	bfi_h2i_set(disable_req.mh, BFI_MC_IOC, BFI_IOC_H2I_DISABLE_REQ,
> +		    bfa_ioc_portid(ioc));
> +	bfa_ioc_mbox_send(ioc, &disable_req, sizeof(struct bfi_ioc_ctrl_req_s));
> +}
> +
> +static void
> +bfa_ioc_send_getattr(struct bfa_ioc_s *ioc)
> +{
> +	struct bfi_ioc_getattr_req_s attr_req;
> +
> +	bfi_h2i_set(attr_req.mh, BFI_MC_IOC, BFI_IOC_H2I_GETATTR_REQ,
> +		    bfa_ioc_portid(ioc));
> +	bfa_dma_be_addr_set(attr_req.attr_addr, ioc->attr_dma.pa);
> +	bfa_ioc_mbox_send(ioc, &attr_req, sizeof(attr_req));
> +}
> +
> +static void
> +bfa_ioc_hb_check(void *cbarg)
> +{
> +	struct bfa_ioc_s *ioc = cbarg;
> +	u32 hb_count;
> +
> +	hb_count = bfa_reg_read(ioc->ioc_regs.heartbeat);
> +	if (ioc->hb_count == hb_count) {
> +		ioc->hb_fail++;
> +	} else {
> +		ioc->hb_count = hb_count;
> +		ioc->hb_fail = 0;
> +	}
> +
> +	if (ioc->hb_fail >= BFA_IOC_HB_FAIL_MAX) {
> +		bfa_log(ioc->logm, BFA_LOG_HAL_HEARTBEAT_FAILURE, hb_count);
> +		ioc->hb_fail = 0;
> +		bfa_ioc_recover(ioc);
> +		return;
> +	}
> +
> +	bfa_ioc_mbox_poll(ioc);
> +	bfa_timer_begin(ioc->timer_mod, &ioc->ioc_timer, bfa_ioc_hb_check, ioc,
> +			BFA_IOC_HB_TOV);
> +}
> +
> +static void
> +bfa_ioc_hb_monitor(struct bfa_ioc_s *ioc)
> +{
> +	ioc->hb_fail = 0;
> +	ioc->hb_count = bfa_reg_read(ioc->ioc_regs.heartbeat);
> +	bfa_timer_begin(ioc->timer_mod, &ioc->ioc_timer, bfa_ioc_hb_check, ioc,
> +			BFA_IOC_HB_TOV);
> +}
> +
> +static void
> +bfa_ioc_hb_stop(struct bfa_ioc_s *ioc)
> +{
> +	bfa_timer_stop(&ioc->ioc_timer);
> +}
> +
> +/**
> + * Host to LPU mailbox message addresses
> + */
> +static struct {
> +	u32 hfn_mbox, lpu_mbox, hfn_pgn;
> +} iocreg_fnreg[] = {
> +	{
> +	HOSTFN0_LPU_MBOX0_0, LPU_HOSTFN0_MBOX0_0, HOST_PAGE_NUM_FN0}, {
> +	HOSTFN1_LPU_MBOX0_8, LPU_HOSTFN1_MBOX0_8, HOST_PAGE_NUM_FN1}, {
> +	HOSTFN2_LPU_MBOX0_0, LPU_HOSTFN2_MBOX0_0, HOST_PAGE_NUM_FN2}, {
> +	HOSTFN3_LPU_MBOX0_8, LPU_HOSTFN3_MBOX0_8, HOST_PAGE_NUM_FN3}
> +};
> +
> +/**
> + * Host <-> LPU mailbox command/status registers - port 0
> + */
> +static struct {
> +	u32 hfn, lpu;
> +} iocreg_mbcmd_p0[] = {
> +	{
> +	HOSTFN0_LPU0_MBOX0_CMD_STAT, LPU0_HOSTFN0_MBOX0_CMD_STAT}, {
> +	HOSTFN1_LPU0_MBOX0_CMD_STAT, LPU0_HOSTFN1_MBOX0_CMD_STAT}, {
> +	HOSTFN2_LPU0_MBOX0_CMD_STAT, LPU0_HOSTFN2_MBOX0_CMD_STAT}, {
> +	HOSTFN3_LPU0_MBOX0_CMD_STAT, LPU0_HOSTFN3_MBOX0_CMD_STAT}
> +};
> +
> +/**
> + * Host <-> LPU mailbox command/status registers - port 1
> + */
> +static struct {
> +	u32 hfn, lpu;
> +} iocreg_mbcmd_p1[] = {
> +	{
> +	HOSTFN0_LPU1_MBOX0_CMD_STAT, LPU1_HOSTFN0_MBOX0_CMD_STAT}, {
> +	HOSTFN1_LPU1_MBOX0_CMD_STAT, LPU1_HOSTFN1_MBOX0_CMD_STAT}, {
> +	HOSTFN2_LPU1_MBOX0_CMD_STAT, LPU1_HOSTFN2_MBOX0_CMD_STAT}, {
> +	HOSTFN3_LPU1_MBOX0_CMD_STAT, LPU1_HOSTFN3_MBOX0_CMD_STAT}
> +};
> +
> +/**
> + * Shared IRQ handling in INTX mode
> + */
> +static struct {
> +	u32 isr, msk;
> +} iocreg_shirq_next[] = {
> +	{
> +	HOSTFN1_INT_STATUS, HOSTFN1_INT_MSK}, {
> +	HOSTFN2_INT_STATUS, HOSTFN2_INT_MSK}, {
> +	HOSTFN3_INT_STATUS, HOSTFN3_INT_MSK}, {
> +HOSTFN0_INT_STATUS, HOSTFN0_INT_MSK},};
> +
> +static void
> +bfa_ioc_reg_init(struct bfa_ioc_s *ioc)
> +{
> +	bfa_os_addr_t rb;
> +	int pcifn = bfa_ioc_pcifn(ioc);
> +
> +	rb = bfa_ioc_bar0(ioc);
> +
> +	ioc->ioc_regs.hfn_mbox = rb + iocreg_fnreg[pcifn].hfn_mbox;
> +	ioc->ioc_regs.lpu_mbox = rb + iocreg_fnreg[pcifn].lpu_mbox;
> +	ioc->ioc_regs.host_page_num_fn = rb + iocreg_fnreg[pcifn].hfn_pgn;
> +
> +	if (ioc->port_id == 0) {
> +		ioc->ioc_regs.heartbeat = rb + BFA_IOC0_HBEAT_REG;
> +		ioc->ioc_regs.ioc_fwstate = rb + BFA_IOC0_STATE_REG;
> +		ioc->ioc_regs.hfn_mbox_cmd = rb + iocreg_mbcmd_p0[pcifn].hfn;
> +		ioc->ioc_regs.lpu_mbox_cmd = rb + iocreg_mbcmd_p0[pcifn].lpu;
> +		ioc->ioc_regs.ll_halt = rb + FW_INIT_HALT_P0;
> +	} else {
> +		ioc->ioc_regs.heartbeat = (rb + BFA_IOC1_HBEAT_REG);
> +		ioc->ioc_regs.ioc_fwstate = (rb + BFA_IOC1_STATE_REG);
> +		ioc->ioc_regs.hfn_mbox_cmd = rb + iocreg_mbcmd_p1[pcifn].hfn;
> +		ioc->ioc_regs.lpu_mbox_cmd = rb + iocreg_mbcmd_p1[pcifn].lpu;
> +		ioc->ioc_regs.ll_halt = rb + FW_INIT_HALT_P1;
> +	}
> +
> +	/**
> +	 * Shared IRQ handling in INTX mode
> +	 */
> +	ioc->ioc_regs.shirq_isr_next = rb + iocreg_shirq_next[pcifn].isr;
> +	ioc->ioc_regs.shirq_msk_next = rb + iocreg_shirq_next[pcifn].msk;
> +
> +	/*
> +	 * PSS control registers
> +	 */
> +	ioc->ioc_regs.pss_ctl_reg = (rb + PSS_CTL_REG);
> +	ioc->ioc_regs.app_pll_fast_ctl_reg = (rb + APP_PLL_425_CTL_REG);
> +	ioc->ioc_regs.app_pll_slow_ctl_reg = (rb + APP_PLL_312_CTL_REG);
> +
> +	/*
> +	 * IOC semaphore registers and serialization
> +	 */
> +	ioc->ioc_regs.ioc_sem_reg = (rb + HOST_SEM0_REG);
> +	ioc->ioc_regs.ioc_usage_sem_reg = (rb + HOST_SEM1_REG);
> +	ioc->ioc_regs.ioc_usage_reg = (rb + BFA_FW_USE_COUNT);
> +
> +	/**
> +	 * sram memory access
> +	 */
> +	ioc->ioc_regs.smem_page_start = (rb + PSS_SMEM_PAGE_START);
> +	ioc->ioc_regs.smem_pg0 = BFI_IOC_SMEM_PG0_CB;
> +	if (ioc->pcidev.device_id == BFA_PCI_DEVICE_ID_CT)
> +		ioc->ioc_regs.smem_pg0 = BFI_IOC_SMEM_PG0_CT;
> +}
> +
> +/**
> + *      Initiate a full firmware download.
> + */
> +static void
> +bfa_ioc_download_fw(struct bfa_ioc_s *ioc, u32 boot_type, u32 boot_param)
> +{
> +	u32 *fwimg;
> +	u32 pgnum, pgoff;
> +	u32 loff = 0;
> +	u32 chunkno = 0;
> +	u32 i;
> +
> +	/**
> +	 * Initialize LMEM first before code download
> +	 */
> +	bfa_ioc_lmem_init(ioc);
> +
> +	/**
> +	 * Flash based firmware boot
> +	 */
> +	bfa_trc(ioc, bfa_ioc_fwimg_get_size(ioc));
> +	if (bfa_ioc_fwimg_get_size(ioc) < BFA_IOC_FWIMG_MINSZ)
> +		boot_type = BFI_BOOT_TYPE_FLASH;
> +	fwimg = bfa_ioc_fwimg_get_chunk(ioc, chunkno);
> +	fwimg[BFI_BOOT_TYPE_OFF / sizeof(u32)] = bfa_os_swap32(boot_type);
> +	fwimg[BFI_BOOT_PARAM_OFF / sizeof(u32)] = bfa_os_swap32(boot_param);
> +
> +	pgnum = bfa_ioc_smem_pgnum(ioc, loff);
> +	pgoff = bfa_ioc_smem_pgoff(ioc, loff);
> +
> +	bfa_reg_write(ioc->ioc_regs.host_page_num_fn, pgnum);
> +
> +	for (i = 0; i < bfa_ioc_fwimg_get_size(ioc); i++) {
> +
> +		if (BFA_FLASH_CHUNK_NO(i) != chunkno) {
> +			chunkno = BFA_FLASH_CHUNK_NO(i);
> +			fwimg = bfa_ioc_fwimg_get_chunk(ioc,
> +							BFA_FLASH_CHUNK_ADDR
> +							(chunkno));
> +		}
> +
> +		/**
> +		 * write smem
> +		 */
> +		bfa_mem_write(ioc->ioc_regs.smem_page_start, loff,
> +			      fwimg[BFA_FLASH_OFFSET_IN_CHUNK(i)]);
> +
> +		loff += sizeof(u32);
> +
> +		/**
> +		 * handle page offset wrap around
> +		 */
> +		loff = PSS_SMEM_PGOFF(loff);
> +		if (loff == 0) {
> +			pgnum++;
> +			bfa_reg_write(ioc->ioc_regs.host_page_num_fn, pgnum);
> +		}
> +	}
> +
> +	bfa_reg_write(ioc->ioc_regs.host_page_num_fn,
> +		      bfa_ioc_smem_pgnum(ioc, 0));
> +}
> +
> +static void
> +bfa_ioc_reset(struct bfa_ioc_s *ioc, bfa_boolean_t force)
> +{
> +	bfa_ioc_hwinit(ioc, force);
> +}
> +
> +/**
> + * Update BFA configuration from firmware configuration.
> + */
> +static void
> +bfa_ioc_getattr_reply(struct bfa_ioc_s *ioc)
> +{
> +	struct bfi_ioc_attr_s *attr = ioc->attr;
> +
> +	attr->adapter_prop = bfa_os_ntohl(attr->adapter_prop);
> +	attr->maxfrsize = bfa_os_ntohs(attr->maxfrsize);
> +
> +	bfa_fsm_send_event(ioc, IOC_E_FWRSP_GETATTR);
> +}
> +
> +/**
> + * Attach time initialization of mbox logic.
> + */
> +static void
> +bfa_ioc_mbox_attach(struct bfa_ioc_s *ioc)
> +{
> +	struct bfa_ioc_mbox_mod_s *mod = &ioc->mbox_mod;
> +	int mc;
> +
> +	INIT_LIST_HEAD(&mod->cmd_q);
> +	for (mc = 0; mc < BFI_MC_MAX; mc++) {
> +		mod->mbhdlr[mc].cbfn = NULL;
> +		mod->mbhdlr[mc].cbarg = ioc->bfa;
> +	}
> +}
> +
> +/**
> + * Mbox poll timer -- restarts any pending mailbox requests.
> + */
> +static void
> +bfa_ioc_mbox_poll(struct bfa_ioc_s *ioc)
> +{
> +	struct bfa_ioc_mbox_mod_s *mod = &ioc->mbox_mod;
> +	struct bfa_mbox_cmd_s *cmd;
> +	u32 stat;
> +
> +	/**
> +	 * If no command pending, do nothing
> +	 */
> +	if (list_empty(&mod->cmd_q))
> +		return;
> +
> +	/**
> +	 * If previous command is not yet fetched by firmware, do nothing
> +	 */
> +	stat = bfa_reg_read(ioc->ioc_regs.hfn_mbox_cmd);
> +	if (stat)
> +		return;
> +
> +	/**
> +	 * Enqueue command to firmware.
> +	 */
> +	bfa_q_deq(&mod->cmd_q, &cmd);
> +	bfa_ioc_mbox_send(ioc, cmd->msg, sizeof(cmd->msg));
> +}
> +
> +/**
> + * Cleanup any pending requests.
> + */
> +static void
> +bfa_ioc_mbox_hbfail(struct bfa_ioc_s *ioc)
> +{
> +	struct bfa_ioc_mbox_mod_s *mod = &ioc->mbox_mod;
> +	struct bfa_mbox_cmd_s *cmd;
> +
> +	while (!list_empty(&mod->cmd_q))
> +		bfa_q_deq(&mod->cmd_q, &cmd);
> +}
> +
> +/**
> + * Initialize IOC to port mapping.
> + */
> +
> +#define FNC_PERS_FN_SHIFT(__fn)	((__fn) * 8)
> +static void
> +bfa_ioc_map_port(struct bfa_ioc_s *ioc)
> +{
> +	bfa_os_addr_t rb = ioc->pcidev.pci_bar_kva;
> +	u32 r32;
> +
> +	/**
> +	 * For crossbow, port id is same as pci function.
> +	 */
> +	if (ioc->pcidev.device_id != BFA_PCI_DEVICE_ID_CT) {
> +		ioc->port_id = bfa_ioc_pcifn(ioc);
> +		return;
> +	}
> +
> +	/**
> +	 * For catapult, base port id on personality register and IOC type
> +	 */
> +	r32 = bfa_reg_read(rb + FNC_PERS_REG);
> +	r32 >>= FNC_PERS_FN_SHIFT(bfa_ioc_pcifn(ioc));
> +	ioc->port_id = (r32 & __F0_PORT_MAP_MK) >> __F0_PORT_MAP_SH;
> +
> +	bfa_trc(ioc, bfa_ioc_pcifn(ioc));
> +	bfa_trc(ioc, ioc->port_id);
> +}
> +
> +
> +
> +/**
> + *  bfa_ioc_public
> + */
> +
> +/**
> +* Set interrupt mode for a function: INTX or MSIX
> + */
> +void
> +bfa_ioc_isr_mode_set(struct bfa_ioc_s *ioc, bfa_boolean_t msix)
> +{
> +	bfa_os_addr_t rb = ioc->pcidev.pci_bar_kva;
> +	u32 r32, mode;
> +
> +	r32 = bfa_reg_read(rb + FNC_PERS_REG);
> +	bfa_trc(ioc, r32);
> +
> +	mode = (r32 >> FNC_PERS_FN_SHIFT(bfa_ioc_pcifn(ioc))) &
> +		__F0_INTX_STATUS;
> +
> +	/**
> +	 * If already in desired mode, do not change anything
> +	 */
> +	if (!msix && mode)
> +		return;
> +
> +	if (msix)
> +		mode = __F0_INTX_STATUS_MSIX;
> +	else
> +		mode = __F0_INTX_STATUS_INTA;
> +
> +	r32 &= ~(__F0_INTX_STATUS << FNC_PERS_FN_SHIFT(bfa_ioc_pcifn(ioc)));
> +	r32 |= (mode << FNC_PERS_FN_SHIFT(bfa_ioc_pcifn(ioc)));
> +	bfa_trc(ioc, r32);
> +
> +	bfa_reg_write(rb + FNC_PERS_REG, r32);
> +}
> +
> +bfa_status_t
> +bfa_ioc_pll_init(struct bfa_ioc_s *ioc)
> +{
> +	bfa_os_addr_t rb = ioc->pcidev.pci_bar_kva;
> +	u32 pll_sclk, pll_fclk, r32;
> +
> +	if (ioc->pcidev.device_id == BFA_PCI_DEVICE_ID_CT) {
> +		pll_sclk =
> +			__APP_PLL_312_ENABLE | __APP_PLL_312_LRESETN |
> +			__APP_PLL_312_RSEL200500 | __APP_PLL_312_P0_1(0U) |
> +			__APP_PLL_312_JITLMT0_1(3U) |
> +			__APP_PLL_312_CNTLMT0_1(1U);
> +		pll_fclk =
> +			__APP_PLL_425_ENABLE | __APP_PLL_425_LRESETN |
> +			__APP_PLL_425_RSEL200500 | __APP_PLL_425_P0_1(0U) |
> +			__APP_PLL_425_JITLMT0_1(3U) |
> +			__APP_PLL_425_CNTLMT0_1(1U);
> +
> +		/**
> +		 * 	For catapult, choose operational mode FC/FCoE
> +		 */
> +		if (ioc->fcmode) {
> +			bfa_reg_write((rb + OP_MODE), 0);
> +			bfa_reg_write((rb + ETH_MAC_SER_REG),
> +				      __APP_EMS_CMLCKSEL | __APP_EMS_REFCKBUFEN2
> +				      | __APP_EMS_CHANNEL_SEL);
> +		} else {
> +			ioc->pllinit = BFA_TRUE;
> +			bfa_reg_write((rb + OP_MODE), __GLOBAL_FCOE_MODE);
> +			bfa_reg_write((rb + ETH_MAC_SER_REG),
> +				      __APP_EMS_REFCKBUFEN1);
> +		}
> +	} else {
> +		pll_sclk =
> +			__APP_PLL_312_ENABLE | __APP_PLL_312_LRESETN |
> +			__APP_PLL_312_P0_1(3U) | __APP_PLL_312_JITLMT0_1(3U) |
> +			__APP_PLL_312_CNTLMT0_1(3U);
> +		pll_fclk =
> +			__APP_PLL_425_ENABLE | __APP_PLL_425_LRESETN |
> +			__APP_PLL_425_RSEL200500 | __APP_PLL_425_P0_1(3U) |
> +			__APP_PLL_425_JITLMT0_1(3U) |
> +			__APP_PLL_425_CNTLMT0_1(3U);
> +	}
> +
> +	bfa_reg_write((rb + BFA_IOC0_STATE_REG), BFI_IOC_UNINIT);
> +	bfa_reg_write((rb + BFA_IOC1_STATE_REG), BFI_IOC_UNINIT);
> +
> +	bfa_reg_write((rb + HOSTFN0_INT_MSK), 0xffffffffU);
> +	bfa_reg_write((rb + HOSTFN1_INT_MSK), 0xffffffffU);
> +	bfa_reg_write((rb + HOSTFN0_INT_STATUS), 0xffffffffU);
> +	bfa_reg_write((rb + HOSTFN1_INT_STATUS), 0xffffffffU);
> +	bfa_reg_write((rb + HOSTFN0_INT_MSK), 0xffffffffU);
> +	bfa_reg_write((rb + HOSTFN1_INT_MSK), 0xffffffffU);
> +
> +	bfa_reg_write(ioc->ioc_regs.app_pll_slow_ctl_reg,
> +		      __APP_PLL_312_LOGIC_SOFT_RESET);
> +	bfa_reg_write(ioc->ioc_regs.app_pll_slow_ctl_reg,
> +		      __APP_PLL_312_BYPASS | __APP_PLL_312_LOGIC_SOFT_RESET);
> +	bfa_reg_write(ioc->ioc_regs.app_pll_fast_ctl_reg,
> +		      __APP_PLL_425_LOGIC_SOFT_RESET);
> +	bfa_reg_write(ioc->ioc_regs.app_pll_fast_ctl_reg,
> +		      __APP_PLL_425_BYPASS | __APP_PLL_425_LOGIC_SOFT_RESET);
> +	bfa_os_udelay(2);
> +	bfa_reg_write(ioc->ioc_regs.app_pll_slow_ctl_reg,
> +		      __APP_PLL_312_LOGIC_SOFT_RESET);
> +	bfa_reg_write(ioc->ioc_regs.app_pll_fast_ctl_reg,
> +		      __APP_PLL_425_LOGIC_SOFT_RESET);
> +
> +	bfa_reg_write(ioc->ioc_regs.app_pll_slow_ctl_reg,
> +		      pll_sclk | __APP_PLL_312_LOGIC_SOFT_RESET);
> +	bfa_reg_write(ioc->ioc_regs.app_pll_fast_ctl_reg,
> +		      pll_fclk | __APP_PLL_425_LOGIC_SOFT_RESET);
> +
> +	/**
> +	 * Wait for PLLs to lock.
> +	 */
> +	bfa_os_udelay(2000);
> +	bfa_reg_write((rb + HOSTFN0_INT_STATUS), 0xffffffffU);
> +	bfa_reg_write((rb + HOSTFN1_INT_STATUS), 0xffffffffU);
> +
> +	bfa_reg_write(ioc->ioc_regs.app_pll_slow_ctl_reg, pll_sclk);
> +	bfa_reg_write(ioc->ioc_regs.app_pll_fast_ctl_reg, pll_fclk);
> +
> +	if (ioc->pcidev.device_id == BFA_PCI_DEVICE_ID_CT) {
> +		bfa_reg_write((rb + MBIST_CTL_REG), __EDRAM_BISTR_START);
> +		bfa_os_udelay(1000);
> +		r32 = bfa_reg_read((rb + MBIST_STAT_REG));
> +		bfa_trc(ioc, r32);
> +	}
> +
> +	return BFA_STATUS_OK;
> +}
> +
> +/**
> + * Interface used by diag module to do firmware boot with memory test
> + * as the entry vector.
> + */
> +void
> +bfa_ioc_boot(struct bfa_ioc_s *ioc, u32 boot_type, u32 boot_param)
> +{
> +	bfa_os_addr_t rb;
> +
> +	bfa_ioc_stats(ioc, ioc_boots);
> +
> +	if (bfa_ioc_pll_init(ioc) != BFA_STATUS_OK)
> +		return;
> +
> +	/**
> +	 * Initialize IOC state of all functions on a chip reset.
> +	 */
> +	rb = ioc->pcidev.pci_bar_kva;
> +	if (boot_param == BFI_BOOT_TYPE_MEMTEST) {
> +		bfa_reg_write((rb + BFA_IOC0_STATE_REG), BFI_IOC_MEMTEST);
> +		bfa_reg_write((rb + BFA_IOC1_STATE_REG), BFI_IOC_MEMTEST);
> +	} else {
> +		bfa_reg_write((rb + BFA_IOC0_STATE_REG), BFI_IOC_INITING);
> +		bfa_reg_write((rb + BFA_IOC1_STATE_REG), BFI_IOC_INITING);
> +	}
> +
> +	bfa_ioc_download_fw(ioc, boot_type, boot_param);
> +
> +	/**
> +	 * Enable interrupts just before starting LPU
> +	 */
> +	ioc->cbfn->reset_cbfn(ioc->bfa);
> +	bfa_ioc_lpu_start(ioc);
> +}
> +
> +/**
> + * Enable/disable IOC failure auto recovery.
> + */
> +void
> +bfa_ioc_auto_recover(bfa_boolean_t auto_recover)
> +{
> +	bfa_auto_recover = BFA_FALSE;
> +}
> +
> +
> +bfa_boolean_t
> +bfa_ioc_is_operational(struct bfa_ioc_s *ioc)
> +{
> +	return bfa_fsm_cmp_state(ioc, bfa_ioc_sm_op);
> +}
> +
> +void
> +bfa_ioc_msgget(struct bfa_ioc_s *ioc, void *mbmsg)
> +{
> +	u32 *msgp = mbmsg;
> +	u32 r32;
> +	int i;
> +
> +	/**
> +	 * read the MBOX msg
> +	 */
> +	for (i = 0; i < (sizeof(union bfi_ioc_i2h_msg_u) / sizeof(u32)); i++) {
> +		r32 = bfa_reg_read(ioc->ioc_regs.lpu_mbox + i * sizeof(u32));
> +		msgp[i] = bfa_os_htonl(r32);
> +	}
> +
> +	/**
> +	 * turn off mailbox interrupt by clearing mailbox status
> +	 */
> +	bfa_reg_write(ioc->ioc_regs.lpu_mbox_cmd, 1);
> +	bfa_reg_read(ioc->ioc_regs.lpu_mbox_cmd);
> +}
> +
> +void
> +bfa_ioc_isr(struct bfa_ioc_s *ioc, struct bfi_mbmsg_s *m)
> +{
> +	union bfi_ioc_i2h_msg_u *msg;
> +
> +	msg = (union bfi_ioc_i2h_msg_u *) m;
> +
> +	bfa_ioc_stats(ioc, ioc_isrs);
> +
> +	switch (msg->mh.msg_id) {
> +	case BFI_IOC_I2H_HBEAT:
> +		break;
> +
> +	case BFI_IOC_I2H_READY_EVENT:
> +		bfa_fsm_send_event(ioc, IOC_E_FWREADY);
> +		break;
> +
> +	case BFI_IOC_I2H_ENABLE_REPLY:
> +		bfa_fsm_send_event(ioc, IOC_E_FWRSP_ENABLE);
> +		break;
> +
> +	case BFI_IOC_I2H_DISABLE_REPLY:
> +		bfa_fsm_send_event(ioc, IOC_E_FWRSP_DISABLE);
> +		break;
> +
> +	case BFI_IOC_I2H_GETATTR_REPLY:
> +		bfa_ioc_getattr_reply(ioc);
> +		break;
> +
> +	default:
> +		bfa_trc(ioc, msg->mh.msg_id);
> +		bfa_assert(0);
> +	}
> +}
> +
> +/**
> + * IOC attach time initialization and setup.
> + *
> + * @param[in]	ioc	memory for IOC
> + * @param[in]	bfa	driver instance structure
> + * @param[in]	trcmod	kernel trace module
> + * @param[in]	aen	kernel aen event module
> + * @param[in]	logm	kernel logging module
> + */
> +void
> +bfa_ioc_attach(struct bfa_ioc_s *ioc, void *bfa, struct bfa_ioc_cbfn_s *cbfn,
> +	       struct bfa_timer_mod_s *timer_mod,
> +	       struct bfa_trc_mod_s *trcmod, struct bfa_aen_s *aen,
> +	       struct bfa_log_mod_s *logm)
> +{
> +	ioc->bfa = bfa;
> +	ioc->cbfn = cbfn;
> +	ioc->timer_mod = timer_mod;
> +	ioc->trcmod = trcmod;
> +	ioc->aen = aen;
> +	ioc->logm = logm;
> +	ioc->fcmode = BFA_FALSE;
> +	ioc->pllinit = BFA_FALSE;
> +	ioc->dbg_fwsave_once = BFA_TRUE;
> +
> +	bfa_ioc_mbox_attach(ioc);
> +	INIT_LIST_HEAD(&ioc->hb_notify_q);
> +
> +	bfa_fsm_set_state(ioc, bfa_ioc_sm_reset);
> +}
> +
> +/**
> + * Driver detach time IOC cleanup.
> + */
> +void
> +bfa_ioc_detach(struct bfa_ioc_s *ioc)
> +{
> +	bfa_fsm_send_event(ioc, IOC_E_DETACH);
> +}
> +
> +/**
> + * Setup IOC PCI properties.
> + *
> + * @param[in]	pcidev	PCI device information for this IOC
> + */
> +void
> +bfa_ioc_pci_init(struct bfa_ioc_s *ioc, struct bfa_pcidev_s *pcidev,
> +		 enum bfi_mclass mc)
> +{
> +	ioc->ioc_mc = mc;
> +	ioc->pcidev = *pcidev;
> +	ioc->ctdev = (ioc->pcidev.device_id == BFA_PCI_DEVICE_ID_CT);
> +	ioc->cna = ioc->ctdev && !ioc->fcmode;
> +
> +	bfa_ioc_map_port(ioc);
> +	bfa_ioc_reg_init(ioc);
> +}
> +
> +/**
> + * Initialize IOC dma memory
> + *
> + * @param[in]	dm_kva	kernel virtual address of IOC dma memory
> + * @param[in]	dm_pa	physical address of IOC dma memory
> + */
> +void
> +bfa_ioc_mem_claim(struct bfa_ioc_s *ioc, u8 * dm_kva, u64 dm_pa)
> +{
> +	/**
> +	 * dma memory for firmware attribute
> +	 */
> +	ioc->attr_dma.kva = dm_kva;
> +	ioc->attr_dma.pa = dm_pa;
> +	ioc->attr = (struct bfi_ioc_attr_s *) dm_kva;
> +}
> +
> +/**
> + * Return size of dma memory required.
> + */
> +u32
> +bfa_ioc_meminfo(void)
> +{
> +	return BFA_ROUNDUP(sizeof(struct bfi_ioc_attr_s), BFA_DMA_ALIGN_SZ);
> +}
> +
> +void
> +bfa_ioc_enable(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_stats(ioc, ioc_enables);
> +	ioc->dbg_fwsave_once = BFA_TRUE;
> +
> +	bfa_fsm_send_event(ioc, IOC_E_ENABLE);
> +}
> +
> +void
> +bfa_ioc_disable(struct bfa_ioc_s *ioc)
> +{
> +	bfa_ioc_stats(ioc, ioc_disables);
> +	bfa_fsm_send_event(ioc, IOC_E_DISABLE);
> +}
> +
> +/**
> + * Returns memory required for saving firmware trace in case of crash.
> + * Driver must call this interface to allocate memory required for
> + * automatic saving of firmware trace. Driver should call
> + * bfa_ioc_debug_memclaim() right after bfa_ioc_attach() to setup this
> + * trace memory.
> + */
> +int
> +bfa_ioc_debug_trcsz(bfa_boolean_t auto_recover)
> +{
> +	return (auto_recover) ? BFA_DBG_FWTRC_LEN : 0;
> +}
> +
> +/**
> + * Initialize memory for saving firmware trace. Driver must initialize
> + * trace memory before call bfa_ioc_enable().
> + */
> +void
> +bfa_ioc_debug_memclaim(struct bfa_ioc_s *ioc, void *dbg_fwsave)
> +{
> +	bfa_assert(ioc->auto_recover);
> +	ioc->dbg_fwsave = dbg_fwsave;
> +	ioc->dbg_fwsave_len = bfa_ioc_debug_trcsz(ioc->auto_recover);
> +}
> +
> +u32
> +bfa_ioc_smem_pgnum(struct bfa_ioc_s *ioc, u32 fmaddr)
> +{
> +	return PSS_SMEM_PGNUM(ioc->ioc_regs.smem_pg0, fmaddr);
> +}
> +
> +u32
> +bfa_ioc_smem_pgoff(struct bfa_ioc_s * ioc, u32 fmaddr)
> +{
> +	return PSS_SMEM_PGOFF(fmaddr);
> +}
> +
> +/**
> + * Register mailbox message handler functions
> + *
> + * @param[in]	ioc		IOC instance
> + * @param[in]	mcfuncs		message class handler functions
> + */
> +void
> +bfa_ioc_mbox_register(struct bfa_ioc_s *ioc, bfa_ioc_mbox_mcfunc_t * mcfuncs)
> +{
> +	struct bfa_ioc_mbox_mod_s *mod = &ioc->mbox_mod;
> +	int mc;
> +
> +	for (mc = 0; mc < BFI_MC_MAX; mc++)
> +		mod->mbhdlr[mc].cbfn = mcfuncs[mc];
> +}
> +
> +/**
> + * Register mailbox message handler function, to be called by common modules
> + */
> +void
> +bfa_ioc_mbox_regisr(struct bfa_ioc_s *ioc, enum bfi_mclass mc,
> +		    bfa_ioc_mbox_mcfunc_t cbfn, void *cbarg)
> +{
> +	struct bfa_ioc_mbox_mod_s *mod = &ioc->mbox_mod;
> +
> +	mod->mbhdlr[mc].cbfn = cbfn;
> +	mod->mbhdlr[mc].cbarg = cbarg;
> +}
> +
> +/**
> + * Queue a mailbox command request to firmware. Waits if mailbox is busy.
> + * Responsibility of caller to serialize
> + *
> + * @param[in]	ioc	IOC instance
> + * @param[i]	cmd	Mailbox command
> + */
> +void
> +bfa_ioc_mbox_queue(struct bfa_ioc_s *ioc, struct bfa_mbox_cmd_s *cmd)
> +{
> +	struct bfa_ioc_mbox_mod_s *mod = &ioc->mbox_mod;
> +	u32 stat;
> +
> +	/**
> +	 * If a previous command is pending, queue new command
> +	 */
> +	if (!list_empty(&mod->cmd_q)) {
> +		list_add_tail(&cmd->qe, &mod->cmd_q);
> +		return;
> +	}
> +
> +	/**
> +	 * If mailbox is busy, queue command for poll timer
> +	 */
> +	stat = bfa_reg_read(ioc->ioc_regs.hfn_mbox_cmd);
> +	if (stat) {
> +		list_add_tail(&cmd->qe, &mod->cmd_q);
> +		return;
> +	}
> +
> +	/**
> +	 * mailbox is free -- queue command to firmware
> +	 */
> +	bfa_ioc_mbox_send(ioc, cmd->msg, sizeof(cmd->msg));
> +}
> +
> +/**
> + * Handle mailbox interrupts
> + */
> +void
> +bfa_ioc_mbox_isr(struct bfa_ioc_s *ioc)
> +{
> +	struct bfa_ioc_mbox_mod_s *mod = &ioc->mbox_mod;
> +	struct bfi_mbmsg_s m;
> +	int mc;
> +
> +	bfa_ioc_msgget(ioc, &m);
> +
> +	/**
> +	 * Treat IOC message class as special.
> +	 */
> +	mc = m.mh.msg_class;
> +	if (mc == BFI_MC_IOC) {
> +		bfa_ioc_isr(ioc, &m);
> +		return;
> +	}
> +
> +	if ((mc > BFI_MC_MAX) || (mod->mbhdlr[mc].cbfn == NULL))
> +		return;
> +
> +	mod->mbhdlr[mc].cbfn(mod->mbhdlr[mc].cbarg, &m);
> +}
> +
> +void
> +bfa_ioc_error_isr(struct bfa_ioc_s *ioc)
> +{
> +	bfa_fsm_send_event(ioc, IOC_E_HWERROR);
> +}
> +
> +#ifndef BFA_BIOS_BUILD
> +
> +/**
> + * return true if IOC is disabled
> + */
> +bfa_boolean_t
> +bfa_ioc_is_disabled(struct bfa_ioc_s *ioc)
> +{
> +	return (bfa_fsm_cmp_state(ioc, bfa_ioc_sm_disabling)
> +		|| bfa_fsm_cmp_state(ioc, bfa_ioc_sm_disabled));
> +}
> +
> +/**
> + * return true if IOC firmware is different.
> + */
> +bfa_boolean_t
> +bfa_ioc_fw_mismatch(struct bfa_ioc_s * ioc)
> +{
> +	return (bfa_fsm_cmp_state(ioc, bfa_ioc_sm_reset)
> +		|| bfa_fsm_cmp_state(ioc, bfa_ioc_sm_fwcheck)
> +		|| bfa_fsm_cmp_state(ioc, bfa_ioc_sm_mismatch));
> +}
> +
> +#define bfa_ioc_state_disabled(__sm)		\
> +	(((__sm) == BFI_IOC_UNINIT) ||		\
> +	 ((__sm) == BFI_IOC_INITING) ||		\
> +	 ((__sm) == BFI_IOC_HWINIT) ||		\
> +	 ((__sm) == BFI_IOC_DISABLED) ||	\
> +	 ((__sm) == BFI_IOC_HBFAIL) ||		\
> +	 ((__sm) == BFI_IOC_CFG_DISABLED))
> +
> +/**
> + * Check if adapter is disabled -- both IOCs should be in a disabled
> + * state.
> + */
> +bfa_boolean_t
> +bfa_ioc_adapter_is_disabled(struct bfa_ioc_s * ioc)
> +{
> +	u32 ioc_state;
> +	bfa_os_addr_t rb = ioc->pcidev.pci_bar_kva;
> +
> +	if (!bfa_fsm_cmp_state(ioc, bfa_ioc_sm_disabled))
> +		return BFA_FALSE;
> +
> +	ioc_state = bfa_reg_read(rb + BFA_IOC0_STATE_REG);
> +	if (!bfa_ioc_state_disabled(ioc_state))
> +		return BFA_FALSE;
> +
> +	ioc_state = bfa_reg_read(rb + BFA_IOC1_STATE_REG);
> +	if (!bfa_ioc_state_disabled(ioc_state))
> +		return BFA_FALSE;
> +
> +	return BFA_TRUE;
> +}
> +
> +/**
> + * Add to IOC heartbeat failure notification queue. To be used by common
> + * modules such as
> + */
> +void
> +bfa_ioc_hbfail_register(struct bfa_ioc_s *ioc,
> +			struct bfa_ioc_hbfail_notify_s *notify)
> +{
> +	list_add_tail(&notify->qe, &ioc->hb_notify_q);
> +}
> +
> +#define BFA_MFG_NAME "Brocade"
> +void
> +bfa_ioc_get_adapter_attr(struct bfa_ioc_s *ioc,
> +			 struct bfa_adapter_attr_s *ad_attr)
> +{
> +	struct bfi_ioc_attr_s *ioc_attr;
> +	char model[BFA_ADAPTER_MODEL_NAME_LEN];
> +
> +	ioc_attr = ioc->attr;
> +	bfa_os_memcpy((void *) &ad_attr->serial_num,
> +		      (void *) ioc_attr->brcd_serialnum,
> +		      BFA_ADAPTER_SERIAL_NUM_LEN);
> +
> +	bfa_os_memcpy(&ad_attr->fw_ver, ioc_attr->fw_version, BFA_VERSION_LEN);
> +	bfa_os_memcpy(&ad_attr->optrom_ver, ioc_attr->optrom_version,
> +		      BFA_VERSION_LEN);
> +	bfa_os_memcpy(&ad_attr->manufacturer, BFA_MFG_NAME,
> +		      BFA_ADAPTER_MFG_NAME_LEN);
> +	bfa_os_memcpy(&ad_attr->vpd, &ioc_attr->vpd,
> +		      sizeof(struct bfa_mfg_vpd_s));
> +
> +	ad_attr->nports = BFI_ADAPTER_GETP(NPORTS, ioc_attr->adapter_prop);
> +	ad_attr->max_speed = BFI_ADAPTER_GETP(SPEED, ioc_attr->adapter_prop);
> +
> +	/**
> +	 * model name
> +	 */
> +	if (BFI_ADAPTER_GETP(SPEED, ioc_attr->adapter_prop) == 10) {
> +		strcpy(model, "BR-10?0");
> +		model[5] = '0' + ad_attr->nports;
> +	} else {
> +		strcpy(model, "Brocade-??5");
> +		model[8] =
> +			'0' + BFI_ADAPTER_GETP(SPEED, ioc_attr->adapter_prop);
> +		model[9] = '0' + ad_attr->nports;
> +	}
> +
> +	if (BFI_ADAPTER_IS_SPECIAL(ioc_attr->adapter_prop))
> +		ad_attr->prototype = 1;
> +	else
> +		ad_attr->prototype = 0;
> +
> +	bfa_os_memcpy(&ad_attr->model, model, BFA_ADAPTER_MODEL_NAME_LEN);
> +	bfa_os_memcpy(&ad_attr->model_descr, &ad_attr->model,
> +		      BFA_ADAPTER_MODEL_NAME_LEN);
> +
> +	ad_attr->pwwn = bfa_ioc_get_pwwn(ioc);
> +	ad_attr->mac = bfa_ioc_get_mac(ioc);
> +
> +	ad_attr->pcie_gen = ioc_attr->pcie_gen;
> +	ad_attr->pcie_lanes = ioc_attr->pcie_lanes;
> +	ad_attr->pcie_lanes_orig = ioc_attr->pcie_lanes_orig;
> +	ad_attr->asic_rev = ioc_attr->asic_rev;
> +	ad_attr->hw_ver[0] = 'R';
> +	ad_attr->hw_ver[1] = 'e';
> +	ad_attr->hw_ver[2] = 'v';
> +	ad_attr->hw_ver[3] = '-';
> +	ad_attr->hw_ver[4] = ioc_attr->asic_rev;
> +	ad_attr->hw_ver[5] = '\0';
> +
> +	ad_attr->cna_capable = ioc->cna;
> +}
> +
> +void
> +bfa_ioc_get_attr(struct bfa_ioc_s *ioc, struct bfa_ioc_attr_s *ioc_attr)
> +{
> +	bfa_os_memset((void *) ioc_attr, 0, sizeof(struct bfa_ioc_attr_s));
> +
> +	ioc_attr->state = bfa_sm_to_state(ioc_sm_table, ioc->fsm);
> +	ioc_attr->port_id = ioc->port_id;
> +
> +	if (!ioc->ctdev)
> +		ioc_attr->ioc_type = BFA_IOC_TYPE_FC;
> +	else if (ioc->ioc_mc == BFI_MC_IOCFC)
> +		ioc_attr->ioc_type = BFA_IOC_TYPE_FCoE;
> +	else if (ioc->ioc_mc == BFI_MC_LL)
> +		ioc_attr->ioc_type = BFA_IOC_TYPE_LL;
> +
> +	bfa_ioc_get_adapter_attr(ioc, &ioc_attr->adapter_attr);
> +
> +	ioc_attr->pci_attr.device_id = ioc->pcidev.device_id;
> +	ioc_attr->pci_attr.pcifn = ioc->pcidev.pci_func;
> +	ioc_attr->pci_attr.chip_rev[0] = 'R';
> +	ioc_attr->pci_attr.chip_rev[1] = 'e';
> +	ioc_attr->pci_attr.chip_rev[2] = 'v';
> +	ioc_attr->pci_attr.chip_rev[3] = '-';
> +	ioc_attr->pci_attr.chip_rev[4] = ioc_attr->adapter_attr.asic_rev;
> +	ioc_attr->pci_attr.chip_rev[5] = '\0';
> +}
> +
> +/**
> + *  hal_wwn_public
> + */
> +wwn_t
> +bfa_ioc_get_pwwn(struct bfa_ioc_s *ioc)
> +{
> +	union {
> +		wwn_t wwn;
> +		u8 byte[sizeof(wwn_t)];
> +	}
> +	w;
> +
> +	w.wwn = ioc->attr->mfg_wwn;
> +
> +	if (bfa_ioc_portid(ioc) == 1)
> +		w.byte[7]++;
> +
> +	return w.wwn;
> +}
> +
> +wwn_t
> +bfa_ioc_get_nwwn(struct bfa_ioc_s * ioc)
> +{
> +	union {
> +		wwn_t wwn;
> +		u8 byte[sizeof(wwn_t)];
> +	}
> +	w;
> +
> +	w.wwn = ioc->attr->mfg_wwn;
> +
> +	if (bfa_ioc_portid(ioc) == 1)
> +		w.byte[7]++;
> +
> +	w.byte[0] = 0x20;
> +
> +	return w.wwn;
> +}
> +
> +wwn_t
> +bfa_ioc_get_wwn_naa5(struct bfa_ioc_s * ioc, u16 inst)
> +{
> +	union {
> +		wwn_t wwn;
> +		u8 byte[sizeof(wwn_t)];
> +	}
> +	w, w5;
> +
> +	bfa_trc(ioc, inst);
> +
> +	w.wwn = ioc->attr->mfg_wwn;
> +	w5.byte[0] = 0x50 | w.byte[2] >> 4;
> +	w5.byte[1] = w.byte[2] << 4 | w.byte[3] >> 4;
> +	w5.byte[2] = w.byte[3] << 4 | w.byte[4] >> 4;
> +	w5.byte[3] = w.byte[4] << 4 | w.byte[5] >> 4;
> +	w5.byte[4] = w.byte[5] << 4 | w.byte[6] >> 4;
> +	w5.byte[5] = w.byte[6] << 4 | w.byte[7] >> 4;
> +	w5.byte[6] = w.byte[7] << 4 | (inst & 0x0f00) >> 8;
> +	w5.byte[7] = (inst & 0xff);
> +
> +	return w5.wwn;
> +}
> +
> +u64
> +bfa_ioc_get_adid(struct bfa_ioc_s * ioc)
> +{
> +	return ioc->attr->mfg_wwn;
> +}
> +
> +mac_t
> +bfa_ioc_get_mac(struct bfa_ioc_s * ioc)
> +{
> +	mac_t mac;
> +
> +	mac = ioc->attr->mfg_mac;
> +	mac.mac[MAC_ADDRLEN - 1] += bfa_ioc_pcifn(ioc);
> +
> +	return mac;
> +}
> +
> +void
> +bfa_ioc_set_fcmode(struct bfa_ioc_s *ioc)
> +{
> +	ioc->fcmode = BFA_TRUE;
> +	ioc->port_id = bfa_ioc_pcifn(ioc);
> +}
> +
> +bfa_boolean_t
> +bfa_ioc_get_fcmode(struct bfa_ioc_s *ioc)
> +{
> +	return ioc->fcmode || (ioc->pcidev.device_id != BFA_PCI_DEVICE_ID_CT);
> +}
> +
> +/**
> + * Return true if interrupt should be claimed.
> + */
> +bfa_boolean_t
> +bfa_ioc_intx_claim(struct bfa_ioc_s * ioc)
> +{
> +	u32 isr, msk;
> +
> +	/**
> +	 * Always claim if not catapult.
> +	 */
> +	if (!ioc->ctdev)
> +		return BFA_TRUE;
> +
> +	/**
> +	 * FALSE if next device is claiming interrupt.
> +	 * TRUE if next device is not interrupting or not present.
> +	 */
> +	msk = bfa_reg_read(ioc->ioc_regs.shirq_msk_next);
> +	isr = bfa_reg_read(ioc->ioc_regs.shirq_isr_next);
> +	return !(isr & ~msk);
> +}
> +
> +/**
> + * Send AEN notification
> + */
> +static void
> +bfa_ioc_aen_post(struct bfa_ioc_s *ioc, enum bfa_ioc_aen_event event)
> +{
> +	union bfa_aen_data_u aen_data;
> +	struct bfa_log_mod_s *logmod = ioc->logm;
> +	s32 inst_num = 0;
> +	struct bfa_ioc_attr_s ioc_attr;
> +
> +	switch (event) {
> +	case BFA_IOC_AEN_HBGOOD:
> +		bfa_log(logmod, BFA_AEN_IOC_HBGOOD, inst_num);
> +		break;
> +	case BFA_IOC_AEN_HBFAIL:
> +		bfa_log(logmod, BFA_AEN_IOC_HBFAIL, inst_num);
> +		break;
> +	case BFA_IOC_AEN_ENABLE:
> +		bfa_log(logmod, BFA_AEN_IOC_ENABLE, inst_num);
> +		break;
> +	case BFA_IOC_AEN_DISABLE:
> +		bfa_log(logmod, BFA_AEN_IOC_DISABLE, inst_num);
> +		break;
> +	case BFA_IOC_AEN_FWMISMATCH:
> +		bfa_log(logmod, BFA_AEN_IOC_FWMISMATCH, inst_num);
> +		break;
> +	default:
> +		break;
> +	}
> +
> +	memset(&aen_data.ioc.pwwn, 0, sizeof(aen_data.ioc.pwwn));
> +	memset(&aen_data.ioc.mac, 0, sizeof(aen_data.ioc.mac));
> +	bfa_ioc_get_attr(ioc, &ioc_attr);
> +	switch (ioc_attr.ioc_type) {
> +	case BFA_IOC_TYPE_FC:
> +		aen_data.ioc.pwwn = bfa_ioc_get_pwwn(ioc);
> +		break;
> +	case BFA_IOC_TYPE_FCoE:
> +		aen_data.ioc.pwwn = bfa_ioc_get_pwwn(ioc);
> +		aen_data.ioc.mac = bfa_ioc_get_mac(ioc);
> +		break;
> +	case BFA_IOC_TYPE_LL:
> +		aen_data.ioc.mac = bfa_ioc_get_mac(ioc);
> +		break;
> +	default:
> +		bfa_assert(ioc_attr.ioc_type == BFA_IOC_TYPE_FC);
> +		break;
> +	}
> +	aen_data.ioc.ioc_type = ioc_attr.ioc_type;
> +}
> +
> +/**
> + * Retrieve saved firmware trace from a prior IOC failure.
> + */
> +bfa_status_t
> +bfa_ioc_debug_fwsave(struct bfa_ioc_s *ioc, void *trcdata, int *trclen)
> +{
> +	int tlen;
> +
> +	if (ioc->dbg_fwsave_len == 0)
> +		return BFA_STATUS_ENOFSAVE;
> +
> +	tlen = *trclen;
> +	if (tlen > ioc->dbg_fwsave_len)
> +		tlen = ioc->dbg_fwsave_len;
> +
> +	bfa_os_memcpy(trcdata, ioc->dbg_fwsave, tlen);
> +	*trclen = tlen;
> +	return BFA_STATUS_OK;
> +}
> +
> +/**
> + * Retrieve saved firmware trace from a prior IOC failure.
> + */
> +bfa_status_t
> +bfa_ioc_debug_fwtrc(struct bfa_ioc_s * ioc, void *trcdata, int *trclen)
> +{
> +	u32 pgnum;
> +	u32 loff = BFA_DBG_FWTRC_OFF(bfa_ioc_portid(ioc));
> +	int i, tlen;
> +	u32 *tbuf = trcdata, r32;
> +
> +	bfa_trc(ioc, *trclen);
> +
> +	pgnum = bfa_ioc_smem_pgnum(ioc, loff);
> +	loff = bfa_ioc_smem_pgoff(ioc, loff);
> +	bfa_reg_write(ioc->ioc_regs.host_page_num_fn, pgnum);
> +
> +	tlen = *trclen;
> +	if (tlen > BFA_DBG_FWTRC_LEN)
> +		tlen = BFA_DBG_FWTRC_LEN;
> +	tlen /= sizeof(u32);
> +
> +	bfa_trc(ioc, tlen);
> +
> +	for (i = 0; i < tlen; i++) {
> +		r32 = bfa_mem_read(ioc->ioc_regs.smem_page_start, loff);
> +		tbuf[i] = bfa_os_ntohl(r32);
> +		loff += sizeof(u32);
> +
> +		/**
> +		 * handle page offset wrap around
> +		 */
> +		loff = PSS_SMEM_PGOFF(loff);
> +		if (loff == 0) {
> +			pgnum++;
> +			bfa_reg_write(ioc->ioc_regs.host_page_num_fn, pgnum);
> +		}
> +	}
> +	bfa_reg_write(ioc->ioc_regs.host_page_num_fn,
> +		      bfa_ioc_smem_pgnum(ioc, 0));
> +	bfa_trc(ioc, pgnum);
> +
> +	*trclen = tlen * sizeof(u32);
> +	return BFA_STATUS_OK;
> +}
> +
> +/**
> + * Save firmware trace if configured.
> + */
> +static void
> +bfa_ioc_debug_save(struct bfa_ioc_s *ioc)
> +{
> +	int tlen;
> +
> +	if (ioc->dbg_fwsave_len) {
> +		tlen = ioc->dbg_fwsave_len;
> +		bfa_ioc_debug_fwtrc(ioc, ioc->dbg_fwsave, &tlen);
> +	}
> +}
> +
> +/**
> + * Firmware failure detected. Start recovery actions.
> + */
> +static void
> +bfa_ioc_recover(struct bfa_ioc_s *ioc)
> +{
> +	if (ioc->dbg_fwsave_once) {
> +		ioc->dbg_fwsave_once = BFA_FALSE;
> +		bfa_ioc_debug_save(ioc);
> +	}
> +
> +	bfa_ioc_stats(ioc, ioc_hbfails);
> +	bfa_fsm_send_event(ioc, IOC_E_HBFAIL);
> +}
> +
> +#else
> +
> +static void
> +bfa_ioc_aen_post(struct bfa_ioc_s *ioc, enum bfa_ioc_aen_event event)
> +{
> +}
> +
> +static void
> +bfa_ioc_recover(struct bfa_ioc_s *ioc)
> +{
> +	bfa_assert(0);
> +}
> +
> +#endif
> diff -ruP linux-2.6.30.5-orig/drivers/net/bna/bfa_ioc.h linux-2.6.30.5-mod/drivers/net/bna/bfa_ioc.h
> --- linux-2.6.30.5-orig/drivers/net/bna/bfa_ioc.h	1969-12-31 16:00:00.000000000 -0800
> +++ linux-2.6.30.5-mod/drivers/net/bna/bfa_ioc.h	2009-08-28 21:09:22.833926000 -0700
> @@ -0,0 +1,266 @@
> +/*
> + * Copyright (c) 2005-2009 Brocade Communications Systems, Inc.
> + * All rights reserved
> + * www.brocade.com
> + *
> + * Linux network driver for Brocade Converged Network Adapter.
> + *
> + * This program is free software; you can redistribute it and/or modify it
> + * under the terms of the GNU General Public License (GPL) Version 2 as
> + * published by the Free Software Foundation
> + *
> + * This program is distributed in the hope that it will be useful, but
> + * WITHOUT ANY WARRANTY; without even the implied warranty of
> + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
> + * General Public License for more details.
> + */
> +/*
> + * Copyright (c) 2005-2009 Brocade Communications Systems, Inc.
> + * All rights reserved
> + * www.brocade.com
> + *
> + * See LICENSE.bna for copyright and licensing details.
> + */
> +
> +#ifndef __BFA_IOC_H__
> +#define __BFA_IOC_H__
> +
> +#include <cs/bfa_sm.h>
> +#include <bfi/bfi.h>
> +#include <bfi/bfi_ioc.h>
> +#include <bfi/bfi_boot.h>
> +#include <bfa_timer.h>
> +
> +/**
> + * PCI device information required by IOC
> + */
> +struct bfa_pcidev_s {
> +	int pci_slot;
> +	u8 pci_func;
> +	u16 device_id;
> +	bfa_os_addr_t pci_bar_kva;
> +};
> +
> +/**
> + * Structure used to remember the DMA-able memory block's KVA and Physical
> + * Address
> + */
> +struct bfa_dma_s {
> +	void *kva;		/*! Kernel virtual address      */
> +	u64 pa;			/*! Physical address            */
> +};
> +
> +#define BFA_DMA_ALIGN_SZ	256
> +#define BFA_ROUNDUP(_l, _s)	(((_l) + ((_s) - 1)) & ~((_s) - 1))
> +
> +
> +
> +#define bfa_dma_addr_set(dma_addr, pa)	\
> +		__bfa_dma_addr_set(&dma_addr, (u64)pa)
> +
> +static inline void
> +__bfa_dma_addr_set(union bfi_addr_u *dma_addr, u64 pa)
> +{
> +	dma_addr->a32.addr_lo = (u32) pa;
> +	dma_addr->a32.addr_hi = (u32) (bfa_os_u32(pa));
> +}
> +
> +
> +#define bfa_dma_be_addr_set(dma_addr, pa)	\
> +		__bfa_dma_be_addr_set(&dma_addr, (u64)pa)
> +static inline void
> +__bfa_dma_be_addr_set(union bfi_addr_u *dma_addr, u64 pa)
> +{
> +	dma_addr->a32.addr_lo = (u32) bfa_os_htonl(pa);
> +	dma_addr->a32.addr_hi = (u32) bfa_os_htonl(bfa_os_u32(pa));
> +}
> +
> +struct bfa_ioc_regs_s {
> +	bfa_os_addr_t hfn_mbox_cmd;
> +	bfa_os_addr_t hfn_mbox;
> +	bfa_os_addr_t lpu_mbox_cmd;
> +	bfa_os_addr_t lpu_mbox;
> +	bfa_os_addr_t pss_ctl_reg;
> +	bfa_os_addr_t app_pll_fast_ctl_reg;
> +	bfa_os_addr_t app_pll_slow_ctl_reg;
> +	bfa_os_addr_t ioc_sem_reg;
> +	bfa_os_addr_t ioc_usage_sem_reg;
> +	bfa_os_addr_t ioc_usage_reg;
> +	bfa_os_addr_t host_page_num_fn;
> +	bfa_os_addr_t heartbeat;
> +	bfa_os_addr_t ioc_fwstate;
> +	bfa_os_addr_t ll_halt;
> +	bfa_os_addr_t shirq_isr_next;
> +	bfa_os_addr_t shirq_msk_next;
> +	bfa_os_addr_t smem_page_start;
> +	u32 smem_pg0;
> +};
> +
> +#define bfa_reg_read(_raddr)	bfa_os_reg_read(_raddr)
> +#define bfa_reg_write(_raddr, _val)	bfa_os_reg_write(_raddr, _val)
> +#define bfa_mem_read(_raddr, _off)	bfa_os_mem_read(_raddr, _off)
> +#define bfa_mem_write(_raddr, _off, _val)	\
> +					bfa_os_mem_write(_raddr, _off, _val)
> +/**
> + * IOC Mailbox structures
> + */
> +struct bfa_mbox_cmd_s {
> +	struct list_head qe;
> +	u32 msg[BFI_IOC_MSGSZ];
> +};
> +
> +/**
> + * IOC mailbox module
> + */
> +typedef void (*bfa_ioc_mbox_mcfunc_t) (void *cbarg, struct bfi_mbmsg_s * m);
> +struct bfa_ioc_mbox_mod_s {
> +	struct list_head cmd_q;	/*  pending mbox queue  */
> +	int nmclass;		/*  number of handlers */
> +	struct {
> +		bfa_ioc_mbox_mcfunc_t cbfn;	/*  message handlers    */
> +		void *cbarg;
> +	} mbhdlr[BFI_MC_MAX];
> +};
> +
> +/**
> + * IOC callback function interfaces
> + */
> +typedef void (*bfa_ioc_enable_cbfn_t) (void *bfa, enum bfa_status status);
> +typedef void (*bfa_ioc_disable_cbfn_t) (void *bfa);
> +typedef void (*bfa_ioc_hbfail_cbfn_t) (void *bfa);
> +typedef void (*bfa_ioc_reset_cbfn_t) (void *bfa);
> +struct bfa_ioc_cbfn_s {
> +	bfa_ioc_enable_cbfn_t enable_cbfn;
> +	bfa_ioc_disable_cbfn_t disable_cbfn;
> +	bfa_ioc_hbfail_cbfn_t hbfail_cbfn;
> +	bfa_ioc_reset_cbfn_t reset_cbfn;
> +};
> +
> +/**
> + * Heartbeat failure notification queue element.
> + */
> +struct bfa_ioc_hbfail_notify_s {
> +	struct list_head qe;
> +	bfa_ioc_hbfail_cbfn_t cbfn;
> +	void *cbarg;
> +};
> +
> +/**
> + * Initialize a heartbeat failure notification structure
> + */
> +#define bfa_ioc_hbfail_init(__notify, __cbfn, __cbarg) do {	\
> +	(__notify)->cbfn = (__cbfn);      \
> +	(__notify)->cbarg = (__cbarg);      \
> +} while (0)
> +
> +struct bfa_ioc_s {
> +	bfa_fsm_t fsm;
> +	struct bfa_s *bfa;
> +	struct bfa_pcidev_s pcidev;
> +	struct bfa_timer_mod_s *timer_mod;
> +	struct bfa_timer_s ioc_timer;
> +	struct bfa_timer_s sem_timer;
> +	u32 hb_count;
> +	u32 hb_fail;
> +	u32 retry_count;
> +	struct list_head hb_notify_q;
> +	void *dbg_fwsave;
> +	int dbg_fwsave_len;
> +	bfa_boolean_t dbg_fwsave_once;
> +	enum bfi_mclass ioc_mc;
> +	struct bfa_ioc_regs_s ioc_regs;
> +	struct bfa_trc_mod_s *trcmod;
> +	struct bfa_aen_s *aen;
> +	struct bfa_log_mod_s *logm;
> +	struct bfa_ioc_drv_stats_s stats;
> +	bfa_boolean_t auto_recover;
> +	bfa_boolean_t fcmode;
> +	bfa_boolean_t ctdev;
> +	bfa_boolean_t cna;
> +	bfa_boolean_t pllinit;
> +	u8 port_id;
> +
> +	struct bfa_dma_s attr_dma;
> +	struct bfi_ioc_attr_s *attr;
> +	struct bfa_ioc_cbfn_s *cbfn;
> +	struct bfa_ioc_mbox_mod_s mbox_mod;
> +};
> +
> +#define bfa_ioc_pcifn(__ioc)		(__ioc)->pcidev.pci_func
> +#define bfa_ioc_devid(__ioc)		(__ioc)->pcidev.device_id
> +#define bfa_ioc_bar0(__ioc)		(__ioc)->pcidev.pci_bar_kva
> +#define bfa_ioc_portid(__ioc)		((__ioc)->port_id)
> +#define bfa_ioc_fetch_stats(__ioc, __stats) \
> +		((__stats)->drv_stats) = (__ioc)->stats
> +#define bfa_ioc_clr_stats(__ioc)	\
> +		bfa_os_memset(&(__ioc)->stats, 0, sizeof((__ioc)->stats))
> +#define bfa_ioc_maxfrsize(__ioc)	(__ioc)->attr->maxfrsize
> +#define bfa_ioc_rx_bbcredit(__ioc)	(__ioc)->attr->rx_bbcredit
> +#define bfa_ioc_speed_sup(__ioc)	\
> +	BFI_ADAPTER_GETP(SPEED, (__ioc)->attr->adapter_prop)
> +
> +/**
> + * IOC mailbox interface
> + */
> +void bfa_ioc_mbox_queue(struct bfa_ioc_s *ioc, struct bfa_mbox_cmd_s *cmd);
> +void bfa_ioc_mbox_register(struct bfa_ioc_s *ioc,
> +			   bfa_ioc_mbox_mcfunc_t * mcfuncs);
> +void bfa_ioc_mbox_isr(struct bfa_ioc_s *ioc);
> +void bfa_ioc_mbox_send(struct bfa_ioc_s *ioc, void *ioc_msg, int len);
> +void bfa_ioc_msgget(struct bfa_ioc_s *ioc, void *mbmsg);
> +void bfa_ioc_mbox_regisr(struct bfa_ioc_s *ioc, enum bfi_mclass mc,
> +			 bfa_ioc_mbox_mcfunc_t cbfn, void *cbarg);
> +
> +/**
> + * IOC interfaces
> + */
> +void bfa_ioc_attach(struct bfa_ioc_s *ioc, void *bfa,
> +		    struct bfa_ioc_cbfn_s *cbfn,
> +		    struct bfa_timer_mod_s *timer_mod,
> +		    struct bfa_trc_mod_s *trcmod, struct bfa_aen_s *aen,
> +		    struct bfa_log_mod_s *logm);
> +void bfa_ioc_detach(struct bfa_ioc_s *ioc);
> +void bfa_ioc_pci_init(struct bfa_ioc_s *ioc, struct bfa_pcidev_s *pcidev,
> +		      enum bfi_mclass mc);
> +u32 bfa_ioc_meminfo(void);
> +void bfa_ioc_mem_claim(struct bfa_ioc_s *ioc, u8 * dm_kva, u64 dm_pa);
> +void bfa_ioc_enable(struct bfa_ioc_s *ioc);
> +void bfa_ioc_disable(struct bfa_ioc_s *ioc);
> +bfa_boolean_t bfa_ioc_intx_claim(struct bfa_ioc_s *ioc);
> +
> +void bfa_ioc_boot(struct bfa_ioc_s *ioc, u32 boot_type, u32 boot_param);
> +void bfa_ioc_isr(struct bfa_ioc_s *ioc, struct bfi_mbmsg_s *msg);
> +void bfa_ioc_error_isr(struct bfa_ioc_s *ioc);
> +void bfa_ioc_isr_mode_set(struct bfa_ioc_s *ioc, bfa_boolean_t intx);
> +bfa_status_t bfa_ioc_pll_init(struct bfa_ioc_s *ioc);
> +bfa_boolean_t bfa_ioc_is_operational(struct bfa_ioc_s *ioc);
> +bfa_boolean_t bfa_ioc_is_disabled(struct bfa_ioc_s *ioc);
> +bfa_boolean_t bfa_ioc_fw_mismatch(struct bfa_ioc_s *ioc);
> +bfa_boolean_t bfa_ioc_adapter_is_disabled(struct bfa_ioc_s *ioc);
> +void bfa_ioc_cfg_complete(struct bfa_ioc_s *ioc);
> +void bfa_ioc_get_attr(struct bfa_ioc_s *ioc, struct bfa_ioc_attr_s *ioc_attr);
> +void bfa_ioc_get_adapter_attr(struct bfa_ioc_s *ioc,
> +			      struct bfa_adapter_attr_s *ad_attr);
> +int bfa_ioc_debug_trcsz(bfa_boolean_t auto_recover);
> +void bfa_ioc_debug_memclaim(struct bfa_ioc_s *ioc, void *dbg_fwsave);
> +bfa_status_t bfa_ioc_debug_fwsave(struct bfa_ioc_s *ioc, void *trcdata,
> +				  int *trclen);
> +bfa_status_t bfa_ioc_debug_fwtrc(struct bfa_ioc_s *ioc, void *trcdata,
> +				 int *trclen);
> +u32 bfa_ioc_smem_pgnum(struct bfa_ioc_s *ioc, u32 fmaddr);
> +u32 bfa_ioc_smem_pgoff(struct bfa_ioc_s *ioc, u32 fmaddr);
> +void bfa_ioc_set_fcmode(struct bfa_ioc_s *ioc);
> +bfa_boolean_t bfa_ioc_get_fcmode(struct bfa_ioc_s *ioc);
> +void bfa_ioc_hbfail_register(struct bfa_ioc_s *ioc,
> +			     struct bfa_ioc_hbfail_notify_s *notify);
> +
> +/*
> + * bfa mfg wwn API functions
> + */
> +wwn_t bfa_ioc_get_pwwn(struct bfa_ioc_s *ioc);
> +wwn_t bfa_ioc_get_nwwn(struct bfa_ioc_s *ioc);
> +wwn_t bfa_ioc_get_wwn_naa5(struct bfa_ioc_s *ioc, u16 inst);
> +mac_t bfa_ioc_get_mac(struct bfa_ioc_s *ioc);
> +u64 bfa_ioc_get_adid(struct bfa_ioc_s *ioc);
> +
> +#endif /* __BFA_IOC_H__ */
> diff -ruP linux-2.6.30.5-orig/drivers/net/bna/bna_hwreg.h linux-2.6.30.5-mod/drivers/net/bna/bna_hwreg.h
> --- linux-2.6.30.5-orig/drivers/net/bna/bna_hwreg.h	1969-12-31 16:00:00.000000000 -0800
> +++ linux-2.6.30.5-mod/drivers/net/bna/bna_hwreg.h	2009-08-28 21:09:22.805949000 -0700
> @@ -0,0 +1,1004 @@
> +/*
> + * Copyright (c) 2005-2009 Brocade Communications Systems, Inc.
> + * All rights reserved
> + * www.brocade.com
> + *
> + * Linux network driver for Brocade Converged Network Adapter.
> + *
> + * This program is free software; you can redistribute it and/or modify it
> + * under the terms of the GNU General Public License (GPL) Version 2 as
> + * published by the Free Software Foundation
> + *
> + * This program is distributed in the hope that it will be useful, but
> + * WITHOUT ANY WARRANTY; without even the implied warranty of
> + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
> + * General Public License for more details.
> + */
> +/*
> + * Copyright (c) 2005-2009 Brocade Communications Systems, Inc.
> + * All rights reserved
> + * www.brocade.com
> + *
> + * See <license_file> for copyright and licensing details.
> + */
> +
> +/*
> + *  bna_hwreg.h Catapult host block register definitions
> + *
> + */
> +
> +#ifndef __BNA_HWREG_H__
> +#define __BNA_HWREG_H__
> +
> +#include <bfi/bfi_ctreg.h>
> +
> +/**************************************************************
> + *               REGISTER ACCESS
> + **************************************************************/
> +/**
> + * CPE Block Register Address Offset from BAR0
> + */
> +	/* TODO */
> +/**
> + * RME Block Register Address Offset from BAR0
> + */
> +	/* TODO */
> +/**
> + * SGM Block Register Address Offset from BAR0
> + */
> +	/* TODO */
> +
> +/**
> + * DMA Block Register Host Window Start Address
> + */
> +#define DMA_BLK_REG_ADDR		0x00013000
> +/**
> + * DMA Block Internal Registers
> + */
> +#define DMA_CTRL_REG0			(DMA_BLK_REG_ADDR + 0x000)
> +#define DMA_CTRL_REG1			(DMA_BLK_REG_ADDR + 0x004)
> +#define DMA_ERR_INT_STATUS		(DMA_BLK_REG_ADDR + 0x008)
> +#define DMA_ERR_INT_ENABLE		(DMA_BLK_REG_ADDR + 0x00c)
> +#define DMA_ERR_INT_STATUS_SET		(DMA_BLK_REG_ADDR + 0x010)
> +
> +/**
> + * APP Block Register Address Offset from BAR0
> + */
> +#define APP_BLK_REG_ADDR		0x00014000
> +
> +/**
> + * Host Function Interrupt Mask Registers
> + */
> +#define HOSTFN0_INT_MASK		(APP_BLK_REG_ADDR + 0x004)
> +#define HOSTFN1_INT_MASK		(APP_BLK_REG_ADDR + 0x104)
> +#define HOSTFN2_INT_MASK		(APP_BLK_REG_ADDR + 0x304)
> +#define HOSTFN3_INT_MASK		(APP_BLK_REG_ADDR + 0x404)
> +
> +/**
> + * Host Function PCIe Error Registers
> + * Duplicates "Correctable" & "Uncorrectable"
> + * registers in PCIe Config space.
> + */
> +#define FN0_PCIE_ERR_REG		(APP_BLK_REG_ADDR + 0x014)
> +#define FN1_PCIE_ERR_REG		(APP_BLK_REG_ADDR + 0x114)
> +#define FN2_PCIE_ERR_REG		(APP_BLK_REG_ADDR + 0x314)
> +#define FN3_PCIE_ERR_REG		(APP_BLK_REG_ADDR + 0x414)
> +
> +/**
> + * Host Function Error Type Status Registers
> + */
> +#define FN0_ERR_TYPE_STATUS_REG		(APP_BLK_REG_ADDR + 0x018)
> +#define FN1_ERR_TYPE_STATUS_REG		(APP_BLK_REG_ADDR + 0x118)
> +#define FN2_ERR_TYPE_STATUS_REG		(APP_BLK_REG_ADDR + 0x318)
> +#define FN3_ERR_TYPE_STATUS_REG		(APP_BLK_REG_ADDR + 0x418)
> +/**
> + * Host Function Error Type Mask Registers
> + */
> +#define FN0_ERR_TYPE_MSK_STATUS_REG	(APP_BLK_REG_ADDR + 0x01c)
> +#define FN1_ERR_TYPE_MSK_STATUS_REG	(APP_BLK_REG_ADDR + 0x11c)
> +#define FN2_ERR_TYPE_MSK_STATUS_REG	(APP_BLK_REG_ADDR + 0x31c)
> +#define FN3_ERR_TYPE_MSK_STATUS_REG	(APP_BLK_REG_ADDR + 0x41c)
> +
> +/**
> + * Catapult Host Semaphore Status Registers (App block)
> + */
> +#define HOST_SEM_STS0_REG		(APP_BLK_REG_ADDR + 0x630)
> +#define HOST_SEM_STS1_REG		(APP_BLK_REG_ADDR + 0x634)
> +#define HOST_SEM_STS2_REG		(APP_BLK_REG_ADDR + 0x638)
> +#define HOST_SEM_STS3_REG		(APP_BLK_REG_ADDR + 0x63c)
> +#define HOST_SEM_STS4_REG		(APP_BLK_REG_ADDR + 0x640)
> +#define HOST_SEM_STS5_REG		(APP_BLK_REG_ADDR + 0x644)
> +#define HOST_SEM_STS6_REG		(APP_BLK_REG_ADDR + 0x648)
> +#define HOST_SEM_STS7_REG		(APP_BLK_REG_ADDR + 0x64c)
> +
> +/**
> + * PCIe Misc Register
> + * Check catapult_spec.pdf for details
> + */
> +#define PCIE_MISC_REG			(APP_BLK_REG_ADDR + 0x200)
> +/**
> + * Temp Sensor Control Registers
> + */
> +#define TEMPSENSE_CNTL_REG		(APP_BLK_REG_ADDR + 0x250)
> +#define TEMPSENSE_STAT_REG		(APP_BLK_REG_ADDR + 0x254)
> +
> +/**
> + * APP Block local error registers
> + */
> +#define APP_LOCAL_ERR_STAT		(APP_BLK_REG_ADDR + 0x258)
> +#define APP_LOCAL_ERR_MSK		(APP_BLK_REG_ADDR + 0x25c)
> +
> +/**
> + * PCIe Link Error registers
> + */
> +#define PCIE_LNK_ERR_STAT		(APP_BLK_REG_ADDR + 0x260)
> +#define PCIE_LNK_ERR_MSK		(APP_BLK_REG_ADDR + 0x264)
> +
> +/**
> + * FCoE/FIP Ethertype Register
> + * 31:16 -- Chip wide value for FIP type
> + * 15:0  -- Chip wide value for FCoE type
> + */
> +#define FCOE_FIP_ETH_TYPE		(APP_BLK_REG_ADDR + 0x280)
> +
> +/**
> + * Reserved Ethertype Register
> + * 31:16 -- Reserved
> + * 15:0  -- Other ethertype
> + */
> +#define RESV_ETH_TYPE			(APP_BLK_REG_ADDR + 0x284)
> +
> +/**
> + * Host Command Status Registers
> + * Each set consists of 3 registers :
> + * clear, set, cmd
> + * 16 such register sets in all
> + * See catapult_spec.pdf for detailed functionality
> + * Put each type in a single macro accessed by _num ?
> + */
> +#define HOST_CMDSTS0_CLR_REG		(APP_BLK_REG_ADDR + 0x500)
> +#define HOST_CMDSTS0_SET_REG		(APP_BLK_REG_ADDR + 0x504)
> +#define HOST_CMDSTS0_REG		(APP_BLK_REG_ADDR + 0x508)
> +#define HOST_CMDSTS1_CLR_REG		(APP_BLK_REG_ADDR + 0x510)
> +#define HOST_CMDSTS1_SET_REG		(APP_BLK_REG_ADDR + 0x514)
> +#define HOST_CMDSTS1_REG		(APP_BLK_REG_ADDR + 0x518)
> +#define HOST_CMDSTS2_CLR_REG		(APP_BLK_REG_ADDR + 0x520)
> +#define HOST_CMDSTS2_SET_REG		(APP_BLK_REG_ADDR + 0x524)
> +#define HOST_CMDSTS2_REG		(APP_BLK_REG_ADDR + 0x528)
> +#define HOST_CMDSTS3_CLR_REG		(APP_BLK_REG_ADDR + 0x530)
> +#define HOST_CMDSTS3_SET_REG		(APP_BLK_REG_ADDR + 0x534)
> +#define HOST_CMDSTS3_REG		(APP_BLK_REG_ADDR + 0x538)
> +#define HOST_CMDSTS4_CLR_REG		(APP_BLK_REG_ADDR + 0x540)
> +#define HOST_CMDSTS4_SET_REG		(APP_BLK_REG_ADDR + 0x544)
> +#define HOST_CMDSTS4_REG		(APP_BLK_REG_ADDR + 0x548)
> +#define HOST_CMDSTS5_CLR_REG		(APP_BLK_REG_ADDR + 0x550)
> +#define HOST_CMDSTS5_SET_REG		(APP_BLK_REG_ADDR + 0x554)
> +#define HOST_CMDSTS5_REG		(APP_BLK_REG_ADDR + 0x558)
> +#define HOST_CMDSTS6_CLR_REG		(APP_BLK_REG_ADDR + 0x560)
> +#define HOST_CMDSTS6_SET_REG		(APP_BLK_REG_ADDR + 0x564)
> +#define HOST_CMDSTS6_REG		(APP_BLK_REG_ADDR + 0x568)
> +#define HOST_CMDSTS7_CLR_REG		(APP_BLK_REG_ADDR + 0x570)
> +#define HOST_CMDSTS7_SET_REG		(APP_BLK_REG_ADDR + 0x574)
> +#define HOST_CMDSTS7_REG		(APP_BLK_REG_ADDR + 0x578)
> +#define HOST_CMDSTS8_CLR_REG		(APP_BLK_REG_ADDR + 0x580)
> +#define HOST_CMDSTS8_SET_REG		(APP_BLK_REG_ADDR + 0x584)
> +#define HOST_CMDSTS8_REG		(APP_BLK_REG_ADDR + 0x588)
> +#define HOST_CMDSTS9_CLR_REG		(APP_BLK_REG_ADDR + 0x590)
> +#define HOST_CMDSTS9_SET_REG		(APP_BLK_REG_ADDR + 0x594)
> +#define HOST_CMDSTS9_REG		(APP_BLK_REG_ADDR + 0x598)
> +#define HOST_CMDSTS10_CLR_REG		(APP_BLK_REG_ADDR + 0x5A0)
> +#define HOST_CMDSTS10_SET_REG		(APP_BLK_REG_ADDR + 0x5A4)
> +#define HOST_CMDSTS10_REG		(APP_BLK_REG_ADDR + 0x5A8)
> +#define HOST_CMDSTS11_CLR_REG		(APP_BLK_REG_ADDR + 0x5B0)
> +#define HOST_CMDSTS11_SET_REG		(APP_BLK_REG_ADDR + 0x5B4)
> +#define HOST_CMDSTS11_REG		(APP_BLK_REG_ADDR + 0x5B8)
> +#define HOST_CMDSTS12_CLR_REG		(APP_BLK_REG_ADDR + 0x5C0)
> +#define HOST_CMDSTS12_SET_REG		(APP_BLK_REG_ADDR + 0x5C4)
> +#define HOST_CMDSTS12_REG		(APP_BLK_REG_ADDR + 0x5C8)
> +#define HOST_CMDSTS13_CLR_REG		(APP_BLK_REG_ADDR + 0x5D0)
> +#define HOST_CMDSTS13_SET_REG		(APP_BLK_REG_ADDR + 0x5D4)
> +#define HOST_CMDSTS13_REG		(APP_BLK_REG_ADDR + 0x5D8)
> +#define HOST_CMDSTS14_CLR_REG		(APP_BLK_REG_ADDR + 0x5E0)
> +#define HOST_CMDSTS14_SET_REG		(APP_BLK_REG_ADDR + 0x5E4)
> +#define HOST_CMDSTS14_REG		(APP_BLK_REG_ADDR + 0x5E8)
> +#define HOST_CMDSTS15_CLR_REG		(APP_BLK_REG_ADDR + 0x5F0)
> +#define HOST_CMDSTS15_SET_REG		(APP_BLK_REG_ADDR + 0x5F4)
> +#define HOST_CMDSTS15_REG		(APP_BLK_REG_ADDR + 0x5F8)
> +
> +/**
> + * IOQ Block Register Address Offset from BAR0
> + */
> +	/* TODO ? */
> +/**
> + * IOH Block Register Address Offset from BAR0
> + */
> +	/* Required only in FC mode */
> +/**
> + * DIP Block Register Address Offset from BAR0
> + */
> +	/* Required only in FC mode */
> +/**
> + * SIP Block Register Address Offset from BAR0
> + */
> +	/* Required only in FC mode */
> +
> +/**
> + * LPU0 Block Register Address Offset from BAR0
> + * Range 0x18000 - 0x18033
> + */
> +#define LPU0_BLK_REG_ADDR		0x00018000
> +
> +/**
> + * LPU0 Registers
> + * Should they be directly used from host,
> + * except for diagnostics ?
> + * CTL_REG : Control register
> + * CMD_REG : Triggers exec. of cmd. in
> + *           Mailbox memory
> + */
> +#define LPU0_MBOX_CTL_REG		(LPU0_BLK_REG_ADDR + 0x000)
> +#define LPU0_MBOX_CMD_REG		(LPU0_BLK_REG_ADDR + 0x004)
> +#define LPU0_MBOX_LINK_0REG		(LPU0_BLK_REG_ADDR + 0x008)
> +#define LPU1_MBOX_LINK_0REG		(LPU0_BLK_REG_ADDR + 0x00c)
> +#define LPU0_MBOX_STATUS_0REG		(LPU0_BLK_REG_ADDR + 0x010)
> +#define LPU1_MBOX_STATUS_0REG		(LPU0_BLK_REG_ADDR + 0x014)
> +#define LPU0_ERR_STATUS_REG		(LPU0_BLK_REG_ADDR + 0x018)
> +#define LPU0_ERR_SET_REG		(LPU0_BLK_REG_ADDR + 0x020)
> +
> +/**
> + * LPU1 Block Register Address Offset from BAR0
> + * Range 0x18400 - 0x18433
> + */
> +#define LPU1_BLK_REG_ADDR		0x00018400
> +
> +/**
> + * LPU1 Registers
> + * Same as LPU0 registers above
> + */
> +#define LPU1_MBOX_CTL_REG		(LPU1_BLK_REG_ADDR + 0x000)
> +#define LPU1_MBOX_CMD_REG		(LPU1_BLK_REG_ADDR + 0x004)
> +#define LPU0_MBOX_LINK_1REG		(LPU1_BLK_REG_ADDR + 0x008)
> +#define LPU1_MBOX_LINK_1REG		(LPU1_BLK_REG_ADDR + 0x00c)
> +#define LPU0_MBOX_STATUS_1REG		(LPU1_BLK_REG_ADDR + 0x010)
> +#define LPU1_MBOX_STATUS_1REG		(LPU1_BLK_REG_ADDR + 0x014)
> +#define LPU1_ERR_STATUS_REG		(LPU1_BLK_REG_ADDR + 0x018)
> +#define LPU1_ERR_SET_REG		(LPU1_BLK_REG_ADDR + 0x020)
> +
> +/**
> + * PSS Block Register Address Offset from BAR0
> + * Range 0x18800 - 0x188DB
> + */
> +#define PSS_BLK_REG_ADDR		0x00018800
> +
> +/**
> + * PSS Registers
> + * For details, see catapult_spec.pdf
> + * ERR_STATUS_REG : Indicates error in PSS module
> + * RAM_ERR_STATUS_REG : Indicates RAM module that detected error
> + */
> +#define PSS_ERR_STATUS_REG		(PSS_BLK_REG_ADDR + 0x010)
> +#define ERR_STATUS_SET			(PSS_BLK_REG_ADDR + 0x018)
> +#define PSS_RAM_ERR_STATUS_REG		(PSS_BLK_REG_ADDR + 0x01C)
> +
> +/**
> + * PSS Semaphore Lock Registers, total 16
> + * First read when unlocked returns 0,
> + * and is set to 1, atomically.
> + * Subsequent reads returns 1.
> + * To clear set the value to 0.
> + * Range : 0x20 to 0x5c
> + */
> +#define PSS_SEM_LOCK_REG(_num) 		\
> + 	(PSS_BLK_REG_ADDR + 0x020 + ((_num) << 2))
> +
> +/**
> + * PSS Semaphore Status Registers,
> + * corresponding to the lock registers above
> + */
> +#define PSS_SEM_STATUS_REG(_num) 		\
> + 	(PSS_BLK_REG_ADDR + 0x060 + ((_num) << 2))
> +
> +/**
> + * Catapult CPQ Registers
> + * Defines for Mailbox Registers
> + * Used to send mailbox commands to firmware from
> + * host. The data part is written to the MBox
> + * memory, registers are used to indicate that
> + * a commnad is resident in memory.
> + *
> + * Note : LPU0<->LPU1 mailboxes are not listed here
> + */
> +#define CPQ_BLK_REG_ADDR		0x00019000
> +
> +#define HOSTFN0_LPU0_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x130)
> +#define HOSTFN0_LPU1_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x134)
> +#define LPU0_HOSTFN0_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x138)
> +#define LPU1_HOSTFN0_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x13C)
> +
> +#define HOSTFN1_LPU0_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x140)
> +#define HOSTFN1_LPU1_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x144)
> +#define LPU0_HOSTFN1_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x148)
> +#define LPU1_HOSTFN1_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x14C)
> +
> +#define HOSTFN2_LPU0_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x170)
> +#define HOSTFN2_LPU1_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x174)
> +#define LPU0_HOSTFN2_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x178)
> +#define LPU1_HOSTFN2_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x17C)
> +
> +#define HOSTFN3_LPU0_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x180)
> +#define HOSTFN3_LPU1_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x184)
> +#define LPU0_HOSTFN3_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x188)
> +#define LPU1_HOSTFN3_MBOX1_CMD_STAT	(CPQ_BLK_REG_ADDR + 0x18C)
> +
> +/**
> + * Host Function Force Parity Error Registers
> + */
> +#define HOSTFN0_LPU_FORCE_PERR		(CPQ_BLK_REG_ADDR + 0x120)
> +#define HOSTFN1_LPU_FORCE_PERR		(CPQ_BLK_REG_ADDR + 0x124)
> +#define HOSTFN2_LPU_FORCE_PERR		(CPQ_BLK_REG_ADDR + 0x128)
> +#define HOSTFN3_LPU_FORCE_PERR		(CPQ_BLK_REG_ADDR + 0x12C)
> +
> +/**
> + * LL Port[0|1] Halt Mask Registers
> + */
> +#define LL_HALT_MSK_P0			(CPQ_BLK_REG_ADDR + 0x1A0)
> +#define LL_HALT_MSK_P1			(CPQ_BLK_REG_ADDR + 0x1B0)
> +
> +/**
> + * LL Port[0|1] Error Mask Registers
> + */
> +#define LL_ERR_MSK_P0			(CPQ_BLK_REG_ADDR + 0x1D0)
> +#define LL_ERR_MSK_P1			(CPQ_BLK_REG_ADDR + 0x1D4)
> +
> +/**
> + * TXQ Block Register Address Offset from BAR0
> + */
> +	/* TODO :
> +	 * For per-port registers, offsets look
> +	 * skewed, need clarification
> +	 */
> +/**
> + * RDS (Rx Frame Data Store) Block Register Address Offset from BAR0
> + */
> +	/* Host does not need to access these */
> +/**
> + * TDS (Tx Frame Data Store) Block Register Address Offset from BAR0
> + */
> +	/* Host does not need to access these */
> +/**
> + * BAL Block Register Address Offset from BAR0
> + */
> +	/* TODO : Some of these may be needed */
> +
> +/**
> + * EMC FLI (Flash Controller) Block Register Address Offset from BAR0
> + */
> +#define FLI_BLK_REG_ADDR		0x0001D000
> +
> +/**
> + * EMC FLI Registers
> + */
> +#define FLI_CMD_REG			(FLI_BLK_REG_ADDR + 0x000)
> +#define FLI_ADDR_REG			(FLI_BLK_REG_ADDR + 0x004)
> +#define FLI_CTL_REG			(FLI_BLK_REG_ADDR + 0x008)
> +#define FLI_WRDATA_REG			(FLI_BLK_REG_ADDR + 0x00C)
> +#define FLI_RDDATA_REG			(FLI_BLK_REG_ADDR + 0x010)
> +#define FLI_DEV_STATUS_REG		(FLI_BLK_REG_ADDR + 0x014)
> +#define FLI_SIG_WD_REG			(FLI_BLK_REG_ADDR + 0x018)
> +/*
> + * H/W document lists the following vendor ID register as
> + * FLI_ERR_VENDOR_REG -- typo ?
> + * RO register
> + * 31:16 -- Vendor Id
> + * 15:0  -- Device Id
> + */
> +#define FLI_DEV_VENDOR_REG		(FLI_BLK_REG_ADDR + 0x01C)
> +#define FLI_ERR_STATUS_REG		(FLI_BLK_REG_ADDR + 0x020)
> +
> +/**
> + * FPG (FC Port Group) Block Register Address Offset from BAR0
> + */
> +	/* FC driver specific */
> +
> +/**
> + * RAD (RxAdm) Block Register Address Offset from BAR0
> + * RAD0 Range : 0x20000 - 0x203FF
> + * RAD1 Range : 0x20400 - 0x207FF
> + */
> +#define RAD0_BLK_REG_ADDR		0x00020000
> +#define RAD1_BLK_REG_ADDR		0x00020400
> +
> +/**
> + * RAD0 Registers
> + */
> +#define RAD0_CTL_REG			(RAD0_BLK_REG_ADDR + 0x000)
> +#define RAD0_PE_PARM_REG		(RAD0_BLK_REG_ADDR + 0x004)
> +#define RAD0_BCN_REG			(RAD0_BLK_REG_ADDR + 0x008)
> +/*
> + * Default function ID register
> + */
> +#define RAD0_DEFAULT_REG		(RAD0_BLK_REG_ADDR + 0x00C)
> +/*
> + * Default promiscuous ID register
> + */
> +#define RAD0_PROMISC_REG		(RAD0_BLK_REG_ADDR + 0x010)
> +
> +#define RAD0_BCNQ_REG			(RAD0_BLK_REG_ADDR + 0x014)
> +/*
> + * This register selects 1 of 8 PM Q's using
> + * VLAN pri, for non-BCN packets without a VLAN tag
> + */
> +#define RAD0_DEFAULTQ_REG		(RAD0_BLK_REG_ADDR + 0x018)
> +
> +#define RAD0_ERR_STS			(RAD0_BLK_REG_ADDR + 0x01C)
> +#define RAD0_SET_ERR_STS		(RAD0_BLK_REG_ADDR + 0x020)
> +#define RAD0_ERR_INT_EN			(RAD0_BLK_REG_ADDR + 0x024)
> +#define RAD0_FIRST_ERR          (RAD0_BLK_REG_ADDR + 0x028)
> +#define RAD0_FORCE_ERR          (RAD0_BLK_REG_ADDR + 0x02C)
> +
> +#define RAD0_IF_RCVD            (RAD0_BLK_REG_ADDR + 0x030)
> +#define RAD0_IF_RCVD_OCTETS_HIGH (RAD0_BLK_REG_ADDR + 0x034)
> +#define RAD0_IF_RCVD_OCTETS_LOW (RAD0_BLK_REG_ADDR + 0x038)
> +#define RAD0_IF_RCVD_VLAN       (RAD0_BLK_REG_ADDR + 0x03C)
> +#define RAD0_IF_RCVD_UCAST       (RAD0_BLK_REG_ADDR + 0x040)
> +#define RAD0_IF_RCVD_UCAST_OCTETS_HIGH  (RAD0_BLK_REG_ADDR + 0x044)
> +#define RAD0_IF_RCVD_UCAST_OCTETS_LOW   (RAD0_BLK_REG_ADDR + 0x048)
> +#define RAD0_IF_RCVD_UCAST_VLAN (RAD0_BLK_REG_ADDR + 0x04C)
> +#define RAD0_IF_RCVD_MCAST      (RAD0_BLK_REG_ADDR + 0x050)
> +#define RAD0_IF_RCVD_MCAST_OCTETS_HIGH  (RAD0_BLK_REG_ADDR + 0x054)
> +#define RAD0_IF_RCVD_MCAST_OCTETS_LOW   (RAD0_BLK_REG_ADDR + 0x058)
> +#define RAD0_IF_RCVD_MCAST_VLAN (RAD0_BLK_REG_ADDR + 0x05C)
> +#define RAD0_IF_RCVD_BCAST      (RAD0_BLK_REG_ADDR + 0x060)
> +#define RAD0_IF_RCVD_BCAST_OCTETS_HIGH  (RAD0_BLK_REG_ADDR + 0x064)
> +#define RAD0_IF_RCVD_BCAST_OCTETS_LOW   (RAD0_BLK_REG_ADDR + 0x068)
> +#define RAD0_IF_RCVD_BCAST_VLAN (RAD0_BLK_REG_ADDR + 0x06C)
> +#define RAD0_DROPPED_FRAMES     (RAD0_BLK_REG_ADDR + 0x070)
> +
> +#define RAD0_MAC_MAN_1H         (RAD0_BLK_REG_ADDR + 0x080)
> +#define RAD0_MAC_MAN_1L         (RAD0_BLK_REG_ADDR + 0x084)
> +#define RAD0_MAC_MAN_2H         (RAD0_BLK_REG_ADDR + 0x088)
> +#define RAD0_MAC_MAN_2L         (RAD0_BLK_REG_ADDR + 0x08C)
> +#define RAD0_MAC_MAN_3H         (RAD0_BLK_REG_ADDR + 0x090)
> +#define RAD0_MAC_MAN_3L         (RAD0_BLK_REG_ADDR + 0x094)
> +#define RAD0_MAC_MAN_4H         (RAD0_BLK_REG_ADDR + 0x098)
> +#define RAD0_MAC_MAN_4L         (RAD0_BLK_REG_ADDR + 0x09C)
> +
> +
> +#define RAD0_LAST4_IP           (RAD0_BLK_REG_ADDR + 0x100)
> +
> +/**
> + * RAD1 Registers
> + */
> +#define RAD1_CTL_REG			(RAD1_BLK_REG_ADDR + 0x000)
> +#define RAD1_PE_PARM_REG		(RAD1_BLK_REG_ADDR + 0x004)
> +#define RAD1_BCN_REG			(RAD1_BLK_REG_ADDR + 0x008)
> +/*
> + * Default function ID register
> + */
> +#define RAD1_DEFAULT_REG		(RAD1_BLK_REG_ADDR + 0x00C)
> +/*
> + * Promiscuous function ID register
> + */
> +#define RAD1_PROMISC_REG		(RAD1_BLK_REG_ADDR + 0x010)
> +
> +#define RAD1_BCNQ_REG			(RAD1_BLK_REG_ADDR + 0x014)
> +/*
> + * This register selects 1 of 8 PM Q's using
> + * VLAN pri, for non-BCN packets without a VLAN tag
> + */
> +#define RAD1_DEFAULTQ_REG		(RAD1_BLK_REG_ADDR + 0x018)
> +
> +#define RAD1_ERR_STS			(RAD1_BLK_REG_ADDR + 0x01C)
> +#define RAD1_SET_ERR_STS		(RAD1_BLK_REG_ADDR + 0x020)
> +#define RAD1_ERR_INT_EN			(RAD1_BLK_REG_ADDR + 0x024)
> +/*
> + * RxAdm Statistics Registers:
> + * F/W will access these, not required immediately for host
> + */
> +
> +/**
> + * RLB (Rx Loop Back) Block Register Address Offset from BAR0
> + */
> +	/* TODO : ?? */
> +/**
> + * TXA Block Register Address Offset from BAR0
> + * TXA0 Range : 0x21000 - 0x213FF
> + * TXA1 Range : 0x21400 - 0x217FF
> + */
> +#define TXA0_BLK_REG_ADDR		0x00021000
> +#define TXA1_BLK_REG_ADDR		0x00021400
> +
> +/**
> + * TXA Registers
> + */
> +#define TXA0_CTRL_REG			(TXA0_BLK_REG_ADDR + 0x000)
> +#define TXA1_CTRL_REG			(TXA1_BLK_REG_ADDR + 0x000)
> +
> +/**
> + * TSO Sequence # Registers (RO)
> + * Total 8 (for 8 queues)
> + * Holds the last seq.# for TSO frames
> + * See catapult_spec.pdf for more details
> + */
> +#define TXA0_TSO_TCP_SEQ_REG(_num)		\
> +	(TXA0_BLK_REG_ADDR + 0x020 + ((_num) << 2))
> +
> +#define TXA1_TSO_TCP_SEQ_REG(_num)		\
> +	(TXA1_BLK_REG_ADDR + 0x020 + ((_num) << 2))
> +
> +/**
> + * TSO IP ID # Registers (RO)
> + * Total 8 (for 8 queues)
> + * Holds the last IP ID for TSO frames
> + * See catapult_spec.pdf for more details
> + */
> +#define TXA0_TSO_IP_INFO_REG(_num)		\
> +	(TXA0_BLK_REG_ADDR + 0x040 + ((_num) << 2))
> +
> +#define TXA1_TSO_IP_INFO_REG(_num)		\
> +	(TXA1_BLK_REG_ADDR + 0x040 + ((_num) << 2))
> +
> +/**
> + * RXA Block Register Address Offset from BAR0
> + * RXA0 Range : 0x21800 - 0x21BFF
> + * RXA1 Range : 0x21C00 - 0x21FFF
> + */
> +#define RXA0_BLK_REG_ADDR		0x00021800
> +#define RXA1_BLK_REG_ADDR		0x00021C00
> +
> +/**
> + * RXA Registers
> + */
> +#define RXA0_CTL_REG			(RXA0_BLK_REG_ADDR + 0x040)
> +#define RXA1_CTL_REG			(RXA1_BLK_REG_ADDR + 0x040)
> +
> +/**
> + * PPLB Block Register Address Offset from BAR0
> + * PPLB0 Range : 0x22000 - 0x223FF
> + * PPLB1 Range : 0x22400 - 0x227FF
> + */
> +#define PLB0_BLK_REG_ADDR		0x00022000
> +#define PLB1_BLK_REG_ADDR		0x00022400
> +
> +/**
> + * PLB Registers
> + */
> +/**
> + * Holds RL timer used time stamps in RLT tagged frames
> + */
> +#define PLB0_ECM_TIMER_REG		(PLB0_BLK_REG_ADDR + 0x05C)
> +#define PLB1_ECM_TIMER_REG		(PLB1_BLK_REG_ADDR + 0x05C)
> +/**
> + * Controls the rate-limiter on each of the priority class
> + */
> +#define PLB0_RL_CTL			(PLB0_BLK_REG_ADDR + 0x060)
> +#define PLB1_RL_CTL			(PLB1_BLK_REG_ADDR + 0x060)
> +/**
> + * Max byte register, total 8, 0-7
> + * see catapult_spec.pdf for details
> + */
> +#define PLB0_RL_MAX_BC(_num)			\
> +	(PLB0_BLK_REG_ADDR + 0x064 + ((_num) << 2))
> +#define PLB1_RL_MAX_BC(_num)			\
> +	(PLB1_BLK_REG_ADDR + 0x064 + ((_num) << 2))
> +/**
> + * RL Time Unit Register for priority 0-7
> + * 4 bits per priority
> + * (2^rl_unit)*1us is the actual time period
> + */
> +#define PLB0_RL_TU_PRIO			(PLB0_BLK_REG_ADDR + 0x084)
> +#define PLB1_RL_TU_PRIO			(PLB1_BLK_REG_ADDR + 0x084)
> +/**
> + * RL byte count register,
> + * bytes transmitted in (rl_unit*1)us time period
> + * 1 per priority, 8 in all, 0-7.
> + */
> +#define PLB0_RL_BYTE_CNT(_num)			\
> +	(PLB0_BLK_REG_ADDR + 0x088 + ((_num) << 2))
> +#define PLB1_RL_BYTE_CNT(_num)			\
> +	(PLB1_BLK_REG_ADDR + 0x088 + ((_num) << 2))
> +/**
> + * RL Min factor register
> + * 2 bits per priority,
> + * 4 factors possible: 1, 0.5, 0.25, 0
> + * 2'b00 - 0; 2'b01 - 0.25; 2'b10 - 0.5; 2'b11 - 1
> + */
> +#define PLB0_RL_MIN_REG			(PLB0_BLK_REG_ADDR + 0x0A8)
> +#define PLB1_RL_MIN_REG			(PLB1_BLK_REG_ADDR + 0x0A8)
> +/**
> + * RL Max factor register
> + * 2 bits per priority,
> + * 4 factors possible: 1, 0.5, 0.25, 0
> + * 2'b00 - 0; 2'b01 - 0.25; 2'b10 - 0.5; 2'b11 - 1
> + */
> +#define PLB0_RL_MAX_REG			(PLB0_BLK_REG_ADDR + 0x0AC)
> +#define PLB1_RL_MAX_REG			(PLB1_BLK_REG_ADDR + 0x0AC)
> +
> +/**
> + * MAC SERDES Address Paging register
> + */
> +#define PLB0_EMS_ADD_REG		(PLB0_BLK_REG_ADDR + 0xD0)
> +#define PLB1_EMS_ADD_REG		(PLB1_BLK_REG_ADDR + 0xD0)
> +
> +/**
> + * LL EMS Registers
> + */
> +#define LL_EMS0_BLK_REG_ADDR		0x00026800
> +#define LL_EMS1_BLK_REG_ADDR		0x00026C00
> +
> +/**
> + * BPC Block Register Address Offset from BAR0
> + * BPC0 Range : 0x23000 - 0x233FF
> + * BPC1 Range : 0x23400 - 0x237FF
> + */
> +#define BPC0_BLK_REG_ADDR		0x00023000
> +#define BPC1_BLK_REG_ADDR		0x00023400
> +
> +/**
> + * BPC Registers
> + */
> +	/* TODO */
> +/**
> + * PMM Block Register Address Offset from BAR0
> + * PMM0 Range : 0x23800 - 0x23BFF
> + * PMM1 Range : 0x23C00 - 0x23FFF
> + */
> +#define PMM0_BLK_REG_ADDR		0x00023800
> +#define PMM1_BLK_REG_ADDR		0x00023C00
> +/**
> + * PMM Registers
> + */
> +	/* TODO */
> +
> +/**
> + * HQM Block Register Address Offset from BAR0
> + * HQM0 Range : 0x24000 - 0x243FF
> + * HQM1 Range : 0x24400 - 0x247FF
> + */
> +#define HQM0_BLK_REG_ADDR		0x00024000
> +#define HQM1_BLK_REG_ADDR		0x00024400
> +/**
> + * HQM Control Register
> + * Controls some aspects of IB
> + * See catapult_spec.pdf for details
> + */
> +#define HQM0_CTL_REG			(HQM0_BLK_REG_ADDR + 0x000)
> +#define HQM1_CTL_REG			(HQM1_BLK_REG_ADDR + 0x000)
> +/**
> + * HQM Stop Q Semaphore Registers.
> + * Only one Queue resource can be stopped at
> + * any given time. This register controls access
> + * to the single stop Q resource.
> + * See catapult_spec.pdf for details
> + */
> +#define HQM0_RXQ_STOP_SEM		(HQM0_BLK_REG_ADDR + 0x028)
> +#define HQM0_TXQ_STOP_SEM		(HQM0_BLK_REG_ADDR + 0x02C)
> +#define HQM1_RXQ_STOP_SEM		(HQM1_BLK_REG_ADDR + 0x028)
> +#define HQM1_TXQ_STOP_SEM		(HQM1_BLK_REG_ADDR + 0x02C)
> +
> +/**
> + * LUT Block Register Address Offset from BAR0
> + * LUT0 Range : 0x25800 - 0x25BFF
> + * LUT1 Range : 0x25C00 - 0x25FFF
> + */
> +#define LUT0_BLK_REG_ADDR		0x00025800
> +#define LUT1_BLK_REG_ADDR		0x00025C00
> +/**
> + * LUT Registers
> + * See catapult_spec.pdf for details
> + */
> +#define LUT0_ERR_STS			(LUT0_BLK_REG_ADDR + 0x000)
> +#define LUT1_ERR_STS			(LUT1_BLK_REG_ADDR + 0x000)
> +#define LUT0_SET_ERR_STS		(LUT0_BLK_REG_ADDR + 0x004)
> +#define LUT1_SET_ERR_STS		(LUT1_BLK_REG_ADDR + 0x004)
> +
> +/**
> + * LDM (Link Layer DMA) Registers
> + */
> +	/* TODO */
> +
> +/**
> + * SGD (Scatter/Gather Database) Registers
> + */
> +	/* TODO */
> +
> +/**
> + * TRC (Debug/Trace) Register Offset from BAR0
> + * Range : 0x26000 -- 0x263FFF
> + */
> +#define TRC_BLK_REG_ADDR		0x00026000
> +/**
> + * TRC Registers
> + * See catapult_spec.pdf for details of each
> + */
> +#define TRC_CTL_REG			(TRC_BLK_REG_ADDR + 0x000)
> +#define TRC_MODS_REG			(TRC_BLK_REG_ADDR + 0x004)
> +#define TRC_TRGC_REG			(TRC_BLK_REG_ADDR + 0x008)
> +#define TRC_CNT1_REG			(TRC_BLK_REG_ADDR + 0x010)
> +#define TRC_CNT2_REG			(TRC_BLK_REG_ADDR + 0x014)
> +#define TRC_NXTS_REG			(TRC_BLK_REG_ADDR + 0x018)
> +#define TRC_DIRR_REG			(TRC_BLK_REG_ADDR + 0x01C)
> +/**
> + * TRC Trigger match filters, total 10
> + * Determines the trigger condition
> + */
> +#define TRC_TRGM_REG(_num)		\
> +	(TRC_BLK_REG_ADDR + 0x040 + ((_num) << 2))
> +/**
> + * TRC Next State filters, total 10
> + * Determines the next state conditions
> + */
> +#define TRC_NXTM_REG(_num)		\
> +	(TRC_BLK_REG_ADDR + 0x080 + ((_num) << 2))
> +/**
> + * TRC Store Match filters, total 10
> + * Determines the store conditions
> + */
> +#define TRC_STRM_REG(_num)		\
> +	(TRC_BLK_REG_ADDR + 0x0C0 + ((_num) << 2))
> +
> +
> +
> +/**************************************************************
> + *               DOORBELLS ACCESS
> + **************************************************************/
> +/**
> + * Catapult doorbells
> + * Each doorbell-queue set has
> + * 1 RxQ, 1 TxQ, 2 IBs in that order
> + * Size of each entry in 32 bytes, even though only 1 word
> + * is used. For Non-VM case each doorbell-q set is
> + * separated by 128 bytes, for VM case it is separated
> + * by 4K bytes
> + * Non VM case Range : 0x38000 - 0x39FFF
> + * VM case Range     : 0x100000 - 0x11FFFF
> + * The range applies to both HQMs
> + */
> +#define HQM_DOORBELL_BLK_BASE_ADDR	0x00038000
> +#define HQM_DOORBELL_VM_BLK_BASE_ADDR	0x00100000
> +
> +/**************************************************************
> + *               MEMORY ACCESS
> + **************************************************************/
> +/**
> + * Catapult H/W Block Memory Access Address
> + * To the host a memory space of 32K (page) is visible
> + * at a time. The address range is from 0x08000 to 0x0FFFF
> + */
> +#define HW_BLK_HOST_MEM_ADDR		0x08000
> +
> +/**
> + * Catapult LUT Memory Access Page Numbers
> + * Range : LUT0 0xa0-0xa1
> + *         LUT1 0xa2-0xa3
> + */
> +#define LUT0_MEM_BLK_BASE_PG_NUM	0x000000A0
> +#define LUT1_MEM_BLK_BASE_PG_NUM	0x000000A2
> +
> +/**
> + * Catapult RxFn Database Memory Block Base Offset
> + *
> + * The Rx function database exists in LUT block.
> + * In PCIe space this is accessible as a 256x32
> + * bit block. Each entry in this database is 4
> + * (4 byte) words. Max. entries is 64.
> + * Address of an entry corresponding to a function
> + * = base_addr + (function_no. * 16)
> + */
> +#define RX_FNDB_RAM_BASE_OFFSET		0x0000B400
> +
> +/**
> + * Catapult TxFn Database Memory Block Base Offset Address
> + *
> + * The Tx function database exists in LUT block.
> + * In PCIe space this is accessible as a 64x32
> + * bit block. Each entry in this database is 1
> + * (4 byte) word. Max. entries is 64.
> + * Address of an entry corresponding to a function
> + * = base_addr + (function_no. * 4)
> + */
> +#define TX_FNDB_RAM_BASE_OFFSET		0x0000B800
> +
> +/**
> + * Catapult Unicast CAM Base Offset Address
> + *
> + * Exists in LUT memory space.
> + * Shared by both the LL & FCoE driver.
> + * Size is 256x48 bits; mapped to PCIe space
> + * 512x32 bit blocks. For each address, bits
> + * are written in the order : [47:32] and then
> + * [31:0].
> + */
> +#define UCAST_CAM_BASE_OFFSET		0x0000A800
> +
> +/**
> + * Catapult Unicast RAM Base Offset Address
> + *
> + * Exists in LUT memory space.
> + * Shared by both the LL & FCoE driver.
> + * Size is 256x9 bits.
> + */
> +#define UCAST_RAM_BASE_OFFSET		0x0000B000
> +
> +/**
> + * Catapult Mulicast CAM Base Offset Address
> + *
> + * Exists in LUT memory space.
> + * Shared by both the LL & FCoE driver.
> + * Size is 256x48 bits; mapped to PCIe space
> + * 512x32 bit blocks. For each address, bits
> + * are written in the order : [47:32] and then
> + * [31:0].
> + */
> +#define MCAST_CAM_BASE_OFFSET		0x0000A000
> +
> +/**
> + * Catapult VLAN RAM Base Offset Address
> + *
> + * Exists in LUT memory space.
> + * Size is 4096x66 bits; mapped to PCIe space as
> + * 8192x32 bit blocks.
> + * All the 4K entries are within the address range
> + * 0x0000 to 0x8000, so in the first LUT page.
> + */
> +#define VLAN_RAM_BASE_OFFSET		0x00000000
> +
> +/**
> + * Catapult Tx Stats RAM Base Offset Address
> + *
> + * Exists in LUT memory space.
> + * Size is 1024x33 bits;
> + * Each Tx function has 64 bytes of space
> + */
> +#define TX_STATS_RAM_BASE_OFFSET	0x00009000
> +
> +/**
> + * Catapult Rx Stats RAM Base Offset Address
> + *
> + * Exists in LUT memory space.
> + * Size is 1024x33 bits;
> + * Each Rx function has 64 bytes of space
> + */
> +#define RX_STATS_RAM_BASE_OFFSET	0x00008000
> +/**
> + * Catapult RXA Memory Access Page Numbers
> + */
> +#define RXA0_MEM_BLK_BASE_PG_NUM	0x0000008C
> +#define RXA1_MEM_BLK_BASE_PG_NUM	0x0000008D
> +
> +/**
> + * Catapult Multicast Vector Table Base Offset Address
> + *
> + * Exists in RxA memory space.
> + * Organized as 512x65 bit block.
> + * However for each entry 16 bytes allocated (power of 2)
> + * Total size 512*16 bytes.
> + * There are two logical divisions, 256 entries each :
> + * a) Entries 0x00 to 0xff (256) -- Approx. MVT
> + *    Offset 0x000 to 0xFFF
> + * b) Entries 0x100 to 0x1ff (256) -- Exact MVT
> + *    Offsets 0x1000 to 0x1FFF
> + */
> +#define MCAST_APPROX_MVT_BASE_OFFSET	0x00000000
> +#define MCAST_EXACT_MVT_BASE_OFFSET	0x00001000
> +/**
> + * Catapult RxQ Translate Table (RIT) Base Offset Address
> + *
> + * Exists in RxA memory space
> + * Total no. of entries 64
> + * Each entry is 1 (4 byte) word.
> + * 31:12 -- Reserved
> + * 11:0  -- Two 6 bit RxQ Ids
> + */
> +#define FUNCTION_TO_RXQ_TRANSLATE	0x00002000
> +
> +/**
> + * Catapult RxAdm (RAD) Memory Access Page Numbers
> + */
> +#define RAD0_MEM_BLK_BASE_PG_NUM	0x00000086
> +#define RAD1_MEM_BLK_BASE_PG_NUM	0x00000087
> +
> +/**
> + * Catapult RSS Table Base Offset Address
> + *
> + * Exists in RAD memory space.
> + * Each entry is 352 bits, but alligned on
> + * 64 byte (512 bit) boundary. Accessed
> + * 4 byte words, the whole entry can be
> + * broken into 11 word accesses.
> + */
> +#define RSS_TABLE_BASE_OFFSET		0x00000800
> +
> +/**
> + * Catapult CPQ Block Page Number
> + * This value is written to the page number registers
> + * to access the memory associated with the mailboxes.
> + */
> +#define CPQ_BLK_PG_NUM			0x00000005
> +
> +/**
> + * Clarification :
> + * LL functions are 2 & 3; can HostFn0/HostFn1
> + * <-> LPU0/LPU1 memories be used ?
> + */
> +/**
> + * Catapult HostFn0/HostFn1 to LPU0/LPU1 Mbox memory
> + * Per catapult_spec.pdf, the offset of the mbox
> + * memory is in the register space at an offset of 0x200
> + */
> +#define CPQ_BLK_REG_MBOX_ADDR		(CPQ_BLK_REG_ADDR + 0x200)
> +
> +#define HOSTFN_LPU_MBOX			(CPQ_BLK_REG_MBOX_ADDR + 0x000)
> +/**
> + * Catapult LPU0/LPU1 to HostFn0/HostFn1 Mbox memory
> + */
> +#define LPU_HOSTFN_MBOX			(CPQ_BLK_REG_MBOX_ADDR + 0x080)
> +
> +/**
> + * Catapult HQM Block Page Number
> + * This is written to the page number register for
> + * the appropriate function to access the memory
> + * associated with HQM
> + */
> +#define HQM0_BLK_PG_NUM			0x00000096
> +#define HQM1_BLK_PG_NUM			0x00000097
> +
> +/**
> + * Note that TxQ and RxQ entries are interlaced
> + * the HQM memory, i.e RXQ0, TXQ0, RXQ1, TXQ1.. etc.
> + */
> +
> +#define HQM_RXTX_Q_RAM_BASE_OFFSET	0x00004000
> +
> +/**
> + * CQ Memory
> + * Exists in HQM Memory space
> + * Each entry is 16 (4 byte) words of which
> + * only 12 words are used for configuration
> + * Total 64 entries per HQM memory space
> + */
> +#define HQM_CQ_RAM_BASE_OFFSET		0x00006000
> +
> +/**
> + * Interrupt Block (IB) Memory
> + * Exists in HQM Memory space
> + * Each entry is 8 (4 byte) words of which
> + * only 5 words are used for configuration
> + * Total 128 entries per HQM memory space
> + */
> +#define HQM_IB_RAM_BASE_OFFSET		0x00001000
> +
> +/**
> + * Index Table (IT) Memory
> + * Exists in HQM Memory space
> + * Each entry is 1 (4 byte) word which
> + * is used for configuration
> + * Total 128 entries per HQM memory space
> + */
> +#define HQM_INDX_TBL_RAM_BASE_OFFSET	0x00002000
> +
> +/**
> + * PSS Block Memory Page Number
> + * This is written to the appropriate page number
> + * register to access the CPU memory.
> + * Also known as the PSS secondary memory (SMEM).
> + * Range : 0x180 to 0x1CF
> + * See catapult_spec.pdf for details
> + */
> +#define PSS_BLK_PG_NUM			0x00000180
> +
> +/**
> + * Offsets of different instances of PSS SMEM
> + * 2.5M of continuous 1T memory space : 2 blocks
> + * of 1M each (32 pages each, page=32KB) and 4 smaller
> + * blocks of 128K each (4 pages each, page=32KB)
> + * PSS_LMEM_INST0 is used for firmware download
> + */
> +#define PSS_LMEM_INST0			0x00000000
> +#define PSS_LMEM_INST1			0x00100000
> +#define PSS_LMEM_INST2			0x00200000
> +#define PSS_LMEM_INST3			0x00220000
> +#define PSS_LMEM_INST4			0x00240000
> +#define PSS_LMEM_INST5			0x00260000
> +
> +#endif /* __BNA_HWREG_H__ */
> --
> To unsubscribe from this list: send the line "unsubscribe netdev" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply

* [PATCH net-next-2.6] vlan: multiqueue vlan devices
From: Eric Dumazet @ 2009-09-02 17:17 UTC (permalink / raw)
  To: David Miller; +Cc: Patrick McHardy, Stephen Hemminger, jarkao2, netdev
In-Reply-To: <4A9EA250.2010209@gmail.com>

vlan devices are currently not multi-queue capable.

We can do that with a new rtnl_link_ops method,
get_tx_queues(), called from rtnl_create_link()

This new method gets num_tx_queues/real_num_tx_queues
from real device.

register_vlan_device() is also handled.

Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com>
---
 include/net/rtnetlink.h  |    2 ++
 net/8021q/vlan.c         |    5 +++--
 net/8021q/vlan_netlink.c |   20 ++++++++++++++++++++
 net/core/rtnetlink.c     |   10 +++++++++-
 4 files changed, 34 insertions(+), 3 deletions(-)

diff --git a/include/net/rtnetlink.h b/include/net/rtnetlink.h
index 3c1895e..0525a1d 100644
--- a/include/net/rtnetlink.h
+++ b/include/net/rtnetlink.h
@@ -70,6 +70,8 @@ struct rtnl_link_ops {
 	size_t			(*get_xstats_size)(const struct net_device *dev);
 	int			(*fill_xstats)(struct sk_buff *skb,
 					       const struct net_device *dev);
+	int			(*get_tx_queues)(struct net*, struct nlattr *tb[],
+						 int *tx_queues, int *real_tx_queues);
 };
 
 extern int	__rtnl_link_register(struct rtnl_link_ops *ops);
diff --git a/net/8021q/vlan.c b/net/8021q/vlan.c
index e814794..8836575 100644
--- a/net/8021q/vlan.c
+++ b/net/8021q/vlan.c
@@ -330,12 +330,13 @@ static int register_vlan_device(struct net_device *real_dev, u16 vlan_id)
 		snprintf(name, IFNAMSIZ, "vlan%.4i", vlan_id);
 	}
 
-	new_dev = alloc_netdev(sizeof(struct vlan_dev_info), name,
-			       vlan_setup);
+	new_dev = alloc_netdev_mq(sizeof(struct vlan_dev_info), name,
+				  vlan_setup, real_dev->num_tx_queues);
 
 	if (new_dev == NULL)
 		return -ENOBUFS;
 
+	new_dev->real_num_tx_queues = real_dev->real_num_tx_queues;
 	dev_net_set(new_dev, net);
 	/* need 4 bytes for extra VLAN header info,
 	 * hope the underlying device can handle it.
diff --git a/net/8021q/vlan_netlink.c b/net/8021q/vlan_netlink.c
index e9c91dc..8ce4122 100644
--- a/net/8021q/vlan_netlink.c
+++ b/net/8021q/vlan_netlink.c
@@ -100,6 +100,25 @@ static int vlan_changelink(struct net_device *dev,
 	return 0;
 }
 
+static int vlan_get_tx_queues(struct net *net, 
+			      struct nlattr *tb[],
+			      int *num_tx_queues,
+			      int *real_num_tx_queues)
+{
+	struct net_device *real_dev;
+
+	if (!tb[IFLA_LINK])
+		return -EINVAL;
+
+	real_dev = __dev_get_by_index(net, nla_get_u32(tb[IFLA_LINK]));
+	if (!real_dev)
+		return -ENODEV;
+
+	*num_tx_queues      = real_dev->num_tx_queues;
+	*real_num_tx_queues = real_dev->real_num_tx_queues;
+	return 0;
+}
+
 static int vlan_newlink(struct net_device *dev,
 			struct nlattr *tb[], struct nlattr *data[])
 {
@@ -216,6 +235,7 @@ struct rtnl_link_ops vlan_link_ops __read_mostly = {
 	.maxtype	= IFLA_VLAN_MAX,
 	.policy		= vlan_policy,
 	.priv_size	= sizeof(struct vlan_dev_info),
+	.get_tx_queues  = vlan_get_tx_queues,
 	.setup		= vlan_setup,
 	.validate	= vlan_validate,
 	.newlink	= vlan_newlink,
diff --git a/net/core/rtnetlink.c b/net/core/rtnetlink.c
index b44775f..5c1fe53 100644
--- a/net/core/rtnetlink.c
+++ b/net/core/rtnetlink.c
@@ -974,12 +974,20 @@ struct net_device *rtnl_create_link(struct net *net, char *ifname,
 {
 	int err;
 	struct net_device *dev;
+	int num_queues = 1;
+	int real_num_queues = 1;
 
+	if (ops->get_tx_queues) {
+		err = ops->get_tx_queues(net, tb, &num_queues, &real_num_queues);
+		if (err)
+			goto err;
+	}
 	err = -ENOMEM;
-	dev = alloc_netdev(ops->priv_size, ifname, ops->setup);
+	dev = alloc_netdev_mq(ops->priv_size, ifname, ops->setup, num_queues);
 	if (!dev)
 		goto err;
 
+	dev->real_num_tx_queues = real_num_queues;
 	if (strchr(dev->name, '%')) {
 		err = dev_alloc_name(dev, dev->name);
 		if (err < 0)

^ permalink raw reply related

* Re: [RFC][PATCH][v5] net/ethtool: Add support for the ethtool feature to flash firmware image from a specified file.
From: Ben Hutchings @ 2009-09-02 17:00 UTC (permalink / raw)
  To: Jeff Garzik; +Cc: Ajit Khaparde, davem, netdev
In-Reply-To: <4A9EA1D7.8090002@pobox.com>

On Wed, 2009-09-02 at 12:48 -0400, Jeff Garzik wrote:
> On 09/02/2009 12:22 PM, Ajit Khaparde wrote:
> > On 21/08/09 22:24 +0530, Ajit Khaparde wrote:
> >> This patch adds support to flash a firmware image to a device using ethtool.
> >> The driver gets the filename of the firmware image and flashes the image
> >> using the request firmware path.
> >>
> >> The region "on the chip" to be flashed can be specified by an option.
> >> It is upto the device driver to enumerate the region number passed by ethtool,
> >> to the region to be flashed.
> >>
> >> The default behavior is to flash all the regions on the chip.
> > David,
> > Is this patch good enough for being accepted? Please let me know if any more
> > work has to be done on this.
> >
> > I want to send the changes to the be2net driver which would use this
> > new ethtool feature.  Any ETA for applying the patch to the git tree?
> > I will schedule my driver patch according to that.
> 
> BTW, don't forget to update the ethtool userland utility as well,
> git://git.kernel.org/pub/scm/network/ethtool/ethtool.git

Ajit also posted a patch for ethtool on 21 August:
<http://article.gmane.org/gmane.linux.network/135785>.

Also, I sent you a series of patches back in April that you have
apparently ignored, although the corresponding kernel patches are in
2.6.31: <http://thread.gmane.org/gmane.linux.network/127025>.

Ben.

-- 
Ben Hutchings, Senior Software Engineer, Solarflare Communications
Not speaking for my employer; that's the marketing department's job.
They asked us to note that Solarflare product names are trademarked.

^ permalink raw reply

* Re: [PATCH net-next-2.6] tc: report informations for multiqueue devices
From: Eric Dumazet @ 2009-09-02 16:50 UTC (permalink / raw)
  To: Patrick McHardy; +Cc: Stephen Hemminger, David Miller, jarkao2, cl, netdev
In-Reply-To: <4A9E9D8A.50406@trash.net>

Patrick McHardy a écrit :
> Stephen Hemminger wrote:
>> Even if the kernel keeps the value in something else for the API
>> QINDEX needs to be a fixed size unsigned type like u32.
> 
> Just to avoid duplicate work - I'm currently trying to put a patch
> together that presents multiqueue qdiscs to userspace as regular
> child qdiscs of a dummy qdisc, which also contains the aggregated
> statistics. So far it actually looks pretty sane :)

Excellent !

I am cooking a patch to make vlan devices multiqueue aware
(ie having same tx_queue numbers than their real_dev)


^ permalink raw reply

* Re: [RFC][PATCH][v5] net/ethtool: Add support for the ethtool feature to flash firmware image from a specified file.
From: Jeff Garzik @ 2009-09-02 16:48 UTC (permalink / raw)
  To: Ajit Khaparde; +Cc: davem, netdev
In-Reply-To: <20090902162241.GB23228@serverengines.com>

On 09/02/2009 12:22 PM, Ajit Khaparde wrote:
> On 21/08/09 22:24 +0530, Ajit Khaparde wrote:
>> This patch adds support to flash a firmware image to a device using ethtool.
>> The driver gets the filename of the firmware image and flashes the image
>> using the request firmware path.
>>
>> The region "on the chip" to be flashed can be specified by an option.
>> It is upto the device driver to enumerate the region number passed by ethtool,
>> to the region to be flashed.
>>
>> The default behavior is to flash all the regions on the chip.
> David,
> Is this patch good enough for being accepted? Please let me know if any more
> work has to be done on this.
>
> I want to send the changes to the be2net driver which would use this
> new ethtool feature.  Any ETA for applying the patch to the git tree?
> I will schedule my driver patch according to that.

BTW, don't forget to update the ethtool userland utility as well,
git://git.kernel.org/pub/scm/network/ethtool/ethtool.git

	Jeff




^ permalink raw reply

* Re: [RFC][PATCH][v5] net/ethtool: Add support for the ethtool feature to flash firmware image from a specified file.
From: Ajit Khaparde @ 2009-09-02 16:22 UTC (permalink / raw)
  To: davem, jgarzik, netdev
In-Reply-To: <20090821165416.GA17102@serverengines.com>

On 21/08/09 22:24 +0530, Ajit Khaparde wrote:
> This patch adds support to flash a firmware image to a device using ethtool.
> The driver gets the filename of the firmware image and flashes the image
> using the request firmware path.
> 
> The region "on the chip" to be flashed can be specified by an option.
> It is upto the device driver to enumerate the region number passed by ethtool,
> to the region to be flashed.
> 
> The default behavior is to flash all the regions on the chip.
David, 
Is this patch good enough for being accepted? Please let me know if any more 
work has to be done on this. 

I want to send the changes to the be2net driver which would use this 
new ethtool feature.  Any ETA for applying the patch to the git tree? 
I will schedule my driver patch according to that. 

Thanks 
-Ajit

^ permalink raw reply

* Re: [PATCH net-next-2.6] tc: report informations for multiqueue devices
From: Patrick McHardy @ 2009-09-02 16:30 UTC (permalink / raw)
  To: Stephen Hemminger; +Cc: Eric Dumazet, David Miller, jarkao2, cl, netdev
In-Reply-To: <20090902092351.11649796@nehalam>

Stephen Hemminger wrote:
> Even if the kernel keeps the value in something else for the API
> QINDEX needs to be a fixed size unsigned type like u32.

Just to avoid duplicate work - I'm currently trying to put a patch
together that presents multiqueue qdiscs to userspace as regular
child qdiscs of a dummy qdisc, which also contains the aggregated
statistics. So far it actually looks pretty sane :)

^ permalink raw reply

* Re: [PATCH net-next-2.6] tc: report informations for multiqueue devices
From: Stephen Hemminger @ 2009-09-02 16:23 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: David Miller, jarkao2, cl, kaber, netdev
In-Reply-To: <4A9E699B.7080400@gmail.com>

On Wed, 02 Sep 2009 14:48:27 +0200
Eric Dumazet <eric.dumazet@gmail.com> wrote:

> Eric Dumazet a écrit :
> > David Miller a écrit :
> >> From: Eric Dumazet <eric.dumazet@gmail.com>
> >> Date: Wed, 02 Sep 2009 10:28:55 +0200
> >>
> >>> What naming convention should we choose for multiqueue devices ?
> >> We could give an index field to multiple root qdiscs assigned
> >> to a device.
> > 
> > Here is a patch then :)
> > 
> > Only point is that I am iterating from 0 to dev->real_num_tx_queues
> > instead of dev->num_tx_queues. I hope it's fine, because there are
> > allocated qdisc, but not really used.
> > 
> > Next patches to allow selective qdisc change/fetch (providing a TCA_QINDEX
> > selector value to kernel)
> > 
> > Thanks
> > 
> > 
> > [PATCH net-next-2.6] tc: report informations for multiqueue devices
> > 
> > qdisc and classes are not yet displayed by "tc -s -d {qdisc|class} show"
> > for multiqueue devices.
> > 
> > We use a new TCA_QINDEX attribute, to report queue index to user space.
> > iproute2 tc should be changed to eventually display this queue index as in :
> > 
> > $ tc -s -d qdisc
> > qdisc pfifo_fast 0: dev eth0 root bands 3 priomap  1 2 2 2 1 2 0 0 1 1 1 1 1 1 1 1
> >  Sent 52498 bytes 465 pkt (dropped 0, overlimits 0 requeues 0)
> >  rate 0bit 0pps backlog 0b 0p requeues 0
> > qdisc pfifo_fast 0: dev eth0 qindex 1 root bands 3 priomap  1 2 2 2 1 2 0 0 1 1 1 1 1 1 1 1
> >  Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0)
> >  rate 0bit 0pps backlog 0b 0p requeues 0
> > 
> 
> 
> Here is the iproute2 patch as well, to display queue indexes
> 
> Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com>
> ---
> 
> diff --git a/include/linux/rtnetlink.h b/include/linux/rtnetlink.h
> index ba3254e..b80e0f6 100644
> --- a/include/linux/rtnetlink.h
> +++ b/include/linux/rtnetlink.h
> @@ -490,6 +490,7 @@ enum
>  	TCA_FCNT,
>  	TCA_STATS2,
>  	TCA_STAB,
> +	TCA_QINDEX,
>  	__TCA_MAX
>  };
>  
> diff --git a/tc/tc_class.c b/tc/tc_class.c
> index 9d4eea5..1bc4bc6 100644
> --- a/tc/tc_class.c
> +++ b/tc/tc_class.c
> @@ -196,6 +196,13 @@ int print_class(const struct sockaddr_nl *who,
>  	if (filter_ifindex == 0)
>  		fprintf(fp, "dev %s ", ll_index_to_name(t->tcm_ifindex));
>  
> +	if (tb[TCA_QINDEX]) {
> +		int qindex = 0;
> +
> +		memcpy(&qindex, RTA_DATA(tb[TCA_QINDEX]), MIN(RTA_PAYLOAD(tb[TCA_QINDEX]), sizeof(int)));
> +		fprintf(fp, "qindex %d ", qindex);
> +	}
> +
>  	if (t->tcm_parent == TC_H_ROOT)
>  		fprintf(fp, "root ");
>  	else {

Even if the kernel keeps the value in something else for the API
QINDEX needs to be a fixed size unsigned type like u32.



^ permalink raw reply

* Re: [PATCH net-next-2.6] ip: Report qdisc packet drops
From: Eric Dumazet @ 2009-09-02 16:20 UTC (permalink / raw)
  To: Sridhar Samudrala; +Cc: David Miller, cl, dlstevens, netdev, niv, mtk.manpages
In-Reply-To: <1251907915.6468.15.camel@w-sridhar.beaverton.ibm.com>

Sridhar Samudrala a écrit :
> On Wed, 2009-09-02 at 16:43 +0200, Eric Dumazet wrote:
>> David Miller a écrit :
>>> From: Eric Dumazet <eric.dumazet@gmail.com>
>>> Date: Mon, 31 Aug 2009 14:09:50 +0200
>>>
>>>> Re-reading again this stuff, I realized ip6_push_pending_frames()
>>>> was not updating IPSTATS_MIB_OUTDISCARDS, even if IP_RECVERR was set.
>>>>
>>>> May I suggest following path :
>>>>
>>>> 1) Correct ip6_push_pending_frames() to properly
>>>> account for dropped-by-qdisc frames when IP_RECVERR is set
>>> Your patch is  applied to net-next-2.6, thanks!
>>>
>>>> 2) Submit a patch to account for qdisc-dropped frames in SNMP counters
>>>> but still return a OK to user application, to not break them ?
>>> Sounds good.
>>>
>>> I think if you sample random UDP applications, you will find that such
>>> errors will upset them terribly, make them log tons of crap to
>>> /var/log/messages et al., and consume tons of CPU.
>>>
>>> And in such cases silent ignoring of drops is entirely appropriate and
>>> optimal, which supports our current behavior.
>>>
>>> If we are to make such applications "more sophisticated" such
>>> converted apps can be indicated simply their use of IP_RECVERR.
>>>
>>> If you want to be notified of all asynchronous errors we can detect,
>>> you use this, end of story.  It is the only way to handle this
>>> situation without breaking the world.
>>>
>>> As usual, Alexey Kuznetsov's analysis of this situation is timeless,
>>> accurate, and wise.  And he understood all of this 10+ years ago.
>> Thanks David, here is the 2nd patch then :
>>
>>
>> [PATCH net-next-2.6] ip: Report qdisc packet drops
>>
>> Christoph Lameter pointed out that packet drops at qdisc level where not
>> accounted in SNMP counters. Only if application sets IP_RECVERR, drops
>> are reported to user (-ENOBUFS errors) and SNMP counters updated.
>>
>> IP_RECVERR is used to enable extended reliable error message passing,
>> but these are not needed to update system wide SNMP stats.
>>
>> This patch changes things a bit to allow SNMP counters to be updated,
>> regardless of IP_RECVERR being set or not on the socket.
>>
>> Example after an UDP tx flood
>> # netstat -s 
>> ...
>> IP:
>>     1487048 outgoing packets dropped
>> ...
>> Udp:
>> ...
>>     SndbufErrors: 1487048
>>
> 
> Didn't we agree that qdisc drops should not be counted as IP or UDP 
> drops as David Stevens pointed out?
> I would say that even when IP_RECVERR is set, SNMP counters at IP and
> UDP should not be counted when a packet is dropped at qdisc level,
> but the error can be reported to user.
> 
> Now that qdisc stats issue is figured out and they can be accounted
> and seen at qdisc level, doesn't it confuse if we count the same drop 
> at IP, UDP and qdisc level?
> 
> Thanks
> Sridhar
>

Yes, I am aware of David point, but its already not true with current kernel.

Current kernels and an UDP frame sent by application :

if IP_RECVERR not set, no SNMP error logged, IP or UDP level

if IP_RECVERR is set, qdisc drops are reported both to IP and UDP
SNMP counters.



udp_sendmsg()
{
...
out:
        ip_rt_put(rt);
        if (free)
                kfree(ipc.opt);
        if (!err)
                return len;
        /*
         * ENOBUFS = no kernel mem, SOCK_NOSPACE = no sndbuf space.  Reporting
         * ENOBUFS might not be good (it's not tunable per se), but otherwise
         * we don't have a good statistic (IpOutDiscards but it can be too many
         * things).  We could add another new stat but at least for now that
         * seems like overkill.
         */
        if (err == -ENOBUFS || test_bit(SOCK_NOSPACE, &sk->sk_socket->flags)) {
                UDP_INC_STATS_USER(sock_net(sk),
                                UDP_MIB_SNDBUFERRORS, is_udplite);
        }
        return err;
...
}


So what shall we do ?

IMHO, one should not add MIB counters for different domains (IP / UDP), this
makes no sense.


^ permalink raw reply

* Re: [PATCH net-next-2.6] ip: Report qdisc packet drops
From: Sridhar Samudrala @ 2009-09-02 16:11 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: David Miller, cl, dlstevens, netdev, niv, mtk.manpages
In-Reply-To: <4A9E849A.30105@gmail.com>

On Wed, 2009-09-02 at 16:43 +0200, Eric Dumazet wrote:
> David Miller a écrit :
> > From: Eric Dumazet <eric.dumazet@gmail.com>
> > Date: Mon, 31 Aug 2009 14:09:50 +0200
> > 
> >> Re-reading again this stuff, I realized ip6_push_pending_frames()
> >> was not updating IPSTATS_MIB_OUTDISCARDS, even if IP_RECVERR was set.
> >>
> >> May I suggest following path :
> >>
> >> 1) Correct ip6_push_pending_frames() to properly
> >> account for dropped-by-qdisc frames when IP_RECVERR is set
> > 
> > Your patch is  applied to net-next-2.6, thanks!
> > 
> >> 2) Submit a patch to account for qdisc-dropped frames in SNMP counters
> >> but still return a OK to user application, to not break them ?
> > 
> > Sounds good.
> > 
> > I think if you sample random UDP applications, you will find that such
> > errors will upset them terribly, make them log tons of crap to
> > /var/log/messages et al., and consume tons of CPU.
> > 
> > And in such cases silent ignoring of drops is entirely appropriate and
> > optimal, which supports our current behavior.
> > 
> > If we are to make such applications "more sophisticated" such
> > converted apps can be indicated simply their use of IP_RECVERR.
> > 
> > If you want to be notified of all asynchronous errors we can detect,
> > you use this, end of story.  It is the only way to handle this
> > situation without breaking the world.
> > 
> > As usual, Alexey Kuznetsov's analysis of this situation is timeless,
> > accurate, and wise.  And he understood all of this 10+ years ago.
> 
> Thanks David, here is the 2nd patch then :
> 
> 
> [PATCH net-next-2.6] ip: Report qdisc packet drops
> 
> Christoph Lameter pointed out that packet drops at qdisc level where not
> accounted in SNMP counters. Only if application sets IP_RECVERR, drops
> are reported to user (-ENOBUFS errors) and SNMP counters updated.
> 
> IP_RECVERR is used to enable extended reliable error message passing,
> but these are not needed to update system wide SNMP stats.
> 
> This patch changes things a bit to allow SNMP counters to be updated,
> regardless of IP_RECVERR being set or not on the socket.
> 
> Example after an UDP tx flood
> # netstat -s 
> ...
> IP:
>     1487048 outgoing packets dropped
> ...
> Udp:
> ...
>     SndbufErrors: 1487048
> 

Didn't we agree that qdisc drops should not be counted as IP or UDP 
drops as David Stevens pointed out?
I would say that even when IP_RECVERR is set, SNMP counters at IP and
UDP should not be counted when a packet is dropped at qdisc level,
but the error can be reported to user.

Now that qdisc stats issue is figured out and they can be accounted
and seen at qdisc level, doesn't it confuse if we count the same drop 
at IP, UDP and qdisc level?

Thanks
Sridhar

> 
> send() syscalls, do however still return an OK status, to not
> break applications.
> 
> Note : send() manual page explicitly says for -ENOBUFS error :
> 
>  "The output queue for a network interface was full.
>   This generally indicates that the interface has stopped sending,
>   but may be caused by transient congestion.
>   (Normally, this does not occur in Linux. Packets are just silently
>   dropped when a device queue overflows.) "
> 
> This is not true for IP_RECVERR enabled sockets : a send() syscall
> that hit a qdisc drop returns an ENOBUFS error.
> 
> Many thanks to Christoph, David, and last but not least, Alexey !
> 
> Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com>
> ---
>  include/net/ip.h      |    2 +-
>  include/net/ipv6.h    |    2 +-
>  include/net/udp.h     |    2 +-
>  net/ipv4/icmp.c       |    2 +-
>  net/ipv4/ip_output.c  |   19 ++++++++++---------
>  net/ipv4/raw.c        |   14 ++++++++++----
>  net/ipv4/udp.c        |   20 +++++++++++++-------
>  net/ipv6/icmp.c       |    2 +-
>  net/ipv6/ip6_output.c |   18 +++++++++++-------
>  net/ipv6/raw.c        |   15 ++++++++++-----
>  net/ipv6/udp.c        |   14 ++++++++++----
>  11 files changed, 69 insertions(+), 41 deletions(-)
> 
> diff --git a/include/net/ip.h b/include/net/ip.h
> index 72c3692..9dd19a8 100644
> --- a/include/net/ip.h
> +++ b/include/net/ip.h
> @@ -116,7 +116,7 @@ extern int		ip_append_data(struct sock *sk,
>  extern int		ip_generic_getfrag(void *from, char *to, int offset, int len, int odd, struct sk_buff *skb);
>  extern ssize_t		ip_append_page(struct sock *sk, struct page *page,
>  				int offset, size_t size, int flags);
> -extern int		ip_push_pending_frames(struct sock *sk);
> +extern int		ip_push_pending_frames(struct sock *sk, int recverr);
>  extern void		ip_flush_pending_frames(struct sock *sk);
> 
>  /* datagram.c */
> diff --git a/include/net/ipv6.h b/include/net/ipv6.h
> index ad9a511..f514257 100644
> --- a/include/net/ipv6.h
> +++ b/include/net/ipv6.h
> @@ -498,7 +498,7 @@ extern int			ip6_append_data(struct sock *sk,
>  						struct rt6_info *rt,
>  						unsigned int flags);
> 
> -extern int			ip6_push_pending_frames(struct sock *sk);
> +extern int			ip6_push_pending_frames(struct sock *sk, int recverr);
> 
>  extern void			ip6_flush_pending_frames(struct sock *sk);
> 
> diff --git a/include/net/udp.h b/include/net/udp.h
> index 5fb029f..a60ef10 100644
> --- a/include/net/udp.h
> +++ b/include/net/udp.h
> @@ -145,7 +145,7 @@ extern int 	udp_lib_getsockopt(struct sock *sk, int level, int optname,
>  			           char __user *optval, int __user *optlen);
>  extern int 	udp_lib_setsockopt(struct sock *sk, int level, int optname,
>  				   char __user *optval, int optlen,
> -				   int (*push_pending_frames)(struct sock *));
> +				   int (*push_pending_frames)(struct sock *, int));
> 
>  extern struct sock *udp4_lib_lookup(struct net *net, __be32 saddr, __be16 sport,
>  				    __be32 daddr, __be16 dport,
> diff --git a/net/ipv4/icmp.c b/net/ipv4/icmp.c
> index 97c410e..f46a53c 100644
> --- a/net/ipv4/icmp.c
> +++ b/net/ipv4/icmp.c
> @@ -345,7 +345,7 @@ static void icmp_push_reply(struct icmp_bxm *icmp_param,
>  						 icmp_param->head_len, csum);
>  		icmph->checksum = csum_fold(csum);
>  		skb->ip_summed = CHECKSUM_NONE;
> -		ip_push_pending_frames(sk);
> +		ip_push_pending_frames(sk, 0);
>  	}
>  }
> 
> diff --git a/net/ipv4/ip_output.c b/net/ipv4/ip_output.c
> index 7d08210..8f81dab 100644
> --- a/net/ipv4/ip_output.c
> +++ b/net/ipv4/ip_output.c
> @@ -1216,7 +1216,7 @@ static void ip_cork_release(struct inet_sock *inet)
>   *	Combined all pending IP fragments on the socket as one IP datagram
>   *	and push them out.
>   */
> -int ip_push_pending_frames(struct sock *sk)
> +int ip_push_pending_frames(struct sock *sk, int recverr)
>  {
>  	struct sk_buff *skb, *tmp_skb;
>  	struct sk_buff **tail_skb;
> @@ -1301,19 +1301,20 @@ int ip_push_pending_frames(struct sock *sk)
>  	/* Netfilter gets whole the not fragmented skb. */
>  	err = ip_local_out(skb);
>  	if (err) {
> -		if (err > 0)
> -			err = inet->recverr ? net_xmit_errno(err) : 0;
> +		if (err > 0) {
> +			err = net_xmit_errno(err);
> +			if (err && !recverr) {
> +				IP_INC_STATS(net, IPSTATS_MIB_OUTDISCARDS);
> +				err = 0;
> +			}
> +		}
>  		if (err)
> -			goto error;
> +			IP_INC_STATS(net, IPSTATS_MIB_OUTDISCARDS);
>  	}
> 
>  out:
>  	ip_cork_release(inet);
>  	return err;
> -
> -error:
> -	IP_INC_STATS(net, IPSTATS_MIB_OUTDISCARDS);
> -	goto out;
>  }
> 
>  /*
> @@ -1412,7 +1413,7 @@ void ip_send_reply(struct sock *sk, struct sk_buff *skb, struct ip_reply_arg *ar
>  			  arg->csumoffset) = csum_fold(csum_add(skb->csum,
>  								arg->csum));
>  		skb->ip_summed = CHECKSUM_NONE;
> -		ip_push_pending_frames(sk);
> +		ip_push_pending_frames(sk, 0);
>  	}
> 
>  	bh_unlock_sock(sk);
> diff --git a/net/ipv4/raw.c b/net/ipv4/raw.c
> index 2979f14..444c465 100644
> --- a/net/ipv4/raw.c
> +++ b/net/ipv4/raw.c
> @@ -374,8 +374,13 @@ static int raw_send_hdrinc(struct sock *sk, void *from, size_t length,
> 
>  	err = NF_HOOK(PF_INET, NF_INET_LOCAL_OUT, skb, NULL, rt->u.dst.dev,
>  		      dst_output);
> -	if (err > 0)
> -		err = inet->recverr ? net_xmit_errno(err) : 0;
> +	if (err > 0) {
> +		err = net_xmit_errno(err);
> +		if (!inet->recverr && err) {
> +			IP_INC_STATS(net, IPSTATS_MIB_OUTDISCARDS);
> +			err = 0;
> +		}
> +	}
>  	if (err)
>  		goto error;
>  out:
> @@ -576,8 +581,9 @@ back_from_confirm:
>  					&ipc, &rt, msg->msg_flags);
>  		if (err)
>  			ip_flush_pending_frames(sk);
> -		else if (!(msg->msg_flags & MSG_MORE))
> -			err = ip_push_pending_frames(sk);
> +		else if (!(msg->msg_flags & MSG_MORE)) {
> +			err = ip_push_pending_frames(sk, inet->recverr);
> +		}
>  		release_sock(sk);
>  	}
>  done:
> diff --git a/net/ipv4/udp.c b/net/ipv4/udp.c
> index 29ebb0d..6a6bf1d 100644
> --- a/net/ipv4/udp.c
> +++ b/net/ipv4/udp.c
> @@ -513,7 +513,7 @@ static void udp4_hwcsum_outgoing(struct sock *sk, struct sk_buff *skb,
>  /*
>   * Push out all pending data as one UDP datagram. Socket is locked.
>   */
> -static int udp_push_pending_frames(struct sock *sk)
> +static int udp_push_pending_frames(struct sock *sk, int recverr)
>  {
>  	struct udp_sock  *up = udp_sk(sk);
>  	struct inet_sock *inet = inet_sk(sk);
> @@ -560,7 +560,7 @@ static int udp_push_pending_frames(struct sock *sk)
>  		uh->check = CSUM_MANGLED_0;
> 
>  send:
> -	err = ip_push_pending_frames(sk);
> +	err = ip_push_pending_frames(sk, recverr);
>  out:
>  	up->len = 0;
>  	up->pending = 0;
> @@ -752,8 +752,14 @@ do_append_data:
>  			corkreq ? msg->msg_flags|MSG_MORE : msg->msg_flags);
>  	if (err)
>  		udp_flush_pending_frames(sk);
> -	else if (!corkreq)
> -		err = udp_push_pending_frames(sk);
> +	else if (!corkreq) {
> +		err = udp_push_pending_frames(sk, 1);
> +		if (err == -ENOBUFS && !inet->recverr) {
> +			UDP_INC_STATS_USER(sock_net(sk),
> +					   UDP_MIB_SNDBUFERRORS, is_udplite);
> +			err = 0;
> +		}
> +	}
>  	else if (unlikely(skb_queue_empty(&sk->sk_write_queue)))
>  		up->pending = 0;
>  	release_sock(sk);
> @@ -826,7 +832,7 @@ int udp_sendpage(struct sock *sk, struct page *page, int offset,
> 
>  	up->len += size;
>  	if (!(up->corkflag || (flags&MSG_MORE)))
> -		ret = udp_push_pending_frames(sk);
> +		ret = udp_push_pending_frames(sk, inet_sk(sk)->recverr);
>  	if (!ret)
>  		ret = size;
>  out:
> @@ -1354,7 +1360,7 @@ void udp_destroy_sock(struct sock *sk)
>   */
>  int udp_lib_setsockopt(struct sock *sk, int level, int optname,
>  		       char __user *optval, int optlen,
> -		       int (*push_pending_frames)(struct sock *))
> +		       int (*push_pending_frames)(struct sock *, int))
>  {
>  	struct udp_sock *up = udp_sk(sk);
>  	int val;
> @@ -1374,7 +1380,7 @@ int udp_lib_setsockopt(struct sock *sk, int level, int optname,
>  		} else {
>  			up->corkflag = 0;
>  			lock_sock(sk);
> -			(*push_pending_frames)(sk);
> +			(*push_pending_frames)(sk, 0);
>  			release_sock(sk);
>  		}
>  		break;
> diff --git a/net/ipv6/icmp.c b/net/ipv6/icmp.c
> index e2325f6..a9c54c2 100644
> --- a/net/ipv6/icmp.c
> +++ b/net/ipv6/icmp.c
> @@ -253,7 +253,7 @@ static int icmpv6_push_pending_frames(struct sock *sk, struct flowi *fl, struct
>  						      len, fl->proto,
>  						      tmp_csum);
>  	}
> -	ip6_push_pending_frames(sk);
> +	ip6_push_pending_frames(sk, 0);
>  out:
>  	return err;
>  }
> diff --git a/net/ipv6/ip6_output.c b/net/ipv6/ip6_output.c
> index a931229..ade5707 100644
> --- a/net/ipv6/ip6_output.c
> +++ b/net/ipv6/ip6_output.c
> @@ -1440,7 +1440,7 @@ static void ip6_cork_release(struct inet_sock *inet, struct ipv6_pinfo *np)
>  	memset(&inet->cork.fl, 0, sizeof(inet->cork.fl));
>  }
> 
> -int ip6_push_pending_frames(struct sock *sk)
> +int ip6_push_pending_frames(struct sock *sk, int recverr)
>  {
>  	struct sk_buff *skb, *tmp_skb;
>  	struct sk_buff **tail_skb;
> @@ -1510,18 +1510,22 @@ int ip6_push_pending_frames(struct sock *sk)
> 
>  	err = ip6_local_out(skb);
>  	if (err) {
> -		if (err > 0)
> -			err = np->recverr ? net_xmit_errno(err) : 0;
> +		if (err > 0) {
> +			err = net_xmit_errno(err);
> +			if (err && !recverr) {
> +				IP6_INC_STATS(net, rt->rt6i_idev,
> +					      IPSTATS_MIB_OUTDISCARDS);
> +				err = 0;
> +			}
> +		}
>  		if (err)
> -			goto error;
> +			IP6_INC_STATS(net, rt->rt6i_idev,
> +				      IPSTATS_MIB_OUTDISCARDS);
>  	}
> 
>  out:
>  	ip6_cork_release(inet, np);
>  	return err;
> -error:
> -	IP6_INC_STATS(net, rt->rt6i_idev, IPSTATS_MIB_OUTDISCARDS);
> -	goto out;
>  }
> 
>  void ip6_flush_pending_frames(struct sock *sk)
> diff --git a/net/ipv6/raw.c b/net/ipv6/raw.c
> index 5068410..d054fa2 100644
> --- a/net/ipv6/raw.c
> +++ b/net/ipv6/raw.c
> @@ -523,7 +523,7 @@ csum_copy_err:
>  }
> 
>  static int rawv6_push_pending_frames(struct sock *sk, struct flowi *fl,
> -				     struct raw6_sock *rp)
> +				     struct raw6_sock *rp, int recverr)
>  {
>  	struct sk_buff *skb;
>  	int err = 0;
> @@ -595,7 +595,7 @@ static int rawv6_push_pending_frames(struct sock *sk, struct flowi *fl,
>  		BUG();
> 
>  send:
> -	err = ip6_push_pending_frames(sk);
> +	err = ip6_push_pending_frames(sk, recverr);
>  out:
>  	return err;
>  }
> @@ -641,8 +641,13 @@ static int rawv6_send_hdrinc(struct sock *sk, void *from, int length,
>  	IP6_UPD_PO_STATS(sock_net(sk), rt->rt6i_idev, IPSTATS_MIB_OUT, skb->len);
>  	err = NF_HOOK(PF_INET6, NF_INET_LOCAL_OUT, skb, NULL, rt->u.dst.dev,
>  		      dst_output);
> -	if (err > 0)
> -		err = np->recverr ? net_xmit_errno(err) : 0;
> +	if (err > 0) {
> +		err = net_xmit_errno(err);
> +		if (!np->recverr && err) {
> +			IP6_INC_STATS(sock_net(sk), rt->rt6i_idev, IPSTATS_MIB_OUTDISCARDS);
> +			err = 0;
> +		}
> +	}
>  	if (err)
>  		goto error;
>  out:
> @@ -895,7 +900,7 @@ back_from_confirm:
>  		if (err)
>  			ip6_flush_pending_frames(sk);
>  		else if (!(msg->msg_flags & MSG_MORE))
> -			err = rawv6_push_pending_frames(sk, &fl, rp);
> +			err = rawv6_push_pending_frames(sk, &fl, rp, np->recverr);
>  		release_sock(sk);
>  	}
>  done:
> diff --git a/net/ipv6/udp.c b/net/ipv6/udp.c
> index 20d2ffc..963dd0a 100644
> --- a/net/ipv6/udp.c
> +++ b/net/ipv6/udp.c
> @@ -683,7 +683,7 @@ static void udp6_hwcsum_outgoing(struct sock *sk, struct sk_buff *skb,
>   *	Sending
>   */
> 
> -static int udp_v6_push_pending_frames(struct sock *sk)
> +static int udp_v6_push_pending_frames(struct sock *sk, int recverr)
>  {
>  	struct sk_buff *skb;
>  	struct udphdr *uh;
> @@ -723,7 +723,7 @@ static int udp_v6_push_pending_frames(struct sock *sk)
>  		uh->check = CSUM_MANGLED_0;
> 
>  send:
> -	err = ip6_push_pending_frames(sk);
> +	err = ip6_push_pending_frames(sk, recverr);
>  out:
>  	up->len = 0;
>  	up->pending = 0;
> @@ -975,8 +975,14 @@ do_append_data:
>  		corkreq ? msg->msg_flags|MSG_MORE : msg->msg_flags);
>  	if (err)
>  		udp_v6_flush_pending_frames(sk);
> -	else if (!corkreq)
> -		err = udp_v6_push_pending_frames(sk);
> +	else if (!corkreq) {
> +		err = udp_v6_push_pending_frames(sk, 1);
> +		if (err == -ENOBUFS && !np->recverr) {
> +			UDP6_INC_STATS_USER(sock_net(sk),
> +					   UDP_MIB_SNDBUFERRORS, is_udplite);
> +			err = 0;
> +		}
> +	}
>  	else if (unlikely(skb_queue_empty(&sk->sk_write_queue)))
>  		up->pending = 0;
> 
> --
> To unsubscribe from this list: send the line "unsubscribe netdev" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


^ permalink raw reply

* Re: [PATCH net-next-2.6] ip: Report qdisc packet drops
From: Eric Dumazet @ 2009-09-02 16:05 UTC (permalink / raw)
  To: Christoph Lameter; +Cc: David Miller, sri, dlstevens, netdev, niv, mtk.manpages
In-Reply-To: <alpine.DEB.1.10.0909021413490.16364@V090114053VZO-1>

Christoph Lameter a écrit :
> The patch is smaller if you remove the handling of recverr completely from
> ip_push_pending_frames() and return NET_RX_DROP etc. Two of the callers
> never even inspect the return code. For them this is useless processing.

We must check NET_XMIT_CN before doing the update, but you are probably right.

I'll cook another patch ASAP, thanks !



^ permalink raw reply

* Re: [PATCH 1/3] netfilter: xt_ipvs (netfilter matcher for IPVS)
From: Hannes Eder @ 2009-09-02 16:05 UTC (permalink / raw)
  To: Jan Engelhardt
  Cc: Patrick McHardy, lvs-devel, linux-kernel, netdev, netfilter-devel,
	Fabien Duchêne, Jean-Luc Fortemaison, Julian Anastasov,
	Julius Volz, Laurent Grawet, Simon Horman, Wensong Zhang
In-Reply-To: <alpine.LSU.2.00.0909021748310.25091@fbirervta.pbzchgretzou.qr>

On Wed, Sep 2, 2009 at 17:49, Jan Engelhardt<jengelh@medozas.de> wrote:
>
> On Wednesday 2009-09-02 17:36, Patrick McHardy wrote:
>>>
>>> Nice, I'll use par->family.
>>>
>>> So in theory I do not even need a check like the following in the beginning?
>>>
>>>      if (family != NFPROTO_IPV4
>>> #ifdef CONFIG_IP_VS_IPV6
>>>          && family != NFPROTO_IPV6
>>> #endif
>>>              ) {
>>>              match = false;
>>>              goto out;
>>>      }
>>
>>With the AF_UNSPEC registration of your match, it might be used
>
> par->family always contains the NFPROTO of the invoking implementation,
> which can never be UNSPEC (except, in future, xtables2 ;-)
>
> par->match->family however may be UNSPEC if the module works that way.
> Which is why we have par->family.
>

I'll a check_entry function:

static bool ipvs_mt_check(const struct xt_mtchk_param *par)
{
	if (par->family != NFPROTO_IPV4
#ifdef CONFIG_IP_VS_IPV6
	    && par->family != NFPROTO_IPV6
#endif
		)
		return false;

	return true;
}

and remove the runtime check in ipvs_mt.

^ permalink raw reply

* [PATCH] tc35815: Disable PM capability
From: Atsushi Nemoto @ 2009-09-02 15:51 UTC (permalink / raw)
  To: netdev; +Cc: David Miller

This chip may report existance of PM registers though they are not
supported.  Disable PM features by clearing pdev->pm_cap.

Signed-off-by: Atsushi Nemoto <anemo@mba.ocn.ne.jp>
---
 drivers/net/tc35815.c |    5 +++++
 1 files changed, 5 insertions(+), 0 deletions(-)

diff --git a/drivers/net/tc35815.c b/drivers/net/tc35815.c
index d2ca47f..71e4482 100644
--- a/drivers/net/tc35815.c
+++ b/drivers/net/tc35815.c
@@ -909,6 +909,11 @@ static int __devinit tc35815_init_one(struct pci_dev *pdev,
 	lp = netdev_priv(dev);
 	lp->dev = dev;
 
+	/* This device may have PM registers but they are not supported. */
+	if (pdev->pm_cap) {
+		dev_info(&pdev->dev, "Disable PM\n");
+		pdev->pm_cap = 0;
+	}
 	/* enable device (incl. PCI PM wakeup), and bus-mastering */
 	rc = pcim_enable_device(pdev);
 	if (rc)
-- 
1.5.6.5


^ permalink raw reply related

* Re: [RFC][PATCH][v5] net/ethtool: Add support for the ethtool feature to flash firmware image from a specified file.
From: Ajit Khaparde @ 2009-09-02 15:50 UTC (permalink / raw)
  To: davem, jgarzik, netdev
In-Reply-To: <20090821165416.GA17102@serverengines.com>

On 21/08/09 22:24 +0530, Ajit Khaparde wrote:
> This patch adds support to flash a firmware image to a device using ethtool.
> The driver gets the filename of the firmware image and flashes the image
> using the request firmware path.
> 
> The region "on the chip" to be flashed can be specified by an option.
> It is upto the device driver to enumerate the region number passed by ethtool,
> to the region to be flashed.
> 
> The default behavior is to flash all the regions on the chip.
David,
Is this patch good enough for being accepted? Please let me know if any more
work has to be done on this. Also, I want to send the changes to the be2net
driver which would use this ethtool feature.

Any ETA for applying the patch to the git tree? I will schedule my driver patch according to that.

Thanks
-Ajit

^ permalink raw reply

* Re: [PATCH 1/3] netfilter: xt_ipvs (netfilter matcher for IPVS)
From: Jan Engelhardt @ 2009-09-02 15:49 UTC (permalink / raw)
  To: Patrick McHardy
  Cc: Hannes Eder, lvs-devel, linux-kernel, netdev, netfilter-devel,
	Fabien Duchêne, Jean-Luc Fortemaison, Julian Anastasov,
	Julius Volz, Laurent Grawet, Simon Horman, Wensong Zhang
In-Reply-To: <4A9E90E4.9080805@trash.net>


On Wednesday 2009-09-02 17:36, Patrick McHardy wrote:
>> 
>> Nice, I'll use par->family.
>> 
>> So in theory I do not even need a check like the following in the beginning?
>> 
>> 	if (family != NFPROTO_IPV4
>> #ifdef CONFIG_IP_VS_IPV6
>> 	    && family != NFPROTO_IPV6
>> #endif
>> 		) {
>> 		match = false;
>> 		goto out;
>> 	}
>
>With the AF_UNSPEC registration of your match, it might be used

par->family always contains the NFPROTO of the invoking implementation,
which can never be UNSPEC (except, in future, xtables2 ;-)

par->match->family however may be UNSPEC if the module works that way.
Which is why we have par->family.

^ permalink raw reply

page: next (older) | prev (newer) | latest
- recent:[subjects (threaded)|topics (new)|topics (active)]

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox