From: Pavlos Parissis <pavlos.parissis@gmail.com>
To: "Paweł Staszewski" <pstaszewski@itcare.pl>
Cc: Alexander Duyck <alexander.duyck@gmail.com>,
"Anders K. Pedersen | Cohaesio" <akp@cohaesio.com>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
"intel-wired-lan@lists.osuosl.org"
<intel-wired-lan@lists.osuosl.org>,
"alexander.h.duyck@intel.com" <alexander.h.duyck@intel.com>
Subject: Re: Linux 4.12+ memory leak on router with i40e NICs
Date: Thu, 19 Oct 2017 13:41:59 +0200 [thread overview]
Message-ID: <CABOTfnOtw=XdhPL4n1JM16CS02ekoy6EPugOvtSbO6LGp2XOXA@mail.gmail.com> (raw)
In-Reply-To: <57579746-77e1-4603-12ed-7d999fdfeabf@itcare.pl>
On 19 October 2017 at 01:40, Paweł Staszewski <pstaszewski@itcare.pl> wrote:
>
>
> W dniu 2017-10-19 o 01:29, Alexander Duyck pisze:
>
>> On Mon, Oct 16, 2017 at 10:51 PM, Vitezslav Samel <vitezslav@samel.cz>
>> wrote:
>>>
>>> On Tue, Oct 17, 2017 at 01:34:29AM +0200, Paweł Staszewski wrote:
>>>>
>>>> W dniu 2017-10-16 o 18:26, Paweł Staszewski pisze:
>>>>>
>>>>> W dniu 2017-10-16 o 13:20, Pavlos Parissis pisze:
>>>>>>
>>>>>> On 15/10/2017 02:58 πμ, Alexander Duyck wrote:
>>>>>>>
>>>>>>> Hi Pawel,
>>>>>>>
>>>>>>> To clarify is that Dave Miller's tree or Linus's that you are talking
>>>>>>> about? If it is Dave's tree how long ago was it you pulled it since I
>>>>>>> think the fix was just pushed by Jeff Kirsher a few days ago.
>>>>>>>
>>>>>>> The issue should be fixed in the following commit:
>>>>>>>
>>>>>>> https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/commit/drivers/net/ethernet/intel/i40e/i40e_txrx.c?id=2b9478ffc550f17c6cd8c69057234e91150f5972
>>>>>>
>>>>>> Do you know when it is going to be available on net-next and
>>>>>> linux-stable repos?
>>>>>>
>>>>>> Cheers,
>>>>>> Pavlos
>>>>>>
>>>>>>
>>>>> I will make some tests today night with "net" git tree where this patch
>>>>> is included.
>>>>> Starting from 0:00 CET
>>>>> :)
>>>>>
>>>>>
>>>> Upgraded and looks like problem is not solved with that patch
>>>> Currently running system with
>>>> https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/
>>>> kernel
>>>>
>>>> Still about 0.5GB of memory is leaking somewhere
>>>>
>>>> Also can confirm that the latest kernel where memory is not leaking
>>>> (with
>>>> use i40e driver intel 710 cards) is 4.11.12
>>>> With kernel 4.11.12 - after hour no change in memory usage.
>>>>
>>>> also checked that with ixgbe instead of i40e with same net.git kernel
>>>> there
>>>> is no memleak - after hour same memory usage - so for 100% this is i40e
>>>> driver problem.
>>>
>>> I have (probably) the same problem here but with X520 cards: booting
>>> 4.12.x gives me oops after circa 20 minutes of our workload. Booting
>>> 4.9.y is OK. This machine is in production so any testing is very
>>> limited.
>>>
>>> Machine was stable for >2 months (on the desk before got to
>>> production) with 4.12.8 but with no traffic on X520 cards.
>>>
>>> Cheers,
>>>
>>> Vita
>>
>> Sorry but it can't be the same issue since we are discussing a
>> different driver (i40e) running different hardware (X710 or XL170).
>> You might want to start a new thread for your issue, and/or if
>> possible file a bug on e1000.sf.net.
>>
>> Thanks.
>>
>> - Alex
>>
> sorry but bugs reported on e1000.sf.net are delayed - some after about 6 or
> more months - when i reported first bug there iv got reply after a year
> about no activity :):) haha - and reported there bug is still actrive :)
> better for me is now to change nics (for sure cheaper from the perspective
> of clients :) ) to mellanox or just to replace and use ixgbe - that have no
> this bug (mellanox and ixgbe have no such bug - have many servers with them
> with same conf - and only one with i40e where is same conf and memleak)
>
> If nobody from Intel wants to reproduce this - qool - this is not my problem
> but intels :) - there is now many good nics to use - like mellanox or just
> stick with many 10G based on ixgbe that is really good driver - but really ?
> intel guys have no XL710 cards ? i dont want to buy another buggy cards to
> do only kernel bisects .... sorry ....
> To do good bisects with this bug You need to spend maybee 200/300 bisects -
> and to confirm each - You need maybee 30minutes so count how much time You
> need - more that 100 cards in price from mellanox maybee :)
>
I have similar issues with you in regards to the stability of i40e
driver. I will need to open another thread about them, but I would
like to mention that you are not the only one who suffers from
problems related to i40e driver. In my case I can't simply change
NICs..so it is even worse.
Cheers,
Pavlos
next prev parent reply other threads:[~2017-10-19 11:42 UTC|newest]
Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-10-04 12:56 Linux 4.12+ memory leak on router with i40e NICs Anders K. Pedersen | Cohaesio
2017-10-04 15:32 ` Alexander Duyck
2017-10-05 5:19 ` Anders K. Pedersen | Cohaesio
[not found] ` <227d17ae-b040-07d0-3c57-e9acd1a3b5b4@itcare.pl>
[not found] ` <c49a750f-c47c-9de0-ebf0-148db5e3d3c5@itcare.pl>
2017-10-15 0:58 ` Alexander Duyck
2017-10-15 15:03 ` Paweł Staszewski
2017-10-16 11:20 ` Pavlos Parissis
2017-10-16 14:11 ` Alexander Duyck
2017-10-16 16:26 ` Paweł Staszewski
2017-10-16 23:34 ` Paweł Staszewski
2017-10-16 23:56 ` Alexander Duyck
2017-10-17 0:44 ` Paweł Staszewski
2017-10-17 9:48 ` Paweł Staszewski
2017-10-17 10:20 ` Paweł Staszewski
2017-10-17 10:51 ` Paweł Staszewski
2017-10-17 10:59 ` Paweł Staszewski
2017-10-17 11:05 ` Paweł Staszewski
2017-10-17 11:52 ` Paweł Staszewski
2017-10-17 14:08 ` Paweł Staszewski
2017-10-18 15:44 ` Paweł Staszewski
2017-10-18 22:20 ` Paweł Staszewski
2017-10-18 22:50 ` Paweł Staszewski
2017-10-18 22:58 ` Paweł Staszewski
2017-10-18 23:22 ` Paweł Staszewski
2017-10-18 23:37 ` Alexander Duyck
2017-10-18 23:51 ` Paweł Staszewski
2017-10-18 23:56 ` Paweł Staszewski
2017-10-18 23:59 ` Paweł Staszewski
2017-10-19 17:10 ` Alexander Duyck
2017-10-19 12:19 ` Anders K. Pedersen | Cohaesio
2017-10-19 15:40 ` Alexander Duyck
2017-10-22 13:56 ` Anders K. Pedersen | Cohaesio
2017-10-17 5:51 ` Vitezslav Samel
2017-10-18 23:29 ` Alexander Duyck
2017-10-18 23:40 ` Paweł Staszewski
2017-10-19 11:41 ` Pavlos Parissis [this message]
2017-10-19 15:53 ` Alexander Duyck
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CABOTfnOtw=XdhPL4n1JM16CS02ekoy6EPugOvtSbO6LGp2XOXA@mail.gmail.com' \
--to=pavlos.parissis@gmail.com \
--cc=akp@cohaesio.com \
--cc=alexander.duyck@gmail.com \
--cc=alexander.h.duyck@intel.com \
--cc=intel-wired-lan@lists.osuosl.org \
--cc=netdev@vger.kernel.org \
--cc=pstaszewski@itcare.pl \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).