From mboxrd@z Thu Jan 1 00:00:00 1970
From: bugzilla-daemon@bugzilla.kernel.org
Subject: [Bug 112241] Under heavy load FC TARGET going to Oops
Date: Tue, 01 Mar 2016 05:16:10 +0000
Message-ID:
References:
Mime-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: 7bit
Return-path:
Received: from mail.kernel.org ([198.145.29.136]:52187 "EHLO mail.kernel.org"
rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP
id S1750777AbcCAFQO (ORCPT );
Tue, 1 Mar 2016 00:16:14 -0500
Received: from mail.kernel.org (localhost [127.0.0.1])
by mail.kernel.org (Postfix) with ESMTP id 29F492034C
for ; Tue, 1 Mar 2016 05:16:13 +0000 (UTC)
Received: from bugzilla2.web.kernel.org (bugzilla2.web.kernel.org [172.20.200.52])
by mail.kernel.org (Postfix) with ESMTP id 75D562034E
for ; Tue, 1 Mar 2016 05:16:10 +0000 (UTC)
In-Reply-To:
Sender: linux-scsi-owner@vger.kernel.org
List-Id: linux-scsi@vger.kernel.org
To: linux-scsi@vger.kernel.org
https://bugzilla.kernel.org/show_bug.cgi?id=112241
--- Comment #3 from nab ---
Hi Anthony,
On Mon, 2016-02-29 at 02:26 +0000, bugzilla-daemon@bugzilla.kernel.org
wrote:
> https://bugzilla.kernel.org/show_bug.cgi?id=112241
>
> Anthony changed:
>
> What |Removed |Added
> ----------------------------------------------------------------------------
> Kernel Version|4.3.3 |4.5.0
>
> --- Comment #2 from Anthony ---
> With kernel 4.5.0 on target, system hang after clients connects to target.
>
So there are two things going on here.
First, the BUG_ON your ESX <-> LIO FC setup triggered has been addressed
recently in v4.5-rc4 and later kernels with the following series:
http://www.spinics.net/lists/target-devel/msg11822.html
Note these patches will be making it back to earlier stable kernels over
the next weeks.
However, this specific bug is a final consequence of larger ESX v5.5u2+
host side issue of AtomicTestandSet (ATS) heartbeat being enabled (by
default) for all VMFS5 mounts:
https://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2113956
Other folks have been hitting this recently, here's some extra
background:
http://permalink.gmane.org/gmane.linux.scsi.target.devel/11574
http://www.spinics.net/lists/target-devel/msg12124.html
Note this effects all targets w/ VAAI ATS (including EMC, IBM, 3PAR,
SolidFire, etc) and the current solution for ESX v5.5u2+ is to either:
- Explicitly disable ATS heartbeat usage on all VMFS5 mounts as
described in the VMWare -kb article, or:
- Explicitly disable all ATS logic completely from LIO using
emulate_caw=0 on all backends connected to ESX v5.5u2+ hosts
with VMFS5.
You can google for 'esx ats heartbeat bug' to see the gory details.
Thanks for reporting!
--nab
--
You are receiving this mail because:
You are watching the assignee of the bug.