All of lore.kernel.org
 help / color / mirror / Atom feed
* Bug report: ref tag error while running IO > MDTS on nvme device
@ 2017-10-03  9:48 Max Gurtovoy
  2017-10-03 14:25 ` Keith Busch
  0 siblings, 1 reply; 3+ messages in thread
From: Max Gurtovoy @ 2017-10-03  9:48 UTC (permalink / raw)


Hi guys,
I would like to report a bug while running fio on local nvme device 
(SAMSUNG MZPLL1T6HEHP-00003) that was formated to use integrity:

nvme format /dev/nvme2n1 -n 1 -l 1 -i 2 -p 0 -m 0
nvme id-ns /dev/nvme2n1:

NVME Identify Namespace 1:
nsze    : 0xba4d4ab0
ncap    : 0xba4d4ab0
nuse    : 0xba4d4ab0
nsfeat  : 0
nlbaf   : 3
flbas   : 0x1
mc      : 0x3
dpc     : 0x1f
dps     : 0x2
nmic    : 0x1
rescap  : 0x7f
fpi     : 0x80
nawun   : 0
nawupf  : 0
nacwu   : 0
nabsn   : 0
nabo    : 0
nabspf  : 0
noiob   : 0
nvmcap  : 1600321314816
nguid   : 334842304a6000840025384100000002
eui64   : 002538e671000457
lbaf  0 : ms:0   lbads:9  rp:0x1
lbaf  1 : ms:8   lbads:9  rp:0x3 (in use)
lbaf  2 : ms:0   lbads:12 rp:0
lbaf  3 : ms:8   lbads:12 rp:0x2

running traffic up to IO size <= 128KB works fine but running bigger IO 
that the MDTS caused many ref tag errors (for example):

[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 324043008 
(rcvd 0)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2929058096 
(rcvd 1241579520)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2292844816 
(rcvd 2483027968)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2523145472 
(rcvd 243859455)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 1177656832 
(rcvd 15400959)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 187312640 
(rcvd 243859455)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2344798720 
(rcvd 126418943)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2892037376 
(rcvd 126418943)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2292845056 
(rcvd 143196159)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 1237630256 
(rcvd 1241579520)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 1237630208 
(rcvd 143196159)
[Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2986034432 
(rcvd 243859455)
[Tue Oct  3 12:34:45 2017] nvme2n1: ref tag error at location 881629440 
(rcvd 15400959)
[Tue Oct  3 12:34:45 2017] nvme2n1: ref tag error at location 1889589504 
(rcvd 243859455)
[Tue Oct  3 12:34:45 2017] nvme2n1: ref tag error at location 1889589504 
(rcvd 243859455)

haven't debugged it yet, but if anyone has an idea for the root cause, 
it would be great.

Thanks,
-Max.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Bug report: ref tag error while running IO > MDTS on nvme device
  2017-10-03  9:48 Bug report: ref tag error while running IO > MDTS on nvme device Max Gurtovoy
@ 2017-10-03 14:25 ` Keith Busch
  2017-10-03 15:19   ` Max Gurtovoy
  0 siblings, 1 reply; 3+ messages in thread
From: Keith Busch @ 2017-10-03 14:25 UTC (permalink / raw)


Hi Max,

Is this behavior new, or was this always broken?

Since you mention MDTS, I wonder if this device is counting the separate
metadata as part of the transfer and doing the wrong thing as a result?
We're only supposed to count the metadata size in consideration of MDTS
if metadata is interleaved.

On Tue, Oct 03, 2017@12:48:34PM +0300, Max Gurtovoy wrote:
> Hi guys,
> I would like to report a bug while running fio on local nvme device (SAMSUNG
> MZPLL1T6HEHP-00003) that was formated to use integrity:
> 
> nvme format /dev/nvme2n1 -n 1 -l 1 -i 2 -p 0 -m 0
> nvme id-ns /dev/nvme2n1:
> 
> NVME Identify Namespace 1:
> nsze    : 0xba4d4ab0
> ncap    : 0xba4d4ab0
> nuse    : 0xba4d4ab0
> nsfeat  : 0
> nlbaf   : 3
> flbas   : 0x1
> mc      : 0x3
> dpc     : 0x1f
> dps     : 0x2
> nmic    : 0x1
> rescap  : 0x7f
> fpi     : 0x80
> nawun   : 0
> nawupf  : 0
> nacwu   : 0
> nabsn   : 0
> nabo    : 0
> nabspf  : 0
> noiob   : 0
> nvmcap  : 1600321314816
> nguid   : 334842304a6000840025384100000002
> eui64   : 002538e671000457
> lbaf  0 : ms:0   lbads:9  rp:0x1
> lbaf  1 : ms:8   lbads:9  rp:0x3 (in use)
> lbaf  2 : ms:0   lbads:12 rp:0
> lbaf  3 : ms:8   lbads:12 rp:0x2
> 
> running traffic up to IO size <= 128KB works fine but running bigger IO that
> the MDTS caused many ref tag errors (for example):
> 
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 324043008
> (rcvd 0)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2929058096
> (rcvd 1241579520)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2292844816
> (rcvd 2483027968)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2523145472
> (rcvd 243859455)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 1177656832
> (rcvd 15400959)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 187312640
> (rcvd 243859455)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2344798720
> (rcvd 126418943)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2892037376
> (rcvd 126418943)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2292845056
> (rcvd 143196159)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 1237630256
> (rcvd 1241579520)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 1237630208
> (rcvd 143196159)
> [Tue Oct  3 12:34:25 2017] nvme2n1: ref tag error at location 2986034432
> (rcvd 243859455)
> [Tue Oct  3 12:34:45 2017] nvme2n1: ref tag error at location 881629440
> (rcvd 15400959)
> [Tue Oct  3 12:34:45 2017] nvme2n1: ref tag error at location 1889589504
> (rcvd 243859455)
> [Tue Oct  3 12:34:45 2017] nvme2n1: ref tag error at location 1889589504
> (rcvd 243859455)
> 
> haven't debugged it yet, but if anyone has an idea for the root cause, it
> would be great.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Bug report: ref tag error while running IO > MDTS on nvme device
  2017-10-03 14:25 ` Keith Busch
@ 2017-10-03 15:19   ` Max Gurtovoy
  0 siblings, 0 replies; 3+ messages in thread
From: Max Gurtovoy @ 2017-10-03 15:19 UTC (permalink / raw)




On 10/3/2017 5:25 PM, Keith Busch wrote:
> Hi Max,

Hi Keith,

> 
> Is this behavior new, or was this always broken?

Good question.
Haven't tried older kernels (only 4.13.0).

> 
> Since you mention MDTS, I wonder if this device is counting the separate
> metadata as part of the transfer and doing the wrong thing as a result?
> We're only supposed to count the metadata size in consideration of MDTS
> if metadata is interleaved.

mdts is 5 (max_hw_sectors_kb is 128) so I guess it's ok from device 
point of view.

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2017-10-03 15:19 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-10-03  9:48 Bug report: ref tag error while running IO > MDTS on nvme device Max Gurtovoy
2017-10-03 14:25 ` Keith Busch
2017-10-03 15:19   ` Max Gurtovoy

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.