All of lore.kernel.org
 help / color / mirror / Atom feed
* status of spdk
@ 2016-11-08 23:31 Yehuda Sadeh-Weinraub
  2016-11-08 23:40 ` Sage Weil
  2016-11-09  4:45 ` Haomai Wang
  0 siblings, 2 replies; 18+ messages in thread
From: Yehuda Sadeh-Weinraub @ 2016-11-08 23:31 UTC (permalink / raw)
  To: Wang, Haomai, Weil, Sage; +Cc: ceph-devel

I just started looking at spdk, and have a few comments and questions.

First, it's not clear to me how we should handle build. At the moment
the spdk code resides as a submodule in the ceph tree, but it depends
on dpdk, which currently needs to be downloaded separately. We can add
it as a submodule (upstream is here: git://dpdk.org/dpdk). That been
said, getting it to build was a bit tricky and I think it might be
broken with cmake. In order to get it working I resorted to building a
system library and use that.

The way to currently configure an osd to use bluestore with spdk is by
creating a symbolic link that replaces the bluestore 'block' device to
point to a file that has a name that is prefixed with 'spdk:'.
Originally I assumed that the suffix would be the nvme device id, but
it seems that it's not really needed, however, the file itself needs
to contain the device id (see
https://github.com/yehudasa/ceph/tree/wip-yehuda-spdk for a couple of
minor fixes).

As I understand it, in order to support multiple osds on the same NVMe
device we have a few options. We can leverage NVMe namespaces, but
that's not supported on all devices. We can configure bluestore to
only use part of the device (device sharding? not sure if it supports
it). I think it's best if we could keep bluestore out of the loop
there and have the NVMe driver abstract multiple partitions of the
NVMe device. The idea is to be able to define multiple partitions on
the device (e.g., each partition will be defined by the offset, size,
and namespace), and have the osd set to use a specific partition.
We'll probably need a special tool to manage it, and potentially keep
the partition table information on the device itself. The tool could
also manage the creation of the block link. We should probably rethink
how the link is structure and what it points at.

Any thoughts?

Yehuda

^ permalink raw reply	[flat|nested] 18+ messages in thread

end of thread, other threads:[~2016-11-10 23:55 UTC | newest]

Thread overview: 18+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-11-08 23:31 status of spdk Yehuda Sadeh-Weinraub
2016-11-08 23:40 ` Sage Weil
2016-11-09  0:06   ` Yehuda Sadeh-Weinraub
2016-11-09  0:21     ` LIU, Fei
2016-11-09  2:45       ` Dong Wu
2016-11-09 20:53         ` Moreno, Orlando
2016-11-09 20:58           ` Sage Weil
2016-11-09 21:00             ` Gohad, Tushar
2016-11-09 21:10             ` Gohad, Tushar
2016-11-10 22:39               ` Walker, Benjamin
2016-11-10 22:59                 ` Sage Weil
2016-11-10 23:54                   ` Walker, Benjamin
2016-11-09  4:59       ` Haomai Wang
2016-11-09  5:02         ` LIU, Fei
2016-11-09  5:09           ` Liu, Changpeng
2016-11-09  5:23             ` LIU, Fei
2016-11-09  4:49   ` Haomai Wang
2016-11-09  4:45 ` Haomai Wang

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.