All of lore.kernel.org
 help / color / mirror / Atom feed
* Adding a proprietary key value store to CEPH
@ 2015-02-24 13:20 Varada Kari
  2015-02-24 14:29 ` Loic Dachary
  2015-02-24 16:51 ` Sage Weil
  0 siblings, 2 replies; 16+ messages in thread
From: Varada Kari @ 2015-02-24 13:20 UTC (permalink / raw)
  To: Ceph Development

Hi Sage,

We are trying to integrate a new proprietary key value store to CEPH. To integrate this KV-store, which is a closed source shared library, we propose a new class to CEPH called PropDBStore which does a dlopen and imports the required symbols. This framework will help in integrating vendor specific extensions to CEPH.

The gist of the implementation is as follows.

1. Implement a wrapper around the proprietary KVStore. Let us call it as KVExtension. This is a shared library which implements all interfaces required by CEPH KeyValueStore.
2. A new class is derived from KeyValueDB called PropDBStore, which honors the semantics of KeyvalueStore and KeyValueDB. This class acts as mediator between CEPH and KVExtension.  This class transforms bufferlist etc... to const char pointers or strings for the extension to understand.
3. PropDBStore, loads (dlopen) the KVExtension during OSD initialization.  Path to the KVExtension can be mentioned in ceph.conf.
4. Interfaces that needs to be implemented in KVExtension, which are imported by the PropDBStore are added in a new header called PropDBWrapper.h.  This header contains the signatures for the necessary interfaces like init(), close(), submit_transaction(), get() and get_iterator(). Similarly for Iterator functionality, PropDBIterator.h, which specifies the signatures of seek_to_first (), seek_to_last(), lower_bound() and upper_bound() etc...  PropDBStore includes these headers to import the symbols, using dlsym().
5. Choosing the proprietary DB as Backend to the OSD is controlled/managed by config options of the ceph (/etc/ceph/ceph.conf) like rocksdb or leveldb.
6. Rest of the existing functionality is not disturbed by this change. Changing the osd backend option will change backend implementation. But this change is not dynamic. The type of the backend should be chosen at osd creation time and osd will continue use that backend till that osd is reformatted again.
7. The new KVStore we are trying to integrate works on a raw partition, so we divided the osd drive into two partitions. One partition is given to osd Meta data (super block, fsid etc...), and the other is given to the new db to manage it. OSD partition is now not the entire disk, but 2-4GB which needed for the metadata.

Please share your thoughts around this.
Thanks,
Varada



________________________________

PLEASE NOTE: The information contained in this electronic mail message is intended only for the use of the designated recipient(s) named above. If the reader of this message is not the intended recipient, you are hereby notified that you have received this message in error and that any review, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this communication in error, please notify the sender by telephone or e-mail (as shown above) immediately and destroy any and all copies of this message in your possession (whether hard copies or electronically stored copies).


^ permalink raw reply	[flat|nested] 16+ messages in thread
[parent not found: <531793771.166.1424908519960.JavaMail.root@thunderbeast.private.linuxbox.com>]

end of thread, other threads:[~2015-03-23 21:07 UTC | newest]

Thread overview: 16+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-02-24 13:20 Adding a proprietary key value store to CEPH Varada Kari
2015-02-24 14:29 ` Loic Dachary
2015-02-24 16:13   ` Somnath Roy
2015-02-24 16:27     ` Loic Dachary
2015-02-24 16:50       ` Varada Kari
2015-02-24 20:01         ` Loic Dachary
2015-02-24 16:51 ` Sage Weil
2015-02-25  8:10   ` Varada Kari
2015-02-25 14:45     ` Sage Weil
2015-02-25 22:40       ` Somnath Roy
2015-02-25 23:25         ` Sage Weil
2015-02-25 23:30           ` Somnath Roy
2015-03-23 13:42   ` Varada Kari
2015-03-23 21:07     ` Sage Weil
     [not found] <531793771.166.1424908519960.JavaMail.root@thunderbeast.private.linuxbox.com>
2015-02-25 23:56 ` Matt W. Benjamin
2015-03-09 13:00   ` Varada Kari

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.