Ceph erasure code : ready for alpha testing

The addition of erasure code in Ceph started in april 2013 and was discussed during the first Ceph Developer Summit. The implementation reached an important milestone a few days ago and it is now ready for alpha testing.
For the record, here is the simplest way to store and retrieve an object in an erasure coded pool as of today:

parameters="erasure-code-k=2 erasure-code-m=1"
./ceph osd crush rule create-erasure ecruleset \
  $parameters \
  erasure-code-ruleset-failure-domain=osd
./ceph osd pool create ecpool 12 12 erasure \
  crush_ruleset=ecruleset \
  $parameters
./rados --pool ecpool put SOMETHING /etc/group
./rados --pool ecpool get SOMETHING /tmp/group
$ tail -3 /tmp/group
postfix:x:133:
postdrop:x:134:
_cvsadmin:x:135:

The chunks are stored in three objects and it can be reconstructed if any of them are lost.

find dev | grep SOMETHING
dev/osd4/current/3.7s0_head/SOMETHING__head_847441D7__3_ffffffffffffffff_0
dev/osd6/current/3.7s1_head/SOMETHING__head_847441D7__3_ffffffffffffffff_1
dev/osd9/current/3.7s2_head/SOMETHING__head_847441D7__3_ffffffffffffffff_2

6 Replies to “Ceph erasure code : ready for alpha testing”

    1. Yes, the erasure code pools are going to be significantly slower than the replicated pools. It is one of the tradeoff : slower storage using less space versus faster storage using more space.

      1. Ok, thank you for your answer!

        Will the replicated pools be faster even if the erasure code is systematic, and there are only reads(no recovery activity or writes)?

  1. We use rados to bench the erasure-code pool and replicated pool, there are no big difference of performance in seq read and write tests. But when we test erasure-code + cache tier in Cephfs way, it apparently slow than repliacted pools.

Leave a Reply to Anonymous Cancel reply

Your email address will not be published. Required fields are marked *