Roads Less Taken

A blend of programming, boats and life.

RethinkDB - Yet Another NoSQL?

| Comments

15-18 years ago my passion was in OODBs. As a Smalltalker GemStone was one of the most fascinating to work with, but as we all know OODBs never got really popular, despite their fantastic qualities. But the new NoSQL databases in many respects offer OODB-ish characteristics, although they of course also bring a whole new menu to the table.

In my eternal quest for “database bliss” my next stop is RethinkDB, but let me tell you how I got here.

When it comes to NoSQL databases I have both worked and dabbled with a few:

  1. Tokyo Tyrant. Yeah, it was extremely fast (still is) and I did implement a Squeak binding for its binary protocol, but nah, the use cases are very limited. But it was fun!

  2. CouchDB. A lot of nice design decisions with a very friendly HTTP oriented API, all JSON and interesting map/reduce mechanisms. CouchDB was one of the primary NoSQL databases to start the whole “movement” but although it’s brilliant in many ways it also wasn’t built to scale and it quickly got side tracked by the competition. Most other NoSQL databases have picked up quite a few tricks from Couch though so it has been very influential.

  3. Riak. Riak feels like “CouchDB done right”, same HTTP friendliness but a very robust Dynamo-inspired fully distributed architecture. If one wants a system that doesn’t trade availability for consistency - Riak is it. But there is a cost - you need to put a lot of effort into conflict resolution mechanisms (resolving siblings) - and that is not trivial to do. While Riak is very “seductive” in all its 100% Buzzword Compliance-ness, you might come to the conclusion that your use case actually isn’t the next Amazon or Facebook needing to serve millions of people at the same time with 99.99999 availability.

  4. HyperDex. This is a really fast database with several interesting mechanisms like hyper hashing. The only API is going through the C library, which can be a bit of a hard time from Pharo/Squeak. I started out with NativeBoost FFI (cool stuff Igor) for this but got stuck. Still very interesting and did I mention super fast? A bit immature (docs lacking) and it also depends on fixed schemas, not sure how to deal with schema migration. So, fascinating and FAST, with lots of cool functionality but… doesn’t feel practical - there must be alternatives not needing fixed schemas? Why isn’t the “spaces” definition grammar documented? Who knows, HyperDex may be the Next Big Thing, but as of now I think I am moving on.

So where to go from here? Riak is great, but I want more consistency guarantees to eliminate all that conflict resolution work. This blog post nails it pretty good - it probably turns out I want something that is CP instead of AP like Riak is.

And in case you wonder - MongoDB is not for me, I know… some people love it, but let me quote (and there are many other sources too) the internetz:

I would suggest you don’t try to use MongoDB in a high-availability mode (or at all if you can help it). It’s quite buggy, lacks useful features and isn’t in fact Consistent (even though its design might suggest it is). It’s just a bad database with too much marketing :( There are a few decent-looking Consistent databases out there (HBase, Couchbase, RethinkDB, Hyperdex) and several decent Eventually Consistent databases (Cassandra, Riak, Dynamo).

So Google to the rescue, there must be more consistent NoSQL dbs out there…

  1. FoundationDB. Sounds and looks very impressive “on paper” but it is not yet available and AFAICT it is not going to be open source either, so… nope.

  2. Couchbase. Looks fairly interesting, Membase married to CouchDB, but their description of the “community edition” doesn’t feel that inspiring to me. They don’t really seem committed to open source, this is more a straight up commercial offering with “Enterprise trash talk”. So no, not me, and the muddled merge with CouchDB has at least left me utterly confused over this product and what it actually “is”.

  3. RethinkDB. Aha. I think this is my next NoSQL database to look closer at, I really like what I see, both technically and how they feel as a company - very open and they seem to really understand how open source works, culturally. And installing it and playing with the web console was trivial… fun stuff!

Now I just need to implement protobuf in Pharo… :)

Comments