Relaxo is a transactional database built on top of git. It's aim is to provide a robust interface for document storage and sorted indexes. If you prefer a higher level interface, you can try relaxo-model.
Installation
Add this line to your application's Gemfile:
gem 'relaxo'
And then execute:
$ bundle
Or install it yourself as:
$ gem install relaxo
Usage
Connect to a local database and manipulate some documents.
require 'relaxo'
require 'msgpack'
DB = Relaxo.connect("test")
DB.commit(message: "Create test data") do |dataset|
object = dataset.append(MessagePack.dump({bob: 'dole'}))
dataset.write("doc1.msgpack", object)
end
DB.commit(message: "Update test data") do |dataset|
doc = MessagePack.load dataset.read('doc1.msgpack').data
doc[:foo] = 'bar'
object = dataset.append(MessagePack.dump(doc))
dataset.write("doc2.msgpack", object)
end
doc = MessagePack.load DB.current['doc2.msgpack'].data
puts doc
# => {"bob"=>"dole", "foo"=>"bar"}
Document Storage
Relaxo uses the git persistent data structure for storing documents. This data structure exposes a file-system like interface, which stores any kind of data. This means that you are free to use JSON, or BSON, or MessagePack, or JPEG, or XML, or any combination of those.
Relaxo has a transactional model for both reading and writing.
Authors
By default, Relaxo sets up the repository author using the login name and hostname of the current session. You can explicitly change this by modifying database.config
. Additionally, you can set this per-commit:
database.commit(message: "Testing Enumeration", author: {user: "Alice", email: "alice@localhost"}) do |dataset|
object = dataset.append("Hello World!")
dataset.write("hello.txt", object)
end
Reading Files
path = "path/to/document"
DB.current do |dataset|
object = dataset.read(path)
puts "The object id: #{object.oid}"
puts "The object data size: #{object.size}"
puts "The object data: #{object.data.inspect}"
end
Writing Files
path = "path/to/document"
data = MessagePack.dump(document)
DB.commit(message: "Adding document") do |changeset|
object = changeset.append(data)
changeset.write(path, object)
end
Datasets and Transactions
Dataset
s and Changeset
s are important concepts. Relaxo doesn't allow arbitrary access to data, but instead exposes the git persistent model for both reading and writing. The implications of this are that when reading or writing, you always see a consistent snapshot of the data store.
Suitability
Relaxo is designed to scale to the hundreds of thousands of documents. It's designed around the git persistent data store, and therefore has some performance and concurrency limitations due to the underlying implementation.
Because it maintains a full history of all changes, the repository would continue to grow over time by default, but there are mechanisms to deal with that.
Performance
Relaxo can do anywhere from 1000-10,000 inserts per second depending on how you structure the workload.
Relaxo Performance
Warming up --------------------------------------
single 129.000 i/100ms
Calculating -------------------------------------
single 6.224k (±14.7%) i/s - 114.036k in 20.000025s
single transaction should be fast
Warming up --------------------------------------
multiple 152.000 i/100ms
Calculating -------------------------------------
multiple 1.452k (±15.2%) i/s - 28.120k in 20.101831s
multiple transactions should be fast
Reading data is lighting fast as it's loaded directly from disk and cached.
Loading Data
As Relaxo is unapologetically based on git, you can use git directly with a non-bare working directory to add any files you like. You can even point Relaxo at an existing git repository.
Durability
Relaxo is based on libgit2
and asserts that it is a transactional database. We base this assertion on:
- All writes into the object store using
libgit2
are atomic and synchronized to disk. - All updates to refs are atomic and synchronized to disk.
Provided these two invariants are maintained, the operation of Relaxo will be safe, even if there are unexpected interruptions to the program.
The durability guarantees of Relaxo depend on libgit2
calling fsync
, and this being respected by the underlying hardware. Otherwise, durability cannot be guaranteed.
Contributing
We welcome contributions to this project.
- Fork it.
- Create your feature branch (
git checkout -b my-new-feature
). - Commit your changes (
git commit -am 'Add some feature'
). - Push to the branch (
git push origin my-new-feature
). - Create new Pull Request.
Developer Certificate of Origin
This project uses the Developer Certificate of Origin. All contributors to this project must agree to this document to have their contributions accepted.
Contributor Covenant
This project is governed by the Contributor Covenant. All contributors and participants agree to abide by its terms.