No commit activity in last 3 years
No release in over 3 years
Transforms the data of a database from an old schema to a new one.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
 Dependencies

Development

Runtime

 Project Readme

Database Transform

Build Status Coverage Status Code Climate

This gem will allow you to transform a database's contents across schemas with a simple DSL. This is useful when migrating from an application written in another framework to Rails, allowing programmers to reason about how an old schema maps to a new one.

Similar gems exist:

However, they have not been updated for a few years.

Installation

Add this line to your application's Gemfile:

gem 'database_transform'

And then execute:

$ bundle

Or install it yourself as:

$ gem install database_transform

Usage

Database Transform is built with ActiveRecord in mind. First, define a new database connection to the old database in database.yml:

my_old_app_production:
  adapter: postgresql
  host: old_server
  database: my_old_app

Then, define a transform in db/transforms/my_old_app_schema.rb (or db/transforms/my_old_app_schema/my_old_app_schema.rb if there are many files and you want to organize them in a folder):

class MyOldAppSchema < DatabaseTransform::Schema
  transform_table :users, to: ::User, default_scope: proc { where('uid <> 0') } do
    primary_key :uid
    column :mail, to: :email
    column :pass, to: :password do |password|
      self.password_confirmation = password
    end
  end
  
  transform_table :posts, to: ::Post do
    primary_key :post_id
    column :uid, to: :user, null: false do |uid|
      Source::User.transform(uid)
    end
    column :content
    save unless: proc { content.empty? }
  end
end

A summary of methods:

  • transform_table tells Database Transform to perform the given transform over records in the given source table.
    • The first argument is the table to transform. This can be a symbol, string, or an ActiveRecord model.
    • to specifies the new table to transform to. This can be a symbol, string, or an ActiveRecord model.
      • If either argument is a symbol or string, an ActiveRecord model is generated which allows access to the record's data.
        • Source models are found in the Source namespace, and can be used as the posts.uid column above.
        • Destination models are found in the Destination namespace.
      • In all cases, the model will have extra accessory methods:
        • transform(old_primary_key): This takes a primary key in the source table, and returns the transformed object. This only returns a valid result after the object has been transformed.
        • transformed?(old_primary_key): This checks if the object has been transformed.
    • default_scope allows the programmer to specify the records to transform
  • primary_key declares the name of column with the primary key. This allows later access when relations need to be mapped.
    • Use the transform and transformed? methods on the model to obtain the transformed object.
  • column declares how to transform the contents of that column from the old database to the new one.
    • If to: is omitted, then it is assumed that the transfer function is the identity function, and the column would map across as the same name.
    • If null: false is specified, the value assigned to the column (in to) will be checked for nullity.
    • A block can be provided.
      • If so, then the data from the old record is passed to the block as the first argument
      • In the context of the block, self refers to the new record.
      • self has an additional attribute source_record which refers to the old record.
      • self has an additional attribute schema which refers to the transformation schema.
  • save declares whether the new record should be saved.
    • if and unless accepts a block which will be evaluated to determine if the record should be saved.
    • validate will allow the record to be saved bypassing validations. This defaults to true.

Finally, execute the Rake task:

$ rake db:transform[my_old_app]

And the schema (MyOldAppSchema) and database connection (via my_old_app_production) will be established for you. A few variants of the schema name will be checked:

  • my_old_app
  • my_old_app_production

Only the source schema will be annotated to use the other connection. The destination schema will be used through the application's normal configuration (i.e. depends on the value of ENV['RAILS_ENV'].)

Additional arguments can be passed to the schema. All arguments specified in the Rake command following the schema name will be passed to the initialiser of the schema.

class MyOldAppWithArgumentsSchema < DatabaseTransform::Schema
  def initialize(uploads_path)
    @uploads_path = uploads_path
  end
end
$ rake db:transform[my_old_app_with_arguments,/home/joel/server/dumps/path]

These arguments then can be used from within any transform blocks by accessing the schema property.

Contributing

  1. Fork it ( https://github.com/[my-github-username]/database_transform/fork )
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create a new Pull Request