RnDB
RnDB is a procedurally-generated fake database. Read the blog post for details.
Video Overview
Usage
First, create tables with columns that may have a pre-determined distribution of
values (which can be queried on), or which may have a lambda for generating a
random value, or both (such as for the weight column below).
class Widget < RnDB::Table
column :colour, { red: 0.3, green: 0.12, brown: 0.01, blue: 0.5, orange: 0.07 }
column :weight, { light: 0.3, medium: 0.64, heavy: 0.06 }, -> value do
range =
case value
when :light
(0.0..5.0)
when :medium
(6.0..9.0)
when :heavy
(10.0..20.0)
end
self.rand(range)
end
column :name, -> { Faker::Games::Pokemon.name }
end
Next, create a database with an optional random seed (137 in the example
below), and add the table to the database, specifying the number of records to
simulate (in this case, one trillion).
DB = RnDB::Database.new(137)
DB.add_table(Widget, 1e12)
Finally, fetch some records!
puts Widget.count
puts Widget[1234567890].name
puts Widget.find { |widget| (3.1415..3.1416).include?(widget.weight) }.attributes
Which will display the following:
1000000000000
Charmander
{:id=>61520, :weight=>3.1415121332762386, :colour=>:red, :name=>"Exeggcute"}
Note that the find command tested over sixty thousand records in just a second
or two without needing to generate all attributes of each record first. But an
even faster way of honing in on a particular record is to run a query, such as:
query = Widget.where(colour: [:brown, :orange], :weight => :heavy)
You can then retrieve random records that match the query with sample, use
pluck to retrieve specific attributes without generating all of them, and use
find or filter to further refine your search, like this:
puts query.count
puts query.sample.pluck(:colour, :weight)
puts query.lazy.filter { |ball| ball.name == 'Pikachu' }.map(&:id).take(10).to_a
Which will display the following:
4800000000
{:colour=>:orange, :weight=>16.096085279047017}
[429400000068, 429400000087, 429400000875, 429400000885, 429400000914, 429400001036, 429400001062, 429400001330, 429400001341, 429400001438]
Note that we used the lazy enumerator when filtering records to prevent
running the block on all records before performing the map and taking the
first ten results.
Release Process
rake standard:fixrake version:bump:whateverrake gemspec:release BRANCH=mainrake git:release BRANCH=main- Create new release on GitHub to trigger ship workflow
Copyright
Copyright (c) 2021 Jason Hutchens. See LICENSE for further details.