Ruby Marks
A simple OMR (Optical Mark Recognition) gem for ruby.
Requirements
This gem uses ImageMagick (version 6.8+) to manipulate the given images.
You can verify if this utility is installed by the command line which convert
, which should return the current ImageMagick path.
For example, /usr/local/bin/convert
.
If not installed:
MacOS X
If you're on Mac OS X, Homebrew may be your best option:
brew install imagemagick
Ubuntu
On Ubuntu, the apt-get
should be enough:
apt-get install imagemagick
Supported versions
- Ruby 2.3.x
- Ruby 2.4.x
- Ruby 2.5.x
- Ruby 2.6.x
Install
If you are using Bundler
, just put this line in your Gemfile:
gem 'ruby_marks'
Then run bundle install command:
bundle
If not, you still can run a default gem installation method:
gem install ruby_marks
And require it in your ruby code:
require 'ruby_marks'
How it Works
Using a template document, you should especify the expected area where each group is. By applying an edge detect algorithm it will discover where the groups are, and will check if they are near the expected position. After the groups being found, the gem will perform a scan in each group in order to recognize their marks. In the end, returns a hash with each correspondent mark found in the group.
The gem will not perform deskew in your documents. If the document have a huge skew, then you should apply your own deskew method on the file before.
Usage
Unfortunatelly, this gem will require a bit more configuration to work, since the implementation depends a lot of your document sizes, positions, brightness, etc...
That said, lets describe it's basic structure. The example will assume a directory with some png images like this one:
First, we will need to get the pixels coordinates, using one document as template, of the areas where the expected groups are. This image can explain where to pick each position:
The threshold level should be adjusted too, in order to don't get a too bright or too polluted marks. See:
Then, we write a basic code to scan it and print result on console (each option available are described bellow):
# Instantiate the Recognizer
recognizer = RubyMarks::Recognizer.new
# Configuring the document aspects
recognizer.configure do |config|
config.threshold_level = 90
config.default_expected_lines = 5
config.define_group :first do |group|
group.expected_coordinates = {x1: 34, y1: 6, x2: 160, y2: 134}
end
config.define_group :second do |group|
group.expected_coordinates = {x1: 258, y1: 6, x2: 388, y2: 134}
end
config.define_group :third do |group|
group.expected_coordinates = {x1: 486, y1: 6, x2: 614, y2: 134}
end
config.define_group :fourth do |group|
group.expected_coordinates = {x1: 714, y1: 6, x2: 844, y2: 134}
end
config.define_group :fifth do |group|
group.expected_coordinates = {x1: 942, y1: 6, x2: 1068, y2: 134}
end
end
Then we need to adjust the edge level to make sure the groups are being highlighted enough to being recognized. You can see the image after the edge algorithm is applied if you write the file after submit it to Recognizer. Like this:
recognizer.file = 'example.png'
file = @recognizer.file
filename = "temp_image.png"
file.write(filename)
The result image should be like this one (note that all the groups are separated from the rest of the document these white blocks):
There's a method you can call to help you to identify how the document is being recognized. This method return the image with the showing where is the expected groups coordinates are, where are the actual groups coordinates, and where the marks is being recognized in each group.
Example:
flagged_document = recognizer.flag_all_marks
flagged_document.write(temp_filename)
Will return the image below:
With all this configured, we can submit our images to a scan:
# Read all documents in directory thats in a png format
Dir["./*.png"].each do |file|
recognizer.file = file
puts recognizer.scan
end
And, this should puts each scan in a hash, like this:
{
first: {
1 => ['A'],
2 => ['C'],
3 => ['B'],
4 => ['B'],
5 => ['D']
},
second: {
1 => ['E'],
2 => ['A'],
3 => ['B'],
4 => ['A'],
5 => ['B']
},
three: {
1 => ['B'],
2 => ['B'],
3 => ['D'],
4 => ['B'],
5 => ['B']
},
four: {
1 => ['B'],
2 => ['E'],
3 => ['A'],
4 => ['C'],
5 => ['D']
},
five: {
1 => ['B'],
2 => ['A'],
3 => ['A'],
4 => ['C'],
5 => ['D']
}
}
General Configuration Options
As you may see, it's necessary configure some document aspects to make this work properly. So, lets describe each general configuration option available:
Edge level
# The size of the edge to apply in the edge detect algorithm.
# The default value is 4, but is very important you verify the algorithm result and adjust it to work.
config.edge_level = 4
Threshold level
# Applies the given percentual in the image in order to get it back with only black and white pixels.
# Low percentuals will result in a bright image, as High percentuals will result in a more darken image.
# The default value is 60, but is very important you verify the algorithm result and adjust it to work.
config.threshold_level = 60
Scan timeout
# Sets a timeout in seconds, to break long time scans.
# The default value is 0 (zero) and means there's no timeout. Any value will quit the scan and raise timed_out_watcher
config.scan_timeout = 0
Expected lines
# The scan will raise the incorrect group watcher if one or more group don't have the expected number of lines
# Here, this configuration becomes valid to all groups.
# The default value is 20, but is very
config.default_expected_lines = 20
Default blocks sizes tolerances
# Defines the tolerance in width and height of the group blocks (in pixels) to compare with de expected block size.
# This tolerance is intended to make the scan recognizes the correct whole group. The scanned area + this tolerance
# should be greater than the expected sizes of the block, in order to consider it.
# In small size groups, you should reduce this parameter in order to don't capture weong groups positions, such a
# not marked bubble as a group, for example.
# The default values is 100 for both width and height, and should be adjusted for your documents.
config.default_block_width_tolerance = 100
config.default_block_height_tolerance = 100
Default mark sizes
# Defines the expected width and height of the marks (in pixels). With the tolerance, if the recognized
# mark exceeds or stricts those values, it will be ignored.
# The default values is 20 to width and 20 to height. Since the marks can be different, you SHOULD
# calculate those sizes for your documents.
config.default_mark_width = 20
config.default_mark_height = 20
Default mark sizes tolerances
# Defines the tolerance in width and height of the marks (in pixels). With default_mark_width and default_mark_width
# size, if the recognized mark exceeds or stricts those values, it will be ignored.
# The default values is 4 for both width and height.
config.default_mark_width_tolerance = 4
config.default_mark_height_tolerance = 4
Intensity percentual
# Set the intensity sensitivity (in percentual) expected to recognize a mark as a marked one.
# When the scan find some potential marked area, then it will analyse if the count of valid pixels (black pixels)
# have this minimun percentage.
# Increasing this value, the recognition becomes more sensitive and can ignore valid weaker markings.
# Decreasing this value, recognition becomes less sensitive and can recognize false markings.
# The default value is 50.
config.intensity_percentual = 50
Default marks options
# Set the marks options that the groups represents. When the scan recognizes a mark in some position,
# it will return they value in the result hash.
# The default value is the [A, B, C, D, E] array.
config.default_marks_options = %w{A B C D E}
Default distance between each mark in group
# Defines the distance (in pixel) between the middle of a mark and the middle of the next mark in the same group.
# This option is used to try suppose not found marks.
# The default value is 25
config.default_distance_between_marks = 25
Group Configuration Options
The General Configuration Options is more generic for the entire document. So, you can have some particularities when defining a group. So:
Expected coordinates
# This configuration defines the area coordinate where the group is expected to be.
group.expected_coordinates = {x1: 145, y1: 780, x2: 270, y2: 1290}
Block sizes tolerances
# It overwrites the default_block_width_tolerance and default_block_height_tolerance values for the group you configure it.
group.block_width_tolerance = RubyMarks.default_block_width_tolerance
group.block_height_tolerance = RubyMarks.default_block_height_tolerance
Mark sizes
# It overwrites the default_mark_width and default_mark_height values for the group you configure it.
group.mark_width = RubyMarks.default_mark_width
group.mark_height = RubyMarks.default_mark_height
Mark sizes tolerances
# It overwrites the default_mark_width_tolerance and default_mark_height_tolerance values for the group you configure it.
group.mark_width_tolerance = RubyMarks.default_mark_width_tolerance
group.mark_height_tolerance = RubyMarks.default_mark_height_tolerance
Marks options
# It overwrites the default_marks_options values for the group you configure it.
group.marks_options = RubyMarks.default_marks_options
Distance Between Marks
# It overwrites the default_distance_between_marks values for the group you configure it.
group.distance_between_marks = RubyMarks.default_distance_between_marks
Expected lines
# It overwrites the default_expected_lines values for the group you configure it.
group.expected_lines = @recognizer.config.default_expected_lines
Watchers
Sometimes, due some image flaws, the scan can't recognize some group, or a mark, or even recognize
more than one mark in a clock row in the same group when it is not expected. Then, you can place some
watchers, that will perform some custom code made by yourself in those cases, such applies a deskew
in image and re-run the scan, for example.
But, be advised, if you call the scan method again inside the watcher, you should make sure that you
have a way to leave the watcher to avoid a endless loop. You always can check how many times the watcher
got raised by checking in recognizer.raised_watchers[:watcher_name]
hash.
Scan Mark Watcher
# Will execute your custom code if didn't recognizes some mark or recognizes more than one mark in a clock
# row and the same group.
# It returns the recognizer object, the result of scan hash, a boolean value if this watcher was raised by unmarked
# options and a boolean value if the watcher was raised by a multiple marks options
recognizer.add_watcher :scan_mark_watcher do |recognizer, result, unmarked_group_found, multiple_marked_found|
# place your custom code
end
Scan Unmarked Watcher
# Will execute your custom code if didn't recognizes some mark.
# It returns the recognizer object, the result of scan hash.
recognizer.add_watcher :scan_unmarked_watcher do |recognizer, result|
# place your custom code
end
Scan Multiple Marked Watcher
# Will execute your custom code if recognizes more than one mark in a clock row and the same group.
# It returns the recognizer object, the result of scan hash.
recognizer.add_watcher :scan_multiple_marked_watcher do |recognizer, result|
# place your custom code
end
Incorrect Group Watcher
# Will execute your custom code if didn't a group isn't found, or it have a line count different than expected,
# or in one or more lines the options marks found are different of the specified in marks options.
# It returns the recognizer object, a boolean value to incorrect expected lines count, and a hash with the
# incorrect bubble lines found, and a hash with the coordinates of bubbles adjusted.
# Pay attention on incorrect_expected_lines and incorrect_bubble_line_found, because if some of this variables
# becomes present, then your document may have an incorrect result scan...
recognizer.add_watcher :incorrect_group_watcher do |recognizer, incorrect_expected_lines, incorrect_bubble_line_found, bubbles_adjusted|
# place your custom code
end
Timed Out Watcher
# Will execute your custom code if your scan outrun the specified timeout in configuration. It returns you recognizer
# object.
recognizer.add_watcher :timed_out_watcher do |recognizer|
# place your custom code
end
Contributing
- Fork it
- Make your implementations
- Send me a pull request
Thank you!
License
Copyright © 2012 André Rodrigues, Ronaldo Araújo, Rodrigo Virgilio, Lucas Correa. See MIT-LICENSE for further details.