Project

ocr

0.01
No commit activity in last 3 years
No release in over 3 years
Recognize text and characters from image files using web services.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

~> 2.7.0

Runtime

>= 1.5.0
>= 0.9.9
 Project Readme

#OCR OCR web services gateway for Ruby.

#Description Recognize text and characters from image files using web services.

##Web services supported

  • WeOCR project
    • update WeOCR servers list
    • autoselect server
    • no identify
    • languages: that's supported by server. TODO: select server by languaje requested.
    • output formats: :txt
  • OCR Web Service(Online OCR)
    • identify: Username and license code as password
    • languages: :brazilian, :byelorussian, :bulgarian, :catalan, :croatian, :czech, :danish, :dutch, :english, :estonian, :finnish, :french, :german, :greek, :hungarian, :indonesian, :italian, :latin, :latvian, :lithuanian, :moldavian, :polish, :portuguese, :romanian, :russian, :serbian, :slovakian, :slovenian, :spanish, :swedish, :turkish, :ukrainian
    • output formats: :doc, :pdf, :excel, :html, :txt, :rtf
  • Free OCR online webservice
    • identify: Username
    • No tested for images more than 100x100px in size.
    • Free service is limited to 100x100px images.

#Installation ##From the command line

  gem install ocr

##Using Gemfile

1 Add to your application Gemfile

gem 'ocr'

2 Type

  bundle install

Using

  • Get a OCR: ocr = OCR.use OCR_NAME
  • Set the login Credentials: ocr.login YOUR_USER, YOUR_PASSWORD, [EXTRA_LOGIN_DATA]
  • Set proxy configuration: ocr.proxy p_addr, p_port = nil, p_user = nil, p_pass = nil
  • Set image to work: ocr.file= FILE_NAME_AND_PATH
  • Set languaje: ocr.file= FILE_NAME_AND_PATH
  • Set output format: ocr.format= FORMAT_NAME
  • Set output file: ocr.outputfile= FILE_NAME_AND_PATH
  • Test error: error = ocr.error if ocr.error?
  • Results: text = ocr.text unless ocr.error?
  • Disable auto rescue exceptions: ocr.rescue_exceptions= false

WeOCR project

More info at WeOCR project.

Extra properties outputencoding=NAME.

  ocr = OCR.use :weocr

  ocr.file= 'text_image.jpg'
  ocr.format= :txt
  ocr.outputencoding="utf-8"
  ocr.recognize

  puts "ERROR: #{ocr.error}" if ocr.error?
  puts "RESULT: #{ocr.text}" unless ocr.error?

OCR Web Service

More info at OCR Web Service.

Extra properties convert_to_bw=BOOLEAN, multi_page_doc=BOOLEAN.

  ocr = OCR.use :onlineocr

  ocr.login <YOUR_USER>, <YOUR_LICENSE_CODE>
  ocr.file= 'text_image.jpg'
  ocr.format= :pdf
  ocr.outputfile= 'text_doc.pdf'
  ocr.recognize

  puts "ERROR: #{ocr.error}" if ocr.error?
  puts "RESULT: #{ocr.text}" unless ocr.error?

Free OCR online webservice

More info at Free OCR online webservice. No tested for images larger than 100x100px. Free service is limited to 100x100px images.

  ocr = OCR.use :free_ocr

  ocr.login <YOUR_USER_NAME>
  ocr.file= 'text_image.jpg'
  ocr.recognize

  puts "ERROR: #{ocr.error}" if ocr.error?
  puts "RESULT: #{ocr.text}" unless ocr.error?

Credits

  • Thanks to @ylluminate for request auto rescue exception feature

License

Released under the MIT license: http://www.opensource.org/licenses/MIT