Project

cogibara

0.0
No commit activity in last 3 years
No release in over 3 years
A framework for combining natural speech processing tools with public APIs. Basic functions work out of the box, and with a bit of configuration you can get weather information, manage your google calendar, or access wolfram alpha, all using your voice or natural language text. If you want more functionality, it's easy to associate your own code with a keyword or speech category. Try the demo interface by tweeting @Cogibara
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
 Dependencies

Development

~> 1.3
>= 0

Runtime

 Project Readme

Cogibara

This repository is no longer maintained or updated; see https://github.com/Cogibara/cogibara

Your friendly natural language interface to computer and the internet, leveraging the power of NLP techniques used in applications such as Siri! Why let your someone else decide what you need your personal assistant software to do? Cogibara uses free and open source libraries and APIs to let you ask general knowledge or math questions, manage your calendar, or just chat when you're bored. It's easy to add new capabilites or integrate other tools; the gem handles the infrastructure and language processing so you can focus on making your Cogibara do whatever awesome things you want it to.

For a demo, try tweeting @Cogibara, or check out the demo client and its github page. Its just something I put together with ExtJS and Rails, so it's got some bugs at the moment, but it should give you an idea of what the gem can do.

NOTE
This gem is still in development, there is no documentation besides this readme, I haven't written any tests, and important things may be implemented in very silly ways. Feel free tell me where this is so (I'd appreciate it in fact), but you've been warned. I was planning on keeping this private for a few more months of development, however other projects have come up for the summer, so I'm releasing what I have now in case I don't get much time to work on it.

Mainly because of the way the speech2text gem is implemented, this will only work with MRI, and likely just linux as well.

Installation

Add this line to your application's Gemfile:

gem 'cogibara'

And then execute:

$ bundle

Or install it yourself as:

$ gem install cogibara

Usage

Gem API
You can use the gem as a part of any Ruby program

require 'cogibara'
Cogibara::dispatcher.config_from_yaml('./config.yml')
Cogibara::message_handler.handle("hello!") 
    # => "How Are you?"
Cogibara::message_handler.handle("what is the square root of 9954") 
    #=> "3 sqrt(1106)" 
Cogibara::message_handler.handle("how's the weather?")
    #=> "Currently Overcast, 65 degrees, cloud cover 99%, then 
        Sprinkling for the hour." 

Executable
The gem will install the cogibara executable. You can run it without arguments for to get to a basic interactive loop, but many functions require an API key or account credentials to work properly. See the Configuration section for more information.

cogibara -m "hows it going?"
  #=> cogibara: Good and you?

cogibara -c config.yml -m "what time is it in the netherlands?"
  #=> cogibara: 7:30:40 am CEST  |  Wednesday, May 29, 2013

The cogibara executable is a command line interface for the gem.

Usage: cogibara [options]
-v, --verbose      Verbose output
-m, --message      Message
-n, --name         Name
-c, --config       Configuration file (.yml or .rb)
-r, --redis        Standalone Redis mode

If a message is specifed, the executable will print the response then exit. Otherwise, it will go into a loop until you close it.

You can also run cogibara on a yaml or Ruby file to start the interactive loop using the configuration it specifies, or on an audio or video file (less than 10 seconds seems to work best), in which case it will extract the speech from it using google's Speech To Text API, then pass the result to the gem.

Cogibara Redis
Running cogibara with the -r flag will start the executable in Redis mode, allowing you to design your own interfaces for the gem. For now it requires a local Redis server to be installed and running, so its probably better just to use the gem in a program, but in the near future it will allow remote Redis connections as well. This could also be used to split up the work of running the gem and serving the responses between computers, giving you the ability to install a lightweight interface on something like a Raspberry Pi and leave the heavy lifting to a desktop PC.

For more on how to use this executable, see the example Rails client here.

The demo client at cogibara.com is, at the time of writing, running both server and client on a Raspberry Pi sitting in my living room.

More details coming soon.

API Keys

Many operators require an API key to communicate with a remote service. The gems for the individual operators will have better documentation, but here's where you can get a key for the built-in modules:

Wolfram Alpha (gem): http://developer.wolframalpha.com/portal/apisignup.html
Maluuba (gem): http://dev.maluuba.com/
Forecast_IO (gem): https://developer.forecast.io/

Configuration

You have three options for configuring the gem; using a YAML file, modifying the configuration at runtime, or adding a configuration block to your code.

YAML

Although using the gem's configuration functions is more flexible, the simplest way to get everything working is to configure the gem using YAML. This can be accomplished by calling

Cogibara::dispatcher.config_from_yaml('./some_file.yml')

See here for an example yaml file which will help you set up all of the built-in modules: https://gist.github.com/wstrinz/5666591

Runtime Configuration

You can call to Cogibara::config to fetch or update your configuration, and Cogibara::dispatcher to add new external operators using the following syntax:

Cogibara::dispatcher.register_operator([mod_keywords], {name: mod_name, file_name: mod_file, class_name: mod_class_name, config: mod})

If you do not provide an option, the dispatcher will attempt to fill it in based on the other options and various naming conventions.

Configuration Block

Explanation coming soon, see examples -

require 'cogibara'
Cogibara.setup do |config|
  config.name = "Mr Robot"
  config.verbose = true
end
Cogibara.setup_dispatcher do |dispatcher|
  dispatcher.register_operator(["REMINDER"],{name: "Reminder Setter"})
  dispatcher.register_operator(["chat"],{name: "Chat Bot", class_name:  "Chat", file_name: "chat.rb"})
end

Extending

It's easy to add new functionality to your Cogibara, just add a process(message) method to your code, and add your class or file to the list of modules using Cogibara::dispatcher.register_operator. You can also extend the Cogibara::OperatorBase class, which will allow you to utilize some built in helper methods for things like sending multiple messages and handling confirmation callbacks.

Your operator must be bound to a list of keywords or natural language categories. The Maluuba Natural Speech API, which the gem current uses for language processing, places speech into a number of different categories and extracts information such as dates and times. You can bind your operator to any category, or to any individual 'action' in a category, then handle queries and the structured information Maluuba extracts in whatver way you need to.

Cogibara can recognize both individual keywords and categories of natural speech. Keyword queries are of the form "[name] [keyword] message", and natural language queries can be hooked into using the categories for the Maluuba API. To add a new module, either from a file or an existing Class object, simply call Cogibara::dispatcher.register_operator(['keyword','list'],{name: "Operator Name"}), which will attempt to load the operator based on standard Ruby naming conventions. In this case, it will be looking for a operator_name.rb file, or a class object called OperatorName. By specifying class_name or file_name options, you can override this behavior to define your own custom naming.

As an example, here's how you might go about adding an operator that tells you your current location:

require 'cogibara'
require 'geocoder'
require 'json'
require 'open-uri'
class Locator < Cogibara::OperatorBase
  def process(message)
    ip = 0
    open( 'http://jsonip.com/ ' ){ |s| ip = JSON::parse( s.string())['ip'] }
    loc = Geocoder.search(ip)
    "You are in #{loc[0].city}, #{loc[0].state}!"
  end
end

Cogibara.default_config

# Register operator under the 'where am I?' Maluuba speech category, and the 'location' keyword.
Cogibara::dispatcher.register_operator(['NAVIGATION_WHERE_AM_I', 'location'], {name: 'Locator'})

# natural language query
Cogibara::message_handler.handle("so where am I now?")
  #=> "You are in Madison, Wisconsin!"

# keyword query
Cogibara::message_handler.handle("cogibara location")
  #=> "You are in Madison, Wisconsin!"

Credit

This project would not be possible without the awesome gems and APIs available for free online. Here is a list of all the built in services, please let me know if I'm missing someone or you'd like your gem or service to be removed from this project.

speech2text gem, by Todd Fisher
wolfram gem, by Gabriel Horner
Maluuba Natural Language API, by Maluuba Inc
maluuba_napi gem, by Maluuba Inc
Forecastio API, by Dark Sky Company
forecast_io gem, by David Czarnecki
Cleverbot
cleverbot gem, by Benjamin Manns
bing_translator gem, by Ricky Elrod
wikipedia-client gem, by Ken Pratt
sanitize gem, by Ryan Grove
Redis
wikicloth gem, by David Ricciardi
google_calendar gem, by Northworld LLC
geocoder gem, by Alex Reisner
slop gem, by Lee Jarvis

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request

A few things I'll try to get in place in the next week
-Fix file handling (will only do transcription for now)
-An executable to listen on a local source (eg microphone). I have code for this from an earlier version of the gem, I just need to update it.
-Add Text to Speech output to configuration. Again, I also have some work done on this, so it shouldn't take too long to add back in. Espeak will be the default, but you'll be able to specify alternate engines if they're available. Currently I'm rather partial to Pico.

Things that may have to wait/wishlist for contributions
-RSpec tests
-Documentation; RDocs would be nice, but pages for the wiki or more details for this document would also be good.
-A better client to use by default. Preferably using a lighter weight framework such as Sinatra, and allowing easy access for mobile devices.
-Find replacements for C Extensions; there are a lot of good language processing tools for Java, so it'd be nice for this gem to be JRuby compatible.
-More offline operators. A simple request now often goes through 3 APIs; Google Speech Recognition, Maluuba's natural language processing, and whatever keyword function it hits after that. By adding support for PocketSphinx and/or something link LingPipe would speed up response times enormously.
-More operators in general, and improve the current ones. For example, the weather operator only gives current conditions and the days forecast, but it could easily be extended to handle more specific queries using the information extracted by Maluuba.