Project

linnaeus

0.02
No commit activity in last 3 years
No release in over 3 years
Linnaeus provides a redis-backed Bayesian classifier. Words are stemmed, stopwords are stopped, and redis is used to allow for persistent and concurrent training and classification.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

~> 3.12
~> 2.11.0
~> 0.7

Runtime

~> 3.0.0
~> 1.0.0
 Project Readme

Linnaeus Build Status

Carl Linnaeus

Linnaeus is a redis-backed naive Bayesian classification system. Please see the rdoc for more information. Ruby 1.9 is required.

Examples

lt = Linnaeus::Trainer.new      # Used to train documents
lc = Linnaeus::Classifier.new   # Used to classify documents

lt.train 'language', 'Ruby is a dynamic, reflective, general-purpose object-oriented programming language that combines syntax inspired by Perl with Smalltalk-like features.'
lt.train 'database', 'PostgreSQL, often simply Postgres, is an object-relational database management system (ORDBMS) available for many platforms including Linux, FreeBSD, Solaris, Microsoft Windows and Mac OS X.'

lc.classify 'Perl is a high-level, general-purpose, interpreted, dynamic programming language.' # returns "language"

Contributing to linnaeus

  • Submit bugs to the github issue tracker: https://github.com/djcp/linnaeus/issues
  • If you'd like to add a feature, please submit a description of it to the issue tracker so we can discuss.
  • If the feature makes sense, fork the github repository. Write rspec tests and issue a pull request when your change is done.

The Future

  • Create additional storage backends - sqlite, postgresql, mongodb, etc.
  • Allow for weighting tweaks.

Copyright

Copyright (c) 2012 Dan Collis-Puro. See LICENSE.txt for further details.

Credits