0.0
No commit activity in last 3 years
No release in over 3 years
This gem uses unicode_utils to lowercase text, removes non-letters, strips and squeezes whitespace, then optionally uses stemwords (from libstemming-tools) to stem every word.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

~> 1.7
~> 10.0

Runtime

 Project Readme

PristineText

This gem uses unicode_utils to lowercase text, removes non-letters, strips and squeezes whitespace, then optionally uses stemwords (from libstemming-tools) to stem every word.

Installation

Add this line to your application's Gemfile:

gem 'pristine_text'

And then execute:

$ bundle

Or install it yourself as:

$ gem install pristine_text

Usage

require "pristine_text"

puts PristineText.clean("haberler geliyorlar gidiyorlar", :tr)

Contributing

  1. Fork it ( https://github.com/nurettin/pristine_text/fork )
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create a new Pull Request