Low commit activity in last 3 years
A long-lived project that still receives updates
This is a ChupaText decomposer plugin for to extract text and meta-data from HTML. You can use `html` decomposer.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Runtime

 Project Readme

README

Name

chupa-text-decomposer-html

Description

This is a ChupaText decomposer plugin for to extract text and meta-data from HTML.

You can use html decomposer.

Install

Install chupa-text-decomposer-html gem:

% gem install chupa-text-decomposer-html

Now, you can extract text and meta-data from HTML:

% chupa-text index.html

Author

  • Kouhei Sutou <kou@clear-code.com>

License

LGPL 2.1 or later.

(Kouhei Sutou has a right to change the license including contributed patches.)