0.0
No commit activity in last 3 years
No release in over 3 years
A simple spider
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

~> 1.3
>= 10.1.0

Runtime

>= 0.3.13
>= 1.6.0
 Project Readme

FreeSpider

A simple spider

Installation

Add this line to your application's Gemfile:

gem 'free_spider'

And then execute:

$ bundle

Or install it yourself as:

$ gem install free_spider

Usage

require 'free_spider'
spider = FreeSpider::Begin.new
spider.plan do
  site 'http://www.example.com/'
end
spider.crawl

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request

Feature

  • 爬取时去除除网站外的外部链接,去除一些特殊链接,如:搜索链接
  • 网站链接过多可以使用队列
  • 多线程并发增加爬取速度