0.0
No commit activity in last 3 years
No release in over 3 years
Ruby library for reading sitemaps
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

~> 1.3
>= 0

Runtime

 Project Readme

sitemap_reader

Build Status

Ruby library for reading sitemaps

Installation

Add this line to your application's Gemfile:

gem 'sitemap_reader'

And then execute:

$ bundle

Or install it yourself as:

$ gem install sitemap_reader

Usage

require 'sitemap_reader'

sm = SitemapReader.new('http://example.com/sitemap.xml')
sm.get_urls

will return list of ruby hashes representing urls

[
  {
    :loc=>"http://example.com/page1",
    :lastmod=>"2013-08-17 23:00:00",
    :changefreq=>'monthly',
    :priority=>0.8
  },
  {
    :loc=>"http://example.com/page2",
    :lastmod=>nil,
    :changefreq=>nil,
    :priority=>nil
  }
]

The loc attribute of the url in the sitemap cannot be empty, but the rest can be nil if not set or can't be parsed.

TODO

  • optimize code for large sitemaps
  • read sitemapindexes
  • create benchmark tests

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request