Project

gdor-indexer

0.0

No commit activity in last 3 years

No release in over 3 years

gdor-indexer sul-dlss/gdor-indexer Homepage Documentation Source Code Bug Tracker Wiki

PURL doc => Solr hash logic

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

Popularity

21,766

2

0

8

Releases

Current version

0.8.0

9

2015-10-26

2016-09-22

Issues

2

18

20

Issue Closure Rate

90%

Pull Requests

Open Pull Requests

0

Closed Pull Requests

5

Merged Pull Requests

55

Pull Request Acceptance Rate

91%

Development

Primary Language

Ruby

Licenses

Apache 2

Average date of last 50 commits

2016-04-14

Reverse Dependencies

1

Dependencies

Development

~> 1.5

equivalent-xml

~> 0.5

>= 0

rake

>= 0

rdoc

>= 0

~> 3.1

rspec-rails

>= 0

>= 0

rubocop-rspec

>= 0

>= 0

vcr

>= 0

>= 0

yard

>= 0

Runtime

activesupport

>= 0

harvestdor-indexer

>= 0

>= 0

mail

>= 0

>= 0

>= 0

stanford-mods

>= 2.2.1, ~> 2.2

>= 0

Project Readme

gdor-indexer

Code to harvest DOR druids via DOR Fetcher service, mods from PURL, and use it to index items into a Solr index, such as that for SearchWorks.

Prerequisites

ruby 2.x+
bundler gem must be installed

Install steps for running locally

Add this line to your application's Gemfile:

gem 'harvestdor-indexer'

Then execute:

bundle

Configuration

Create a collections folder in the config directory:

cd /path/to/gdor-indexer/config
mkdir collections

Create a yml config file for your collection(s) to be harvested and indexed.

See spec/config/walters_integration_spec.yml for an example. Copy that file to config/collections and change the following settings:

whitelist
dor_fetcher service_url
harvestdor log_dir and log_name
solr_url

whitelist

The whitelist is how you specify which objects to index. The whitelist can be:

an Array of druids inline in the config yml file
a filename containing a list of druids (one per line)

If a druid, per the object's identityMetadata at purl page, is for a:

collection record: then we process all the item druids in that collection (as if they were included individually in the whitelist)
non-collection record: then we process the druid as an individual item

Run the indexer script

$ cd /path/to/gdor-indexer
$ nohup ./bin/indexer -c my_collection &>path/to/nohup.output

Running the tests

rake

Contributing

Fork it (https://help.github.com/articles/fork-a-repo/)
Create your feature branch (git checkout -b my-new-feature)
Write code and tests.
Commit your changes (git commit -am 'Added some feature')
Push to the branch (git push origin my-new-feature)
Create new Pull Request (https://help.github.com/articles/creating-a-pull-request/)