The project is in a healthy, maintained state
Show a bot challenge interstitial for Rails, usually using Cloudflare Turnstile
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
 Dependencies

Development

Runtime

~> 5.2
>= 7.1, < 8.1
 Project Readme

BotChallengePage

CI Gem Version

BotChallengePage lets you protect certain routes in your Rails app with CloudFlare Turnstile "CAPTHCA alternate" bot detector. Rather than the typical form submission use case for Turnstile, the user will be redirected to an interstitial challenge page, and automatically redirected back immediately on success.

The motivating use case is fairly dumb (probably AI-related) crawlers, rather than targetted attacks, although we have tried to pay attention to security. Many of our use cases were crawlers getting caught following every combination of voluminous facet values in search results in a near "infinite space", and causing us resource usage issues.

challenge page screenshot

  • You can optionally configure a rate limit that is allowed BEFORE the challenge is triggered

    • Uses rack-attack to track rate, requires Rails.cache or Rack::Attack.cache.store to be set to a persistent shared high-performance cache, probably redis or memcached.
  • Once a challenge is passed, the pass is stored in a cookie, and a challenge won't be redisplayed for a configurable amount of time, so long as cookie is present

  • Note: User-agent does always need both cookies and javascript enabled to be able to pass challenge and get through!

Installation and Configuration

  • Get a CloudFlare account and Turnstile widget set up, which should give you a turnstile sitekey and secret_key you will need later in configuration.

  • bundle add bot_challenge_page, bundle install

  • Run the installer

    • if you want to use rack-attack for some permissive pre-challenge rate, rails g bot_challenge_page:install

    • If you do not want to use rack-attack and want challenge on FIRST request, rails g bot_challenge_page:install --no-rack-attack

  • If you are not using rack-attack, you need to add a before_action to the controller(s) you'd like to protect, eg:

      before_action only: :index do |controller|
        BotChallengePage::BotChallengePageController.bot_challenge_enforce_filter(controller, immediate: true)
      end
    
  • Configure in the generated ./config/initializers/bot_challenge_page.rb

    • At a minimum you need to configure your Cloudflare Turnstile keys, and some paths to protect!

      • Note that we can only protect GET paths, and also think about making sure you DON'T protect any path your front-end needs JS fetch access to, as this would block it (at least without custom front-end code we haven't really explored)

      • If you are tempted to just protect / that may work, but worth thinking about any hearbeat paths, front-end requestable paths, or other machine-access-desired paths.

    • Some other configuration options are offered -- more advanced/specialized ones are available that are not mentioned in generated config file, see Config class

Customize challenge page display

Some of the default challenge page html uses bootstrap alert classes. You may want to provide custom CSS if you aren't using bootstrap. You can see the default challenge page html at challenge.html.erb. You may wish to CSS-style other parts too!

You can customize all text via I18n, see keys in bot_challenge_page.en.yml

The challenge page by default will be displayed in your app's default rails layout.

To customize the layout or challenge page HTML more further, you can use configuration to supply a render method for the controller pointing to your own templates or other layouts. You will probably want to re-use the partials we use in our default template, for standard functionality. And you'll want to provide <template> elements with the same id's for those elements, but can put whatever you want inside the templates!

BotChallengePage::BotChallengePageController.bot_challenge_config.challenge_renderer = ()->  {
  render "my_local_view_folder/whatever", layout "another_layout"
  render layout: "another_layout" # default html but change layout. etc.
}

Example possible Blacklight config

Many of us in my professional community use blacklight. Here's a possible sample blacklight config to:

  • Protect default catalog controller, including search results and any other actions
  • But give the user 3 free searches in a 36 hour period before challenged
  • For the #facet action used for "facet… more" links -- exempt from protection if the request is being made by a browser JS fetch, we just let those through. (Which means a determined attacker could do that on purpose, not defense against on purpose DDoS)
Rails.application.config.to_prepare do
  BotChallengePage::BotChallengePageController.bot_challenge_config.enabled = true

  # Get from CloudFlare Turnstile: https://www.cloudflare.com/application-services/products/turnstile/
  BotChallengePage::BotChallengePageController.bot_challenge_config.cf_turnstile_sitekey = "MUST GET"
  BotChallengePage::BotChallengePageController.bot_challenge_config.cf_turnstile_secret_key = "MUST GET"

  BotChallengePage::BotChallengePageController.bot_challenge_config.rate_limited_locations = [
    "/catalog"
  ]

  # allow rate_limit_count requests in rate_limit_period, before issuing challenge
  BotChallengePage::BotChallengePageController.bot_challenge_config.rate_limit_period = 36.hour
  BotChallengePage::BotChallengePageController.bot_challenge_config.rate_limit_count = 3

  BotChallengePage::BotChallengePageController.allow_exempt = ->(controller) {
    # Excempt any Catalog #facet action that looks like an ajax/fetch request, the redirect
    # ain't gonna work there, we just exempt it.
    #
    # sec-fetch-dest is set to 'empty' by browser on fetch requests, to limit us further;
    # sure an attacker could fake it, we don't mind if someone determined can avoid
    # bot challenge on this one action
    ( controller.params[:action] == "facet" &&
      controller.request.headers["sec-fetch-dest"] == "empty" &&
      controller.kind_of?(CatalogController)
    )
  }

  BotChallengePage::BotChallengePageController.rack_attack_init
end

Development and automated testing

All logic and config hangs off a controller, with the idea that you could sub-class the controller to override any functionality -- or even have multiple sub-classes in your app with different configuration or customized config. But this hasn't really been tested/fleshed out yet.

Run tests with bundle exec rspec.

We test with a checked-into-repo dummy app at ./spec/dummy, and use Appraisal to test under different rails versions.

Locally one way to test with a specific rails version appraisal is bundle exec appraisal rails-7.2 rspec

If you make any changes to Gemfile you may need to run bundle exec appraisal install and commit changes.

One reason tests are slow is I think we're running system tests with real turnstile proof-of-work bot detection JS code? (Or is it, when we are are using a CF turnstile testing key that always passes?). There aren't many tests so it's no big deal, but this is something that could be investigated/optmized more potentially.

Possible future features?

  • allow regex in default location_matcher? Easy to do if you want it, just say so.

  • We could support swap-in Turnstile-alternatives, like hCAPTHCA, Google reCAPTCHA v3, or even open source proof of work implementations like ALTCHA, pow-bot-deterrent, or Friendly Captcha. But the (free) cost/benefit of Turnstile are pretty good, so I don't myself have a lot of motivation to add this complexity.

  • Something to make it easier to switch the challenge on only based on signals that server/app is under some defined heavy load?

License

The gem is available as open source under the terms of the MIT License.

See also/Acknowledgements