BlueFactory 🏭
A Ruby gem for hosting custom feeds for Bluesky.
Note
ATProto Ruby gems collection: skyfall | blue_factory | minisky | didkit
What does it do
BlueFactory is a Ruby library which helps you build a web service that hosts custom feeds a.k.a. "feed generators" for the Bluesky social network. It implements a simple HTTP server based on Sinatra which provides the required endpoints for the feed generator interface. You need to provide the content for the feed by making a query to your preferred local database.
A feed server will usually be run together with a second piece of code that streams posts from the Bluesky "firehose" stream, runs them through some kind of filter and saves some or all of them to the database. To build that part, you can use my other Ruby gem Skyfall.
Installation
gem install blue_factory
Usage
The server is configured through the BlueFactory
module. The two required settings are:
-
publisher_did
- DID identifier of the account that you will publish the feed on (the string that starts withdid:plc:...
) -
hostname
- the hostname on which the feed service will be run
You also need to configure at least one feed by passing a feed key and a feed object. The key is the identifier that will appear at the end of the feed URI - it must only contain characters that are valid in URLs (preferably all lowercase) and it can't be longer than 15 characters. The object is anything that implements the single required method get_posts
(could be a class, a module or an instance).
So a simple setup could look like this:
require 'blue_factory'
BlueFactory.set :publisher_did, 'did:plc:loremipsumqwerty'
BlueFactory.set :hostname, 'feeds.example.com'
BlueFactory.add_feed 'starwars', StarWarsFeed.new
The feed object
The get_posts
method of the feed object should:
- accept a
params
argument which is a hash with fields::feed
,:cursor
and:limit
(the last two are optional) - optionally, accept a second
current_user
argument which is a string with the authenticated user's DID (depends on authentication config - see below) - return a hash with two fields:
:cursor
and:posts
The :feed
is the at://
URI of the feed. The :cursor
param, if included, should be a cursor returned by your feed from one of the previous requests, so it should be in the format used by the same function - but anyone can call the endpoint with any params, so you should validate it. The cursor is used for pagination to provide more pages further down in the feed (the first request to load the top of the feed doesn't include a cursor).
The :limit
, if included, should be a numeric value specifying the number of posts to return, and you should return at most that many posts in response. According to the spec, the maximum allowed value for the limit is 100, but again, you should verify this. The default limit is 50.
The :cursor
that you return is some kind of string that encodes the offset in the feed for a request for the next page. The structure of the cursor is something for you to decide, and it could possibly be a very long string (the actual length limit is uncertain). See the readme of the official feed-generator repo for some guidelines on how to construct cursor strings.
And finally, the :posts
value should be an array of posts, returned as at://
URI strings only. The Bluesky server that makes the request for the feed will provide all the other data for the posts based on the URIs you return.
If you determine that the request is somehow invalid (e.g. the cursor doesn't match what you expect), you can also raise a BlueFactory::InvalidRequestError
error, which will return a JSON error message with status 400.
An example implementation could look like this:
require 'time'
class StarWarsFeed
def get_posts(params, current_user = nil)
limit = check_query_limit(params)
query = Post.select('uri, time').order('time DESC').limit(limit)
if params[:cursor].to_s != ""
time = Time.at(params[:cursor].to_i)
query = query.where("time < ?", time)
end
posts = query.to_a
last = posts.last
cursor = last && last.time.to_i.to_s
{ cursor: cursor, posts: posts.map(&:uri) }
end
def check_query_limit(params)
if params[:limit]
limit = params[:limit].to_i
(limit < 0) ? 0 : (limit > MAX_LIMIT ? MAX_LIMIT : limit)
else
DEFAULT_LIMIT
end
end
end
Starting the server
The server itself is run using the BlueFactory::Server
class, which is a subclass of Sinatra::Base
and is used as described in the Sinatra documentation (as a "modular application").
In development, you can launch it using:
BlueFactory::Server.run!
In production, you will probably want to create a config.ru
file that instead runs it from the Rack interface:
run BlueFactory::Server
Then, you would configure your preferred Ruby app server like Passenger, Unicorn or Puma to run the server using that config file and configure the main HTTP server (Nginx, Apache) to route requests on the given hostname to that app server.
As an example, an Nginx configuration for a site that runs the server via Passenger could look something like this:
server {
server_name feeds.example.com;
listen 443 ssl;
passenger_enabled on;
root /var/www/feeds/current/public;
ssl_certificate /etc/letsencrypt/live/example.com/fullchain.pem;
ssl_certificate_key /etc/letsencrypt/live/example.com/privkey.pem;
access_log /var/log/nginx/feeds-access.log combined buffer=16k flush=10s;
error_log /var/log/nginx/feeds-error.log;
}
Authentication
Feeds are authenticated using a technology called JSON Web Tokens. If a user is logged in, when they open, refresh or scroll down a feed in their app, requests are made to the feed service from the Bluesky network's IP address with user's authentication token in the Authorization
HTTP header. (This is not the same kind of token as the access token that you use to make API calls - it does not let you perform any actions on user's behalf.)
At the moment, Blue Factory handles authentication in a very simplified way - it extracts the user's DID from the authentication header, but it does not verify the signature. This means that anyone with some programming knowledge can trivially prepare a fake token and make requests to the getFeedSkeleton
endpoint as a different user.
As such, this authentication should not be used for anything critical. It may be used for things like logging, analytics, or as "security by obscurity" to just discourage others from accessing the feed in the app. You can also use this to build personalized feeds, as long as it's not a problem that the user DID may be fake.
To use this simple authentication, set the enable_unsafe_auth
option:
BlueFactory.set :enable_unsafe_auth, true
The user's DID extracted from the token is passed as a second argument to #get_posts
. You may, for example, return an empty list when the user is not authorized to use it:
class HiddenFeed
def get_posts(params, current_user)
if AUTHORIZED_USERS.include?(current_user)
# ...
else
{ posts: [] }
end
end
end
Alternatively, you can raise a BlueFactory::AuthorizationError
with an optional custom message. This will return a 401 status response to the Bluesky app, which will make it display the pink error banner in the app:
class HiddenFeed
def get_posts(params, current_user)
if AUTHORIZED_USERS.include?(current_user)
# ...
else
raise BlueFactory::AuthorizationError, "You shall not pass!"
end
end
end
Unauthenticated access
Please note that the current_user
may be nil - this will happen if the authentication header is not set at all. Since the bsky.app website is now open to the public and can be accessed without authentication, people can also access your feeds without being logged in.
If you want the feed to only be available to logged in users (even if it's a non-personalized feed), simply raise an AuthorizationError
if current_user
is nil:
class RestrictedFeed
def get_posts(params, current_user)
if current_user.nil?
raise BlueFactory::AuthorizationError, "Log in to see this feed"
end
# ...
end
end
Additional configuration & customizing
You can use the Sinatra API to do any additional configuration, like changing the server port, enabling/disabling logging and so on.
For example, you can change the port used in development with:
BlueFactory::Server.set :port, 7777
You can also add additional routes, e.g. to make a redirect or print something on the root URL:
BlueFactory::Server.get '/' do
redirect 'https://github.com/mackuba/blue_factory'
end
Publishing the feed
When your feed server is ready and deployed to the production server, you can use the included bluesky:publish
Rake task to upload the feed configuration to the Bluesky network. To do that, add this line to your Rakefile
:
require 'blue_factory/rake'
You also need to load your BlueFactory
configuration and your feed classes here, so it's recommended that you extract this configuration code to some kind of init file that can be included in the Rakefile
, config.ru
and elsewhere if needed.
To publish the feed, you will need to provide some additional info about the feed, like its public name, through a few more methods in the feed object (the same one that responds to #get_posts
):
-
display_name
(required) - the publicly visible name of your feed, e.g. "WWDC 23" (should be something short) -
description
(optional) - a longer (~1-2 lines) description of what the feed does, displayed on the feed page as the "bio" -
avatar_file
(optional) - path to an avatar image from the project's root (PNG or JPG)
When you're ready, run the rake task passing the feed key (you will be asked for the uploader account's password):
bundle exec rake bluesky:publish KEY=wwdc
Credits
Copyright © 2023 Kuba Suder (@mackuba.eu).
The code is available under the terms of the zlib license (permissive, similar to MIT).
Bug reports and pull requests are welcome 😎