RegexpPropertyValues
This small library lets you see which property values are supported by the regular expression engine of the Ruby version you are running and directly reads out their codepoint ranges from there.
That is, it determines all supported values for \p{value}
expressions and what they match.
Usage
Browse all property values (supported by any Ruby, ever)
require 'regexp_property_values'
PV = RegexpPropertyValues
PV.all # => [<Value name='Alpha'>, <Value name='Blank'>, ...]
Browse property values supported by the Ruby you are running
PV.all_for_current_ruby # => [<Value name='Alpha'>, <Value name='Blank'>, ...]
Inspect property values
PV['alpha'].supported_by_current_ruby? # => true
PV['foobar'].supported_by_current_ruby? # => false
PV['AHex'].matched_characters # => %w[0 1 2 3 4 5 6 7 8 9 A B C ...]
PV['AHex'].matched_codepoints # => [48, 49, 50, ...]
PV['AHex'].matched_ranges # => [48..57, 65..70, 97..102]
# Note: #matched_characters is slow for large properties and you
# may not want to use it in time-critical code. It also omits surrogates.
PV['foobar'].matched_ranges # => RegexpPropertyValues::Error
If character_set
is installed, you can also do this:
PV['AHex'].character_set # => #<CharacterSet: {48, 49...} (size: 22)>
Utility methods
# get a Hash of aliases for property names
PV.alias_hash # => { <Value name='M'> => <Value name='Mark'>, ... }
# download a list of possible properties for the running Ruby version
# (only used for .all and .alias_hash, not needed for prop lookup via .[])
PV.update