Development with The Reformation American English Alphabet

Below is a python transcription script. This takes in a text file in and then transcribes it based on defined equivalents. This script relies on an external CSV (also seen below as en-RAEA.csv) to give these defined equivalents. This script relies on Spacy to provide differentiation between words based on parts of speech. Our implementation is built to require the en_core_web_sm model.

transcribe.py (Version 0.32)
en-RAEA.csv

The transliteration dictionary (wordbook) to build the CSV file is provided down below. It additionally includes extra information regarding why words take the spellings they do. When words are spelt the same in classical english, but need to be differentiated in raea the entry may contain a plus sign. Thus: read+vb specifies that read is the word in its base form while read+vbd is it in its past tense form. Which could then be transcribed as: ríd & red. A list of possible parts of speech values can be found here. Do note that this document (and the csv file) will be updated frequently.

en-RAEA.ods


It is recommended to use the EurKey Keyboard Layout, unless one is familiar with creating custom layouts. However, EurKey doesn't have the ezh key, which means the letter will have to be manually transferred every time. The Sarcasm Mark is not supported by any font, and thus not a part of any keyboard layout.

For linux users here is a qwerty based layout for xkb laid out as such. Good guidance for installing such layouts can be seen either in the arch wiki or through searching online for distro and window manager specific guides.


The following graphs represent the letter and digraph frequencies of the RAEA across our current library corpus.

letter frequency in RAEA based on books digraph frequency in RAEA based on books. top 200 digraph frequency in RAEA based on books. top 41