Dictionary creation for T3, T7 & T9 encodings
freq9.pl
Description:
Perl program that produces a list of all possible frequency-ordered words corresponding to each unique string that would be used to type words using a T3, T7 or T9 encoding. Words appearing in are priviledged over others.
Usage:
Required parameters:
freq9.pl EMMA FILE t#
- EMMA: Any text or corpus. Word frequencies and other text statistics are calculated.
- FILE: Document containing all other words to be included in dictionary. Should be formatted as a two-column list of word frequencies and words, separated with an empty space.
- T#: Preferred encoding. Valid options: t3 t7 t9
Dictionaries:
T3 Dictionary
T7 Dictionary
T9 Dictionary
Files:
freq9.pl
emma-training contains all sections of Jane Austen's Emma not included in user dictation tasks
unigram contains 27,124 English-language words