- getLocale
- isPosPunctuation
Based on the regex compiled in #setPunctuationPosRegex(), determine whether a
given POS string is cl
- isUnpronounceable
- lexiconLookup
Look a given text up in the (standard) lexicon. part-of-speech is used in case
of ambiguity.
- lexiconLookupPrimitive
- maybePronounceable
Determine whether token should be pronounceable, based on text and POS tag.
- phonemise
Phonemise the word text. This starts with a simple lexicon lookup, followed by
some heuristics, and
- readLexicon
Read a lexicon. Lines must have the format graphemestring | phonestring |
optional-parts-of-speech T
- setPh
- setPunctuationPosRegex
Compile a regex pattern used to determine whether tokens are processed as
punctuation or not, based
- setUnpronounceablePosRegex
Compile a regex pattern used to determine whether tokens are processed as
unprounounceable or not, b
- startup