Profiling a data set
The LatinParsedTokenSequence and the ParsedSequenceCollection both include a number of methods for summarizing the contents of a data set.
- tokens
- lexemes, and metrics for lexical ambiguity
- morphological forms, and metrics for morphological ambiguity, and analytical coverage