Version 7.0.0-pr6
The latincorpus
library
The foundational class hierarchy
The library works with a hierarchy of three central models: the parsed token, a sequence of parsed tokens (such as an n-gram, or an entire edition), and a collection of parsed token sequences (for example, a collection of sentences).
See more about these data models.
Analytical operations: matching and filtering
This hierarchy of classes makes it possible to match individual tokens, and apply filters to tokens organized in different ways.
You equally easily find examples of tokens analyzed as verbs in the imperfect tense and passive voice, find occurrences of all forms of the verb interficio
, or select sentences with verbs in the imperfect subjunctive.
See more about matching and filtering data.
Analytical operations: profiling a data set
Both the LatinParsedTokenSequence
and the ParsedSequenceCollection
include a number of methods to simplify summarizing, characterizing and comparing data sets.
See more about profiling data sets: TBA.