API Documentation
Types
CitableCorpusAnalysis.AnalyticalCorpus — TypeEssential components of an analytical corpus.
Its contents must be citable, in a defined orthography, and tokens defined by that orthography must be parseable.
Exported functions
CitableCorpusAnalysis.tacorpus — FunctionCreate a TextAnalysis.Corpus from a CitableTextCorpus.
tacorpus(c)
CitableCorpusAnalysis.talexicon — FunctionCreate a lexicon (a Dict of tokens to counts) from a CitableTextCorpus using the lexicon function of the TextAnalysis module.
talexicon(c)
CitableCorpusAnalysis.dtmatrix — FunctionCreates the document-term matrix for a CitableTextCorpus in dense matrix format.
dtmatrix(c)
Note that this relies on the dictionary created by TextAnalysis.lexicon().