API Documentation
Types
CitableCorpusAnalysis.AnalyticalCorpus
— TypeEssential components of an analytical corpus.
Its contents must be citable, in a defined orthography, and tokens defined by that orthography must be parseable.
Exported functions
CitableCorpusAnalysis.tacorpus
— FunctionCreate a TextAnalysis.Corpus
from a CitableTextCorpus
.
tacorpus(c)
CitableCorpusAnalysis.talexicon
— FunctionCreate a lexicon (a Dict
of tokens to counts) from a CitableTextCorpus
using the lexicon
function of the TextAnalysis
module.
talexicon(c)
CitableCorpusAnalysis.dtmatrix
— FunctionCreates the document-term matrix for a CitableTextCorpus
in dense matrix format.
dtmatrix(c)
Note that this relies on the dictionary created by TextAnalysis.lexicon()
.