Uses of Interface
cc.mallet.extract.Tokenization
-
Packages that use Tokenization Package Description cc.mallet.extract Unimplemented. -
-
Uses of Tokenization in cc.mallet.extract
Classes in cc.mallet.extract that implement Tokenization Modifier and Type Class Description class
StringTokenization
Methods in cc.mallet.extract that return Tokenization Modifier and Type Method Description Tokenization
DocumentExtraction. getInput()
Methods in cc.mallet.extract with parameters of type Tokenization Modifier and Type Method Description LabeledSpans
BIOTokenizationFilter. constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
LabeledSpans
ConfidenceTokenizationFilter. constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
LabeledSpans
DefaultTokenizationFilter. constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
LabeledSpans
HierarchicalTokenizationFilter. constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
LabeledSpans
TokenizationFilter. constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
Converts a the sequence of labels into a set of labeled spans.protected Span
BIOTokenizationFilter. createSpan(Tokenization input, int startTokenIdx, int endTokenIdx)
protected Span
BIOTokenizationFilterWithTokenIndices. createSpan(Tokenization input, int startTokenIdx, int endTokenIdx)
Extraction
CRFExtractor. extract(Tokenization spans)
Extraction
Extractor. extract(Tokenization toks)
Performs extraction from an object that has been already been tokenized.Constructors in cc.mallet.extract with parameters of type Tokenization Constructor Description DocumentExtraction(java.lang.String name, LabelAlphabet dict, Tokenization input, LabeledSpans predictedSpans, LabeledSpans trueSpans, java.lang.String background)
DocumentExtraction(java.lang.String name, LabelAlphabet dict, Tokenization input, Sequence predicted, Sequence target, java.lang.String background)
DocumentExtraction(java.lang.String name, LabelAlphabet dict, Tokenization input, Sequence predicted, Sequence target, java.lang.String background, TokenizationFilter filter)
DocumentExtraction(java.lang.String name, LabelAlphabet dict, Tokenization input, Sequence predicted, java.lang.String background)
Extraction(Extractor extractor, LabelAlphabet dict, java.lang.String name, Tokenization input, Sequence output, java.lang.String background)
Creates an extration given a sequence output by some kind of per-sequece labeler, like an HMM or a CRF.
-