Package cc.mallet.extract
Interface TokenizationFilter
-
- All Known Implementing Classes:
BIOTokenizationFilter
,BIOTokenizationFilterWithTokenIndices
,ConfidenceTokenizationFilter
,DefaultTokenizationFilter
,HierarchicalTokenizationFilter
public interface TokenizationFilter
Created: Nov 12, 2004- Version:
- $Id: TokenizationFilter.java,v 1.1 2007/10/22 21:37:44 mccallum Exp $
- Author:
-
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description LabeledSpans
constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
Converts a the sequence of labels into a set of labeled spans.
-
-
-
-
Method Detail
-
constructLabeledSpans
LabeledSpans constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
Converts a the sequence of labels into a set of labeled spans. Essentially, this converts the output of sequence labeling into an extraction output.- Parameters:
dict
-document
-backgroundTag
-input
-seq
-- Returns:
-
-