Package cc.mallet.pipe
Class SelectiveSGML2TokenSequence
- java.lang.Object
-
- cc.mallet.pipe.Pipe
-
- cc.mallet.pipe.SelectiveSGML2TokenSequence
-
- All Implemented Interfaces:
AlphabetCarrying
,java.io.Serializable
public class SelectiveSGML2TokenSequence extends Pipe implements java.io.Serializable
Similar toSGML2TokenSequence
, except that only the tags listed inallowedTags
are converted toLabel
s.- Author:
- Aron Culotta culotta@cs.umass.edu
- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description SelectiveSGML2TokenSequence(CharSequenceLexer lexer, java.lang.String backgroundTag, java.util.Set allowed)
SelectiveSGML2TokenSequence(CharSequenceLexer lex, java.util.Set allowed)
SelectiveSGML2TokenSequence(java.lang.String regex, java.lang.String backgroundTag, java.util.Set allowed)
SelectiveSGML2TokenSequence(java.util.Set allowed)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Instance
pipe(Instance carrier)
Really this should be 'protected', but isn't for historical reasons.java.lang.String
toString()
-
Methods inherited from class cc.mallet.pipe.Pipe
alphabetsMatch, getAlphabet, getAlphabets, getDataAlphabet, getInstanceId, getTargetAlphabet, instanceFrom, instancesFrom, instancesFrom, isDataAlphabetSet, isTargetProcessing, newIteratorFrom, preceedingPipeDataAlphabetNotification, preceedingPipeTargetAlphabetNotification, precondition, readResolve, setDataAlphabet, setOrCheckDataAlphabet, setOrCheckTargetAlphabet, setTargetAlphabet, setTargetProcessing
-
-
-
-
Constructor Detail
-
SelectiveSGML2TokenSequence
public SelectiveSGML2TokenSequence(CharSequenceLexer lexer, java.lang.String backgroundTag, java.util.Set allowed)
- Parameters:
lexer
- to tokenize inputbackgroundTag
- default tag when not in any other tagallowed
- set of tags (Strings) that will be converted to labels
-
SelectiveSGML2TokenSequence
public SelectiveSGML2TokenSequence(java.lang.String regex, java.lang.String backgroundTag, java.util.Set allowed)
-
SelectiveSGML2TokenSequence
public SelectiveSGML2TokenSequence(java.util.Set allowed)
-
SelectiveSGML2TokenSequence
public SelectiveSGML2TokenSequence(CharSequenceLexer lex, java.util.Set allowed)
-
-