Package cc.mallet.pipe
Class CharSequenceRemoveUUEncodedBlocks
- java.lang.Object
-
- cc.mallet.pipe.Pipe
-
- cc.mallet.pipe.CharSequenceRemoveUUEncodedBlocks
-
- All Implemented Interfaces:
AlphabetCarrying
,java.io.Serializable
public class CharSequenceRemoveUUEncodedBlocks extends Pipe
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description static java.util.regex.Pattern
UU_ENCODED_LINE
Given a string, remove lines that begin with M and are 61 characters long.
-
Constructor Summary
Constructors Constructor Description CharSequenceRemoveUUEncodedBlocks()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Instance
pipe(Instance carrier)
Really this should be 'protected', but isn't for historical reasons.-
Methods inherited from class cc.mallet.pipe.Pipe
alphabetsMatch, getAlphabet, getAlphabets, getDataAlphabet, getInstanceId, getTargetAlphabet, instanceFrom, instancesFrom, instancesFrom, isDataAlphabetSet, isTargetProcessing, newIteratorFrom, preceedingPipeDataAlphabetNotification, preceedingPipeTargetAlphabetNotification, precondition, readResolve, setDataAlphabet, setOrCheckDataAlphabet, setOrCheckTargetAlphabet, setTargetAlphabet, setTargetProcessing
-
-
-
-
Field Detail
-
UU_ENCODED_LINE
public static final java.util.regex.Pattern UU_ENCODED_LINE
Given a string, remove lines that begin with M and are 61 characters long. Note that there are some UUEncoded blocks that do not match this. I have seen some that are 64 characters long, and have no regular prefix character, but this filter gets most of them in 20 Newsgroups.
-
-