Class TokenSequenceRemoveNonAlpha

  • All Implemented Interfaces:

    public class TokenSequenceRemoveNonAlpha
    extends Pipe
    Remove tokens that contain non-alphabetic characters. This class is used in conjunction wtih CharSequenceLexer.LEX_NON_WHITESPACE_CLASSES and FeatureSequenceWithBigrams, which in turn is used by TopicalNGrams.
    Andrew McCallum
    See Also:
    Serialized Form
    • Constructor Detail

      • TokenSequenceRemoveNonAlpha

        public TokenSequenceRemoveNonAlpha​(boolean markDeletions)
      • TokenSequenceRemoveNonAlpha

        public TokenSequenceRemoveNonAlpha()
    • Method Detail

      • pipe

        public Instance pipe​(Instance carrier)
        Description copied from class: Pipe
        Really this should be 'protected', but isn't for historical reasons.
        pipe in class Pipe