Package cc.mallet.fst

Class SimpleTagger.SimpleTaggerSentence2FeatureVectorSequence

  • All Implemented Interfaces:
    Enclosing class:

    public static class SimpleTagger.SimpleTaggerSentence2FeatureVectorSequence
    extends Pipe
    Converts an external encoding of a sequence of elements with binary features to a FeatureVectorSequence. If target processing is on (training or labeled test data), it extracts element labels from the external encoding to create a target LabelSequence. Two external encodings are supported:
    1. A String containing lines of whitespace-separated tokens.
    2. a String[][].
    Both represent rows of tokens. When target processing is on, the last token in each row is the label of the sequence element represented by this row. All other tokens in the row, or all tokens in the row if not target processing, are the names of features that are on for the sequence element described by the row.
    See Also:
    Serialized Form
    • Constructor Detail

      • SimpleTaggerSentence2FeatureVectorSequence

        public SimpleTaggerSentence2FeatureVectorSequence()
        Creates a new SimpleTaggerSentence2FeatureVectorSequence instance.
    • Method Detail

      • pipe

        public Instance pipe​(Instance carrier)
        Description copied from class: Pipe
        Really this should be 'protected', but isn't for historical reasons.
        pipe in class Pipe