Uses of Interface
cc.mallet.types.AlphabetCarrying
-
Packages that use AlphabetCarrying Package Description cc.mallet.classify Classes for training and classifying instances.cc.mallet.cluster.tui cc.mallet.extract.pipe cc.mallet.fst Transducers, including Conditional Random Fields (CRFs).cc.mallet.fst.tests Tests for Transducers, including Conditional Random Fields (CRFs).cc.mallet.pipe Classes for processing arbitrary data into instances.cc.mallet.pipe.tests JUnit tests for pipes.cc.mallet.pipe.tsf TokenSequenceFeature Pipes.cc.mallet.share.casutton.ner cc.mallet.share.mccallum.ner Named entity recognizer.cc.mallet.share.upenn.ner cc.mallet.share.weili.ner.enron cc.mallet.types Fundamental MALLET types, including FeatureVector, Instance, Label etc. -
-
Uses of AlphabetCarrying in cc.mallet.classify
Classes in cc.mallet.classify that implement AlphabetCarrying Modifier and Type Class Description class
AdaBoost
AdaBoost Robert E.class
AdaBoostM2
AdaBoostM2class
BaggingClassifier
class
BalancedWinnow
Classification methods of BalancedWinnow algorithm.class
C45
A C4.5 Decision Tree classifier.class
Classifier
Abstract parent of all Classifiers.class
ClassifierEnsemble
Classifer for an ensemble of classifers, combined with learned weights.class
ConfidencePredictingClassifier
class
DecisionTree
Decision Tree classifier.class
MaxEnt
Maximum Entropy (AKA Multivariate Logistic Regression) classifier.class
MCMaxEnt
Maximum Entropy classifier.class
MostFrequentClassAssignmentTrainer
A Classifier Trainer to be used with MostFrequentClassifier.class
MostFrequentClassifier
A Classifier that will return the most frequent class label based on a training set.class
NaiveBayes
A classifier that classifies instances according to the NaiveBayes method.class
NaiveBayesTrainer
Class used to generate a NaiveBayes classifier from a set of training data.class
PRAuxClassifier
Auxiliary model (q) for E-step/I-projection in PR training.class
RandomAssignmentTrainer
A Classifier Trainer to be used with RandomClassifier.class
RandomClassifier
A Classifier that will return a randomly selected class label.class
RankMaxEnt
Rank Maximum Entropy classifier.class
Winnow
Classification methods of Winnow2 algorithm.Methods in cc.mallet.classify with parameters of type AlphabetCarrying Modifier and Type Method Description boolean
Classifier. alphabetsMatch(AlphabetCarrying object)
boolean
MostFrequentClassAssignmentTrainer. alphabetsMatch(AlphabetCarrying object)
boolean
NaiveBayesTrainer. alphabetsMatch(AlphabetCarrying object)
boolean
RandomAssignmentTrainer. alphabetsMatch(AlphabetCarrying object)
-
Uses of AlphabetCarrying in cc.mallet.cluster.tui
Classes in cc.mallet.cluster.tui that implement AlphabetCarrying Modifier and Type Class Description static class
Clusterings2Clusterer.ClusteringPipe
-
Uses of AlphabetCarrying in cc.mallet.extract.pipe
Classes in cc.mallet.extract.pipe that implement AlphabetCarrying Modifier and Type Class Description class
TokenSequence2Tokenization
Heuristically converts a simple token sequence into a Tokenization that can be used with all the extract package goodies. -
Uses of AlphabetCarrying in cc.mallet.fst
Classes in cc.mallet.fst that implement AlphabetCarrying Modifier and Type Class Description static class
SimpleTagger.SimpleTaggerSentence2FeatureVectorSequence
Converts an external encoding of a sequence of elements with binary features to aFeatureVectorSequence
. -
Uses of AlphabetCarrying in cc.mallet.fst.tests
Classes in cc.mallet.fst.tests that implement AlphabetCarrying Modifier and Type Class Description static class
TestCRF.TestCRF2String
static class
TestCRF.TestCRFTokenSequenceRemoveSpaces
class
TestMEMM.TestMEMM2String
static class
TestMEMM.TestMEMMTokenSequenceRemoveSpaces
-
Uses of AlphabetCarrying in cc.mallet.pipe
Classes in cc.mallet.pipe that implement AlphabetCarrying Modifier and Type Class Description class
AddClassifierTokenPredictions
This pipe uses a Classifier to label each token (i.e., using 0-th order Markov assumption), then adds the predictions as features to each token.static class
AddClassifierTokenPredictions.TokenClassifiers
This inner class represents the trained token classifiers.class
Array2FeatureVector
Converts a Java array of numerical types to a FeatureVector, where the Alphabet is the data array index wrapped in an Integer object.class
AugmentableFeatureVectorAddConjunctions
Add specified conjunctions to each instance.class
AugmentableFeatureVectorLogScale
Given an AugmentableFeatureVector, set those values greater than or equal to 1 to log(value)+1.class
BranchingPipe
Deprecated.class
CharSequence2CharNGrams
Transform a character sequence into a token sequence of character N grams.class
CharSequence2TokenSequence
Pipe that tokenizes a character sequence.class
CharSequenceArray2TokenSequence
Transform an array of character Sequences into a token sequence.class
CharSequenceLowercase
Replace the data string or string buffer with a lowercased version.class
CharSequenceNoDiacritics
A string normalizer which performs the following steps: Unicode canonical decomposition (Form#NFD
) Removal of diacritical marks Unicode canonical composition (Form#NFC
)class
CharSequenceRemoveHTML
This pipe removes HTML from a CharSequence.class
CharSequenceRemoveUUEncodedBlocks
class
CharSequenceReplace
Given a string, repeatedly look for matches of the regex, and replace the entire match with the given replacement string.class
CharSequenceReplaceHtmlEntities
Be careful here: this pipe must be applied beforeCharSequenceLowercase
because it is case sensitive.class
CharSubsequence
Given a string, return only the portion of the string inside a regex parenthesized group.class
Classification2ConfidencePredictingFeatureVector
Pipe features from underlying classifier to the confidence prediction instance listclass
CountsToFeatureSequencePipe
class
Csv2Array
Converts a string of comma separated values to an array.class
Csv2FeatureVector
Converts a string of the form feature_1:val_1 feature_2:val_2 ...class
Directory2FileIterator
Convert a File object representing a directory into a FileIterator which iterates over files in the directory matching a pattern and which extracts a label from each file path to become the target field of the instance.class
FeatureCountPipe
Pruning low-count features can be a good way to save memory and computation.class
FeatureDocFreqPipe
Pruning low-count features can be a good way to save memory and computation.class
FeatureSequence2AugmentableFeatureVector
Convert the data field from a feature sequence to an augmentable feature vector.class
FeatureSequence2FeatureVector
Convert the data field from a feature sequence to a feature vector.class
FeatureSequenceConvolution
class
FeatureValueString2FeatureVector
class
FeatureVectorConjunctions
Include in the FeatureVector conjunctions of all its features.class
FeatureVectorSequence2FeatureVectors
Given instances with a FeatureVectorSequence in the data field, break up the sequence into the individual FeatureVectors, producing one FeatureVector per Instance.class
Filename2CharSequence
Given a filename contained in a string, read in contents of file into a CharSequence.class
FilterEmptyFeatureVectors
class
FixedVocabTokenizer
A simple unicode tokenizer that accepts sequences of letters as tokens.class
Input2CharSequence
Pipe that can read from various kinds of text sources (either URI, File, or Reader) into a CharSequenceclass
InstanceListTrimFeaturesByCount
Unimplemented.class
LineGroupString2TokenSequence
class
MakeAmpersandXMLFriendly
convert & to & in tokens of a token sequenceclass
NGramPreprocessor
This pipe changes text to lowercase, removes common XML entities (quot, apos, lt, gt), and replaces all punctuation except the - character with whitespace.class
Noop
A pipe that does nothing to the instance fields but which has side effects on the dictionary.class
Pipe
The abstract superclass of all Pipes, which transform one data type to another.class
PrintInput
Print the data field of each instance.class
PrintInputAndTarget
Print the data and target fields of each instance.class
PrintTokenSequenceFeatures
Print properties of the token sequence in the data field and the corresponding value of any token in a token sequence or feature in a featur sequence in the target field.class
SaveDataInSource
Set the source field of each instance to its data field.class
SelectiveSGML2TokenSequence
Similar toSGML2TokenSequence
, except that only the tags listed inallowedTags
are converted toLabel
s.class
SerialPipes
Convert an instance through a sequence of pipes.class
SGML2TokenSequence
Converts a string containing simple SGML tags into a dta TokenSequence of words, paired with a target TokenSequence containing the SGML tags in effect for each word.class
SimpleTaggerSentence2StringTokenization
This extendsSimpleTaggerSentence2TokenSequence
to use {Slink StringTokenizations} for use with the extract package.class
SimpleTaggerSentence2TokenSequence
Converts an external encoding of a sequence of elements with binary features to aTokenSequence
.class
SimpleTokenizer
A simple unicode tokenizer that accepts sequences of letters as tokens.class
SourceLocation2TokenSequence
Read from File or BufferedRead in the data field and produce a TokenSequence.class
StringAddNewLineDelimiter
Pipe that can adds special text between lines to explicitly represent line breaks.class
StringList2FeatureSequence
Convert a list of strings into a feature sequenceclass
SvmLight2FeatureVectorAndLabel
This Pipe converts a line in SVMLight format to a Mallet instance with FeatureVector data and Label target.class
Target2Double
Convert object in the target field into a floating-point numeric typeclass
Target2FeatureSequence
Convert a token sequence in the target field into a feature sequence in the target field.class
Target2Integer
Convert object in the target field into an integer numeric typeclass
Target2Label
Convert object in the target field into a label in the target field.class
Target2LabelSequence
convert a token sequence in the target field into a label sequence in the target field.class
TargetRememberLastLabel
For each position in the target, remember the last non-background label.class
TargetStringToFeatures
class
Token2FeatureVector
convert the property list on a token into a feature vectorclass
TokenSequence2FeatureSequence
Convert the token sequence in the data field each instance to a feature sequence.class
TokenSequence2FeatureSequenceWithBigrams
Convert the token sequence in the data field of each instance to a feature sequence that preserves bigram information.class
TokenSequence2FeatureVectorSequence
Convert the token sequence in the data field of each instance to a feature vector sequence.class
TokenSequence2PorterStems
class
TokenSequenceLowercase
Convert the text in each token in the token sequence in the data field to lower case.class
TokenSequenceMatchDataAndTarget
Run a regular expression over the text of each token; replace the text with the substring matching one regex group; create a target TokenSequence from the text matching another regex group.class
TokenSequenceNGrams
Convert the token sequence in the data field to a token sequence of ngrams.class
TokenSequenceParseFeatureString
Convert the string in each fieldToken.text
to a list of Strings (space delimited).class
TokenSequenceRemoveNonAlpha
Remove tokens that contain non-alphabetic characters.class
TokenSequenceRemoveStopPatterns
Remove tokens from the token sequence in the data field whose text matches any of a set of regular expressions.class
TokenSequenceRemoveStopwords
Remove tokens from the token sequence in the data field whose text is in the stopword list.class
ValueString2FeatureVector
Methods in cc.mallet.pipe with parameters of type AlphabetCarrying Modifier and Type Method Description boolean
Pipe. alphabetsMatch(AlphabetCarrying object)
-
Uses of AlphabetCarrying in cc.mallet.pipe.tests
Classes in cc.mallet.pipe.tests that implement AlphabetCarrying Modifier and Type Class Description static class
TestInstancePipe.Array2ArrayIterator
static class
TestSGML2TokenSequence.Array2ArrayIterator
-
Uses of AlphabetCarrying in cc.mallet.pipe.tsf
Classes in cc.mallet.pipe.tsf that implement AlphabetCarrying Modifier and Type Class Description class
CountMatches
class
CountMatchesAlignedWithOffsets
class
CountMatchesMatching
class
FeaturesInWindow
class
FeaturesOfFirstMention
class
LexiconMembership
class
OffsetConjunctions
class
OffsetFeatureConjunction
class
OffsetPropertyConjunctions
class
RegexMatches
class
SequencePrintingPipe
Created: Jul 6, 2005class
Target2BIOFormat
class
TokenFirstPosition
class
TokenText
class
TokenTextCharNGrams
class
TokenTextCharPrefix
class
TokenTextCharSuffix
class
TokenTextNGrams
class
TrieLexiconMembership
class
WordVectors
-
Uses of AlphabetCarrying in cc.mallet.share.casutton.ner
Classes in cc.mallet.share.casutton.ner that implement AlphabetCarrying Modifier and Type Class Description class
ConllNer2003Sentence2TokenSequence
Reads a data file in CoNLL 2003 format, and makes some simple transformations. -
Uses of AlphabetCarrying in cc.mallet.share.mccallum.ner
Classes in cc.mallet.share.mccallum.ner that implement AlphabetCarrying Modifier and Type Class Description class
ConllNer2003Sentence2TokenSequence
class
TokenSequenceDocHeader
-
Uses of AlphabetCarrying in cc.mallet.share.upenn.ner
Classes in cc.mallet.share.upenn.ner that implement AlphabetCarrying Modifier and Type Class Description class
FeatureWindow
Adds all features of tokens in the window to the center token.class
LengthBins
A feature approximating string length.class
ListMember
Checks membership in a lexicon in a text file.class
LongRegexMatches
Matches a regular expression which spans several tokens.class
NEPipes
-
Uses of AlphabetCarrying in cc.mallet.share.weili.ner.enron
Classes in cc.mallet.share.weili.ner.enron that implement AlphabetCarrying Modifier and Type Class Description class
EnronMessage2TokenSequence
-
Uses of AlphabetCarrying in cc.mallet.types
Subinterfaces of AlphabetCarrying in cc.mallet.types Modifier and Type Interface Description interface
Labeling
A distribution over possible labels for an instance.Classes in cc.mallet.types that implement AlphabetCarrying Modifier and Type Class Description class
AugmentableFeatureVector
class
BiNormalSeparation
Bi-Normal Separation is a feature weighting algorithm introduced in: An Extensive Empirical Study of Feature Selection Metrics for Text Classification, George Forman, Journal of Machine Learning Research, 3:1289--1305, 2003.class
ExpGain
class
FeatureCounts
class
FeatureSelection
class
FeatureSequence
An implementation ofSequence
that ensures that every Object in the sequence has the same class.class
FeatureSequenceWithBigrams
A FeatureSequence with a parallel record of bigrams, kept in a separate dictionaryclass
FeatureVector
A subset of anAlphabet
in which each element of the subset has an associated value.class
FeatureVectorSequence
class
GainRatio
List of features along with their thresholds sorted in descending order of the ratio of (1) information gained by splitting instances on the feature at its associated threshold value, to (2) the split information.class
GradientGain
class
InfoGain
class
Instance
A machine learning "example" to be used in training, testing or performance of various machine learning algorithms.class
InstanceList
A list of machine learning instances, typically used for training or testing of a machine learning algorithm.class
KLGain
class
Label
class
Labelings
A collection of labelings, either for a multi-label problem (all labels are part of the same label dictionary), or a factorized labeling, (each label is part of a different dictionary).class
Labels
Usually some distribution over possible labels for an instance.class
LabelSequence
class
LabelsSequence
A simpleSequence
implementation where all of the elements must be Labels.class
LabelVector
class
MultiInstanceList
An implementation of InstanceList that logically combines multiple instance lists so that they appear as one list without copying the original lists.class
Multinomial
A probability distribution over a set of features represented as aFeatureVector
.static class
Multinomial.Logged
A Multinomial in which the values associated with each feature index fi is Math.log(probability[fi]) instead of probability[fi].class
NullLabel
Object that carries a LabelAlphabet.class
PagedInstanceList
An InstanceList which avoids OutOfMemoryErrors by saving Instances to disk when there is not enough memory to create a new Instance.class
PartiallyRankedFeatureVector
class
RankedFeatureVector
class
ROCData
Tracks ROC data for instances inTrial
results.class
StringEditFeatureVectorSequence
Methods in cc.mallet.types with parameters of type AlphabetCarrying Modifier and Type Method Description static boolean
Alphabet. alphabetsMatch(AlphabetCarrying object1, AlphabetCarrying object2)
Convenience method that can often implement alphabetsMatch in classes that implement the AlphabetsCarrying interface.boolean
FeatureSequence. alphabetsMatch(AlphabetCarrying object)
boolean
FeatureVector. alphabetsMatch(AlphabetCarrying object)
boolean
Instance. alphabetsMatch(AlphabetCarrying object)
-