| Home | Trees | Indices | Help |
|
|---|
|
|
object --+
|
TaggerI
A processing interface for assigning a tag to each token in a list. Tags are case sensitive strings that identify some property of each token, such as its part of speech or its sense.
Some taggers require specific types for their tokens. This is
generally indicated by the use of a sub-interface to
TaggerI. For example, featureset taggers, which are
subclassed from FeaturesetTaggerI, require that each token be a
featureset.
Subclasses must define:
|
|||
list of (token, tag)
|
|
||
|
|||
float
|
|
||
|
|||
|
|||
Determine the most appropriate tag sequence for the given token
sequence, and return a corresponding list of tagged tokens. A tagged
token is encoded as a tuple
|
Apply self.tag() to each element of >>> return [self.tag(sent) for sent in sentences] |
Score the accuracy of the tagger against the gold standard. Strip the tags from the gold standard text, retag it using the tagger, then compute the accuracy score.
|
| Home | Trees | Indices | Help |
|
|---|
| Generated by Epydoc 3.0.1 on Sun Sep 20 21:27:47 2009 | http://epydoc.sourceforge.net |