A trainer for brill taggers.
|
|
|
|
|
train(self,
train_sents,
max_rules=200,
min_score=2)
Trains the Brill tagger on the corpus train_token,
producing at most max_rules transformations, each of
which reduces the net number of errors in the corpus by at least
min_score. |
source code
|
|
|
|
|
|
|
_find_rules(self,
test_sents,
train_sents)
Find all rules that correct at least one token's tag in
test_sents. |
source code
|
|
Set
|
_find_rules_at(self,
test_sent,
train_sent,
i)
Returns:
the set of all rules (based on the templates) that correct token
i's tag in test_sent. |
source code
|
|
|
|
|
|
|
| _trace_rule(self,
rule,
score,
fixscore,
numchanges) |
source code
|
|