| Home | Trees | Indices | Help |
|
|---|
|
|
object --+
|
PunktParameters
Stores data used to perform sentence boundary detection with punkt.
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
abbrev_types A set of word types for known abbreviations. |
|||
|
collocations A set of word type tuples for known common collocations where the first word ends in a period. |
|||
|
sent_starters A set of word types for words that often appear at the beginning of sentences. |
|||
|
ortho_context A dictionary mapping word types to the set of orthographic contexts that word type appears in. |
|||
|
|||
|
|
|||
collocationsA set of word type tuples for known common collocations where the first word ends in a period. E.g., ('S.', 'Bach') is a common collocation in a text that discusses 'Johann S. Bach'. These count as negative evidence for sentence boundaries. |
ortho_contextA dictionary mapping word types to the set of orthographic contexts that word type appears in. Contexts are represented by adding orthographic context flags: ... |
| Home | Trees | Indices | Help |
|
|---|
| Generated by Epydoc 3.0.1 on Mon Apr 11 14:39:53 2011 | http://epydoc.sourceforge.net |