Package nltk :: Package tokenize :: Module punkt :: Class PunktWordTokenizer
[hide private]
[frames] | no frames]

type PunktWordTokenizer

source code

    object --+    
             |    
api.TokenizerI --+
                 |
                PunktWordTokenizer

Instance Methods [hide private]
 
__init__(self, lang_vars=PunktLanguageVars()) source code
 
tokenize(self, text)
Divide the given string into a list of substrings.
source code

Inherited from api.TokenizerI: batch_span_tokenize, batch_tokenize, span_tokenize

Method Details [hide private]

__init__(self, lang_vars=PunktLanguageVars())
(Constructor)

source code 
Overrides: object.__init__
(inherited documentation)

tokenize(self, text)

source code 

Divide the given string into a list of substrings.

Returns:
list of str
Overrides: api.TokenizerI.tokenize
(inherited documentation)