Package nltk :: Package corpus :: Package reader :: Module bracket_parse :: Class AlpinoCorpusReader
[hide private]
[frames] | no frames]

type AlpinoCorpusReader

source code

        object --+            
                 |            
  api.CorpusReader --+        
                     |        
api.SyntaxCorpusReader --+    
                         |    
  BracketParseCorpusReader --+
                             |
                            AlpinoCorpusReader

Reader for the Alpino Dutch Treebank.

Instance Methods [hide private]
 
__init__(self, root, encoding=None, tag_mapping_function=None) source code
 
_normalize(self, t) source code

Inherited from BracketParseCorpusReader (private): _parse, _read_block, _tag, _word

Inherited from api.SyntaxCorpusReader: parsed_sents, raw, sents, tagged_sents, tagged_words, words

Inherited from api.CorpusReader: __repr__, abspath, abspaths, encoding, fileids, open, readme

Inherited from api.CorpusReader (private): _get_root

    Block Readers
    Deprecated since 0.8

Inherited from api.SyntaxCorpusReader: parsed, read, tagged, tokenized

    Deprecated since 0.9.7

Inherited from api.CorpusReader: files

    Deprecated since 0.9.1

Inherited from api.CorpusReader: items

Inherited from api.CorpusReader (private): _get_items

Instance Variables [hide private]

Inherited from api.CorpusReader (private): _encoding, _fileids, _root

Properties [hide private]

Inherited from api.CorpusReader: root

Method Details [hide private]

__init__(self, root, encoding=None, tag_mapping_function=None)
(Constructor)

source code 
Parameters:
  • root - The root directory for this corpus.
  • fileids - A list or regexp specifying the fileids in this corpus.
  • comment_char - The character which can appear at the start of a line to indicate that the rest of the line is a comment.
  • detect_blocks - The method that is used to find blocks in the corpus; can be 'unindented_paren' (every unindented parenthesis starts a new parse) or 'sexpr' (brackets are matched).
Overrides: api.CorpusReader.__init__
(inherited documentation)

_normalize(self, t)

source code 
Overrides: BracketParseCorpusReader._normalize