|
|
__init__(self,
*args,
**kwargs)
Initialize this mapping based on keyword arguments, as follows: |
source code
|
|
|
|
|
|
|
|
|
|
| _resolve(self,
fileids,
categories,
textids=None) |
source code
|
|
|
|
|
|
|
textids(self,
fileids=None,
categories=None)
In the pl196x corpus each category is stored in single file and thus
both methods provide identical functionality. |
source code
|
|
list of str
|
words(self,
fileids=None,
categories=None,
textids=None)
Returns all of the words and punctuation symbols in the specified
file that were in text nodes -- ie, tags are ignored. |
source code
|
|
|
|
| sents(self,
fileids=None,
categories=None,
textids=None) |
source code
|
|
|
|
| paras(self,
fileids=None,
categories=None,
textids=None) |
source code
|
|
|
|
| tagged_words(self,
fileids=None,
categories=None,
textids=None) |
source code
|
|
|
|
| tagged_sents(self,
fileids=None,
categories=None,
textids=None) |
source code
|
|
|
|
| tagged_paras(self,
fileids=None,
categories=None,
textids=None) |
source code
|
|
|
|
|
|
|
|
|
Inherited from api.CategorizedCorpusReader:
categories,
fileids
Inherited from api.CorpusReader:
__repr__,
abspath,
abspaths,
encoding,
open,
readme
|
|
Inherited from xmldocs.XMLCorpusReader:
read
|
|
Inherited from api.CorpusReader:
files
|
|
Inherited from api.CorpusReader:
items
|