| Home | Trees | Indices | Help |
|
|---|
|
|
object --+
|
StringSource --+
|
ContiguousCharStringSource
A StringSource that specifies the source of strings whose character are contiguous, but do not necessarily have consecutive offsets. In particular, each character's end offset must be equal to the next character's start offset:
This property allow the source to be stored using a list of
len(source)+1 offsets (along with a docid).
This StringSource can be used to describe unicode strings
that are indexed using byte offsets.
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
Inherited from |
|||
|
|||
|
Inherited from |
|||
|
|||
CONSTRUCTOR_CHECKS_OFFSETS = False
|
|||
|
|||
|
Inherited from |
|||
|
|||
|
begin The document offset where the string begins. |
|||
|
end The document offset where the string ends. |
|||
|
|||
Create a new
In both cases, the arguments must be specified as keyword arguments (not positional arguments).
|
Return the length of the string described by this
|
Return a StringSource describing the location where the specified
substring was found. In particular, if
|
|
|
|
|||
beginThe document offset where the string begins. (I.e., the offset of the first character in the string.)source.begin is always equal
to source.offsets[0].
|
endThe document offset where the string ends. (For character offsets, one plus the offset of the last character; for byte offsets, one plus the offset of the last byte that encodes the last character).source.end is always equal to
source.offsets[-1].
|
| Home | Trees | Indices | Help |
|
|---|
| Generated by Epydoc 3.0.1 on Mon Apr 11 14:39:51 2011 | http://epydoc.sourceforge.net |