Package nltk :: Package stem :: Module snowball2 :: Class ItalianStemmer
[hide private]
[frames] | no frames]

type ItalianStemmer

source code

          object --+            
                   |            
        api.StemmerI --+        
                       |        
_LanguageSpecificStemmer --+    
                           |    
            _StandardStemmer --+
                               |
                              ItalianStemmer

The Italian Snowball stemmer.


Note: A detailed description of the Italian stemming algorithm can be found under http://snowball.tartarus.org/algorithms /italian/stemmer.html.

Instance Methods [hide private]
unicode
stem(self, word)
Stem an Italian word and return the stemmed form.
source code

Inherited from _StandardStemmer (private): _r1r2_standard, _rv_standard

Inherited from _LanguageSpecificStemmer: __init__, __repr__

Class Variables [hide private]
unicode __vowels = u'aeiouàèìòù'
The Italian vowels.
tuple __step0_suffixes = (u'gliela', u'gliele', u'glieli', u'glielo'...
Suffixes to be deleted in step 0 of the algorithm.
tuple __step1_suffixes = (u'atrice', u'atrici', u'azione', u'azioni'...
Suffixes to be deleted in step 1 of the algorithm.
tuple __step2_suffixes = (u'erebbero', u'irebbero', u'assero', u'ass...
Suffixes to be deleted in step 2 of the algorithm.
Method Details [hide private]

stem(self, word)

source code 

Stem an Italian word and return the stemmed form.

Parameters:
  • word (str, unicode) - The word that is stemmed.
Returns: unicode
The stemmed form.
Overrides: api.StemmerI.stem

Class Variable Details [hide private]

__step0_suffixes

Suffixes to be deleted in step 0 of the algorithm.
Type:
tuple
Value:
(u'gliela',
 u'gliele',
 u'glieli',
 u'glielo',
 u'gliene',
 u'sene',
 u'mela',
 u'mele',
...

__step1_suffixes

Suffixes to be deleted in step 1 of the algorithm.
Type:
tuple
Value:
(u'atrice',
 u'atrici',
 u'azione',
 u'azioni',
 u'uzione',
 u'uzioni',
 u'usione',
 u'usioni',
...

__step2_suffixes

Suffixes to be deleted in step 2 of the algorithm.
Type:
tuple
Value:
(u'erebbero',
 u'irebbero',
 u'assero',
 u'assimo',
 u'eranno',
 u'erebbe',
 u'eremmo',
 u'ereste',
...