| Home | Trees | Indices | Help |
|
|---|
|
|
object --+
|
api.ClusterI --+
|
util.VectorSpaceClusterer --+
|
GAAClusterer
The Group Average Agglomerative starts with each of the N vectors as singleton clusters. It then iteratively merges pairs of clusters which have the closest centroids. This continues until there is only one cluster. The order of merges gives rise to a dendrogram: a tree with the earlier merges lower than later merges. The membership of a given number of clusters c, 1 <= c <= N, can be found by cutting the dendrogram at depth c.
This clusterer uses the cosine similarity metric only, which allows for efficient speed-up in the clustering process.
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
| Dendrogram |
|
||
|
|||
|
|||
|
|||
|
Inherited from Inherited from Inherited from |
|||
|
|||
|
Assigns the vectors to clusters, learning the clustering parameters from the data. Returns a cluster identifier for each vector.
|
Finds the clusters using the given set of vectors.
|
Returns the index of the appropriate cluster for the vector.
|
|
Returns the number of clusters.
|
|
| Home | Trees | Indices | Help |
|
|---|
| Generated by Epydoc 3.0.1 on Mon Apr 11 14:39:44 2011 | http://epydoc.sourceforge.net |