SOFTWARE/STATA

 

Cluster analysis

Hierarchical clustering

  • single linkage
  • complete linkage
  • average linkage
  • Ward's linkage (including Ward's method)
  • weighted average linkage
  • centroid linkage
  • median linkage

Nonhierarchical

  • kmeans
  • kmedians

Cluster on observations*

Cluster using any proximity matrix*

Dendograms

  • full trees
  • sub trees
  • upper-portion of trees
  • vertical or horizontal orientation*
  • branch counts*

Stopping rules

  • Calinksi & Harabasz pueudo-F index
  • Dua & Hart Je(2)/Je(1) index

Support tools

  • generate summary and grouping varaibles
  • attach notes to analyses

Similarity/dissimilarity measures for continuous data

  • L2/Euclidean
  • L1/absolute/cityblock/manhattan
  • L(#)
  • Canberra
  • correlation
  • angular
 

Similarity/dissimilarity measures for binary data

  • matching
  • Jaccard
  • Russell
  • Hamman
  • Dice
  • antidice
  • Sneath
  • Rogers
  • Ochiai
  • Yule
  • Anderberg
  • Kulczynski
  • Gower2
  • Pearson

Support tools

  • generate summary and grouping variables
  • attach notes to analyses

Result management utilities

  • dir
  • list
  • drop
  • use
  • rename

User extensible

  • ability to add new clustering methods and utilities
  • full set of tools to ease making additions

* New in Stata 9

© Copyright 2005 StataCorp LP 2005.


 
Copyright © 2005 TStat All rights reserved via Baden Powell, 8/I - 67039 - Sulmona (AQ) - Italia