SOFTWARE/STATA

 

Cluster analysis

Hierarchical clustering

  • single linkage
  • complete linkage
  • average linkage
  • Ward's linkage (including Ward's method)
  • weighted average linkage
  • centroid linkage
  • median linkage

Nonhierarchical

  • kmeans
  • kmedians

Cluster on observations

Cluster using any proximity matrix

Dendograms

  • full trees
  • sub trees
  • upper-portion of trees
  • vertical or horizontal orientation
  • branch counts

Stopping rules

  • Calinksi & Harabasz pueudo-F index
  • Dua & Hart Je(2)/Je(1) index

Support tools

  • generate summary and grouping varaibles
  • attach notes to analyses

Similarity/dissimilarity measures for continuous data

  • L2/Euclidean
  • L1/absolute/cityblock/manhattan
  • L(#)
  • Canberra
  • correlation
  • angular
 

Similarity/dissimilarity measures for binary data

  • matching
  • Jaccard
  • Russell
  • Hamman
  • Dice
  • antidice
  • Sneath
  • Rogers
  • Ochiai
  • Yule
  • Anderberg
  • Kulczynski
  • Gower2
  • Pearson
Gower measure for mixed binary and continuous data

Result management utilities

  • dir
  • list
  • drop
  • use
  • rename

User extensible

  • ability to add new clustering methods and utilities
  • full set of tools to ease making additions

* New in Stata 12

© Copyright 2005 StataCorp LP 1996-2011.


 
Copyright © 2011 TStat All rights reserved via Rettangolo, 12/14 - 67039 - Sulmona (AQ) - Italia