support & services / Education

Applied Clustering Techniques - CLUS92

The course looks at the theoretical and practical implications of a wide array of clustering techniques currently available in SAS. The techniques considered include cluster preprocessing, variable clustering, k-nearest-neighbor clustering, k-means clustering, hierarchical clustering, and fuzzy clustering.

2 days - Classroom

Learn how to

  • prepare and explore data for a cluster analysis
  • distinguish among many different clustering techniques, making informed choices about which to use
  • evaluate the results of a cluster analysis
  • determine the appropriate number of clusters to retain
  • profile and describe clustered observations
  • score observations into clusters.

Who should attend?

Intermediate or senior level statisticians, data analysts, and data miners

Prerequisites

Course Contents

Introduction to Clustering

  • identifying types of clustering
  • measuring similarity
  • assessing multivariate normality
  • using classification matrices

Preparation for Clustering

  • using variable clustering for variable selection
  • using graphical clustering aids
  • making elongated clusters more spherical
  • viewing the impact of input standardisation

Partitive Clustering

  • performing k-means clustering
  • outlining the advantages of nonparametric clustering

Hierarchical Clustering

  • using hierarchical clustering methods

Assessing Clustering Results

  • determining the number of clusters
  • profiling a cluster solution
  • scoring new observations

Canonical Discriminant Analysis (CDA) Plots

  • introducing canonical discriminant analysis

Fuzzy Clustering

  • performing fuzzy clustering using the FACTOR procedure
  • interpreting the PROC FACTOR output in terms of fuzzy clustering membership

Software Addressed

This course addresses the following software product(s): SAS/STAT.

Book Your Place Today