Class 4 (Module III)

23 Maio 2022, 09:30 Pedro Cristiano Santos Martins da Silva

This class was lectured on the  30 may, 2022

Clustering analysis: introduction, motivation and general definitions and concepts. 

Dissimilarity and distances for quantitative data (Minkowski distance, Canberra distance, generalized euclidean distances including Mahalanobis), binary data (simple matching and Jacard and  Legendre and Gower distance) and categorical data (chi-square distances). Generic hierarchical clustering algorithm and dendrogram. Single-linkage, complete-linkage, average and centroid methods: properties and examples. Hierarchical methods via Lance-Williams updating formula table.