Class 4 (Module III)
23 Maio 2022, 09:30 • Pedro Cristiano Santos Martins da Silva
This class was lectured on the 30 may, 2022
Clustering analysis: introduction, motivation and general definitions and concepts.
Dissimilarity and distances for quantitative data (Minkowski distance, Canberra distance, generalized euclidean distances including Mahalanobis), binary data (simple matching and Jacard and Legendre and Gower distance) and categorical data (chi-square distances). Generic hierarchical clustering algorithm and dendrogram. Single-linkage, complete-linkage, average and centroid methods: properties and examples. Hierarchical methods via Lance-Williams updating formula table.