Robust clustering and visualization of heterogeneous multivariate data
F. Scielzo Ortiz, A. Grané Chávez
In this work we propose two new robust metrics for multivariate heterogenous data and study their performance as auxiliary tools in clustering through k-medoids algorithm. Additionally, Multidimensional Scaling is used for clustering visualization. The new proposals performance is evaluated through a collection of synthetic and real datasets, with outlying contamination, as well as compared to classical metrics by means of adjusted accuracy and adjusted Rand index. A Python library with the new proposals has been developed.
Keywords: Clustering, k-medoids, mixed-type data, robust metrics
Scheduled
Posters II
November 9, 2023 11:40 AM
CC: coffee break Hall
Other papers in the same session
K. R. Bondugula, S. Mazuelas Franco, A. Pérez
V. López López, Y. Jimenez Agudelo, P. Llamocca Portella, C. Guevara Maldonado, D. Urgelés Puértolas, M. Espinosa Ruíz
J. Ollero Hinojosa
I. Rodriguez Moreno, I. Irigoien Garbizu, J. M. Martinez Otzeta, B. Sierra Araujo, C. Arenas Sola
J. M. Sánchez Santos, M. J. Rivas López, A. Lorenzo Hernández, M. Suárez Barrios
J. Gutiérrez Botella, C. Armero, M. Pata, T. Kneib, F. Gude-Sampedro
L. Aixalà-Perelló, X. Barber, A. López-Quílez
B. Sawik, A. Agustin
A. V. Garcia Luengo, I. Oña Casado
E. Gómez Déniz, E. Calderín-Ojeda, F. J. Vázquez-Polo