New methodological contributions in time series clustering

  1. Lafuente Rego, Borja Raúl
unter der Leitung von:
  1. José Vilar Doktorvater/Doktormutter

Universität der Verteidigung: Universidade da Coruña

Fecha de defensa: 06 von Juli von 2017

Gericht:
  1. Ana María Colubi Cervero Präsident/in
  2. Pedro César Álvarez Esteban Sekretär
  3. María Brígida Ferraro Vocal

Art: Dissertation

Teseo: 490117 DIALNET lock_openRUC editor

Zusammenfassung

his thesis presents new procedures to address the analysis cluster of time series. First of all a two-stage procedure based on comparing frequencies and magnitudes of the absolute maxima of the spectral densities is proposed. Assuming that the clustering purpose is to group series according to the underlying dependence structures, a detailed study of the behavior in clustering of a dissimilarity based on comparing estimated quantile autocovariance functions (QAF) is also carried out. A prediction-based resampling algorithm proposed by Dudoit and Fridlyand is adjusted to select the optimal number of clusters. The asymptotic behavior of the sample quantile autocovariances is studied and an algorithm to determine optimal combinations of lags and pairs of quantile levels to perform clustering is introduced. The proposed metric is used to perform hard and soft partitioning-based clustering. First, a broad simulation study examines the behavior of the proposed metric in crisp clustering using hierarchkal and PAM procedure. Then, a novel fuzzy C-mcdoids algorithm based on the QAF-dissimilarity is proposed. Three different robust versions of this fuzzy algorithm are also presented to deal with data containing outlier time series. Finally, other ways of soft clustering analysis are explored, namely probabilistic 0-clustering and clustering based on mixture models.