Loading…
Hierarchical clustering with spatial adjacency constraints in heavy-tailed areal data
Some natural phenomena with areal/lattice data structures include extreme values or outliers. In such situations, the assumption of Gaussianity for the random field may not be reasonable, and no Gaussian transformation can be found because they exhibit heavy tails. A non-Gaussian stable random field...
Saved in:
Published in: | Communications in statistics. Simulation and computation 2024-11, Vol.53 (11), p.5385-5405 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Some natural phenomena with areal/lattice data structures include extreme values or outliers. In such situations, the assumption of Gaussianity for the random field may not be reasonable, and no Gaussian transformation can be found because they exhibit heavy tails. A non-Gaussian stable random field, which is heavy-tailed, could be a more appropriate choice in these cases. This article introduces a sub-Gaussian α-stable (SGαS) random field for spatial analysis of multivariate areal data using a multivariate conditional autoregressive model. We, specifically, focus on the spatial clustering problem of such areal data. To address it, we develop methods that work based on adjacency constraints. To group the data, we offer an adjacency-constrained hierarchical agglomerative clustering (HAC) technique that considers both spatial and non-spatial attributes existing in the multivariate areal data. The proposed clustering algorithm is developed based on spatial adjacency constraint criteria incorporated into the HAC technique. We employed the developed algorithm to group Luxembourg communes based on simulated areal data from the SGαS distributions and French cities along the Gironde estuary based on the estuary areal dataset. We compare the results of the proposed method with the mentioned adjacency criteria, various dissimilarity, and linkages measures in clustering these two datasets. |
---|---|
ISSN: | 0361-0918 1532-4141 |
DOI: | 10.1080/03610918.2023.2184761 |