Logo image
Fed-SC: One-Shot Federated Subspace Clustering over High-Dimensional Data
Conference proceeding

Fed-SC: One-Shot Federated Subspace Clustering over High-Dimensional Data

Songjie Xie, Youlong Wu, Kewen Liao, Lu Chen, Chengfei Liu, Haifeng Shen, MingJian Tang and Lu Sun
2023 IEEE 39th International Conference on Data Engineering (ICDE)
IEEE International Conference on Data Engineering (ICDE) , 39th (Anaheim, CA, USA, 03/04/2023–07/04/2024)
2023

Metrics

31 Record Views

Abstract

subspace clustering federated clustering satistical heterogeneity high-dimensional data Database systems Computer systems
Recent work has explored federated clustering and developed an efficient k-means based method. However, it is well known that k-means clustering underperforms in high-dimensional space due to the so-called "curse of dimensionality". In addition, high-dimensional data (e.g., generated from healthcare, medical, and biological sectors) are pervasive in the big data era, which poses critical challenges to federated clustering in terms of, but not limited to, clustering effectiveness and communication efficiency. To fill this significant gap in federated clustering, we propose a one-shot federated subspace clustering scheme Fed-SC that can achieve remarkable clustering effectiveness on high-dimensional data while keeping communication cost low using only one round of communication for each local device. We further establish theoretical guarantees on the clustering effectiveness of one-shot Fed-SC and exploit the benefits of statistical heterogeneity across distributed data. Extensive experiments on synthetic and real-world datasets demonstrate significant effectiveness gains of Fed-SC compared with both subspace clustering and one-shot federated clustering methods.

Details

Logo image