Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Data clustering
13.050
Zitationen
3
Autoren
1999
Jahr
Abstract
Clustering is the unsupervised classification of patterns (observations, data items, or feature vectors) into groups (clusters). The clustering problem has been addressed in many contexts and by researchers in many disciplines; this reflects its broad appeal and usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult problem combinatorially, and differences in assumptions and contexts in different communities has made the transfer of useful generic concepts and methodologies slow to occur. This paper presents an overview of pattern clustering methods from a statistical pattern recognition perspective, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners. We present a taxonomy of clustering techniques, and identify cross-cutting themes and recent advances. We also describe some important applications of clustering algorithms such as image segmentation, object recognition, and information retrieval.
Ähnliche Arbeiten
Visualizing Data using t-SNE
2008 · 35.663 Zit.
Silhouettes: A graphical aid to the interpretation and validation of cluster analysis
1987 · 19.967 Zit.
A density-based algorithm for discovering clusters in large spatial Databases with Noise
1996 · 19.115 Zit.
Algorithm AS 136: A K-Means Clustering Algorithm
1979 · 14.202 Zit.
Data clustering: 50 years beyond K-means
2009 · 8.940 Zit.