Cookies and Tracking help us to give you a better experience on our website
StochastikKolloquium
Adaptive Density-Based Clustering
8.11.2017, 11:15 - 12:15
Speaker:Prof. Dr. Ingo Steinwart, Universität Stuttgart
Location:Institut für Informatik, Goldschmidtstrasse 7SR 5.101Gras Geo Map
Organizer:Institut für Mathematische Stochastik
Details:
A central task in nonparametric statistics is cluster analysis, where the goal is to find clusters in unlabeled data. One widely accepted definition of clusters has its roots in a paper by Carmichael et al., where clusters are described to be densely populated areas in the input space that are separated by less populated areas.
The non-parametric mathematical translation of this idea usually assumes that the data is generated by some unknown probability measure that has a density with respect to the Lebesgue measure. Given a threshold level, the clusters are then defined to be the connected components of the density level set. Here, the choice of the threshold, which is left to the user, is a notoriously difficult problem, typically only addressed by heuristics.
In this talk, we show how a simple algorithm based on a density estimator can find the smallest level for which there are more than one connected component in the level set. For some classical density estimators we further establish rates of convergence and present a simple approach for selecting the width parameter. It turns out that in many cases this approach is adaptive, i.e. it achieves the previously established rates of convergence without knowing specifics about the distribution. Finally, we discuss some practical aspects of the algorithm.
Search for keywords:
Type:Colloquium
Language:English
Category:Research
Host:Dozenten des Instituts für Mathematische Stochastik
Contact:0551-39172100stochastik@uni-goettingen.de
Additional information:Download PDF attachment
Export to your calendar (e.g., Outlook or iCal):
Download
EN DE