
Building Data Science Solutions with Anaconda
By :

In addition to anomaly detection, there is another class of problem that takes an unsupervised approach to trying to group entities together in order to understand more about the dataset. Clustering is the process of finding elements of a dataset that contain enough similar attributes that you can determine clear distinctions from among the individual points.
There are many applications of this technique, and we'll go over the following few examples now:
To achieve this, we can use a few different algorithms such as the following:
While there are many more, you can be sure that these have shown promising results across various datasets and are a great place to start.
Let's look at DBscan first.
Density-Based Spatial Clustering of Applications with Noise (or DBScan for...