# SKMeans

Learn features from data

SKMeans is an object that facilitates learning *features* from data. It is very similar to KMeans but with 2 distinctions.

Instead of taking the Euclidian distance to each center of each cluster like KMeans does, SKMeans takes the angular distance of the normalised vector in high dimension. This is also named cosine similarity.

Once the cluster are found, the distance are encoded with an ‘alpha’ function, in effect promoting a sparser representations where smaller similarities are penalised.

In effect, SKMeans is mostly used to learn features in a higher dimension space than the original data, with the assumption that it would help untangle near clusters.

## Related Resources

Spherical KMeans @Machine Learning Catalogue

A terse definition of the algorithm.

https://machinelearningcatalogue.com/algorithm/alg_spherical-k-means.html