An example Power Iteration Clustering http://www.icml2010.org/papers/387.pdf app.
Takes an input of K concentric circles and the number of points in the innermost circle.
The output should be K clusters - each cluster containing precisely the points associated
with each of the input circles.
Run with
./bin/run-example mllib.PowerIterationClusteringExample [options]
Where options include:
k: Number of circles/clusters
n: Number of sampled points on innermost circle.. There are proportionally more points
within the outer/larger circles
maxIterations: Number of Power Iterations
An example Power Iteration Clustering http://www.icml2010.org/papers/387.pdf app. Takes an input of K concentric circles and the number of points in the innermost circle. The output should be K clusters - each cluster containing precisely the points associated with each of the input circles.
Run with
Here is a sample run and output:
./bin/run-example mllib.PowerIterationClusteringExample -k 2 --n 10 --maxIterations 15
Cluster assignments: 1 -> [0,1,2,3,4,5,6,7,8,9], 0 -> [10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29]
If you use it as a template to create your own app, please use
spark-submit
to submit your app.