K-Means-X-MapReduce

A project implementing K-Means clustering algorithm in Map-Reduce using sythetic data as a sample.

PointsGenerator.py

A python file that generated random points in the form of x,y following skewed distribution towards 0.

exec:

$python3   PointsGenerator.py    [-n NUM]     out

positional arguments: 
  out                   output file
  
optional arguments:
  -n NUM, --number      number of points to generate  (default: 1.000.000)

*this program must run with python3

KMeans.jar

$bin/hadoop    jar     KMeans.jar     KMeans    input_dir     output_dir

output_dir must not exist, it will be generated by the program.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
KMeans.jar		KMeans.jar
KMeans.java		KMeans.java
PointsGenerator.py		PointsGenerator.py
Project #1 - Hadoop.pdf		Project #1 - Hadoop.pdf
README.md		README.md
points.zip		points.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

K-Means-X-MapReduce

PointsGenerator.py

KMeans.jar

About

Uh oh!

Releases

Packages

Languages

chiotisn/K-Means-X-MapReduce

Folders and files

Latest commit

History

Repository files navigation

K-Means-X-MapReduce

PointsGenerator.py

KMeans.jar

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages