Skip to content

Plotting Results

Young edited this page Apr 29, 2014 · 12 revisions

Pre-requisites

The parser requires numpy, while the plot script requires numpy and matplotlib. Instructions for installing these can be found online, but here's one way to install it for Ubuntu 12.04:

  1. Ensure Python is at least 2.7
  2. Perform:

sudo aptitude install libfreetype6-dev libpng12-dev gfortran sudo pip install virtualenv sudo pip install ipython sudo pip install numpy matplotlib

  1. You may also need to perform "sudo easy_install -U distribute"

Parsing the logs

Here we describe the basics to parsing and plotting data from the collected log files.

Let "~/" denote the root directory containing the project/code.

First, ensure the logs are structured in the following way: ~/results/[system]/[# machines]

For example, ~/results/giraph/32 or ~/results/graphlab/128.

Next, to parse all of the data, perform:

cd ~/results/plots/ ./gen-all.sh

This will generate the data for the worker machines (data_time.py, data_mem.py, data_net.py) and the data for the master (data_mem_master.py, data_net_master.py). These data files are used by the plot script.

Plotting the data

Simply use plot.py for regular plotting or plot-with-cuts.py for plots with a cut y-axis. Use --help to find the available option

Clone this wiki locally