Skip to content

Plotting Results

Young edited this page Apr 29, 2014 · 12 revisions

Pre-requisites

The parser requires numpy, while the plot scripts require numpy and matplotlib. Instructions for installing these can be found online, but here's one way to install it for Ubuntu 12.04:

  1. Ensure Python is at least 2.7 and pip is installed.

  2. Perform:

    sudo aptitude install libfreetype6-dev libpng12-dev gfortran
    sudo pip install virtualenv
    sudo pip install ipython
    sudo pip install numpy matplotlib
    
  3. You may also need to perform sudo easy_install -U distribute.

Parsing the logs

Here we describe the basics to parsing and plotting data from the collected log files.

Ensure the logs are structured on your local machine as: */results/<system>/<# of machines>. For example, */results/giraph/32 or */results/graphlab/128.

To parse all available data, perform:

cd ~/results/plots/
./gen-all.sh

This will generate the data for the worker machines (data_time.py, data_mem.py, data_net.py) and the data for the master (data_mem_master.py, data_net_master.py). These data files are used by the plot script.

Plotting the data

Simply use */results/plots/plot.py for regular plotting or */results/plots/plot-with-cuts.py for plots with a cut y-axis. Use --help to get the available options.

Clone this wiki locally