Data Analytics

You can get whole masters degrees in Data Analytics, so we can't cover even a tiny fraction of it in one class. Instead, we're going to look at a couple interesting tools/areas and try them out.

Network Analysis

For this, you should get Gephi.

We're going to use the Flickr API to build an adjacency list and visualize it.

Here's a sample network

Weka

Weka is an awesome tool for doing machine learning. It lets you take data and either classify items into categories (e.g. take text and classify the author as male or female) or to do regression (predict a score on something).

We're going to try an example where we first transform this data into ARFF format and then classify it.
http://archive.ics.uci.edu/ml/datasets/Housing