Instructor Amol Deshpande
Tuesday 10-11am, Thursday 2-3pm, or by appointment;
, Office Hours: Monday 11-12pm, TA Office
, Office Hours: Thursday 9-10am, TA Office
We will use Piazza
for class discussion/clarifications etc.
Github Repository for the Class
We will use github
for disseminating assignments/labs, datasets, and lecture notes.
There is no required textbook for the course, however the following two books will be useful for you to consult.
Both the books are available for free online at Safari Books Online
if you are on the UMD network
The course will be heavily assignment-based, with bi-weekly assignments focused on learning how to use different tools. See Schedule for more details.
You should be familiar with Java, be comfortable with using Unix/Linux, and also be comfortable with downloading and installing
/packages from the Web (we will only use widely-used packages that have extensive documentation). Although we will use other
languages like Python in some cases, sufficient guidance will be provided and prior familiarity with those languages is not expected.
We will primarily use Python for this class. Sufficient guidance will be provided to help you learn Python along the way, and it is not
expected that you are familiar with Python. Java may still be needed for some assignments.
Some examples of tools that we will learn to use include: (1) Amazon EC2 Cloud Computing, (2) Data Wrangler (a data cleaning tool),
(3) PostgreSQL (a relational database), (4) Hadoop/Map-Reduce, (5) MongoDB/HBase/Cassandra.
There will also be in-class exams and a final. Details to be announced later.