Logistics

When & Where

  • CISC 2120
  • Tuesday, Thursday 3:30pm - 4:45pm

Instructor

Abhinav Shrivastava

  • 3209 AV Williams
  • abhinav@cs.umd.edu
  • Office hours: email for appointments (‘CMSC828I’ in the subject line)

Teaching Assistant

Alex Hanson

  • 4420 AV Williams
  • hanson@cs.umd.edu
  • Office hours: email for appointments (‘CMSC828I’ in the subject line)

Overview

Summary

A graduate-level course in computer vision, with an emphasis on high-level recognition tasks. We will read an eclectic mix of classic and contemporary papers on a wide-range of topics. The course structure will combine lectures, student presentations, in-class discussions, and a course project. The goal of this course is to:

  1. Become knowledgeable about how a particular sub-topic evolved, the state-of-the-art, and the challenges that need to be addressed.
  2. Develop skills to critically read, analyze, write-up, and present a body of research.
  3. Learn to draw connections between different works and engage in productive discussions.

Prerequisites

While there are no formal prerequisites for this course, familiarity with introductory courses in computer vision (CMSC426 or similar) and machine learning (CMSC422 or similar) is assumed. If you have not taken courses covering this material, consult with the instructor. Note that a basic knowledge of linear algebra, probability, and calculus is required.

Awards

At the end of the course, we will have prizes (and bonus credits) for:

  • Best paper/sub-topic review (5x)
  • Best Project
  • Best Presentation

Class Presentation

Every student will give one presentation in the course. One week before the presentation date, schedule a meeting with the instructor for discussion. Two days before the presentation, present an initial draft to the instructor.

Paper/Sub-topic Review or Assignments

Paper reviews and assignments, their format, and submission will be announced in the class.

Late Days

You get 5 late days (to be used in 24-hour blocks) that can be used throughout the course. Late days may be used for paper reviews and project proposal submission. Late days may not be used towards the presentations, exams, or final project report (see ‘Accommodations and Policies’ below for exceptions).

Project Report Format and Deadline

Final project reports (5 pages in NeurIPS final format) are due on Dec 15, 2018 at Midnight. Submit via email with subject “[CMSC828i] Final project report” to the instructor.

Grading

  • Participation: 10%
  • Presentation: 10%
  • Paper/Sub-topic Reviews: 20%
  • Research Project: 30%
  • Exams (1 mid-term, 1 final): 30%

Schedule

Presenter(s) Date Lecture topics* Papers Slides/Notes
Abhinav Aug 28 Introduction to the class 01-Intro.pdf
Abhinav Aug 30 Introduction to data 02-Data-01.pdf
Abhinav Sep 4 Introduction to Data-driven methods in vision 03-Data-02.pdf
Introduction to ConvNet Architecures 03-ConvNets.pdf
Alex Sep 6 Debugging Deep Networks + Adversarial Examples 04-Viz.pdf
Abhinav Sep 11 Introduction to ConvNet Architecures (cont.) 03-ConvNets.pdf
Assignment 0
Abhinav Sep 13 Introduction to Data-driven methods (cont.) 06-Data-03.pdf
Abhinav Sep 18 Object Detection (Part I) Background: Single-stage object detection: 07-ObjDet-01.pdf
Abhinav Sep 20 Object Detection (Part I) (cont.)
Image Segmentation
(+ other pixel labeling tasks)
Background: Semantic Segmentation: Object Proposals: 08-Seg+ObjDet-2.pdf
Abhinav Sep 25 Image Segmentation (cont.)
Object Detection (Part II) Multi-stage object detection: Multi-stage object detection & instance segmentation: Architectures: Analysis and diagnosis: Training: 09-ObjDet-3.pdf
Abhinav Sep 27 Object Detection (Part II) (cont.)
Kamal Gupta Human Pose Estimation Background: Readings: Review 1 (Action)

10-Pose-Kamal.pdf
Abhinav
Jack Wang
Oct 2 Action Recognition Background: Video Representation Action Classification Temporal Action Detection Spatio-temporal Action Detection Review 2
(Fine-graned and Attributes)

11-Videos.pdf

11-Videos-Jack.pdf
Abhinav
Yu Shen
Luyu Yang
Oct 4 Attributes and Fine-grained Recognition Attributes (background and readings): Fine-grained Recognition (background and readings): 12-Attributes-and-FG.pdf

12-FG-Yu.pdf

12-Attributes-Luyu.pdf
Abhinav
Jane Tsai
Kuan-Ho Lao
Oct 9 3D Scene Understanding - Primitives and Reasoning Background: Primitives (required reading) Reasoning (required reading) Optional readings: 13-3D-1.pdf

13-3D-Jane.pdf

13-3D-KuanHo.pdf
Invited Talk
Debadeepta Dey
Oct 11 Reinforcement Learning Primer;
Neural Architecture Search
RL Primer:
  • Reinforcement Learning: An Introduction; (website, pdf) (skim the first two chapters)
NAS Papers: Bonus papers:
14-NAS-Dey.pdf

14-POMDP-Geoff.pdf
Abhinav
Haoying
Jun Wang
Oct 16 Objects + 3D Required readings: Additional and background readings: 15-3D-2.pdf

15-3D-Haoying.pdf

15-3D-Jun.pdf
Abhinav
Lillian
Anurag
Oct 18 Context Reasoning Required readings: Additional readings: 16-context.pdf

16-context-Lillian.pdf

16-context-Anurag.pdf
Oct 23 Mid-term presentations
Oct 25 Mid-term exam
Abhinav
Ankan
Saketh
Oct 30 Weakly, Semi Supervised Learning Semi-supervised Learning (required readings): Weakly-supervised Learning (required readings): Additional readings: 17-Weakly-Saketh.pdf

17-SSL-Ankan.pdf
Abhinav
Anshul
Ketul
Nov 1 Learning from the Web 18-SSL+Web.pdf

18-web-Anshul.pdf

18-web-Ketul.pdf
Abhinav
Sahil
Nov 6 Visual data mining and discovery Required readings: Additional readings: 19-midlevel+mining.pdf

19-mining-Sahil.pdf
Abhinav
Yixuan
Nov 8 Generative Models GANs: VAEs: Additional recommended readings: 20-26-gen+unsup+self.pdf

20-gen-Yixuan.pdf

21-gen-Bo.pdf

21-gen-Yue.pdf

21-gen-Hao.pdf
Abhinav
Bo He
Yue Jiang
Hao Chen
Nov 13
Nov 15 No Class CVPR deadline
Abhinav Nov 20 Misc. (see above)
Nov 22 Thanksgiving Break
Abhinav Nov 27 Misc. (see above)
Ruoxi
Sweta
Nov 29 Self-Supervised Learning Context from images: Context from videos: Additional readings: follow the references of the listed papers. 24-self-Sweta.pdf

24-self-Ruoxi.pdf
Mingfei
Kyungjun
Few-shot (low-shot), Zero-shot Recognition Few-shot (low-shot) Recognition readings: Zero-shot Recognition (using KG) readings: Additional readings: 24-fsl-Kyungjun.pdf

24-zsl-Mingfei.pdf
Abhinav
Shi
Mozhi
Pedro
Dec 4 Vision + Language/text Easier tasks: Tasks: Sampling of methods: Review 3 (group)

25-text-Mozhi.pdf

25-text-Pedro.pdf

25-text-Shi.pdf
Abhinav
Jack
Uttaran
Rohan
Dec 6 Embodied visual perception (vision + action) Primary readings: Additional readings: 26-action-Uttaran.pdf

26-action-Rohan.pdf

26-action-Jack.pdf
Dec 13 Final Presentations 12:00 - 4:00 PM
Location: CSIC 2120
Format: 10 min per group (+2 minutes for questions)
Dec 18 Final exam Time: 10:30am-12:30pm
Location: CSIC 2120

*Note that the topic schedule is subject to change.

Additional topics we can cover if the schedule changes:

  • Intuitive physics
  • Hyperparameter optimization and architecture search (AutoML, HyperOpt, TPE, SMAC, RoBO, etc.)
  • Information theoretic interpretation of deep networks
  • Training with synthetic data
  • Neuroscience study example
  • ?

Accommodations and Policies

Academic Integrity

Note that academic dishonesty includes not only cheating, fabrication, and plagiarism, but also includes helping other students commit acts of academic dishonesty by allowing them to obtain copies of your work. In short, all submitted work must be your own. Cases of academic dishonesty will be pursued to the fullest extent possible as stipulated by the Office of Student Conduct. It is very important for you to be aware of the consequences of cheating, fabrication, facilitation, and plagiarism. For more information on the Code of Academic Integrity or the Student Honor Council, please visit http://www.shc.umd.edu.

Excused Absence and Academic Accommodations

Any student who needs to be excused for an absence from a single lecture, recitation, or lab due to a medically necessitated absence shall:

  1. Make a reasonable attempt to inform the instructor of his/her illness prior to the class.
  2. Upon returning to the class, present their instructor with a self-signed note attesting to the date of their illness. Each note must contain an acknowledgment by the student that the information provided is true and correct. Providing false information to University officials is prohibited under Part 9(i) of the Code of Student Conduct (V-1.00(B) University of Maryland Code of Student Conduct) and may result in disciplinary action.
  3. This self-documentation may not be used for the Major Scheduled Grading Events as defined below.

Any student who needs to be excused for a Major Scheduled Grading Event, must provide written documentation of the illness from the Health Center or from an outside health care provider. This documentation must verify dates of treatment and indicate the time frame that the student was unable to meet academic responsibilities. No diagnostic information shall be given. The Major Scheduled Grading Events for this course include midterm and final exam. For class presentations, the instructor will help the student swap their presentation slot with other students.

It is also the student's responsibility to inform the instructor of any intended absences from exams and class presentations for religious observances in advance. Notice should be provided as soon as possible, but no later than the Monday prior to the the midterm exam, the class presentation date, and the final exam.

Any student eligible for and requesting reasonable academic accommodations due to a disability is requested to provide a letter of accommodation from the Office of Disability Support Services within the first three weeks of the semester.

Other Accommodations and Policies

You can find the university’s course policies here.