Jay Pujara's Webpage


Jay Pujara
Ph.D. Candidate
Computer Science Department
University of Maryland, College Park

Contact Information:

E-Mail:
<my first name> @ cs.umd.edu

Mailing Address:
3228 AV Williams Building
University of Maryland
College Park, MD 20742

Career Information:
CV
Career Website




|   About Me  |   Research Interests  |   Education  |
|   Publications  |   Work Experience  |   Course Work  |

About Me

I'm Jay and I'm a PhD student in Computer Science at the University of Maryland. Currently, I'm doing research in the field of machine learning with my advisor Lise Getoor and the LINQS group. From 2006-2010 I worked on spam detection at Yahoo! in Sunnyvale, CA. I completed my undergraduate education as well as a research Masters program at Carnegie Mellon University, graduating in 2005.

Research Interests

My research focuses on scalable machine learning to address scenarios where billions of predictions are necessary in a limited amount of time and large, noisy corpora of training data are available. Particular topics that I'm actively pursuing are Efficient Prediction Using Classifier Cascades, Scalable Entity Resolution, and Reducing Label Cost. Check out my CV for more elaborate descriptions about each of these topics.

Education

University of Maryland, College Park, 2010-
School of Computer Science
Ph.D. Candidate

Carnegie Mellon University, 2004-2005
School of Computer Science
M.S. Computer Science
Thesis: Understanding Feature Selection in Functional Magnetic Resonance Imaging

Carnegie Mellon University, 2000-2004
School of Computer Science
B.S. Computer Science
Minors in Logic and Computation, Mathematical Science, and Robotics
Thesis: Machine Learning Classification of fMRI data
University Honors and College Honors

Carnegie Mellon University, 2001-2004
Carnegie Institute of Technology
B.S. Electrical and Computer Engineering
University Honors

Carnegie Mellon University, 2001-2004
School of Humanities and Social Sciences
B.S. Cognitive Science
University Honors


Publications

See also: LINQS Publications

Using Classifier Cascades for Scalable E-Mail Classification. Jay Pujara, Hal Daume III, and Lise Getoor. CEAS 2011. [winner of Best Paper award]

Reducing Label Cost by Combining Feature Labels and Crowdsourcing. Jay Pujara, Ben London, and Lise Getoor. ICML 2011 workshop on Combining Learning Strategies to Reduce Label Cost. [selected for contributed talk]

Coarse-to-Fine, Cost-Sensitive Classification of E-Mail. Jay Pujara and Lise Getoor. NIPS 2010 Workshop on Coarse-to-Fine Processing. [selected for spotlight talk]


Patents

Real-time Ad-Hoc Spam Filtering of E-Mail. Jay Pujara, Patent 8,069,128; awarded 2011.

Employing pixel density to detect a spam image. Ke Wei, Hao Zheng, Jay Pujara, Patent 7,882,177; awarded 2011.

Identifying IP addresses for spammers. Jaesik Choi, Jay Pujara, Vishwanath Ramarao, Ke Wei, Patent 7,849,146; awarded 2010.


Work Experience

Location Position Dates
Yahoo! Inc, Sunnyvale, CA Senior Engineer, Yahoo! Mail Fall 2006 - Fall 2010
Oracle Corp, Redwood Shores, CA Member of Technical Staff, Business Intelligence Fall 2005 - Fall 2006
Carnegie Mellon University, Pittsburgh, PA Graduate Research Assistant Summer 2004
University of Pittsburgh, Pittsburgh, PA Research Programmer, Learning R&D Center Summer 2003
InternalDrive Corporation, Stanford, CA Camp Instructor, Game Programming and C++ Summer 2002
Carnegie Mellon University, Pittsburgh, PA Research Programmer, Robotics Institute Summer 2001
West Virginia State Legislature, Charleston, WV Web Designer and Developer Spring 2000


Graduate-level Course Work

Spring 2011 at University of Maryland
CMSC734: Information Visualization with Ben Shneiderman
CMSC858P: Computational methods for high-throughput analysis of biological systems with Hector Corrada Bravo

Fall 2010 at University of Maryland
CMSC723: Computational Linguistics with Hal Daume
CMSC858F: Algorithmic Game Theory with Mohammad Hajiaghayi

Spring 2005 at Carnegie Mellon University
15-721: Database System Design and Implementation with Anastasia Ailamaki
85-714: Cognitive Neuropsychology with Marlene Behrmann

Fall 2004 at Carnegie Mellon University
15-744: Computer Networks with Srinivasan Seshan
15-781: Machine Learning with Andrew Moore