Ahmed Taha Home Page

I have moved my website hosting to github io starting Mar 2024.

Ahmed Taha

PhD in Computer Vision/Machine Learning

News

[03/2024] I most this website hosting to github io
[01/2024] Article: The Equations Behind DALL-E
[10/2023] Press: MICCAI 2023 Magazine has featured our M&M Paper for lesions detection in mammograms.
[08/2023] Paper: One Paper accepted in MICCAI 2023
[08/2023] Video: How Fully Sharded Data Parallel (FSDP) works?
[05/2023] Article: High Resolution Images and Efficient Transformers
[03/2023] Article: Masked Autoencoders Are Scalable Vision Learners
[02/2023] Article: Rethinking Attention with Performers - Part II & Final
[10/2022] Article: Rethinking Attention with Performers - Part I
[08/2022] Paper: One Paper accepted in MICCAI 2022
[07/2022] Github: Official PyTorch implementation of Deep is a Luxury We Don't Have
[05/2022] Article: Understanding the Effective Receptive Field in Deep Convolutional Neural Networks
[04/2022] Article: Understanding Transfer Learning for Medical Imaging
Full News list

Research Interest:	Feature Embedding, Metric Learning, Deep Networks, Machine Learning, Image segmentation, Texture classification, Patch matching.

Technical Skills:	Python, C/C++, JAVA, OpenCV, MATLAB, mex files, and CUDA TensorFlow, PyTorch, Keras, OpenCV, SimpleITK, CAFFE

Education:	Computer Science PhD (+MS) GPA: 4.0/4.0 - University of Maryland Masters of Business Administration (Marketing Major) GPA 3.83/4.0 - Arab Academy for Science and Technology Computer Science BS GPA 3.81/4.0 - Alexandria University- Faculty of Engineering

Publications:

(click on image to expand)

	M&M: Tackling False Positives in Mammography with a Multi-view and Multi-instance Learning Sparse Detector, MICCAI 2023 (acceptance rate 32%) Yen Nhi Truong Vu, Dan Guo, Ahmed Taha, Jason Su, Thomas Paul Matthews Project Page MICCAI Featured
	Deep is a Luxury We Don't Have, MICCAI 2022 (acceptance rate 31%) Ahmed Taha, Yen Nhi Truong Vu, Brent Mombourquette, Thomas Matthews, Jason Su, Sadanand Singh Project Page
	SVMax: A Feature Embedding Regularizer, arXiv 2021 Ahmed Taha, Alex Hanson, Abhinav Shrivastava, Larry Davis Project Page
	Knowledge Evolution in Neural Networks, CVPR Oral 2021 (acceptance rate 24% -- Oral 4%) Ahmed Taha, Abhinav Shrivastava, Larry Davis Project Page
	A Generic Visualization Approach for Convolutional Neural Networks, ECCV 2020 (acceptance rate 27%) Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis Project Page
	Boosting Standard Classification Architectures Through a Ranking Regularizer, WACV 2020 (acceptance rate 34.5%) Ahmed Taha, Yi-Ting Chen, Teruhisa Misu, Abhinav Shrivastava, Larry Davis Project Page
	Unsupervised Data Uncertainty Learning in Visual Retrieval Systems, arXiv 2019 Ahmed Taha, Yi-Ting Chen, Teruhisa Misu, Abhinav Shrivastava, Larry Davis arXiv Link
	Exploring Uncertainty in Conditional Multi-Modal Retrieval Systems, arXiv 2019 Ahmed Taha, Yi-Ting Chen, Teruhisa Misu, Larry Davis arXiv Link
	Segmentation of Renal Structures for Image-Guided Surgery, MICCAI 2018 (acceptance rate 34.9%) Junning Li, Pechin Lo, Ahmed Taha, Hang Wu, Tao Zhao
	Kid-Net: Convolution Networks for Kidney Vessels Segmentation from CT-Volumes, MICCAI 2018 (acceptance rate 34.9%) Ahmed Taha, Pechin Lo, Junning Li, Tao Zhao
	Two Stream Self-Supervised Learning for Action Recognition CVPRW 2018 Ahmed Taha, Moustafa Meshry, Xitong Yang, Yi-Ting Chen, Larry Davis (DeepVision)- Extended Abstract Github Code
	Texture Synthesis with Recurrent Variational Auto-Encoder, ARXIV 2017 Rohan Chandra, Sachin Grover, Kyungjun Lee, Moustafa Meshry, Ahmed Taha Github Code
	Seeded Laplacian: An Interactive Image Segmentation Approach using Eigenfunctions, ICIP 2015 Ahmed Taha, Marwan Torki Github Code
	Multi-Modality Feature Transform: An Interactive Image Segmentation Approach, BMVC 2015 Moustafa Meshry, Ahmed Taha, Marwan Torki Code

Internship:

Summer 2019: Student Associate at Honda Research Institute (HRI-US)
Summer 2018: Research Assistant in University Of Maryland, sponsored by Honda Research Institute
Summer 2017: Medical Image Analysis/Machine Learning Intern at Intuitive Surgical Inc
Summer 2016: Emerging Graphics Group Intern at Adobe Systems Inc

Research Experience:

[Summer 2018] Video Retrieval System [Honda Research Institute (HRI-US) internship] [Sample Video]: Propose a descriptive markup language to describe participants' (e.g., cars and pedestrian) movements at road intersections. Using the proposed language, a deterministic polynomial-time algorithm is utilized to quantitatively compute an interpretable similarity metric between different driving intersection scenarios. This enables us to develop a video retrieval system for both stop-sign and traffic-light controlled intersections. We investigate multiple approaches to automatically transform trimmed autonomous navigation ego-videos at intersections into the proposed markup language.
[Spring 2018-Summer2019] Autonomous Navigation Recognition & Retrieval: Sponsored by Honda Research Institute, I explore self-supervised approaches for ego-motion action recognition. I studied also video embedding, triplet loss retrieval and uncertainty estimation. This work led to a CVPRW2018 publication and pending anonymous submission.
[Summer 2017] Semantic Segmentation [MICCAI Poster]: During Intuitive Surgical internship, I applied machine learning techniques to segment key anatomical structures from volumetric CT-images, in a fully automatic and semi-automatic fashion. We propose a convolution neural network developed using Keras, Tensorflow and Python libraries. Based on that work, two MICCAI 2018 papers are published.
[Summer 2016] Patch Matching [Adobe Research Intern Expo Poster][Sample Video, Video 2]: During Adobe summer internship, I worked on a new selection/segmentation tool based on patch matching. The intuition is much similar to "Pseudo-polar based estimation of large translations rotations and scalings in images" but instead of comparing images, we compare patches. The selection results are remarkable, yet it suffered large computational time. Thus, it is not ready for industry use yet. While at Adobe, I intensively learned about Coherency Sensitive Hashing, The Generalized PatchMatch Correspondence Algorithm and of course Fourier transform. After the internship, I did additional evaluation against PatchMatchGraph: Building a Graph of Dense Patch Correspondences for Label Transfer. That's why I became expert with Darwin Framework.

[Winter 2016] Texture Classification: I worked on a new texture classification approach classifying textures suffering a dark shadow. The results achieved was not good enough for a top conference. Yet, I reviewed the literature in great details. I implemented both local binary pattern (LBP) and various filter banks like Leung-Malik, Schmid, and maximum response. I got familiar with compressed sensing for texture classification. I ran my evaluation experiments using the following texture datasets: CUReT, UIUC and DTD.

[2015] Image Segmentation: Developed an approached for solving interactive image segmentation problem. The approach supports different user annotation forms like scribble, complete and incomplete trimaps, tight contour and bounding box. Qualitative and quantitative results are compared against Grabcut, Geodesic Star Convexity and MILCut.

Selected Awards:

WACV Doctoral Consortium Plus Travel Award 2020.
Graduate School's Outstanding Teaching Assistant Award for AY 2019-20 (Awarded to 2%).
Gifted unrestricted 2500$ from Adobe Systems, Inc.
University of Maryland Graduate School Deans Fellowship, 2015 and 2016
Team has been chosen as one of the Young Innovators Awards(YIA) Program winners for the academic year 2008/2009.
Awarded four successive times in college for the Excellent grade

Teaching Experience:

(click to see course description)

CMSC216 (Introduction to Computer Systems - Using C)

CMSC132 (Object-Oriented Programming II - Using JAVA)

CMSC420 (Data Structures - Using JAVA)

CMSC426 (Computer Vision - Using PYTHON)

Postscript
After earning my Bsc, I spent some time developing mobile apps for iOS. I co-founded Inova, a software development company.