(Résumé in pdf format available here)


Academic Degrees

Ph.D. Computer Science, Université Joseph Fourier, Grenoble, 1993, Summa Cum Laude (“avec les félicitations du jury).

M.S. Naval Architecture and Offshore Engineering, U.C. Berkeley, 1980.

M.S. (Diplome d'Etudes Approfondies, a.k.a. DEA) Applied Mathematics, Université Claude Bernard, Lyon, France,1973.

M.S. ME / Fluid Dynamics, Ecole Centrale de Lyon, France, 1972.


I am currently a Senior Research Scientist at Johns Hopkins University's Applied Physics Laboratory. My research is on object recognition, face recognition using lidar scans and regular face galleries, scene understanding for robotic navigation, and aerial image analysis.

For the past three years I have taught a course on real-time computer vision using OpenCV with Johns Hopkins University’s Engineering for Professionals program.

From 2005 to 2008 I was Program Director for Computer Vision Research at the National Science Foundation (NSF), on leave from a position as associate research professor at the Laboratory for Language and Media Processing (LAMP), a division of the University of Maryland Institute for Advanced Computer Studies (UMIACS) .

My research at University of Maryland focused on object recognition, video surveillance, video retrieval, and document image analysis. My contributions in video analysis include spatio-temporal segmentation of video using hierarchical mean shift, retrieval of near duplicates using spatio-temporal descriptors, and smart fast-forward of videos by polygon simplification of video trajectories.

I edited a book with Azriel Rosenfeld and David Doermann, called VIDEO MINING, which is still a good snapshot for navigation and search in large video databases.

At University of Maryland I have taught a graduate level computer vision class with Ramani Duraiswami, Larry Davis and Cornelia Fermuller. It is available online: CMSC 828D: Fundamentals of Computer Vision ,

Prior to my involvement with LAMP, I was with CVL (Computer Vision Laboratory) at the Center for Automation Research (CfAR) at University of Maryland, College Park , where I developed camera-based 3D pointers and techniques for pose calculation, such as POSIT and SoftPOSIT. I was also active in visual road following and motion detection research for the DARPA-funded UGV and ALV autonomous land navigation projects.

Recent Publications

Philippe Burlina, Chad Sprouse, Daniel DeMenthon, Ryan Mukherjee and Ted Abraham, "Towards Mitral Valve Closure Prediction using 3D Echocardiography", IEEE Transactions on Medical Imaging, 2013.

Yong Xu, Yuhui Quan, Zhuming Zhang, Morimichi Nishigaki, Hui Ji, Cornelia Fermuller, Daniel DeMenthon, "Contour-based Recognition", CVPR 2012, Providence, RI, June 2012.

Morimichi Nishigaki, Cornelia Fermuller, Daniel DeMenthon, "The Image Torque Operator: A New Tool for Mid-level Vision", CVPR 2012, Providence, RI, June 2012.

Anne Jorstad, Daniel DeMenthon, I-Jeng Wang, Philippe Burlina, “Distributed Consensus on Camera Pose”, IEEE Transactions on Image Processing, Vol.19, No. 8, August 2010.

Philippe Burlina, Chad Sprouse, Daniel DeMenthon, Anne Jorstad, Ray Juang, Francisco Contijoch, Ted Abraham, D. Yuh, E. McVeigh, “Patient Specific Modeling and Analysis of the Mitral Valve using 3D-TEE”, Proc. First International Conference on Information Processing for Computer Assisted Surgical Intervention, 2010.

Jian Liang, Daniel DeMenthon and David Doermann, " Mosaicing of Camera-captured Document Images", Computer Vision and Image Understanding (CVIU), December 2008.

Jian Liang, Daniel DeMenthon and David Doermann, "Geometric Rectification of Camera-captured Document Images", PAMI, vol. 30, no. 4, pp. 591-605, April 2008.

Zhe Lin, Larry S. Davis, David Doermann, Daniel DeMenthon, "Simultaneous Appearance Modeling and Segmentation for Matching People under Occlusion", Asian Conference on Computer Vision (ACCV 2007), Tokyo, Japan, Nov. 18--22, 2007.

Zhe Lin, Larry S. Davis, David Doermann, Daniel DeMenthon, "An Interactive Approach to Pose-Assisted and Appearance-based Segmentation of Humans", ICCV Workshop on Interactive Computer Vision (ICV 2007), Rio de Janeiro, Brazil, October 2007.

Zhe Lin, Larry S. Davis, David Doermann, Daniel DeMenthon, "Hierarchical Part-Template Matching for Human Detection and Segmentation", International Conference on Computer Vision (ICCV 2007), Rio de Janeiro, Brazil, October 2007.

Daniel DeMenthon and David Doermann, "Video Retrieval of Near-Duplicates using k-Nearest Neighbor Retrieval of Spatio-Temporal Descriptors". Multimedia Tools and Applications (MTAP), pp. 229-253, vol. 30, no. 3, Sept. 2006.

Jian Liang, Daniel DeMenthon and David Doermann, "Camera-Based Document Image Mosaicing", ICPR 2006, vol. 2, pp. 476-479, 2006.

Ming Luo, Daniel DeMenthon, Xiaodong Yu and and David Doermann, "SOFTCBIR: Object Searching in Videos Combining Keypoint Matching and Graduated Assignment", University of Maryland Technical Report, LAMP-TR-132, CAR-TR-1013, CS-TR-4804, UMIACS-TR-2006-25, May 2006.


"Device using a Camera and Light Polarization for the Remote Displacement of a Cursor on a Display", U.S. Patent 7,542,072 B2, D. DeMenthon, June 2, 2009.

"Computer Vision System for Accurate Monitoring of Object Pose'', U.S. Patent 5,388,059, D. DeMenthon, 1995.

"Three Dimensional Pointing Device Monitored by Computer Vision'', U.S. Patent 5,297,061, D. DeMenthon and Y. Fujii, 1994.

"Computer Vision System for Position Monitoring in Three Dimensions using Non-Coplanar Light Sources Attached to a Monitored Object", U.S. Patent 5,227,985, D. DeMenthon, 1993.

"Computer-Controlled Peripheral Shaping System", U.S. Patent 4,675,825, D. DeMenthon, 1987.

"Wave Energy Converter", U.S. Patent 4,412,417, D. DeMenthon, 1983.

Updated August 2013