Henry Tang

Researcher
Printing and Content Delivery Lab
Palo Alto

Biography

Henry Hao Tang is a researcher in the Printing and Content Delivery Lab, focused on extracting information from multimedia content and interactions for the purpose of knowledge discovery, representation, and rendering.

Henry completed the Ph.D. degree in electrical and computer engineering at the University of Illinois at Urbana-Champaign in 2010 and was advised by Professor Thomas S. Huang. He holds the M.S. degree in electrical and computer engineering from Rutgers, The State University of New Jersey, and the M.E. and B.E. degrees, both in electrical engineering, from the University of Science and Technology of China.

Prior to joining HP Labs, he was employed by Microsoft Research as a Research Intern (Summer 2007 & Spring 2010), IBM T.J. Watson Research Center as a Research Intern (Summer 2008), Fuji Xerox Palo Alto Laboratory (FXPAL) as a Research Intern (Summer 2009),  and Google as a Software Engineering Intern (Summer 2010).

 

Research interests

  • Multimedia
  • Computer Vision
  • Pattern Recognition, Machine Learning, and Data Mining
  • Augmented Reality and Multimodal Human-Computer Interaction
  • Tele-presence and Tele-immersion

Awards

  • Science and Technology Progress Award of Anhui Province, 1st Prize, China, 1999
  • National Science and Technology Progress Award, 2nd Prize, China, 2002
  • International Champion in CLEAR Evaluation on Multimodal Person ID Task, 2006
  • International Champion in CLEAR Evaluation on Acoustic Event Detection and Classification Task, 2007
  • IBM Best Student Paper Award, International Conference on Pattern Recognition, 2008
  • Emerging Leader in Multimedia, IBM T.J. Watson Research Center, 2009
  • The Excellence Award of the 12th China Patent Award, 2010

Publications

Below listed are my recent publications (2008-present). For a full list, please check out my CV This is a Non-HP site.

Journal Articles

  • Hao Tang, Zicheng Liu, "Computational Audio-Visual Scene Analysis,'' IEEE COMSOC MMTC E-Letter, Vol. 6, No. 1, pp. 21-23, January 2011
  • Hao Tang, Mark Hasegawa-Johnson, Thomas Huang, "A Novel Vector Representation of Stochastic Signals Based on Adapted Ergodic HMMs,'' IEEE Signal Processing Letters, Vol. 17, No. 8, pp. 715-718, August 2010
  • Xi Zhou, Xiaodan Zhuang, Hao Tang, Mark Hasegawa-Johnson, Thomas Huang, "Novel Gaussianized Vector Representation for Improved Natural Scene Categorization,'' Pattern Recognition Letters, No. 8, pp. 702-708, June 2010
  • Thomas Huang, Mark Hasegawa-Johnson, Stephen Chu, Zhihong Zeng, Hao Tang, "Sensitive Talking Heads,'' IEEE Signal Processing Magazine, 26(4):67-72, July 2009
  • Hao Tang, Yun Fu, Jilin Tu, Mark Hasegawa-Johnson, Thomas S. Huang, "Humanoid Audio-Visual Avatar with Emotive Text-To-Speech Synthesis,'' IEEE Transactions on Multimedia, Volume: 10, Issue: 6, pp. 969-981, October, 2008
Book Chapters
  • Yun Fu, Hao Tang, Jilin Tu, Hai Tao, Thomas S. Huang, "Human-Centered Face Computing in Multimedia Interaction and Communication,'' C.W. Chen, Z. Li, and S. Liang (Eds.), Intelligent Multimedia Communication: Techniques and Applications, Springer-Verlag, 2010
Peer-Reviewed Conference Papers
  • Hao Tang, Vivek Kwatra, Mehmet Emre Sargin, Ullas Gargi, "Detecting Highlights in Sports Videos: Cricket as a Test Case," 2011 IEEE International Conference on Multimedia & Expo (ICME) – Workshop on Visual Content Identification and Search (VCIDS 2011) , Barcelona, Spain, July, 2011
  • Vuong Le, Hao Tang, Thomas Huang, "Expression Recognition from 3D Dynamic Faces using Robust Spatio-temporal Shape Features,'' The Ninth IEEE International Conference on Automatic Face and Gesture Recognition, Special Session on 3D Facial Behaviour Analysis and Understanding, Santa Barbara, CA, March 2011
  • Chunyuan Liao, Hao Tang, Qiong Liu, Patrick Chiu, Francine Chen, "FACT: Fine-grained Cross-media Interaction with Documents via a Portable Hybrid Paper-Laptop Interface,'' ACM Multimedia 2010, Firenze, Italy, October, 2010
  • Wenming Zheng, Hao Tang, Zhouchen Lin, Thomas Huang, "Emotion Recognition from Arbitrary View Facial Images,'' 2010 European Conference on Computer Vision (ECCV'10), Crete, Greece, September, 2010
  • Vuong Le, Hao Tang, Liangliang Cao, Thomas Huang, "Accurate and Efficient Reconstruction of 3D Faces from Stereo Images,'' 2010 International Conference on Image Processing (ICIP'10), September, Hong Kong, 2010
  • Kai-Hsiang Lin, Hao Tang, Thomas Huang, "Robust License Plate Detection Using Image Saliency,'' 2010 International Conference on Image Processing (ICIP'10), September, Hong Kong, 2010
  • Hao Tang, Mark Hasegawa-Johnson, Thomas Huang, "Non-frontal View Facial Expression Recognition Based on Ergodic Hidden Markov Model Supervectors,'' 2010 International Conference on Multimedia \& Expo (ICME'10), Singapore, July 2010
  • Hao Tang, Mark Hasegawa-Johnson, Thomas Huang, "Toward Robust Learning of the Gaussian Mixture State Emission Densities for Hidden Markov Models,'' 2010 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'10), Dallas, Texas, March, 2010
  • Wenming Zheng, Hao Tang, Zhouchen Lin, Thomas Huang, "A Novel Approach to Expression Recognition from Non-frontal Face Images,'' 2009 International Conference on Computer Vision (ICCV'09), Kyoto, Japan, September, 2009
  • Hao Tang, Stephen Chu and Thomas Huang, "Spherical Discriminant Analysis in Semi-supervised Speaker Clustering,'' North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Conference, Boulder, CO, June, 2009
  • Stephen M. Chu, Hao Tang, Thomas S. Huang, "Locality Preserving Speaker Clustering,'' 2009 International Conference on Multimedia & Expo (ICME'09), New York, NY, June, 2009
  • Hao Tang, Stephen M. Chu, Mark Hasegawa-Johnson, Thomas S. Huang, "Emotion Recognition from Speech via Boosted Gaussian Mixture Models,'' 2009 International Conference on Multimedia & Expo (ICME'09), New York, NY, June, 2009
  • Stephen M. Chu, Hao Tang, Thomas S. Huang, "Fishervoice and Semi-supervised Speaker Clustering,'' 2009 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'09), Taipei, Taiwan, April, 2009
  • Hao Tang, Stephen M. Chu, Thomas S. Huang, "Generative Model-based Speaker Clustering via Mixture of von Mises-Fisher Distributions,'' 2009 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'09), Taipei, Taiwan, April, 2009
  • Hao Tang and Thomas S. Huang, "Boosting Gaussian Mixture Models via Discriminant Analysis,'' 2008 IEEE International Conference on Pattern Recognition (ICPR'08), Tempa, FL, December, 2008
  • Xi Zhou, Xiaodan Zhuang, Hao Tang, Mark Hasegawa-Johnson, Thomas Huang, "A Novel Gaussianized Vector Representation for Natural Scene Categorization,'' 2008 IEEE International Conference on Pattern Recognition (ICPR'08), Tempa, FL, December, 2008
  • Hao Tang and Thomas S. Huang, "MPEG4 Performance-Driven Avatar via Robust Facial Motion Tracking,'' 2008 IEEE International Conference on Image Processing (ICIP'08), San Diego, CA, October, 2008
  • Jianchao Yang, Hao Tang, Yi Ma, Thomas Huang, "Face Hallucination via Sparse Coding,'' 2008 IEEE International Conference on Image Processing (ICIP'08), San Diego, CA, October, 2008
  • Hao Tang, Xi Zhou, Matthias Odisio, Mark Hasegawa-Johnson, and Thomas S. Huang, "Two-Stage Prosody Prediction for Emotional Text-to-Speech Synthesis,'' INTERSPEECH 2008, Brisbane, Australia, September, 2008
  • Hao Tang and Thomas S. Huang, "3D Facial Expression Recognition Based on Properties of Line Segments Connecting Facial Feature Points,'' 2008 IEEE International Conference on Automatic Face and Gesture Recognition (FG'08), Amsterdam, The Neitherlands, September, 2008
  • Hao Tang and Thomas S. Huang, "3D Facial Expression Recognition Based on Automatically Selected Features,'' 2008 IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08) (Workshop on 3D Face Processing), Anchorage, Alaska, June, 2008
  • Hao Tang, Yuxiao Hu, Yun Fu, Mark Hasegawa-Johnson, and Thomas S. Huang, "Real-Time Conversion from a Single 2D Face Image to a 3D Text-Driven Emotive Audio-Visual Avatar,'' 2008 IEEE International Conference on Multimedia & Expo (ICME'08), Hannover, Germany, June 2008
  • Yuxiao Hu, Hao Tang, and Thomas S. Huang, "Camera and Microphone Array for 3D Audiovisual Face Data Collection,'' 2008 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'08), Las Vegas, Nevada, March, 2008
  • Hao Tang, Zhixiong Chen, and Thomas S. Huang, "Comparison of Algorithms for Speaker Identification under Adverse Far-Field Recording Conditions with Extremely Short Utterances,'' Proc. 2008 IEEE International Conference On Networking, Sensing and Control (ICNSC'08), pp. 796 - 801, Sanya, China, April, 2008

Professional activities

  • Reviewer, IEEE Transactions on Pattern Analysis and Machine Intelligence
  • Reviewer, IEEE Transactions on Audio, Speech and Language Processing
  • Reviewer, IEEE Transactions on Circuits and Systems for Video Technology
  • Reviewer, IEEE Transactions on Image Processing
  • Reviewer, IEEE Transactions on Multimedia
  • Reviewer, ACM Transactions on Multimedia Computing, Communications and Applications
  • Reviewer, International Journal of Image and Graphics
  • Reviewer, Neurocomputing
  • Reviewer, IEEE Signal Processing Letters
  • Reviewer, Pattern Recognition
  • Reviewer, SSCI 2011