Tan & iSocioBot

 

         




   Bio - Activities - Projects - Teaching - Publications - Source code & demo     Chinese homepage        

 

  Zheng-Hua Tan                          

   Professor of Machine Learning and Speech Processing, PhD, IEEE Senior Member

   Signal and Information Processing, Department of Electronic Systems, Aalborg University 

   Fredrik Bajers Vej 7, 9220 Aalborg, Denmark

   Tel: +45 9940 8686, E-mail: zt@es.aau.dk, Skype ID: zhenghuatan, Linkedin, GoogleScholar, ResearchGate.


BIOGRAPHICAL INFORMATION  

He was a Visiting Scientist/Professor at the Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology (MIT), Cambridge, USA, an Associate Professor at the Department of Electronic Engineering at Shanghai Jiao Tong University (SJTU), China, and a postdoctoral fellow at AI Spoken Language Lab, at the Department of Computer Science at KAIST, Korea. He is now a Professor in the Department of Electronic Systems and a co-head of the Centre for Acoustic Signal Processing Research at Aalborg University, Denmark. He received the Ph.D. degree in Electronic Engineering from Shanghai Jiao Tong University in 1999, the B.S. and M.S. degrees in Electrical Engineering from Hunan University in 1990 and 1996, respectively.

His research interests include machine learning, deep learning, pattern recognition, speech recognition, speaker recognition, noise-robust speech processing (speech enhancement and separation, robust features, voice activity detection), multimodal (audio-visual) signal processing, social robotics (built a multimodal interactive social robot called iSocioBot), and recommender systems, which are the topics he has spent the last two decades on. He has about 180 publications in IEEE/ACM-TASLP, IEEE-TNNLS, IEEE-TKDE, IEEE-TMM, IEEE-TAC, IEEE-TCE, IEEE-TSG, IEEE-J-STSP, IEEE-SPL IEEE INTELL SYST, Neurocomputing, CSL, SpeechComm, ICASSP, INTERSPEECH, and other venues. He edited the book Automatic Speech Recognition on Mobile Devices and over Communication Networks (Springer-Verlag, 2008).

He is an elected Member of the IEEE Signal Processing Society Machine Learning for Signal Processing Technical Committee (MLSP TC) for the term of 2018-2020, and the general chair for 2018 IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP2018). He has served as an Editorial Board Member/Associate Editor for Computer Speech and Language, Digital Signal Processing, and Computers and Electrical Engineering. He was a Lead Guest Editor of the IEEE Journal of Selected Topics in Signal Processing and a Guest Editor of Neurocomputing. He has served as a General Chair, Program Co-chair, Area and Session Chair, and Tutorial Speaker of many international conferences. He is a Senior Member of the IEEE, and a Member of ISCA.

He has received major grants from European Commission Horizon 2020, Danish Council for Independent Research, Innovation Fund Denmark, Danish Strategic Research Council and various industrial grants in the areas of machine learning/deep learning, speech and multimodal signal processing with applications to intelligent and interactive machines.

Following are the links to his Activities, Projects, Teaching, Publications, and some Source code & demo.

  _____________________________________________________________________________


PROFESSIONAL ACTIVITIES AND COLLABORATION

.    Member of the IEEE Signal Processing Society Machine Learning for Signal Processing Technical Committee (MLSP TC).

.    General Chair of 2018 IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP2018), September 17-20, 2018, Aalborg, Denmark. Welcome to Aalborg!

.    Invited talk at MIT, “Deep Learning for Speech and Multimodal Signal Processing”, Cambridge, USA, 2017.

.    Co-chair of Oral Session “Far-field Speech Recognition” at Interspeech 2017, Stockholm, Sweden, 20-24 August 2017.

.    Talk, "Multimodal biometrics, anti-spoofing and de-identification”, Training School 2017 - De-identification for privacy protection in multimedia content, Gran Canaria, Spain, Feb 13-16, 2017.

.    Technical co-chair for The IEEE Spoken Language Technology Workshop 2016 (SLT 2016), San Diego, USA, Dec 13-16, 2016.

.    Chair of the International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE2016), July 6-8, 2016, Aalborg, Denmark. SPLINE2016 IEEE Xplore Proceedings.

.    Invited talks at Tsinghua University, Beijing University of Posts and Telecommunications, Beijing Institute of Technology, and Technical University of Denmark, 2016.

.    Area chair in Speech and Language Processing, The 23rd European Signal Processing Conference (EUSIPCO 2015), Nice, France.

.    Chair of Oral Sessions “Face Recognition I” and "Face Recognition II" at IEEE ICIP 2015, 27-30 September 2015, Quebec City, Canada.

.   Editorial Board Member/Associate Editor, Computer Speech and Language (CSL) (since 2009), Digital Signal Processing (DSP), Computers and Electrical Engineering (CAEE), International Journal of Data Mining, Modelling and Management (IJDMMM).

.   Guest Editor, Machine Learning for Big Data Processing in Mobile Internet, Special Issue of Springer Wireless Personal Communications.

.   Guest Editor, Machine Learning for Non-Gaussian Data Processing, Special Issue of Elsevier Neurocomputing (Impact Factor: 2.083).

.    Lead guest editor, Speech Processing for Natural Interaction with Intelligent Environments, Special Issue of IEEE Journal of Selected Topics in Signal Processing (J-STSP, Impact Factor: 3.629). Call-for-Papers

.    Guest editor, New Trends in Signal Processing and Biomedical Engineering, Special Issue of Elsevier Computers and Electrical Engineering (CAEE).

.    Chair of the 3rd AAU Workshop on Robotics (AAUROB2014), Aalborg, Denmark.

.    Best Paper Award (with Jesper Jensen) at the 4th IEEE International Conference on Network Infrastructure and Digital Content (IEEE IC-NIDC2014), Beijing, China.

.    Area Chair in Multimedia Signal Processing, The 21st European Signal Processing Conference (EUSIPCO 2013), Marrakech, Morocco.

.    Invited talks at Hunan University and Hunan Normal University, “Multimodal Sensing and Machine Intelligence”, Changsha, China, 2013.

.    Invited talk at BBN Technologies, "Speech Denoising and Voice Activity Detection", Cambridge, USA, 2012.

.    Invited talk at MIT, "Variable Frame Rate Analysis and Denoising for Speech Recognition", Cambridge, USA, 2012.

.    Invited talk at BUPT National 111 Base, “Multimodal Sensing for Identification and Interaction in the IoT”, Beijing, China, 2012. News page.

.    B&O Innovation Camp Shanghai 2012 (November) and 2013 (July) at Shanghai Jiao Tong University SJTU News 2012, News 2013. AAU Matchmaking News. Exhibition at B&O News. Camp homepage.

.    Invited talk at University of Eastern Finland, Speech and Audio Processing Seminar, Joensuu, Finland, 2011.

.    Program Co-Chair of The 3rd International Congress on Image and Signal Processing (CISP 2010), Yantai, China, 16-18 October 2010. 

.    Organising Committee Member and Area Chair in Multimedia Signal Processing, The 18th European Signal Processing Conference (EUSIPCO 2010), Aalborg, Denmark, Aug. 23 - 28, 2010.

.    Co-organiser, Special Session "Person Tracking for Assistive Working and Living Environment" at EUSIPCO 2010, Denmark. 

.    Tutorial, "Internet of Things: Opportunities and Challenges", at the 13th International Symposium on Wireless Personal Multimedia Communications (WPMC 2010), 11-14 October, 2010, Recife, Brazil.

.    Tutorial, "Speech Recognition on Mobile Devices: Distributed and Embedded Solutions" at Interspeech 2008, Brisbane, Australia, 22-26 Sept. 2008.

.    Co-chair, Special Session “Speech and Audio Processing in Intelligent Environments” at Interspeech 2007, Antwerp, Belgium.

.    Chair, Special Session on "Speech recognition in ubiquitous networking and context-aware computing (Webpage, PDF) at Interspeech 2005, Lisbon, Portugal, Sept. 2005.

.    Organising Committee Member, ITRW and COST278 Final Workshop 278 Final Workshop - Applied Spoken Language Interaction in Distributed Environments (ASIDE2005), Aalborg, Denmark, Nov., 2005.



Collaboration (of ad hoc setup. In fact, everything comes down to nice collaboration, including life.)

.    Hosted a number of visitors (professors and PhD students including CSC), and visited a number of labs.

.    Conducted intensively joint and enjoyable (and productive) free-style research with scientist and engineers both in Denmark and worldwide.

 

PROJECTS (SELECTED, RECENT)

.    Centre for Acoustic Signal Processing Research (CASPR). 2016-2021. AAU news

.    Automated audiovisual inference of the intention of multiple users in the home. The Innovation Fund Denmark and Bang & Olufsen A/S. 2016-2019.

.    OCTAVE - Objective Control for TAlker VErification. Project funded by European Commission under the EU Horizon 2020 program. 2015-2017.

.    Speech Enhancement for Hearing Aid Applications using Machine Learning Techniques. Project funded by Oticon Foundation. 2015-2018.

.   Durable Interaction with Socially Intelligent Robots (iSocioBot, or SocioBot). Project funded by The Danish Council for Independent Research, Technology and Production Sciences. 2013-2017. News in Ingeniøren, NordJyske, Nibeavis. Article 1 and article 2 in BiTE (part of BT). Our robots being @ the People’s Meeting (Folkemødet) in Bornholm, June 2015, the official opening of the Day of Research 2014 in Denmark, ‘Safe 7′ in Nibe 2014, and the Culture Night 2014 in Copenhagen at the Ministry of Higher Education and Science.

.    CoSound – A Cognitive Systems Approach to Enriched and Actionable Information from Audio Streams. Project funded by Danish Strategic Research Council. 2012-2016.

.    COST Action IC1206 – De-identification for privacy protection in multimedia content. Project funded by European Commission, EU website for this Action. 2013-2017.

.    A Robust Audio-based Hybrid Recommendations Framework for Interactive TV. Project funded by Bang & Olufsen A/S and The Danish Council for Technology and Innovation. 2012-2015.



TEACHING

PhD courses

.     Machine Learning Spring 2018, Spring 2017, Spring 2016, Spring 2015, Spring 2013, Spring 2011, Fall 2009, Fall 2007. PhD moodle page.

.     Deep Learning Spring 2018, Spring 2017, Fall 2015 (with Dong Yu, Microsoft Research). PhD moodle page.

.     Signal Processing for Hearing Assistive Devices (PhD course and Winter School) Fall 2017.

.     Advanced Technologies for Green Wireless Communication Networks (data fusion) Spring 2015.

.     Energy Efficient Technologies for Green Wireless Sensor Networks (energy-efficient data fusion) Spring 2014.

.     Sensors and RFID Networks Fall 2012.

.     Internet of Things Fall 2009.

.     Advanced Speech Processing Fall 2015, Spring 2007.


Master/undergraduate courses

.     Machine Learning (MSc course) Fall 2018, Fall 2017, Fall 2016, Fall 2015, Fall 2014, Fall 2013, Fall 2012, Fall 2011. All courses: moodle.aau.dk, sict.moodle; calendar: sict/calmoodle

.     Platforms and Methods for Multi-Modal System Architectures (MSc course) Fall 2016, Fall 2015, Fall 2014, Fall 2012.

.     B&O Innovation Camp, 2017, 2016, 2015, 2013, 2012.

.     Research in Vision, Graphics and Interactive Systems, 2017.

.     Extraction and Representation of Features (6th semester course) Spring 2016, Spring 2015, Spring 2014, Spring 2013, Spring 2011.

.     Programming Social Robots for Human Interaction (ICT Summer School) Summer 2015.

.     Multimedia Information and Signal Processing (ICT Summer School), 2013, 2012, 2011, 2010.

.     Multi-Modal User Interaction (Master course) Fall 2010, MMUI Fall 2008

.     Digital Signal Processing (Master Course) Fall 2010, Fall 2009, Fall 2008; Fall 2007; Digital Signal Processing, Fall 2006

.     Readings in Vision, Graphics and Interactive Systems (Master course) Fall 2010, Course in Fall 2009, Course in Fall 2008 

.    Internet of Things (Life-long Learning), 2010

.    Instrumentation and Data Acquisition (4th semester Course) Spring 2010, Spring 2009

.     Multi-Modal Interaction Design and Perception (Master course) Fall 2009  

    Readings in Advanced Intellimedia (Master Course), Fall 2007

.     Speech Processing (Master Course) Spring 2006

.     Speech Communication (Master Course) Spring 2006, Spring 2005.

.     Signals and Systems 信号与系统 @ Shanghai Jiao Tong University, 1996, 1997, 1998, 1999.

.     Introduction to Artificial Intelligence (AI) 人 工智能导论 @ Shanghai Jiao Tong University, 1999.


Project supervision

.     About 100 Postdoc, PhD, Master and Bachelor student projects, among which 15 are PhD projects. (Info about student project and exams. Curricula, studieordninger in Danish)

 

RECENT PUBLICATIONS:

Copyright Notice: The copyright of each paper belongs to the respective publisher. Electronic copy is provided for personal research and reference only. Here is a list of journals and conferences with submission deadlines.  

 


Journal papers:

  1. Sven Ewan Shepstone, Zheng-Hua Tan and Miklas Strøm Kristoffersen, “Using Closed-set Speaker Identification Score Confidence to Enhance Audio-based Collaborative Filtering for Multiple Users,” accepted by IEEE Transactions on Consumer Electronics, 2018.
  2. Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Rainer Martin, and Jun Guo, "Spoofing Detection in Automatic Speaker Verification Systems Using DNN Classifiers and Dynamic Acoustic Features,” accepted byIEEE Transactions on Neural Networks and Learning Systems, 2017.
  3. Renhua Peng, Zheng-Hua Tan, Xiaodong Li, and Chengshi Zheng. "A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments," accepted by Speech Communication, 2017. (Elsevier).
  4. Md Sahidullah, Dennis Alexander Lehmann Thomsen, Rosa Gonzalez Hautamaki, Tomi Kinnunen, Zheng-Hua Tan, Robert Parts, Martti Pitkanen, “Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones,” accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing.
  5. Zheng-Hua Tan, Nicolai Bæk Thomsen, Xiaodong Duan, Evgenios Vlachos, Sven Ewan Shepstone, Morten H. Rasmussen and Jesper Lisby Højvang, "iSocioBot - A Multimodal Interactive Social Robot," accepted by International Journal of Social Robotics. (Springer). PDF from Springer Nature Sharing.
  6. Jen-Tzung Chien, Chao-Hsi Lee and Zheng-Hua Tan, "Latent Dirichlet Mixture Model," accepted by Neurocomputing, 2017.
  7. Morten Kolbæk, Dong Yu, Zheng-Hua Tan and Jesper Jensen, "Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks”, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 25, no. 10, October 2017, pp. 1901-1913. PDF from IEEEXplore.
  8. Achintya Sarkar and Zheng-Hua Tan, "Incorporating Pass-Phrase Dependent Background Models for Text-Dependent Speaker Verification,” accepted by Computer Speech & Language, vol. 47, January 2018, pp. 259-271. PDF from Elsevier.
  9. Stefanos Astaras, Aristodemos Pnevmatikakis and Zheng-Hua Tan, "Visual Detection of Events of Interest from Urban Activity," Wireless Personal Communications, vol. 97, no. 2, November 2017, pp. 1877–1888.
  10. Zhanyu Ma, Jing-Hao Xue, Arne Leijon, Zheng-Hua Tan, Zhen Yang, and Jun Guo, "Decorrelation of Neutral Vector: Theory and Applications,” accepted by IEEE Transactions on Neural Networks and Learning Systems. PDF from IEEEXplore.
  11. Swati Prasad, Zheng-Hua Tan and Ramjee Prasad, "Frame Selection for Robust Speaker Identification: A Hybrid Approach,” Wireless Personal Communications, 2017. (Springer). PDF from Springer.
  12. Sven Ewan Shepstone, Zheng-Hua Tan and Søren Holdt Jensen, “Audio-based Granularity-adapted Emotion Classification,” accepted by IEEE Transactions on Affective Computing. PDF from IEEEXplore.
  13. Hong Yu, Zheng-Hua Tan, Yiming Zhang, Zhanyu Ma, and Jun Guo, “DNN Filter Bank Cepstral Coefficients for Spoofing Detection," accepted by IEEE Access. PDF from IEEEXplore.
  14. Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, and Jesper Jensen, “Informed Sound Source Localization using Relative Transfer Functions for Hearing Aid Applications,” IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 25, no. 3, pp. 611-623, March 2017. PDF from IEEEXplore.
  15. Morten Kolbæk, Zheng-Hua Tan and Jesper Jensen, "Speech Intelligibility Potential of General and Specialized Deep Neural Network based Speech Enhancement Systems," IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 25, no. 1, pp. 153-167, January 2017. PDF from IEEEXplore.
  16. Zhanyu Ma, Hong Yu, Zheng-Hua Tan, and Jun Guo, “Text-Independent Speaker Identification Using the Histogram Transform Model”, IEEE ACCESS, vol. 4, pp. 9733 - 9739, January 2017. PDF from IEEEXplore.
  17. Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan and Jesper Jensen, “Predicting the Intelligibility of Noisy and Non-Linearly Processed Binaural Speech," IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 24, no. 11, November 2016. PDF from IEEEXplore.
  18. Elizabeth Ann Jochum, Evgenios Vlachos, Sally Grindsted Nielsen, Anja Christoffersen, Ibrahim Hameed and Zheng-Hua Tan, "Using Theatre to Study Interaction with Care Robots," International Journal of Social Robotics, vol. 8, no. 4, pp. 457-470, August 2016. (Springer). PDF from Springer.
  19. Swati Prasad, Zheng-Hua Tan and Ramjee Prasad, “Multiple Frame Rates for Feature Extraction and Reliable Frame Selection at the Decision for Speaker Identification Under Voice Disguise,” Conasense, vol. 1,  no. 1, pp. 29-44, January 2016.
  20. Sven Shepstone, Kong Aik Lee, Haizhou Li, Zheng-Hua Tan and Søren Holdt Jensen, “Total Variability Modeling using Source-specific Priors,” IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 24, no. 3, pp. 504-517, March 2016.  PDF from IEEEXplore.
  21. Zhanyu Ma, Zheng-Hua Tan, and Jun Guo, “Feature Selection for Neutral Vector in EEG Signal Classification”, Neurocomputing, vol. 174, pp. 937-945, January 2016. PDF from Elsevier.
  22. Nikolaos Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan and Ramjee Prasad, “Improved Gaussian Mixture Models for Adaptive Foreground Segmentation,” Wireless Personal Communications, vol. 87, no. 3, pp. 629-643, April 2016. (Springer).  PDF from Springer.
  23. Ioannis T. Christou, Emmanouil Amolochitis and Zheng-Hua Tan "AMORE: Design & Implementation of a Commercial-Strength Parallel Hybrid Movie Recommendation Engine", Knowledge and Information Systems, vol. 47, no. 3, pp. 671-696, June 2016. PDF from SpringerPDF © Springer
  24. Yonggang Qi, Jun Guo, Yi-Zhe Song, Tao Xiang, Honggang Zhang and Zheng-Hua Tan, “Im2Sketch: Sketch Generation by Unconflicted Perceptual Grouping,” Neurocomputing, vol. 165, pp 338-349, 2015.  PDF from Elsevier.
  25. Konstantinos Kouzelis, Zheng-Hua Tan, Birgitte Bak-Jensen, Jayakrishnan R. Pillai and Ewen Ritchie, “Estimation of Residential Heat Pump Consumption for Flexibility Market Applications,” IEEE Transactions on Smart Grid, vol. 6, no. 4, pp. 1852-1864, July 2015. PDF from IEEEXplore
  26.    Jesper Jensen and Zheng-Hua Tan, “Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral Features,” IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 23, no. 1, pp. 186-197, Jan 2015. PDF from IEEEXplore. PDF © IEEE
  27.    Zheng-Hua Tan and Ivan Kraljevski, "Joint variable frame rate and length analysis for speech recognition under adverse conditions," Computers and Electrical Engineering, vol. 40, no. 7, pp 2139-2149, Oct 2014. © Elsevier . PDF from Elsevier. PDF © Elsevier
  28.    Sven Ewan Shepstone, Zheng-Hua Tan and Søren Holdt Jensen, "Using Audio-derived Affective Offset to Enhance TV Recommendation," IEEE Transactions on Multimedia, vol. 16, no. 7, pp. 1999-2010, Nov 2014. PDF from IEEEXplore. PDF © IEEE
  29.    Nikolaos Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan and Ramjee Prasad, "Combination of Multiple Measurement Cues for Visual Face Tracking," Wireless Personal Communications, vol. 78, no.3, pp. 1789-1810, July 2014. PDF from Springer.
  30.    Emanouil Amolochitis, Ioannis T. Christou and Zheng-Hua Tan, "Implementing a Commercial-Strength Parallel Hybrid Movie Recommendation Engine," IEEE Intelligent Systems (AI Innovation and Industry track), vol. 29, no. 2, pp. 92-96, Mar-Apr 2014. PDF from IEEEXplore.
  31. Zhanyu Ma, Arne Leijon, Zheng-Hua Tan and Sheng Gao, “Predictive Distribution of the Dirichlet Mixture Model by the Local Variational Inference Method," The Journal of Signal Processing Systems, 3/2014 74(3), pp: 359-374. PDF from Springer.

  32. Sven Ewan Shepstone, Zheng-Hua Tan and Søren Holdt Jensen, "Audio-based Age and Gender Identification to Enhance the Recommendation of TV Content," IEEE Transactions on Consumer Electronics, vol. 59, no. 3, pp. 721-729, August 2013. PDF from IEEEXplore. PDF © IEEE
  33. Emanouil Amolochitis, Ioannis T. Christou, Zheng-Hua Tan and Ramjee Prasad, “A Heuristic Hierarchical Scheme for Academic Search and Retrieval,” Information Processing and Management, vol. 49, no. 6, pp. 1326–1343, November 2013. PDF from Elsevier.
  34. Theodoros Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan and Ramjee Prasad, "Exploring super-gaussianity towards robust information-theoretical time delay estimation," The Journal of the Acoustical Society of America (JASA), vol. 133, no. 3, pp. 1515-1524, 2013. PDF from JASA.

  35. Pejman Mowlaee, Rahim Saeidi, Mads Græsbøll Christensen, Zheng-Hua Tan, Tomi Kinnunen, Pasi Franti, and Søren Holdt Jensen, “A Joint Approach for Single-Channel Speaker Identification and Speech Separation,” IEEE Transactions on Audio, Speech and Language Processing, vol.20, no.9, pp.2586-2601, Nov. 2012. PDF from IEEEXplore.

  36. Hongbing Cheng, Chunming Rong, Zhenghua Tan and Qingkai Zeng, "Identity based Encryption and Biometric Authentication Scheme for Secure Data Access in Cloud Computing," Chinese Journal of Electronics (English edition), vol. 21, no. 2, pp. 254-259, April 2012. PDF from CJE.

  37. Theodoros Petsatodis, Christos Boukis, Fotios Talantzis, Zheng-Hua Tan and Ramjee Prasad, “Convex Combination of Multiple Statistical Models with Application to VAD,” IEEE Transactions on Audio, Speech and Language Processingvol. 19, no. 8, pp. 2314 - 2327, November 2011. PDF from IEEEXplore.

  38. Hristijan Petreski, Sofia Tsekeridou, Eri Giannaka, Neeli Prasad, Ramjee Prasad and Zheng-Hua Tan, "Technology-enabled social learning: a review," International Journal of Knowledge and Learning, vol. 7, nos. 3/4, pp. 253-270, 2011.

  39. Zheng-Hua Tan and Børge Lindberg, “Low-Complexity Variable Frame Rate Analysis for Speech Recognition and Voice Activity Detection,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, pp. 798 – 807, October 2010. PDF from IEEEXplorePDF© IEEE  Source codes are available here.

  40. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard and Børge Lindberg, “Robust Speech Recognition by Non-Local Means De-Noising Processing,” IEEE Signal Processing Letters, 2008. PDF from IEEEXplore. 

  41. Haitian Xu, Paul Dalsgaard, Zheng-Hua Tan and Børge Lindberg, “Noise Condition-Dependent Training Based on Noise Classification and SNR Estimation,” IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2431 – 2443, Nov. 2007. PDF from IEEEXplore.

  42. Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, "Exploiting Temporal Correlation of Speech for Error-Robust and Bandwidth-Flexible Distributed Speech Recognition," IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 4, pp. 1391-1403, May 2007. PDF from IEEEXplore. Draft PDF © IEEE

  43. Zheng-Hua Tan, "Fuzzy Metagraphs and Its Combination with Indexing Approach in Rule-Based Systems," IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 6, pp. 829-841, June 2006. PDF from IEEEXplore. PDF © IEEE

  44. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard and Børge Lindberg, “Exploitation of spectral variance to improve robustness in speech recognition,” IEE Electronics Letters, vol. 42, no. 5, pp. 312-314, March 2006. PDF © IEE

  45. Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, "Automatic Speech Recognition over Error-Prone Wireless Networks," Speech Communication, vol. 47, no. 1-2, pp. 220-242, Sep.- Oct. 2005. PDF from Sciencedirect. Draft PDF © Elsevier

  46. Zheng-Hua Tan, "Hybrid evolutionary approach for designing neural networks for classification," IEE Electronics Letters, vol. 40, no. 15, pp. 955-957, July 2004.PDF from IEEEXplore © IEE. Draft PDF.  

  47.   Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, "Partial Splicing Packet Loss Concealment for Distributed Speech Recognition," IEE Electronics Letters, vol. 39, no. 22, pp. 1619-1620, October 2003.  PDF © IEE

          Journal special issues:

  1.  Zhanyu Ma, Jen-Tzung Chien, Zheng-Hua Tan, Yi-Zhe Song, Jalil Taghia and Ming Xiao, "Recent Advances in Machine Learning for Non-Gaussian Data Processing,” Neurocomputing, 2017.
  2. Weichuan Yu, Zheng-Hua Tan and Yi Wang, “Guest Editors’ Introduction to the Special Issue on New Trends in Signal Processing and Biomedical Engineering,” Elsevier Computers and Electrical Engineering, vol. 38, no. 1, pp. 1-81, January 2012.
  3. Zheng-Hua Tan, Reinhold Haeb-Umbach, Sadaoki Furui, James R. Glass and Maurizio Omologo, “Introduction to the Issue on Speech Processing for Natural Interaction with Intelligent Environments,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, pp. 769 – 910, October 2010. PDF from IEEEXplore.

  Books and book chapters:

  1.  Zheng-Hua Tan, Najim Dehak, Jan Larsen and Zhanyu Ma (eds.), Proceedings of the First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE 2016), IEEE Press, 2016.
  2.  Zheng-Hua Tan, Shaoping Bai, Thomas Bak, Matthias Rehm and Elizabeth Ann Jochum (eds.), Proceedings of the 3rd AAU Workshop on Robotics, AAU Press, 2015. 
  3. Mohamed Abou-Zleikha , Zheng-Hua Tan, Mads Græsbøll Christensen and Søren Holdt Jensen, "Utilising Tree-Based Ensemble Learning For Speaker Segmentation,” Full paper published in Springer LNCS: Proceedings of the 10th International Conference on Artificial Intelligence Applications and Innovations (AIAI 2014), Island of Rhodes, Greece, September 19-21, 2014.
  4. Nicolai B. Thomsen, Zheng-Hua Tan, Børge Lindberg and Søren Holdt Jensen, “Improving Robustness against Environmental Sounds for Directing Attention of Social Robots,” Springer LNAI, vol. 8757: Proceedings of the 2nd Workshop on Multimodal Analyses Enabling Artificial Agents in Human-Machine Interaction, September 14, 2014, Singapore. PDF 
  5.  Zheng-Hua Tan and Børge Lindberg, "Speech Recognition on Mobile Devices," X. Jiang, M. Ma and C. Chen (eds.), Mobile Multimedia Processing: Fundamentals, Methods, and Applications, Springer LNCS, vol. 5960, 2010.
  6.  Zheng-Hua Tan, Yi Wan, Tao Xiang, and Yibin Song (eds.), Proceedings of the 3rd International Congress on Image and Signal Processing (CISP 2010), IEEE Press, Yantai, China, October 2010. (ISBN: 978-1-4244-6515-6)
  7.  Zheng-Hua Tan and Børge Lindberg (eds.), Automatic speech recognition on mobile devices and over communication networks, Springer-Verlag, London, Feb. 2008, Approx. 415 p. 115 illus., Hardcover, ISBN: 978-1-84800-142-8. 
  8.  Zheng-Hua Tan, “Audio and speech processing for data mining”, John Wang (Ed.), Encyclopedia of Data Warehousing and Mining - 2nd Edition, IGI Global, 2008, pp. 98-103. googlebooks  
  9. Zheng-Hua Tan and Imre Varga “Networked, distributed and embedded speech recognition: an overview”, Z.-H. Tan, and B. Lindberg (eds.), Automatic speech recognition on mobile devices and over communication networks, Springer-Verlag, London, Feb. 2008, pp. 1-23.
  10. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Ralf Mattethat and Børge Lindberg, “A Configurable Distributed Speech Recognition System”, H. Abut, J.H.L. Hansen, K. Takeda (Editors), Digital Signal Processing for In-Vehicle and Mobile Systems 2, Springer Science, New York, NY, 2006.
  11. Paul Dalsgaard, Borge Lindberg, Henrik Benner and Zheng-Hua Tan,  Book Abstract of Eurospeech'01 Proceedings, Kommunik Grafiske Løsninger, September 2001.
  12. Guangrui Hu, Changqing Xu, Zheng-Hua Tan and Xinbao Gong, Problem Handbook of Signals and Systems, Science Press of China, 1999.          
         Conference papers:

  1. Morten Kolbæk, Zheng-Hua Tan and Jesper Jensen, “Monaural Speech Enhancement Using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure,” The 43th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), 15-20 April 2018, Calgary, Alberta, Canada.
  2.  Achintya Kr. Sarkar and Zheng-Hua Tan, “Time-Contrastive Learning Based DNN Bottleneck Features for Text-Dependent Speaker Verification,” NIPS 2017 Time Series Workshop, Long Beach, CA, USA, Dec. 8, 2017.
  3. Xiaodong Duan, Nicolai B. Thomsen, Zheng-Hua Tan, Børge Lindberg and Søren H. Jensen, “Weighted Score Based Fast Converging CO-training with Application to Audio-Visual Person Identification,” The 29th IEEE International Conference on Tools with Artificial Intelligence (ICTAI2017), Boston, Massachusetts, USA, Nov. 6-8, 2017.
  4. Morten Kolbæk, Dong Yu, Zheng-Hua Tan and Jensen, Jesper, "Joint Separation and Denoising of Noisy Multi-Talker Speech Using Recurrent Neural Networks and Permutation Invariant Training,” the IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP), Tokyo, Japan, 25-28 September 2017. PDF. Best student paper award. AAU News
  5.  Hong Yu, Zheng-Hua Tan, Zhanyu Ma and Jun Guo, "Adversarial Network Bottleneck Features for Noise Robust Speaker Verification,” Interspeech 2017, Stockholm, Sweden, 20-24 August 2017.
  6.  Daniel Michelsanti and Zheng-Hua Tan, "Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification,” Interspeech 2017, Stockholm, Sweden, 20-24 August 2017. PDF
  7. Achintya Sarkar, Md Sahidullah, Zheng-Hua Tan and Tomi Kinnunen, "Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data,” Interspeech 2017, Stockholm, Sweden, 20-24 August 2017.
  8.  Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan and Jesper Jensen, "On the use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure,” Interspeech 2017, Stockholm, Sweden, 20-24 August 2017.
  9.  K. A. Lee, et al. , "The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016,” Interspeech 2017, Stockholm, Sweden, 20-24 August 2017.
  10. Dong Yu, Morten Kolbæk, Zheng-Hua Tan, and Jesper Jensen, “Permutation Invariant Training of Deep Models for Speaker-independent Multi-talker Speech Separation,” The 42th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, USA, 5-9 March 2017.
  11. Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, and Jesper Jensen, 'A Non-intrusive Short-time Objective Intelligibility Measure,” The 42th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, USA, 5-9 March 2017.
  12. Tomi Kinnunen, Md Sahidullah, Mauro Falcone, Luca Costantini, Rosa Gonzalez Hautamäki, Dennis Thomsen, Achintya Sarkar, Zheng-Hua Tan, Hector Delgado, Massimiliano Todisco, Nicholas Evans, Ville Hautamäki, and Kong Aik Lee, "RedDots Replayed: A New Replay Spoofing Attack Corpus for Text-dependent Speaker Verification Research,” The 42th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, USA, 5-9 March 2017.
  13. Morten Kolbæk, Zheng-Hua Tan and Jesper Jensen, "Speech Enhancement Using Long Short-Term Memory Based Recurrent Neural Networks for Noise Robust Speaker Verification,” 2016 IEEE Workshop on Spoken Language Technology (SLT 2016), San Diego, California, USA, December 13-16, 2016.
  14. Héctor Delgado, Massimiliano Todisco, Md Sahidullah, Achintya K Sarkar, Nicholas Evans, Tomi Kinnunen and Zheng-Hua Tan, "Further Optimisations of Constant Q Cepstral Processing for Integrated Utterance and Text-Dependent Speaker Verification," 2016 IEEE Workshop on Spoken Language Technology (SLT 2016), San Diego, California, USA, December 13-16, 2016.
  15. Jen-Tzung Chien, Chao-Hsi Lee and Zheng-Hua Tan, “Dirichlet Mixture Allocation”, the 26th IEEE International Workshop on Machine Learning for Signal Processing (MLSP), Salerno-Italy, 13-16 September 2016.
  16. Nicolai Thomsen, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Børge Lindberg and Søren Holdt Jensen, "Speaker-dependent Dictionary-based Speech Enhancement for Text-Dependent Speaker Verification," Interspeech 2016, San Francisco, USA, 8 - 12 September 2016.
  17. Achintya Kumar Sarkar and Zheng-Hua Tan, "Text Dependent Speaker Verification Using Unsupervised HMM-UBM and Temporal GMM-UBM," Interspeech 2016, San Francisco, USA, 8 - 12 September 2016.
  18. Tomi Kinnunen, Md Sahidullah, Ivan Kukanov, Héctor Delgado, Massimiliano Todisco, Achintya sarkar, Nicolai Thomsen, Ville Hautamaki, Nicholas Evans and Zheng-Hua Tan, "Utterance Verification for Text-Dependent Speaker Recognition: a Comparative Assessment Using the RedDots Corpus," Interspeech 2016, San Francisco, USA, 8 - 12 September 2016.
  19. Md Sahidullah, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamaki, Robert Parts and Martti Pitkanen, "Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech,"Interspeech 2016, San Francisco, USA, 8 - 12 September 2016.
  20. Md Sahidullah, Héctor Delgado, Massimiliano Todisco, Hong Yu, Tomi Kinnunen, Nicholas Evans and Zheng-Hua Tan,"Integrated Spoofing Countermeasures and Automatic Speaker Verification: an Evaluation on ASVspoof 2015,"Interspeech 2016, San Francisco, USA, 8 - 12 September 2016.
  21. Tomi Kinnunen, Alexey Sholokhov, Elie Khoury, Dennis Thomsen, Md Sahidullah and Zheng-Hua Tan, "HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors," Interspeech 2016, San Francisco, USA, 8 - 12 September 2016.
  22. Hengwei Lin, Josep M. Guerrero, Juan C. Vásquez, Zheng-hua Tan, Chengxi Liu, andChenxi Jia, "Adaptive Overcurrent Protection for Microgrids in Extensive Distribution Systems," the 42nd IEEE Industrial Electronics Conference (IEEE IECON2016), Florence, Italy, October 24-27, 2016.
  23. Nicolai B. Thomsen, Xiaodong Duan, Zheng-Hua Tan, Børge Lindberg, and Søren Holdt Jensen, “Improving the Convergence of CO-training for Audio-Visual Person Identification,” The International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE2016), July 6-8, 2016, Aalborg, Denmark.
  24. Mohamed Abou-Zleikha, Mads Græsbøll Christensen, Zheng-Hua Tan, and Søren Holdt Jensen, “Projecting Emotional Speech into Arousal-valence Space Using Pairwise Preference Learning,” The International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE2016), July 6-8, 2016, Aalborg, Denmark.
  25. Hong Yu, Achintya Sarkar, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Zhan-Yu Ma, and Jun Guo, “Investigating the Effect of Multi-conditional Training and Speech Enhancement Methods on Spoofing Detection,” The International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE2016), July 6-8, 2016, Aalborg, Denmark.
  26. Stefanos Astaras, Aristodemos Pnevmatikakis and Zheng-Hua Tan, “Background Subtraction for Patterns of Activities in Cities,” The International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE2016), July 6-8, 2016, Aalborg, Denmark.
  27. Mojtaba Farmani, Richard Heusdens, Michael Syskind Pedersen, Zheng-Hua Tan and Jesper Jensen, “Concurrent Localization of Sound Sources and Dual-Microphone Sub-Arrays Using TOFs,” The 19th International Conference on Information Fusion (FUSION 2016), Heidelberg, July 5-8, 20016.
  28. Zongji Sun, Li Meng, Aladdin Ariyaeeinia, Xiaodong Duan, and Zheng-Hua Tan, “Privacy Protection Performance of De-identified Face Images with and without Background,” The 39th International ICT Convention MIPRO 2016, May 30 - June 03, 2016, Opatija, Croatia.
  29. Ibrahim A. Hameed, Zheng-Hua Tan, Nicolai B. Thomsen and Xiaodong Duan, “User Acceptance of Social Robots,” The 9th International Conference on Advances in Computer-Human Interactions (ACHI 2016), Venice, Italy, April 24-28, 2016. Best Paper Award.
  30. Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan and Jesper Jensen, “A Method for Predicting the Intelligibility of Nisy and Non-linearly Enhanced Binaural Speech,” The 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Shanghai, China, 20-25 March 2016.
  31. Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan and Jesper Jensen, “Informed Direction of Arrival Estimation Using a Spherical-head Model for Hearing Aid Applications,” The 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Shanghai, China, 20-25 March 2016.
  32. Xiaodong Duan and Zheng-Hua Tan, "Neighbors Based Discriminative Feature Difference Learning for Kinship Verification,” The 11th International Symposium on Visual Computing, December 14-16, 2015
, Las Vegas, Nevada, USA.
  33. Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan and Jesper Jensen, “Informed TDoA-based Direction of Arrival Estimation for Hearing Aid Applications,” The 3rd IEEE Global Conference on Signal and Information Processing (GlobalSIP 2015), Orlando, Florida, USA, December 14-16, 2015.
  34. Sally Grindsted Nielsen, Anja Christoffersen, Elizabeth Jochum and Zheng-Hua Tan, "Robot Future: Using Theatre to Influence Acceptance of Care Robots," The New Friend 2015 Conference, Almere, The Netherlands, October 22-23, 2015. Best Paper Award Runner-up.
  35. Nicolai Thomsen, Zheng-Hua Tan, Børge Lindberg and Søren Holdt Jensen, “A Heuristic Approach for a Social Robot to Navigate to a Person Based on Audio and Range Information,” 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (iROS), Hamburg, Germany, September 28 - October 02, 2015.
  36. Ivan Kraljevski, Zheng-Hua Tan and Maria Paola Bissiri, “Comparison of Forced-Alignment Speech Recognition and Humans for Generating Reference VAD,” Interspeech 2015, Dresden, Germany, September 6-10, 2015.
  37. Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan and Jesper Jensen, “A Binaural Short Time Objective IntelligibilityMeasure for Noisy and Enhanced Speech,” Interspeech 2015, Dresden, Germany, September 6-10, 2015.
  38. Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll Christensen and Søren Holdt Jensen, “Discriminative Approach for Voice Selection in Speaker De-identification Systtem,” The 23rd European Signal Processing Conference (EUSIPCO 2015), Nice, France, August 31 – September 4, 2015.
  39.  Xiaodong Duan and Zheng-Hua Tan, "Local Feature Learning for Face Recognition under Varying Poses," IEEE International Conference on Image Processing (ICIP 2015), 27-30 September 2015, Quebec City, Canada.
  40. Xiaodong Duan and Zheng-Hua Tan, "A Feature Subtraction Method for Image Based Kinship Verification under Uncontrolled Environments," IEEE International Conference on Image Processing (ICIP 2015), 27-30 September 2015, Quebec City, Canada.
  41. Clara Schaarup, Gunnar Hartvigsen, Lars Bo Larsen, Zheng-Hua Tan, Eirik Årsand, and Ole Hejlesen, “Assessing the potential use of eye-tracking triangulation for evaluating the usability of an online diabetes exercise system,” The 15th World Congress on Health and Biomedical Informatics (MEDINFO 2015: eHealth-enabled Health), pp. 84-88, 1August 9-23, 2015, Sao Paulo, Brazil.
  42.  Rasmus Lyngby Kristensen, Zheng-Hua Tan, Zhanyu Ma and Jun Guo, "Binary Pattern Flavored Feature Extractors for Facial Expression Recognition: An Overview," CIS-MIPRO 2015, 25-29 May 2015, Opatija, Croatia. PDF
  43. Sven Shepstone, Kong Aik Lee, Haizhou Li, Zheng-Hua Tan and Søren Holdt Jensen, “Source-Specific Informative Prior for I-Vector Extraction,” The 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015), April 19 – 24, 2015, Brisbane, Australia. The Ganesh N. Ramaswamy Memorial Student Grant and Award.
  44. Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan and Jesper Jensen, “Maximum Likelihood Approach to "Informed" Sound Source Localization for Hearing Aid Applications,” The 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015), April 19 – 24, 2015, Brisbane, Australia.
  45. Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan and Jesper Jensen, “On the Influence of Microphone Array Geometry on HRTF-Based Sound Source Localization,” The 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015), April 19 – 24, 2015, Brisbane, Australia.
  46. Zheng-Hua Tan, Nicolai Bæk Thomsen and Xiaodong Duan, "Designing and Implementing an Interactive Social Robot from Off-the-shelf Components,"The 3rd IFToMM Symposium on Mechanism Design for Robotics (MEDER2015), June 2-4, 2015, Aalborg, Denmark. PDF
  47. Jesper Jensen and Zheng-Hua Tan, “A Theoretically Consistent Method for Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral Features,” The 4th IEEE International Conference on Network Infrastructure and Digital Content (IEEE IC-NIDC2014), Beijing, China, September 19-21, 2014. Best Paper Award.
  48. Yonggang Qi, Honggang Zhang, Yi-Zhe Song and Zheng-Hua Tan, "A Patch-based Sparse Representation for Sketch Recognition," The 4th IEEE International Conference on Network Infrastructure and Digital Content (IEEE IC-NIDC2014), Beijing, China, September 19-21, 2014.
  49. Ivan Kraljevski and Zheng-Hua Tan, “Variable Frame Rate and Length Analysis for Data Compression in Distributed Speech Recognition,” The 4th IEEE International Conference on Network Infrastructure and Digital Content (IEEE IC-NIDC2014), Beijing, China, September 19-21, 2014.
  50. Nicolai Bæk Thomsen, Zheng-Hua Tan, Børge Lindberg and Søren Holdt Jensen, “Learning Direction of Attention for a Social Robot in Noisy Environments,” The 3rd AAU Workshop on Robotics (AAUROB2014), Aalborg, Denmark.
  51. Mohamed Abou-Zleikha, Zheng-Hua Tan, Søren Holdt Jensen and Mads Græsbøll Christensen, “Cluster-Based Adaptation Using Density Forest for HMM Phone Recognition,” EUSIPCO 2014 - the 2nd European Signal Processing Conference, September 1-5, 2014, Lisbon, Portugal.
  52. Hristijan Petreski, Sofia Tsekeridou, Neeli R. Prasad and Zhen-Hua Tan, “Methodology for Dynamic Learning Resources Discovery and Retrieval from Social Media,” full paper at EDULEARN 2014 – the 6th annual International Conference on Education and New Learning Technologies, Barcelona, July 7-9, 2014.
  53. Mohamed Abou-Zleikha, Zheng-Hua Tan, Søren Holdt Jensen and Mads Græsbøll Christensen, “Non-linguistic Vocal Events Detection and Localistion Using Online Random Forest,” MIPRO 2014 – the 37th International Convention, Special Session on BiForD – Biometrics & Forensics & De-identification and Privacy Protection, 26-30 May 2014, Opatija, Croatia.
  54. Yonggang Qi, Jun Guo, Yi Li, Honggang Zhang, Tao Xiang, Yi-Zhe Song and Zheng-Hua Tan, “Perceptual Grouping via Untangling Gestalt Principles,” The 2013 IEEE Visual Communications and Image Processing conference (VCIP), Kuching, Sarawak, Malaysia, November 17-20, 2013.
  55. Swati Prasad, Zheng-Hua Tan and Ramjee Prasad, “Multistyle Training and Fusion for Speaker Identification of Disguised Voice,” The First International Conference on Communications, Connectivity, Convergence, Content and Cooperation (IC5), Mumbai, Maharashtra, India, December 16-19, 2013.
  56. Hristijan Petreski, Sofia Tsekeridou, Neeli R. Prasad and Zhen-Hua Tan, "Multi-dimensional technology-enabled social learning approach,” The 7th International Conference on Open and Distance Learning (ICODL 2013), Athens, Greece, November 8-10, 2013.

  57. Sven Ewan Shepstone, Zheng-Hua Tan and Søren Holdt Jensen, “Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition,” Interspeech 2013, Lyon, France, August 25-29, 2013.   PDF

  58. Morten Højfeldt Rasmussen and Zheng-Hua Tan, “Fusing Eye-gaze and Speech Recognition for Tracking in an Automatic Reading Tutor – A Step in the Right Direction?” Speech and Language Technology for Education (SLaTE 2013), Grenoble, France - August 30-31 & September 1st, 2013. PDF

  59. O. Plchot, S. Matsoukas, P. Matejka, N. Dehak, J. Ma, S. Cumani, O. Glembek, H. Hermansky, S.H. Mallidi, N. Mesgarani, R. Schwartz, M. Soufifar, Z.-H. Tan, S. Thomas, B. Zhang and X. Zhou, “Developing a Speaker Identification System for the DARPA RATS project,” ICASSP 2013 - the 38th International Conference on Acoustics, Speech, and Signal Processing, Vancouver, Canada, May 26 - 31, 2013. PDF

  60. Swati Prasad, Zheng-Hua Tan and Ramjee Prasad, “Multi-Frame Rate Based Multiple-Model Training for Robust Speaker Identification of Disguised Voice,” The 16th International Symposium on Wireless Personal Multimedia Communications (WPMC 2013), Atlantic City, New Jersey, USA, June 24-27, 2013.

  61. Zhanyu Ma, Zheng-Hua Tan and Swati Prasad, "EEG Signal Classification With Super-Dirichlet Mixture Model," IEEE Statistical Signal Processing Workshop, Ann Arbor, USA, Aug 5-8, 2012.

  62. Emanouil Amolochitis, Ioannis T. Christou and Zheng-Hua Tan, "PUBSEARCH: A Hierarchical Heuristic Scheme for Ranking Academic Search Results," International Conference on Pattern Recognition Applications and Methods (ICPRAM 2012), Vilamoura, Algarve, Portugal, 6-8 Feburary, 2012.

  63. Menelaos Bakopoulos, Sofia Tsekeridou, Eri Giannaka, Zheng-Hua Tan and Ramjee Prasad , “Mobile Video Annotation For Enhanced Rich Media Communication During Emergency Handling,” The 4th International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2011), 26-29 October, 2011, Barcelona, Spain.

  64. Swati Prasad, Zheng-Hua Tan, Ramjee Prasad, Alvaro Fuentes Cabrera, Ying Gu and Kim Dremstrup, “Feature Selection Strategy for Classification of Single-Trial EEG Elicited by Motor Imagery,” The 14th International Symposium on Wireless Personal Multimedia Communications (WPMC 2011), Brest, France, 3-7 October 2011.

  65. P. Mowlaee , R. Saeidi , Z. -H. Tan , M. G. Christensen , T. Kinnunen, P. Fränti, and S. H. Jensen, "Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge," Interspeech 2011, Florence, Italy, 27-31 August 2011.

  66. Theodoros Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan and Ramjee Prasad, "Multi-Sensor Voice Activity Detection based on Multiple Observation Hypothesis Testing," Interspeech 2011, Florence, Italy, 27-31 August 2011.

  67. Morten Højfeldt Rasmussen, Jack Mostow, Zheng-Hua Tan, Børge Lindberg and Yuanpeng Li, "Evaluating Tracking Accuracy of an Automatic Reading Tutor," Speech and Language Technology for Education (SLaTE 2011), Venice, Italy, 24 - 26 August 2011. PDF

  68. Morten Højfeldt Rasmussen, Børge Lindberg and Zheng-Hua Tan, "Combining Acoustic and Language Model Miscue Detection Methods for Dyslexic Read Speech," Speech and Language Technology for Education (SLaTE 2011), Venice, Italy, 24 - 26 August 2011. PDF

  69. Menelaos Bakopoulos, Sofia Tsekeridou, Eri Giannaka, Zheng-Hua Tan, and Ramjee Prasad, "Command & Control: Information Merging, Selective Visualization and Decision Support for Emergency Handling," The 8th International Conference on Information Systems for Crisis Response and Management, Lisbon, Portugal, May 8-11, 2011.

  70. Pejman Mowlaee, Mads Græsbøll Christensen, Zheng-Hua Tan, Søren Holdt Jensen, "A MAP Criterion for Detecting the Number of Speakers at Frame Level in Model-based Single-Channel Speech Separation," The 44th Annual Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, California, USA, November 2010.

  71. Zheng-Hua Tan, "Machine Perception for Identification and Interaction in the Internet of Things," invited paper at The 13th International Symposium on Wireless Personal Multimedia Communications (WPMC 2010), October, 2010, Recife, Brazil.   PDF

  72. R. Saeidi, P. Mowlaee, T. Kinnunen, Z. -H. Tan, M. G. Christensen, S. H. Jensen, and P. Fränti, “Improving Monaural Speaker Identification by Double-Talk Detection,” Interspeech 2010, Makuhari, Japan, 26-30 Sep. 2010.

  73. Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti, "Signal-to-Signal Ratio Independent Speaker Identification Co-Channel Speech Signals," The 20th International Conference on Pattern Recognition (ICPR), Istanbul, Turkey, August 2010. 

  74. M. Andersen, R. S. Andersen, N. Katsarakis, A. Pnevmatikakis and Z.-H. Tan, "Three-Dimensional Adaptive Sensing of People in a Multi-Camera Setup," invited paper at EUSIPCO 2010 – the 18th European Signal Processing Conference, Aalborg, Denmark, August 2010. Video demo.

  75. Francesco Santoro, Sergio Pedro, Zheng-Hua Tan and Thomas B. Moeslund, " Crowd Analysis by Using Optical Flow and Density Based Clustering," EUSIPCO 2010 – the 18th European Signal Processing Conference, Aalborg, Denmark, August 2010. Video demo.

  76. Pejman Mowlaee, Rahim Saiedi, Zheng-Hua Tan, Mads Græsbøll Christensen, Pasi Franti, Søren Holdt Jensen, "Joint Single-Channel Speech Separation and Speaker Identification ICASSP 2010 - the 35th International Conference on Acoustics, Speech, and Signal Processing, Dallas, Texas, USA, March 2010.

  77. Zheng-Hua Tan and Børge Lindberg, “High-Accuracy, Low-Complexity Voice Activity Detection Based on A Posteriori SNR Weighted Energy,” Interspeech 2009, Brighton, U.K., September 2009.

  78. Morten Højfeldt Rasmussen, Zheng-Hua Tan, Børge Lindberg and Søren Holdt Jensen, "A System for Detecting Miscues in Dyslexic Read Speech,” Interspeech 2009, Brighton, U.K., September 2009.

  79. Zheng-Hua Tan and Borge Lindberg, “A Posteriori SNR Weighted Energy Based Variable Frame Rate Analysis for Speech Recognition,” Interspeech 2008, Brisbane, Australia, September 2008.  PDF

  80. Zheng-Hua Tan and Borge Lindberg, “An Efficient Frame Selection Approach to Variable Frame Rate Analysis for Noise Robust Speech RecognitionAcoustics 2008 (the 155th ASA meeting), Paris, France, June 2008.

  81. Zheng-Hua Tan and Borge Lindberg, "A Variable Frame Rate Method for Distributed Speech Recognition over Wireless Networks,” The 10th International Symposium on Wireless Personal Multimedia Communications, Jaipur, India, December 2007. 

  82. Zheng-Hua Tan, “Variable Frame Rate Analysis for Automatic Speech Recognition,” SPIE Multimedia Systems and Applications X, Boston, MA, USA, September 2007. 

  83. Zheng-Hua Tan and Borge Lindberg, "A Variable Frame Rate Method for Distributed Speech Recognition over Wireless Networks,” The 10th International Symposium on Wireless Personal Multimedia Communications, Jaipur, India, December 2007.

  84. Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, "Robust Speech Recognition over Mobile Networks Using Combined Weighted Viterbi Decoding and Subvector Based Error Concealment," Interspeech 2006, Pittsburgh PA, USA, September 2006. PDF

  85. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard and Børge Lindberg, “Robust Speech Recognition from Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework,” ICASSP 2006 - the 31st International Conference on Acoustics, Speech, and Signal Processing, Toulouse, France, May 2006.  PDF © IEEE

  86. Tom Brøndsted, Lars Bo Larsen, Børge Lindberg, Morten Rasmussen, Zheng-Hua Tan, Haitian Xu, “Distributed Speech Recognition for Information Retrieval on Mobile Devices,” Workshop on Speech in Mobile and Pervasive Environments, Espoo, Finland, September 2006.

  87. Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, "Adaptive Multi-Frame-Rate Scheme for Distributed Speech Recognition Based on a Half Frame-Rate Front-End”, IEEE MMSP 2005 – the 7th international workshop on multimedia signal processing, Shanghai, China, November 2005. PDF

  88. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard and Børge Lindberg, “Combined Spectral Subtraction and Cepstral Normalisation for Robust Speech Recognition”, ASIDE 2005 - COST278 and ISCA Tutorial and Research Workshop (ITRW) on Applied Spoken Language Interaction in Distributed Environments, Aalborg, Denmark, November 2005. 

  89. Tom Brøndsted, Henrik L. Larsen, Lars B. Larsen, Børge Lindberg, Daniel Ortiz-Arroyo, Zheng-Hua Tan, Haitian Xu, “Mobile Information Access with Spoken Query Answering”, ASIDE 2005 - COST278 and ISCA Tutorial and Research Workshop (ITRW) on Applied Spoken Language Interaction in Distributed Environments, Aalborg, Denmark, November 2005. PDF

  90. Zheng-Hua Tan, Paul Dalsgaard, Borge Lindberg and Haitian Xu, “Robust Speech Recognition in Ubiquitous Networking and Context-Aware Computing”, Interspeech 2005, Lisbon, Portugal, September 2005. PDF

  91. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard and Børge Lindberg, “Robust Speech Recognition Based on Noise and SNR Classification - a Multiple-Model Framework”, Interspeech 2005, Lisbon, Portugal, September 2005.  PDF 

  92. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Ralf Mattethat and Børge Lindberg, “A Configurable Distributed Speech Recognition System”, Biennial on DSP for in-Vehicle and Mobile Systems, Sesimbra , Portugal , September 2005.  PDF

  93. Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, “On the Integration of Speech Recognition into Personal Networks,” invited paper at ICSLP 2004 - the 8th International Conference on Spoken Language Processing, Jeju Island, Korea, October 2004. PDF

  94. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard and Børge Lindberg, “Spectral Subtraction with Full-Wave Rectification and Likelihood Controlled Instantaneous Noise Estimation for Robust Speech Recognition,” ICSLP 2004 - the 8th International Conference on Spoken Language Processing, Jeju Island , Korea , October 2004. PDF  

  95. Zheng-Hua Tan, Borge Lindberg and Paul Dalsgaard, "A Comparative Study of Feature-Domain Error Concealment Techniques for Distributed Speech Recognition", Robust 2004 - COST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction, Norwich, UK, August 2004.  PDF

  96. Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, "A Subvector-Based Error Concealment Algorithm for Speech Recognition over Mobile Networks," Proceedings ICASSP 2004 - the 29th International Conference on Acoustics, Speech, and Signal Processing, Montreal , Quebec , Canada , May 2004.  PDF

  97. Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, "OOV-Detection And Channel Error Protection For Distributed Speech Recognition Over Wireless Networks," Proceedings ICASSP 2003 - the 28th International Conference on Acoustics, Speech, and Signal Processing, pp. I-336-339, Hong Kong, P R China April, 2003. PDF

  98. Zheng-Hua Tan and Paul Dalsgaard.  "Channel Error Protection Scheme for Distributed Speech Recognition," Proceedings ICSLP 2002 - the 7th International Conference on Spoken Language Processing, pp. 2225-2228, Denver USA , September 2002.  PDF

  99.    Zheng-Hua Tan, Borge Lindberg and Paul Dalsgaard, "Experiments on A Channel Error Protection Scheme for Distributed Speech Recognition," Proceedings NORSIG 2002 – the 5th Nordic Signal Processing Symposium, Norway, October, 2002. PDF
  100.   Xiaolin Ren, Guangrui Hu and Zhenghua Tan, "Controlling Chaos in a Chaotic Neuron," IEEE IECON'99, pp.652-655, San Jose, California, USA, 1999.

            Journal papers in Chinese: :

  1. Zheng-Hua Tan et al., "Modified Miller-Matrix Encoding Method and Its Application in Evolutionary Artificial Neural Networks," Journal of Shanghai Jiao Tong University , 2001.

  2. Zheng-Hua Tan et al., "Study on An Evolutionary Artificial Neural Network," Nature Magazine, 2000.

  3. Zheng-Hua Tan et al., "Designing Artificial Neural Networks Through Evolutionary Programming," Computer Engineering and Applications, 1999, Vol. 35, No. 10.

  4. Zheng-Hua Tan et al., "Fuzzy Metagraphs and Its Feature Analysis," Computer Research & Development, 2000, Vol. 37, No. 3, pp. 272-277.

  5. Zheng-Hua Tan et al., "Fuzzy Metagraphs: A New Method of Constructing Fuzzy Knowledge Base," Control and Decision, 2000, Vol. 15, No. 4, pp. 406-410.

  6. Zheng-Hua Tan et al., "Fuzzy Metagraph and Its Applications in Aerocraft Fault Diagnosis," Journal of Shanghai Jiao Tong University, 1999, 33(9), pp.1103-1106.

  7. Zheng-Hua Tan et al., "Uncertain Knowledge Management in Expert Systems Using Fuzzy Metagraphs," Journal of Shanghai Jiao Tong University (English edition), 2000, Vol. 5, No. 2, pp. 6-9.

  8. Zheng-Hua Tan et al., "The Application of Computational Intelligence in Fault Diagnosis Expert Systems," Computer Engineering and Applications, 1999, Vol. 35, No. 6, pp.7-10.

  9. Chen Wei, Hu Guangrui and Zheng-Hua Tan, "Knowledge Association in Expert System for Fault Diagnosis of Certain Spacecraft," Journal of Shanghai Jiao Tong University, 2000, Vol. 34, No. 2, pp.241-243.

  10. Zheng-Hua Tan et al., "Portable Electrometer Based on PIC Series Singlechip," Chinese Journal of Scientific Instrument, 2000, Vol. 21, No. 1, pp.78-79, 82.

  11. Zheng-Hua Tan et al., "The Development of Intelligent Inverter Supply and the Generation of SPWM Wave by Software," Journal of Shanghai Jiao Tong University, 2000, Vol. 34, No. 2, pp.273-275.

  12. Zheng-Hua Tan et al., "Static Measure and Its Programming," Aerospace Measure Technology, 2000, No.1.

  13. Ren Xiaolin, Hu Guangrui and Zheng-Hua Tan, "Controlling Chaos in a Chaotic Neuron," Nature Magazine, 1999, No. 5, pp.308-309.

  14. Ren Xiaolin, Hu Guangrui and Zheng-Hua Tan, "Controlling Chaos in Chaotic Neuron by Constant Pulses Method," Journal of Shanghai Jiao Tong University, 2000, Vol. 34, No. 2, pp.269-272.

  15. Ren Xiaolin, Hu Guangrui and Zheng-Hua Tan, "Synchronization of chaotic neural networks and applications in secure communications," Journal of Shanghai Jiao Tong University, 2000, Vol. 34, No. 6, pp. 744-747.

  16. Zheng-Hua Tan et al., "Investigation on Some Problems of GAL Programming Using ABEL Software," Journal of Electrical Engineering Education, 1995, 17(Supplemental Issue), 83-85.

  17.   Zheng-Hua Tan et al., "Research on GAL On-line Programming," Journal of Hunan University, 1995, Vol. 22, Special No. 5, 8-12.

     Patent:

TECHNICAL REVIEWING


    
  

  

       

    Since Oct. 1, 2001