Dr. Thilo Stadelmann

Dr. Thilo Stadelmann

Dr. Thilo Stadelmann
ZHAW School of Engineering
Obere Kirchgasse 2 / Steinberggasse 12/14
8400 Winterthur

+41 (0) 58 934 72 08
thilo.stadelmann@zhaw.ch

Personal profile

Management role

  • Deputy Head of Research/Focus Area, Information Engineering research group
  • Head, ZHAW Data Science Laboratory

Position at the ZHAW

Senior lecturer (Associate Professor equivalent) for Information Engineeering

http://www.zhaw.ch/~stdm

Professional development teaching

Expertise and research interests

Data science, machine learning, artificial intelligence, information engineering, data mining, pattern recognition, speaker diarization, audio mining, video mining, multimedia retrieval, signal processing, biometrics, data management, smart software.
Current research interests: Deep Learning, Reinforcement Learning

Educational background

Certificate of Advanced Studies in didactics of higher education, 2015, Zurich University of Teacher Education
PhD studies, Dr. rer. nat (PhD equivalent), 2010, University of Marburg
Computer science studies, Dipl. Inform. (FH) (MSc equivalent), 2004, Giessen University of Applied Sciences

Professional milestones

2016-present: Board of the Swiss Alliance for Data-Intensive Services
2015-present: Deputy head of Information Engineering research group, ZHAW
2015-2017: Co-organizer Zurich Machine Learning & Data Science Meetup
2014-present: Vice president of SGAICO, Swiss Group of Artificial Intelligence and Cognitive Science
2013-present: Founder & head of ZHAW Datalab, Zurich University of Applied Sciences
2013-present: Lecturer Information Engineering (50% research, 50% teaching), ZHAW InIT
2012-2013: Director of internal IT, TWT GmbH Science & Innovation
2011-2013: Head of smart software team, TWT GmbH Science & Innovation
2010-2011: Software architect and project leader, TWT GmbH Science & Innovation
2004-2010: Research assistant in the area of audio- and video mining, University of Marburg
1998-2010: several sideline jobs in software development and data mining

Membership of networks

Projects

Project team leader

Project team member

Publications

Peer-reviewed articles/chapters

; ; ; ; ().

Fully Convolutional Neural Networks for Newspaper Article Segmentation

.

In: Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). Kyoto, Japan: CPS. Peer reviewed.

; ; ; ().

Learning Embeddings for Speaker Clustering Based on Voice Equality

.

In: Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2017). Roppongi, Tokyo, Japan: IEEE. Peer reviewed.

; ; ; ().

Speaker Identification and Clustering using Convolutional Neural Networks

.

In: Proceedings of IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2016). Salerno: IEEE. Peer reviewed.

; ; ().

AI in Switzerland

.

AI Magazine, 36, II. 102-105. Peer reviewed.

; ; ().

Data Scientist als Beruf

.

Big Data – Grundlagen, Systeme und Nutzungspotenziale, Springer Verlag., Edition HMD 59-81. Peer reviewed.

; ; ().

Toward Automatic Data Curation for Open Data

.

ERCIM News, 100. 32-33. Peer reviewed.

; ().

Data Science für Lehre, Forschung und Praxis

.

Praxis der Wirtschaftsinformatik Peer reviewed.

; ; ; ; ; ; ().

Applied Data Science in Europe

: Challenges for Academia in Keeping Up with a Highly Demanded Topic.

European Computer Science Summit Peer reviewed.

Non-peer-reviewed articles/chapters

; ; ; ; ().

PANOPTES

: Automated Article Segmentation of Newspaper Pages for "Real Time Print Media Monitoring“.

Proceedings of SGAICO Annual Assembly and Workshop 2015

Publications before appointment at the ZHAW

Thilo Stadelmann, Sven Johr, Michael Ditze, Florian Dittman, and Viktor Fässler. "FABELHAFT - Fahrerablenkung: Entwicklung eines Meta-Fahrerassistenzsystems durch Echtzeit-Audioklassifikation". In Proceedings of 28. VDI-VW Gemeinschaftstagung Fahrerassistenzsysteme und Integrierte Sicherheit, Wolfsburg, Germany, October 10.-11., 2012. VDI Wissensforum.

Thilo Stadelmann. "Voice Modeling Methods for Automatic Speaker Recognition". Dissertation, Philipps-Universität Marburg. Available online, 2010. URL archiv.ub.uni-marburg.de/diss/z2010/0465/view.html.

Christian Beecks, Thilo Stadelmann, Bernd Freisleben, and Thomas Seidl. "Visual Speaker Model Exploration", In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'2010), pages 727-728, Singapore, July 19-23, 2010, IEEE.

Thilo Stadelmann, Yinghui Wang, Matthew Smith, Ralph Ewerth, and Bernd Freisleben. "Rethinking Algorithm Development and Design in Speech Processing". In Proceedings of the 20th International Conference on Pattern Recognition (ICPR'10), pages 4476-4479, Istanbul, Turkey, August 2010a. IAPR.

Thilo Stadelmann and Bernd Freisleben. "On the MixMax Model and Cepstral Features for Noise-Robust Voice Recognition". Technical Report, Marburg University, July 2010.

Thilo Stadelmann and Bernd Freisleben. Dimension-Decoupled Gaussian Mixture Model for Short Utterance Speaker Recognition. In Proceedings of the 20th International Conference on Pattern Recognition (ICPR'10), pages 1602-1605, Istanbul, Turkey, August 2010a. IAPR.

Markus Mühling, Ralph Ewerth, Thilo Stadelmann, Bing Shi, and Bernd Freisleben. "University of Marburg at TRECVID 2009: High-Level Feature Extraction". In Proceedings of TREC Video Retrieval Evaluation Workshop (TRECVid'09). Available online, 2009. URL www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.htm.

Ernst Juhnke, Dominik Seiler, Thilo Stadelmann, Tim Dörnemann, and Bernd Freisleben. "LCDL: An Extensible Framework for Wrapping Legacy Code". In Proceedings of International Workshop on @WAS Emerging Research Projects, Applications and Services (ERPAS'09), pages 638-642, Kuala Lumpur, Malaysia, December 2009.

Dominik Seiler, Ralph Ewerth, Steffen Heinzl, Thilo Stadelmann, Markus Mühling, Bernd Freisleben, and Manfred Grauer. "Eine Service-Orientierte Grid-Infrastruktur zur Unterstützung Medienwissenschaftlicher Filmanalyse". In Proceedings of the Workshop on Gemeinschaften in Neuen Medien (GeNeMe'09), pages 79-89, Dresden, Germany, September 2009.

Thilo Stadelmann and Bernd Freisleben. "Unfolding Speaker Clustering Potential: A Biomimetic Approach". In Proceedings of the ACM International Conference on Multimedia (ACMMM'09), pages 185-194, Beijing, China, October 2009. ACM.

Thilo Stadelmann, Steffen Heinzl, Markus Unterberger, and Bernd Freisleben. "WebVoice: A Toolkit for Perceptual Insights into Speech Processing". In Proceedingsof the 2nd International Congress on Image and Signal Processing (CISP'09), pages 4358-4362, Tianjin, China, October 2009.

Steffen Heinzl, Markus Mathes, Thilo Stadelmann, Dominik Seiler, Marcel Diegelmann, Helmut Dohmann, and Bernd Freisleben. "The Web Service Browser: Automatic Client Generation and Efficient Data Transfer for Web Services". In Proceedings of the 7th IEEE International Conference on Web Services (ICWS'09), pages 743-750, Los Angeles, CA, USA, July 2009a. IEEE Press.

Steffen Heinzl, Dominik Seiler, Ernst Juhnke, Thilo Stadelmann, Ralph Ewerth, Manfred Grauer, and Bernd Freisleben. "A Scalable Service-Oriented Architecture for Multimedia Analysis, Synthesis, and Consumption". International Journal of Web and Grid Services, 5(3):219-260, 2009b. Inderscience Publishers.

Markus Mühling, Ralph Ewerth, Thilo Stadelmann, Bing Shi, and Bernd Freisleben. "University of Marburg at TRECVID 2008: High-Level Feature Extraction". In Proceedings of TREC Video Retrieval Evaluation Workshop (TRECVid'08). Available online, 2008. URL www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.htm.

Markus Mühling, Ralph Ewerth, Thilo Stadelmann, Bing Shi, Christian Zöfel, and Bernd Freisleben. "University of Marburg at TRECVID 2007: Shot Boundary Detection and High-Level Feature Extraction". In Proceedings of TREC Video Retrieval Evaluation Workshop (TRECVid'07). Available online, 2007a. URL www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.htm.

Ralph Ewerth, Markus Mühling, Thilo Stadelmann, Julinda Gllavata, Manfred Grauer, and Bernd Freisleben. "Videana: A Software Toolkit for Scientific Film Studies". In Proceedings of the International Workshop on Digital Tools in Film Studies, pages 1-16, Siegen, Germany, 2007. Transcript Verlag.

Markus Mühling, Ralph Ewerth, Thilo Stadelmann, Bernd Freisleben, Rene Weber, and Klaus Mathiak. "Semantic Video Analysis for Psychological Research on Violence in Computer Games". In Proceedings of the ACM International Conference on Image and Video Retrieval (CIVR'07), pages 611-618, Amsterdam, The Netherlands, July 2007b. ACM.

Ralph Ewerth, Markus Mühling, Thilo Stadelmann, Ermir Qeli, Björn Agel, Dominik Seiler, and Bernd Freisleben. "University of Marburg at TRECVID 2006: Shot Boundary Detection and Rushes Task Results". In Proceedings of TREC Video Retrieval Evaluation Workshop (TRECVid'06). Available online, 2006. URL www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.htm.

Thilo Stadelmann and Bernd Freisleben. "Fast and Robust Speaker Clustering Using the Earth Mover's Distance and MixMax Models". In Proceedings of the 31st IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'06), volume 1, pages 989-992, Toulouse, France, April 2006. IEEE.

Ralph Ewerth, Christian Behringer, Tobias Kopp, Michael Niebergall, Thilo Stadelmann, and Bernd Freisleben. "University of Marburg at TRECVID 2005: Shot Boundary Detection and Camera Motion Estimation Results". In Proceedings of TREC Video Retrieval Evaluation Workshop (TRECVid'05). Available online, 2005. URL www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.htm.