Prof. Dr. Hans-Peter Hutter
Prof. Dr. Hans-Peter Hutter
ZHAW
School of Engineering
Institut für Informatik
Steinberggasse 13
8400 Winterthur
Arbeit an der ZHAW
Tätigkeit
Prof. für Informatik mit Spezialisierung in DL-basierter Spracherkennung und sprachbasierte User Interfaces, Human-Computer Interaction und UX-Design
Arbeits- und Forschungsschwerpunkte
- Automatische Spracherkennung, Sprachdialogsysteme, Sprachmodellierung
- Mobile Usability, User Centered Design
- Sprecherverifikation
- Software Engineering
- Service Engineering
Lehrtätigkeit
Prof. für Informatik mit Spezialisierung in DL-basierter Spracherkennung und sprachbasierte User Interfaces, Human-Computer Interaction und UX-Design
Berufserfahrung
- Prof. für Informatik
ZHAW
08 / 2002 - heute - Leiter Forschungsgruppe Human-Centered Computing des InIT/ZHAW
ZHAW
04 / 2005 - 2025 - Gründer und Leiter des Instituts für angewandte Informationtechnologie InIT/ZHAW
ZHAW
04 / 2005 - 02 / 2010 - Dozent für Informatik
Technikum Winterthur / ZHW
07 / 1998 - 07 / 2002
Aus- und Weiterbildung
Ausbildung
- Dr. sc. techn. ETH / Technische Informatik
ETH Zürich
04 / 1988 - 03 / 1996 - Dipl. El.-Ing. ETH / Elektrotechnik
ETH Zürich
10 / 1980 - 07 / 1986
Netzwerk
Mitglied in Netzwerken
ORCID digital identifier
Social Media
Projekte
- Barrierefreie Grafiken: Semantische Grafikbeschreibung in PDFs für sehbehinderte Menschen / Teammitglied / abgeschlossen
- Monitoring von AV-Medien / Projektleiter:in / abgeschlossen
- Accessible Scientific PDFs for All / Teammitglied / abgeschlossen
- Barrierefreie Fussgängerwege / Teammitglied / abgeschlossen
- Voice Swap / Projektleiter:in / abgeschlossen
- Barrierefreier Tourismusraum Bodensee / Projektleiter:in / abgeschlossen
- WACS – Eine Service-Plattform für Lawinendienste / Teammitglied / abgeschlossen
- Multimodales Interaktionskonzept für Swisscom TV Box 3.0 / Teammitglied / abgeschlossen
- KWS Key-Word-Spider: System zur Unterstützung der Segmentierung, Inhaltsanalyse und Codierung von audiovisuellen Medienbeiträgen / Projektleiter:in / abgeschlossen
- Living Lab AAL Internationale Bodenseehochschule / Projektleiter:in / abgeschlossen
- Deep-Learning-basierter Spracherkenner mit beschränkten Trainingsdaten / Projektleiter:in / abgeschlossen
- FRIMI App / Projektleiter:in / abgeschlossen
- MOSSEW – Von der Notruf-Uhr zum Personensicherheits-System / Teammitglied / abgeschlossen
- Altersgerechte Gestaltung von mobilen Applikationen / Teammitglied / abgeschlossen
- Electronic Crew Flight Pad / Teammitglied / abgeschlossen
- Plattform für den Austausch von Services mit Entwicklungsländern / Projektleiter:in / abgeschlossen
- Talkalyzer / Teammitglied / abgeschlossen
- System für das integrale datenbasierte strategische Asset Controlling von Immobilien / Projektleiter:in / abgeschlossen
- Challenge Earth Activity Plattform / Projektleiter:in / abgeschlossen
- Pipeline für 3D-Animationsfilm- und Visual Effects Studios / Teammitglied / abgeschlossen
- AgeWeb – Altersgerechte Webseitengestaltung / Teammitglied / abgeschlossen
- WLAN Fingerprinting / Projektleiter:in / abgeschlossen
Publikationen
Beiträge in wissenschaftlicher Zeitschrift, peer-reviewed
- Yan, C., Hutter, H.-P., Schmitt-Koopmann, F. M., & Darvishy, A. (2025). Chart accessibility : a review of current alt text generation. IEEE Access, 13, 94040–94056. https://doi.org/10.1109/access.2025.3571626
- Farahani, A. M., Adibi, P., Ehsani, M. S., Hutter, H.-P., & Darvishy, A. (2025). Chart question answering with multimodal graph representation learning and zero-shot classification. Expert Systems with Applications, 270(126508). https://doi.org/10.1016/j.eswa.2025.126508
- Bamdad, M., Hutter, H.-P., & Darvishy, A. (2024). InCrowd-VI : a realistic visual–inertial dataset for evaluating simultaneous localization and mapping in indoor pedestrian-rich spaces for human navigation. Sensors, 24(24), 8164. https://doi.org/10.3390/s24248164
- Schmitt-Koopmann, F., Huang, E. M., Hutter, H.-P., Stadelmann, T., & Darvishy, A. (2024). MathNet : a data-centric approach for printed mathematical expression recognition. IEEE Access, 12, 76963–76974. https://doi.org/10.1109/ACCESS.2024.3404834
- Farahani, A. M., Adibi, P., Ehsani, M. S., Hutter, H.-P., & Darvishy, A. (2023). Automatic chart understanding : a review. IEEE Access, 11, 76202–76221. https://doi.org/10.1109/ACCESS.2023.3298050
- Mirkazemy, A., Adibi, P., Ehsani, S. M. S., Darvishy, A., & Hutter, H.-P. (2023). Mathematical expression recognition using a new deep neural model. Neural Networks, 167, 865–874. https://doi.org/10.1016/j.neunet.2023.08.045
- Schmitt-Koopmann, F. M., Huang, E. M., Hutter, H.-P., Stadelmann, T., & Darvishy, A. (2022). FormulaNet : a benchmark dataset for mathematical formula detection. IEEE Access, 10, 91588–91596. https://doi.org/10.1109/ACCESS.2022.3202639
Bücher, peer-reviewed
- Darvishy, A., Hutter, H.-P., & Seifert, A. (2022). Age-appropriate digital channels. Springer. https://doi.org/10.1007/978-3-658-38446-3
- Darvishy, A., Hutter, H.-P., & Seifert, A. (2021). Altersgerechte digitale Kanäle. Springer. https://doi.org/10.1007/978-3-658-35501-2
- Hutter, H.-P. (1997). Comparison of classic and hybrid HMM approaches to speech recognition over telefone lines [Doctoral dissertation, ETH Zurich]. https://doi.org/10.3929/ethz-a-001687036
Buchbeiträge, peer-reviewed
Sorano, J., Heitz, C., Hutter, H.-P., Fernández, R., Hierro, J. J., Vogel, J., Edmonds, A., & Bohnert, T. M. (2013). Internet of services : telecommunication services evolution. In Evolution of telecommunication services (pp. 283–325). Springer. https://doi.org/10.1007/978-3-642-41569-2_14
Schriftliche Konferenzbeiträge, peer-reviewed
- Bamdad, M., Hutter, H.-P., & Darvishy, A. (2025). Deep learning-powered visual SLAM aimed at assisting visually impaired navigation [Conference paper]. Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP, 758–765. https://doi.org/10.5220/0013338200003912
- D’Intino, F., & Hutter, H.-P. (2025). Advancing STT for low-resource real-world speech [Conference paper]. In H. Degen & S. Ntoa (Eds.), Artificial Intelligence in HCI (pp. 290–309). Springer. https://doi.org/10.1007/978-3-031-93429-2_20
- Schmitt-Koopmann, F. M., Huang, E. M., Hutter, H.-P., & Darvishy, A. (2025). Towards more accessible scientific PDFs for people with visual impairments : step-by-step PDF remediation to improve tag accuracy [Conference paper]. In N. Yamashita, V. Evers, K. Yatani, X. (. Ding, B. Lee, M. Chetty, & P. Toups-Dugas (Eds.), CHI ’25: Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (p. 48). Association for Computing Machinery. https://doi.org/10.1145/3706598.3713084
- Witzel, L., Chhong, S., Hutter, H.-P., & Darvishy, A. (2023). On-site and remote crowdsourcing of accessibility data for people with mobility impairments : a case study in Zurich’s District 1 [Conference paper]. In T. Ahram & R. Taiar (Eds.), Human Interaction and Emerging Technologies (IHIET-AI 2023): Artificial Intelligence and Future Applications (Vol. 70, pp. 242–253). AHFE International. https://doi.org/10.54941/ahfe1002949
- Darvishy, A., Heeb, Z., Beljulji, E., & Hutter, H.-P. (2023). A new conversational interaction concept for document creation and editing on mobile devices for visually impaired users [Conference paper]. In M. Zallio (Ed.), Human Factors in Accessibility and Assistive Technology (Vol. 87, pp. 33–40). AHFE International. https://doi.org/10.54941/ahfe1003651
- Darvishy, A., Hutter, H.-P., & Mosimann, R. (2022). Towards personalized accessible routing for people with mobility impairments [Conference paper]. In K. Miesenberger, G. Kouroupetroglou, K. Mavrou, R. Manduchi, M. Covarrubias Rodriguez, & P. Penáz (Eds.), Computers Helping People with Special Needs (pp. 215–220). Springer. https://doi.org/10.1007/978-3-031-08648-9_25
- Darvishy, A., Roth, S., & Hutter, H.-P. (2021). Sprachassistent für Hotelinformationen [Conference paper]. In G. Kempter & W. Ritter (eds.), Grenzüberschreitende Reallabore für Assistenztechnik : Beiträge zum Usability Day XIX (pp. 102–108). Pabst Science Publishers.
- Hutter, H.-P., Roth, S., & Darvishy, A. (2021). Service für barrierefreie Ferien im Bodenseeraum [Conference paper]. In G. Kempter & W. Ritter (eds.), Grenzüberschreitende Reallabore für Assistenztechnik : Beiträge zum Usability Day XIX (pp. 90–101). Pabst Science Publishers.
- Ulasik, M. A., Hürlimann, M., Dubel, B., Kaufmann, Y., Rudolf, S., Deriu, J. M., Mlynchyk, K., Hutter, H.-P., & Cieliebak, M. (2021). ZHAW-CAI : ensemble method for Swiss German speech to Standard German text [Conference paper]. In F. Benites de Azevedo e Souza, D. Tuggener, M. Hürlimann, M. Cieliebak, & M. Vogel (Eds.), Proceedings of the Swiss Text Analytics Conference 2021. CEUR Workshop Proceedings. https://doi.org/10.21256/zhaw-23889
- Hutter, H.-P., Darvishy, A., Roth, S., Gäumann, S., Kaspar, H., Thimm, T., Gaiduk, M., Evans, S., & Rosenberg, M. (2020). Service design for accessible tourism [Conference paper]. In M. Antona & C. Stephanidis (Eds.), Universal Access in Human-Computer Interaction. Applications and Practice. Springer. https://doi.org/10.1007/978-3-030-49108-6_29
- Darvishy, A., Hutter, H.-P., Grossenbacher, M., & Merz, D. (2020). Touch explorer : exploring digital maps for visually impaired people [Conference paper]. In K. Miesenberger, R. Manduchi, M. Covarrubias Rodriguez, & P. Peňáz (Eds.), Computers Helping People with Special Needs (pp. 427–434). Springer. https://doi.org/10.1007/978-3-030-58796-3_50
- Jembu Rajkumar, A., Lazar, J., Jordan, J. B., Darvishy, A., & Hutter, H.-P. (2020). PDF accessibility of research papers : what tools are needed for assessment and remediation? [Conference paper]. Proceedings of the 53rd Hawaii International Conference on System Sciences, 4185–4194. https://doi.org/10.24251/HICSS.2020.512
- Darvishy, A., Hutter, H.-P., & Frei, J. (2019). Making mobile map applications accessible for visually impaired people [Conference paper]. In T. Ahram, R. Taiar, S. Colson, & A. Choplin (Eds.), Human Interaction and Emerging Technologies : Proceedings of the 1st International Conference on Human Interaction and Emerging Technologies (IHIET 2019), August 22-24, 2019, Nice, France (pp. 396–400). Springer. https://doi.org/10.1007/978-3-030-25629-6_61
- Hutter, H.-P., & Darvishy, A. (2019). Barrierefreier Tourismus im Bodenseeraum [Conference poster]. In P. Friedrich & D. Fuchs (eds.), 6. Ambient Medicine® Forum „Assistive Technik für selbstbestimmtes Wohnen“ (Band 6) : 19. - 20. Februar 2019, Tagungsband (pp. 165–170). Cuvillier.
- Hutter, H.-P., & Darvishy, A. (2019). Accessible tourism around Lake Constance [Conference paper]. In K. Promberger, F. Piazolo, & G. Kempter (Eds.), Innovative solutions for an ageing society : proceedings of SMARTER LIVES meets uDay 19 (pp. 93–97). Pabst Science Publishers.
- Darvishy, A., & Hutter, H.-P. (2018). Recommendations for age-appropriate mobile application design [Conference paper]. Advances in Design for Inclusion, 241–253. https://doi.org/10.1007/978-3-319-60597-5_22
- Hutter, H.-P., & Darvishy, A. (2017). Barrierefreier Tourismusraum Bodensee [Conference poster]. In G. Kempter & I. Hämmerle (eds.), Beiträge zum Usability Day XV : Umgebungsunterstütztes Leben, 22. Juni 2017 (pp. 91–94). Pabst Science Publishers. https://doi.org/10.21256/zhaw-2782
- Hutter, H.-P., & Ahlenstorf, A. (2017). New mobile service development process [Conference paper]. In A. Marcus & W. Wang (Eds.), Design, User Experience, and Usability: Designing Pleasurable Experiences : 6th International Conference, DUXU 2017, Held as Part of HCI International 2017, Vancouver, BC, Canada, July 9-14, 2017, Proceedings, Part II (pp. 221–232). Springer. https://doi.org/10.1007/978-3-319-58637-3_17
- Darvishy, A., & Hutter, H.-P. (2017). Recommendations for age-appropriate mobile application design [Conference paper]. Harnessing the Power of Technology to Improve Lives, 676–686. https://doi.org/10.3233/978-1-61499-798-6-676
- Kokemor, I., & Hutter, H.-P. (2016). Aspect-oriented approach for user interaction logging of iOS applications [Conference paper]. In M. Aaron (Ed.), Design, user experience, and usability: technological contexts : 5th international conference, DUXU 2016, held as part of HCI international 2016, Toronto, Canada, July 17–22, 2016, proceedings, part III (pp. 45–56). Springer. https://doi.org/10.1007/978-3-319-40406-6
- Darvishy, A., Nevill, M., & Hutter, H.-P. (2016). Automatic paragraph detection for accessible PDF documents [Conference paper]. In K. Miesenberger, C. Bühler, & P. Peňáz (Eds.), Computers Helping People with Special Needs (pp. 367–372). Springer. https://doi.org/10.1007/978-3-319-41264-1_50
- Hutter, H.-P., Ahlenstorf, A., Klammer, J., Van den Anker, F., & Wiedmer, A. (2016). Service platform for the exchange of services with developing countries [Conference paper]. Tech4Dev 2016 : UNESCO Chair in Technologies for Development Int. Conference 2016, 127.
- Doblies, L., Stolz, D., Darvishy, A., & Hutter, H.-P. (2014). PAVE : a web application to identify and correct accessibility problems in PDF documents [Conference paper]. In K. Miesenberger, D. Fels, D. Archambault, P. Peňáz, & W. Zagler (Eds.), Computers Helping People with Special Needs : ICCHP 2014 Proceedings, Part I (pp. 185–192). Springer. https://doi.org/10.1007/978-3-319-08596-8
- Darvishy, A., & Hutter, H.-P. (2013). Comparison of the effectiveness of different accessibility plugins based on important accessibility criteria [Conference paper]. Universal Access in Human-Computer Interaction. Applications and Services for Quality of Life : 7th International Conference, UAHCI 2013, Held as Part of HCI International 2013, Las Vegas, NV, USA, July 21-26, 2013, Proceedings, Part III, 305–310. https://doi.org/10.1007/978-3-642-39194-1_36
- Darvishy, A., Hutter, H.-P., Horvath, A., & Dorigo, M. (2010). A flexible software architecture concept for the creation of accessible PDF documents [Conference paper]. Computers Helping People with Special Needs, 47–52. https://doi.org/10.1007/978-3-642-14097-6_8
- Hutter, H.-P., Jung, U., & Müggler, T. (2008). Augmented mobile tagging. Proceedings of the 10th International Conference on Human Computer Interaction with Mobile Devices and Services.
- Darvishy, A., Hutter, H.-P., Früh, P., Horvath, A., & Berner, D. (2008). Personal mobile assistant for air passengers with disabilities (PMA) [Conference paper]. Computers Helping People with Special Needs, 1129–1134. https://doi.org/10.1007/978-3-540-70540-6_169
- Hutter, H.-P., & Tönz, R. (2005). New concept for designing multimodal user interfaces. Aside 2005 : Applied Spoken Language Interaction in Distributed Environments : 10th and 11th November 2005, Aalborg University, Denmark.
- Hutter, H.-P. (1995). Comparison of a new hybrid connectionist-SCHMM approach with other hybrid approaches for speech recognition [Conference paper]. 1995 International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 3311–3314. https://doi.org/10.1109/ICASSP.1995.479693
Weitere Publikationen
- Hutter, H.-P., & Jekat, S. J. (2010). Evaluation von sprachverstehenden und -generierenden Systemen. In K.-U. Carstensen, C. Ebert, C. Ebert, S. J. Jekat, R. Klabunde, & H. Langer (eds.), Computerlinguistik und Sprachtechnologie : eine Einführung (pp. 659–678). Spektrum Akademischer Verlag. https://doi.org/10.1007/978-3-8274-2224-8
- Darvishy, A., Hutter, H.-P., Früh, P. T., Horvath, A., & Berner, D. (2008). Personal mobile assistant for air passengers with disabilities (PMA). ZHAW Zürcher Hochschule für Angewandte Wissenschaften. https://doi.org/10.21256/zhaw-108
- Hutter, H.-P. (2006). Mobile Anwendungen mit GeoTags. WinLink Fachtagung, Winterthur, 2006.
- Hutter, H.-P. (2004). Grundlagen der Spracherkennung mit Hidden-Markov-Modellen. Design & Elektronik, 2004.
Publikationen vor Tätigkeit an der ZHAW
- F. Bimbot, M. Blomberg, L. Boves, D. Genoud, H.-P. Hutter, C. Jaboulet, J. Koolwaaij, J. Lindberg, and J.-B. Pierrot, “An overview of the CAVE project research activities in speaker verification”, Speech Communication, vol. 31, no. 2-3, pp. 158–180, 2000.
- F. Bimbot, H.-P. Hutter, C. Jaboulet, J. Koolwaaij, J. Lindberg, and J.-B. Pierrot, “An overview of the CAVE project research activities in speaker verification”, in RLA2C, pp. 215–220, April 1998.
- J. Lindberg, J. Koolwaaij, H.-P. Hutter, D. Genoud, M. Blomberg, J.-B. Pierrot, and F. Bimbot, “Techniques for a priori decision threshold estimation in speaker verification”, in RLA2C, pp. 89–92, April 1998.
- J.-B. Pierrot, J. Lindberg, J. Koolwaaij, H.-P. Hutter, D. Genoud, M. Blomberg, and F. Bimbot, “A comparison of a priori threshold setting procedures for speaker verification in the CAVE project”, in IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 125–128, 1998.
- F. Bimbot, H.-P. Hutter, C. Jaboulet, J. Koolwaaij, J. Lindberg, and J.-B. Pierrot, “Speaker verification in the telephone network: research activities in the CAVE project” in Proceedings EUROSPEECH ’97, vol. 2, pp. 971–974, 1997.
- D. James, H.-P. Hutter, and F. Bimbot, “CAVE speaker verification project – experiments on the YOHO and SESP corpora”, in Proceedings 1st Intl. Conference on Audio- and Video-based Biometric Person Authentication (J. Bigu ̈n, G. Chollet, and B. Borgefors, eds.), pp. 385–394, Berlin, Heidelberg: Springer Verlag, 1997.
- D. James, H.-P. Hutter, and F. Bimbot, “Cave - speaker verification in banking and telecom- munications”, in Proceedings of the Ubilab Conference ’96, Zurich, pp. 153–162, 1
- H.-P. Hutter, Comparison of Classic and Hybrid HMM Approaches to Speech Recognition over Telephone Lines. TIK-Schriftenreihe No. 15, Diss. ETH Zürich No. 11662, ETH Zürich, 1996.
- H.-P. Hutter, “Comparison of a new hybrid connectionist-SCHMM approach with other hybrid approaches for speech recognition,” in IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 5, pp. 3311–3314, May 1995.
- H.-P. Hutter, “Comparison of a new hybrid connectionist-SCHMM approach with other hybrid approaches for speech recognition,” in IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 5, pp. 3311–3314, May 1995.
- H.-P. Hutter and B. Pfister, “Neuartiger hybrider SKHMM/KNN-Ansatz für die Spracherkennung,” in Elektronische Sprachsignalverarbeitung (K. Fellbaum, ed.), Studien- texte zur Sprachkommunikation, Heft 11, pp. 90–97, Technische Universit ̈at Berlin, October 1994.