Computer Vision, Perception and Cognition Group

«KI ist DIE Schlüsseltechnologie des digitalen Wandels in allen Branchen und Sektoren und sie hat starke Auswirkungen auf unsere Gesellschaften. Darum leistet unsere Forschung wichtige Beiträge zu robusten und vertrauenswürdigen KI-Methoden, und wir vermitteln mit Begeisterung deren sichere Umsetzung und Anwendung.»
Expertise

- Mustererkennung mit Deep Learning
- Maschinelle Wahrnehmung, Computer Vision und Sprechererkennung
- Entwicklung neuronaler Systeme
Die Computer Vision, Perception and Cognition-Gruppe forscht über Mustererkennung und arbeitet an einer Vielzahl von Aufgaben mit Bild-, Ton- oder allgemein Signaldaten. Wir befassen uns mit der Methodik der tiefen neuronalen Netze (Deep Neural Networks) und des Reinforcement Learning, inspiriert durch biologisches Lernen. Jede unserer Aufgaben hat ihr eigenes Lernziel (z. B. Erkennung, Klassifizierung, Clustering, Segmentierung, Novelty Detection, Steuerung) und ihren eigenen Anwendungsfall (z. B. vorausschauende Instandhaltung (Predictive Maintenance), Sprechererkennung für Multimedia-Indizierung, Dokumentanalyse, optische Notenerkennung, Computer Vision für industrielle Qualitätskontrolle, automatisiertes maschinelles Lernen, Deep Reinforcement Learning für automatisierte Spiele oder Gebäudeleittechnik). Diese werfen ihrerseits ein Licht auf verschiedene Aspekte des Lernprozesses. Wir nutzen diese Erfahrungen, um allgemeinere KI-Systeme zu kreieren, die auf neuronalen Architekturen basieren.
Angebote
- Einblick: Keynotes, Trainings
- KI-Beratung: Workshops, Expertenunterstützung, Beratung, Technikfolgenabschätzung
- Forschung und Entwicklung: kleine bis grosse Gemeinschaftsprojekte, Drittmittelforschung, studentische Projekte, praxiserprobte Prototypen
Team
Head of Research Group
Projekte
-
AC3T – AI powered CBCT for improved Combination Cancer Therapy
The project enables a novel, combined, adaptive cancer therapy combining tumor treating field and radiation therapy due to significantly improved static (3D) and time-resolved (4D) low dose Cone Beam Computer Tomography images based on artificial intelligence image reconstruction algorithms. ...
-
AUTODIDACT – Automated Video Data Annotation to Empower the ICU Cockpit Platform for Clinical Decision Support
Monitoring diverse sensor signals of patients in intensive care can be key to detect potentially fatal emergencies. But in order to perform the monitoring automatically, the monitoring system has to know what is currently happening to the patient: if the patient is for example currently being moved by medical staff, ...
-
Good practices for responsible development of AI-based applications in healthcare
This project will identify proven methods, practices and standards that support responsible research and development of AI systems for health. They will be tested in use cases from medical imaging and neurotechnology, publicly released and published as a guideline of recommended best practices. ...
-
Pilot study machine learning for injection molding processes
Forschende des CAI und InES untersuchen im Rahmen eines technischen Deep Dive gemeinsam die Chancen, Prozesswissen über Spritzgussverfahren in Neuronalen Netzen zu bündeln und auf neue Anwendungsszenarios zu übertragen. Die Gruppen von Prof. Stadelmann (Computer Vision, Perception & Cognition, ZHAW CAI und Prof. ...
-
Accessible Scientific PDFs for All
PDF is the most popular document format to provide and distribute information on the internet. It was developed by Adobe 1996 but has been an open format since 2008. It was estimated in 2015 that more than 2.5 trillion PDF documents exist on the internet, covering all aspects of life and research, and their number ...
-
Synthetic data generation of CoVID-19 CT/X-rays images for enabling fast triage of healthy vs. unhealthy patients
The automatic analysis of X-ray/CT images through artificial intelligence models can be useful to automate the clinical scanning procedure. Nonetheless, the limited access to real COVID patient data leads to the need of synthesizing image samples. The goal of this project is to use existing CT/X-ray image datasets ...
-
Standardized Data and Modeling for AI-based CoVID-19 Diagnosis Support on CT Scans (SDMCT)
Hospitals and research institutes are highly investigating applications of AI in medical imaging. However, developed models and datasets are barely mergeable, and the research results are not reproducible on different datasets due to different CT scanners used. Radiologists told us that “unifying data is crucial for ...
-
DIR3CT: Deep Image Reconstruction through X-Ray Projection-based 3D Learning of Computed Tomography Volumes
Project DIR3CT aims at improving the image quality of CBCT images by deep learning (DL) the 3D reconstruction from X-ray images end-to-end. This enables a novel CBCT product to be used during radiation therapy and will allow the use of these images for adaptive treatment.
-
TAILOR – Foundations of Trustworthy AI - Integrating Reasoning, Learning and Optimization
The main ambition of TAILOR is to build the capacity of providing the scientific foundations for Trustworthy AI in Europe by developing a network of research excellence centers with a technical focus on combining research excellence in the areas of learning, optimisation and reasoning . The current scientific ...
-
RealScore – Scanning of Real-World Sheet Music for a Digital Music Stand
ScorePad’s sheet music scanning service works for high quality input; to scale up business, it should work as well for smartphone pictures, used sheets etc. Project RealScore enhances the successful predecessor project by making deep learning adapt to unseen data through unsupervised learning. ...
Publikationen
-
Stadelmann, Thilo; Glinski-Haefeli, Sebastian; Gerber, Patrick; Dürr, Oliver,
2018.
Capturing suprasegmental features of a voice with RNNs for improved speaker clustering [Paper].
In:
Proceedings of the 8th IAPR TC3 Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR).
8th IAPR TC3 Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR), Siena, Italy, 19-21 September 2018.
IAPR.
S. 333-345.
Lecture Notes in Computer Science ; 11081.
Verfügbar unter: https://doi.org/10.1007/978-3-319-99978-4_26
-
Stadelmann, Thilo; Amirian, Mohammadreza; Arabaci, Ismail; Arnold, Marek; Duivesteijn, Gilbert François; Elezi, Ismail; Geiger, Melanie; Lörwald, Stefan; Meier, Benjamin Bruno; Rombach, Katharina; Tuggener, Lukas,
2018.
Deep learning in the wild [Paper].
In:
Proceedings of the 8th IAPR TC3 Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR).
8th IAPR TC3 Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR), Siena, Italy, 19-21 September 2018.
IAPR.
Verfügbar unter: https://doi.org/10.21256/zhaw-3872
-
Tuggener, Lukas; Elezi, Ismail; Schmidhuber, Jürgen; Stadelmann, Thilo,
2018.
Deep watershed detector for music object recognition [Paper].
In:
Proceedings of the 19th International Society for Music Information Retrieval Conference.
19th International Society for Music Information Retrieval Conference, Paris, 23-27 September 2018.
Paris:
Society for Music Information Retrieval.
Verfügbar unter: https://doi.org/10.21256/zhaw-3760
-
Tuggener, Lukas; Elezi, Ismail; Schmidhuber, Jürgen; Pelillo, Marcello; Stadelmann, Thilo,
2018.
DeepScores : a dataset for segmentation, detection and classification of tiny objects [Paper].
In:
Proceedings of the 24th International Conference on Pattern Recognition.
24th International Conference on Pattern Recognition (ICPR 2018), Beijing, China, 20-28 August 2018.
Beijing:
IAPR.
S. 1-6.
Verfügbar unter: https://doi.org/10.21256/zhaw-4255
-
Meier, Benjamin Bruno; Elezi, Ismail; Amirian, Mohammadreza; Dürr, Oliver; Stadelmann, Thilo,
2018.
Learning neural models for end-to-end clustering [Paper].
In:
Proceedings of the 8th IAPR TC3 Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR).
8th IAPR TC3 Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR), Siena, Italy, 19-21 September 2018.
IAPR.
Verfügbar unter: https://doi.org/10.21256/zhaw-3850