Natural Language Processing Group
“We combine foundational research with industrial applications to build new and innovative products and services, while at the same time exploring the necessary ethical and social boundaries.”
Fields of expertise
- Text analytics
- Dialogue systems
- Speech processing
The NLP research team develops technologies for the analysis, understanding and generation of speech and text. We combine methods from linguistics, natural language processing (NLP) and artificial intelligence to enable natural language communication between humans and machines. In our research, we work on topics such as text classification (e.g. sentiment analysis), chatbots/dialogue systems, text summarization, speech-to-text, speaker diarization and natural language generation. The group particularly focuses on Swiss German speech and text processing.
Services
- Insight: keynotes, trainings
- AI consultancy: workshops, expert support, advice, technology assessment
- Research and development: small to large-scale collaborative projects, third party-funded research, student projects, commercially applicable prototypes
Team
Head of Research Group
Projects
-
NLP Community Building - ComBi
SwissNLP would like to take concerted action to better network Swiss players from industry, science and administration in the field of Natural Language Processing (NLP). For this reason various activities are to be carried out until the end of 2025 such as expert group meetings, applied conferences, data ...
-
AI4CP: AI for self-organizing Content Platform
-
Towards a Voice-Based Chatbot for Language Learners (ChaLL)
We take first steps towards developing ChaLL, a voice-based chatbot that provides language learners with opportunities to practice speaking in both focused and unfocused task-based conversations and receive feedback, free from the time constraints and pressures of the traditional classroom setting. ...
-
PRISM: Predicting Radicalization Events in Social Media User Timelines
The PRISM project focuses on detecting radicalization events in Social Media networks. Overall, we are interested in unveiling the mechanics that lead to the event of extremist ideology being transferred and incorporated into a social media user’s world view. Specifically, the proposed project aims to identify ...
-
DOSSMA – Detection of Suspicious Social Media Activities
The DOSSMA project will investigate suspicious and malicious behaviour on social media platforms. In a first phase, we will compile an extensive survey report on the areas that are currently being researched, including the respective state-of-the-art, existing solutions and initiatives. This report will serve as a ...
Publications
-
von Däniken, Pius; Hürlimann, Manuela; Cieliebak, Mark,
2020.
Overview of the GermEval 2020 shared task on Swiss German language identification [paper].
In:
Ebling, Sarah; Tuggener, Don; Hürlimann, Manuela; Cieliebak, Mark; Volk, Martin, eds.,
Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS).
5th SwissText & 16th KONVENS Joint Conference, Zurich (online), 24-25 June 2020.
CEUR Workshop Proceedings.
Available from: https://doi.org/10.21256/zhaw-21549
-
Büchi, Matthias; Ulasik, Malgorzata Anna; Hürlimann, Manuela; Benites de Azevedo e Souza, Fernando; von Däniken, Pius; Cieliebak, Mark,
2020.
ZHAW-InIT at GermEval 2020 task 4 : low-resource speech-to-text [paper].
In:
Ebling, Sarah; Tuggener, Don; Hürlimann, Manuela; Cieliebak, Mark; Volk, Martin, eds.,
Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS).
5th SwissText & 16th KONVENS Joint Conference, Zurich (online), 24-25 June 2020.
CEUR Workshop Proceedings.
Available from: https://doi.org/10.21256/zhaw-21550
-
Tuggener, Don; von Däniken, Pius; Peetz, Thomas; Cieliebak, Mark,
2020.
LEDGAR : a large-scale multi-label corpus for text classification of legal provisions in contracts [paper].
In:
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020).
12th Language Resources and Evaluation Conference (LREC), Marseille, France, 11-16 May 2020.
European Language Resources Association.
pp. 1235-1241.
Available from: https://doi.org/10.21256/zhaw-20087
-
Hürlimann, Manuela; Cieliebak, Mark; Vogel, Manfred,
2020.
ZHAW Zürcher Hochschule für Angewandte Wissenschaften.
Available from: https://doi.org/10.21256/zhaw-21636
-
Benites de Azevedo e Souza, Fernando; Duivesteijn, Gilbert François; von Däniken, Pius; Cieliebak, Mark,
2020.
TRANSLIT : a large-scale name transliteration resource [paper].
In:
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020).
12th Language Resources and Evaluation Conference (LREC), Marseille, France, 11-16 May 2020.
European Language Resources Association.
pp. 3265-3271.
Available from: https://doi.org/10.21256/zhaw-20082