Dr. Jan Milan Deriu
Dr. Jan Milan Deriu
ZHAW
School of Engineering
Centre for Artificial Intelligence
Technikumstrasse 71
8400 Winterthur
Network
ORCID digital identifier
Social media
Projects
- A Digital Speech and Language Sample Application for Clinical Diagnostics and Monitoring in Swiss German / Deputy project leader / ongoing
- Critical Science Without Borders: LLMs for Translation of Scientific Knowledge in Multilingual Contexts / Project leader / ongoing
- Unified Model for Evaluation of Text Generation Systems / Deputy project leader / ongoing
- Holistic Analysis of Organised Misinformation Activity in Social Networks / Project leader / completed
- End-to-End Low-Resource Speech Translation for Swiss German Dialects / Deputy project leader / completed
- Pre-Study on Generation of Hockey News / Deputy project leader / completed
- Call-E – Virtual Call Agent / Team member / completed
- LIHLITH – Learning to Interact with Humans by Lifelong Interaction with Humans / Team member / completed
- DeepText: Intelligent Text Analysis with Deep Learning / Deputy project leader / completed
Publications
Articles in scientific journal, peer-reviewed
- Sager, P. J. et al. (2026) 'The cooperative network architecture : learning structured networks as representation of sensory patterns', Neural Computation, 38(4), pp. 538–572. doi: 10.1162/neco.a.1505.
- Zhang, Y. et al. (2024) 'ScienceBenchmark : a complex real-world benchmark for evaluating natural language to SQL systems', Proceedings of the VLDB Endowment, 17(4), pp. 685–698. doi: 10.14778/3636218.3636225.
- Deriu, J. M. et al. (2020) 'Survey on evaluation methods for dialogue systems', Artificial Intelligence Review, 54(1), pp. 755–810. doi: 10.1007/s10462-020-09866-x.
Written conference contributions, peer-reviewed
- Giedemann, P. et al. (2025) 'ViClaim : a multilingual multilabel dataset for automatic claim detection in videos', in Christodoulopoulos, C. et al. (eds) Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, pp. 397–413. doi: 10.18653/v1/2025.emnlp-main.21.
- Stucki, S., Deriu, J. and Cieliebak, M. (2025) 'Voice adaptation for Swiss German', in Proceedings Interspeech 2025. International Speech Communication Association, pp. 4143–4147. doi: 10.21437/interspeech.2025-432.
- von Däniken, P., Deriu, J. M. and Cieliebak, M. (2025) 'A measure of the system dependence of automated metrics', in Che, W. et al. (eds) Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, pp. 87–99. doi: 10.18653/v1/2025.acl-short.8.
- Michot, J. et al. (2024) 'Error-preserving automatic speech recognition of young English learners' language', in Ku, L.-W., Martins, A., and Srikumar, V. (eds) Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, pp. 6444–6454. doi: 10.18653/v1/2024.acl-long.348.
- von Däniken, P. et al. (2024) 'Favi-Score : a measure for favoritism in automated preference ratings for generative AI evaluation', in Ku, L.-W., Martins, A., and Srikumar, V. (eds) Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, pp. 4437–4454. doi: 10.18653/v1/2024.acl-long.243.
- von Däniken, P. et al. (2024) 'Improving quantification with minimal in-domain annotations : beyond classify and count', in Proceedings of the International AAAI Conference on Web and Social Media. AAAI Press, pp. 1585–1598. doi: 10.1609/icwsm.v18i1.31411.
- Deriu, J. et al. (2023) 'Correction of errors in preference ratings from automated metrics for text generation', in Rogers, A., Boyd-Graber, R., and Okazaki, N. (eds) Findings of the Association for Computational Linguistics: ACL 2023. Association for Computational Linguistics, pp. 6456–6474. doi: 10.18653/v1/2023.findings-acl.404.
- Plüss, M. et al. (2023) 'STT4SG-350 : a speech corpus for all Swiss German dialect regions', in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, pp. 1763–1772. doi: 10.18653/v1/2023.acl-short.150.
- Peñas, A. et al. (2023) 'Holistic analysis of organised misinformation activity in social networks', in Ceolin, D., Caselli, T., and Tulin, M. (eds) Disinformation in Open Online Media. Cham: Springer, pp. 132–143. doi: 10.1007/978-3-031-47896-3_10.
- von Däniken, P., Deriu, J. M. and Cieliebak, M. (2023) 'ZHAW-CAI at CheckThat! 2023 : ensembling using kernel averaging', in Aliannejadi, M. et al. (eds) Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023). CEUR Workshop Proceedings, pp. 534–545. doi: 10.21256/zhaw-29046.
- Luley, P.-P. et al. (2023) 'From concept to implementation : the data-centric development process for AI in industry', in 2023 10th IEEE Swiss Conference on Data Science (SDS). IEEE, pp. 73–76. doi: 10.1109/SDS57534.2023.00017.
- Bollinger, T., Deriu, J. M. and Vogel, M. (2023) 'Text-to-speech pipeline for Swiss German : a comparison', in 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. arXiv. doi: 10.48550/arXiv.2305.19750.
- von Däniken, P. et al. (2022) 'Improving NL-to-Query systems through re-ranking of semantic hypothesis', in Abbas, M. and Freihat, A. A. (eds) Proceedings of the 5th International Conference on Natural Language and Speech Processing (ICNLSP 2022). Association for Computational Linguistics, pp. 57–67. doi: 10.21256/zhaw-26147.
- Plüss, M. et al. (2022) 'SDS-200 : a Swiss German speech to Standard German text corpus', in Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022). European Language Resources Association, pp. 3250–3256. doi: 10.21256/zhaw-26131.
- Deriu, J. M. et al. (2022) 'Probing the robustness of trained metrics for conversational dialogue systems', in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 750–761. doi: 10.18653/v1/2022.acl-short.85.
- Ulasik, M. A. et al. (2021) 'ZHAW-CAI : ensemble method for Swiss German speech to Standard German text', in Benites de Azevedo e Souza, F. et al. (eds) Proceedings of the Swiss Text Analytics Conference 2021. CEUR Workshop Proceedings. doi: 10.21256/zhaw-23889.
- Tuggener, D. et al. (2021) 'Are we summarizing the right way? : a survey of dialogue summarization data sets', in Proceedings of the Third Workshop on New Frontiers in Summarization. Association for Computational Linguistics, pp. 107–118. doi: 10.21256/zhaw-23506.
- Campos, J. A. et al. (2020) 'DoQA : accessing domain-specific FAQs via conversational QA', in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 7302–7314. doi: 10.18653/v1/2020.acl-main.652.
- Deriu, J. M. et al. (2020) 'Spot The Bot : a robust and efficient framework for the evaluation of conversational dialogue systems', in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, pp. 3971–3984. doi: 10.18653/v1/2020.emnlp-main.326.
- Deriu, J. M. et al. (2020) 'A methodology for creating question answering corpora using inverse data annotation', in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 897–911. doi: 10.18653/v1/2020.acl-main.84.
- Cieliebak, M., Galibert, O. and Deriu, J. M. (2019) 'Towards understanding lifelong learning for dialogue systems', in IWSDS 2019 Proceedings. IWSDS.
- Sileo, D. et al. (2019) 'Matching words and knowledge graph entities with meta-embeddings', in Proceedings of CAp2019. PFIA, pp. 34–39.
- Deriu, J. M. and Cieliebak, M. (2019) 'Towards a metric for automated conversational dialogue system evaluation and improvement', in 2th International Conference on Natural Language Generation (INLG 2019), Tokyo, Japan, October 29 - November 1, 2019. Available at: https://www.inlg2019.com/assets/papers/132_Paper.pdf.
- Deriu, J. M. and Cieliebak, M. (2018) 'Syntactic manipulation for generating more diverse and interesting texts', in Proceedings of the 11th International Conference on Natural Language Generation. Association for Computational Linguistics, pp. 22–34. doi: 10.18653/v1/W18-6503.
- Grubenmann, R. et al. (2018) 'SB-CH : a Swiss German corpus with sentiment annotations', in Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018. European Language Resources Association.
- Benites de Azevedo e Souza, F. et al. (2018) 'Twist Bytes : German dialect identification with data mining optimization', in Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018). VarDial, pp. 218–227. doi: 10.21256/zhaw-4850.
- von Grünigen, D. et al. (2017) 'Potential and limitations of cross-domain sentiment classification', in Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media. Stroudsburg: Association for Computational Linguistics, pp. 17–24. doi: 10.18653/v1/W17-1103.
- Müller, S. et al. (2017) 'TopicThunder at SemEval-2017 Task 4 : sentiment classification using a convolutional neural network with distant supervision', in Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017). Association for Computational Linguistics, pp. 766–771. doi: 10.21256/zhaw-1529.
- Cieliebak, M. et al. (2017) 'A Twitter corpus and benchmark resources for german sentiment analysis', in 5th International Workshop on Natural Language Processing for Social Media, Boston MA, USA, 11 December 2017. Association for Computational Linguistics, pp. 45–51. doi: 10.18653/v1/W17-1106.
- Graf, H. D. et al. (2017) 'Four different ways to build a chatbot about movies', in SwissText 2017: 2nd Swiss Text Analytics Conference, Winterthur, 9. Juni 2017.
- Deriu, J. M. and Cieliebak, M. (2016) 'Sentiment analysis using convolutional neural networks with multi-task training and distant supervision on italian tweets', in Basili, R. and Montemagni, S. (eds) Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016). Italian Journal of Computational Linguistics. doi: 10.21256/zhaw-1527.
Other publications
- Paonessa, C. et al. (2023) Dialect transfer for Swiss German speech translation. arXiv. doi: 10.48550/arXiv.2310.09088.
- von Däniken, P. et al. (2022) 'On the effectiveness of automated metrics for text generation systems', in Findings of the Association for Computational Linguistics: EMNLP 2022. Association for Computational Linguistics, pp. 1503–1522. doi: 10.21256/zhaw-27042.
- Venzin, V. et al. (2019) 'Fact-aware abstractive text summarization using a pointer-generator network', in 4th Swiss Text Analytics Conference (SwissText 2019), Winterthur, June 18-19 2019. Swisstext. doi: 10.21256/zhaw-18988.
- Deriu, J. M. et al. (eds) (2019) Survey on evaluation methods for dialogue. ZHAW Zürcher Hochschule für Angewandte Wissenschaften. doi: 10.21256/zhaw-18985.
- Deriu, J. M. and Cieliebak, M. (2017) 'SwissAlps at SemEval-2017 Task 3 : attention-based convolutional neural network for community question answering', in Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017). Association for Computational Linguistics, pp. 334–338. doi: 10.18653/v1/S17-2054.
- Deriu, J. M. et al. (2017) 'Leveraging large amounts of weakly supervised data for multi-language sentiment classification', in Proceedings of the 26th International Conference on World Wide Web. Association for Computing Machinery, pp. 1045–1052. doi: 10.1145/3038912.3052611.
- Deriu, J. M. and Cieliebak, M. (2017) 'End-to-end trainable system for enhancing diversity in natural language generation', in End-to-End Natural Language Generation Challenge (E2E NLG), 2017. ZHAW Zürcher Hochschule für Angewandte Wissenschaften. doi: 10.21256/zhaw-4889.