Dr. Jan Milan Deriu
Dr. Jan Milan Deriu
ZHAW
School of Engineering
Centre for Artificial Intelligence
Technikumstrasse 71
8400 Winterthur
Netzwerk
ORCID digital identifier
Social Media
Projekte
- Critical Science Without Borders: LLMs for Translation of Scientific Knowledge in Multilingual Contexts / Projektleiter:in / laufend
- Unified Model for Evaluation of Text Generation Systems / Stellv. Projektleiter:in / laufend
- Holistic Analysis of Organised Misinformation Activity in Social Networks / Projektleiter:in / abgeschlossen
- End-to-End Low-Resource Speech Translation for Swiss German Dialects / Stellv. Projektleiter:in / abgeschlossen
- Pre-Study on Generation of Hockey News / Stellv. Projektleiter:in / abgeschlossen
- Call-E – Virtual Call Agent / Teammitglied / abgeschlossen
- LIHLITH – Learning to Interact with Humans by Lifelong Interaction with Humans / Teammitglied / abgeschlossen
- DeepText: Intelligente Textanalyse mit Deep Learning / Stellv. Projektleiter:in / abgeschlossen
Publikationen
Beiträge in wissenschaftlicher Zeitschrift, peer-reviewed
- Sager, P. J. et al. (2026) 'The cooperative network architecture : learning structured networks as representation of sensory patterns', Neural Computation, 38(4), pp. 538–572. doi: 10.1162/neco.a.1505.
- Zhang, Y. et al. (2024) 'ScienceBenchmark : a complex real-world benchmark for evaluating natural language to SQL systems', Proceedings of the VLDB Endowment, 17(4), pp. 685–698. doi: 10.14778/3636218.3636225.
- Deriu, J. M. et al. (2020) 'Survey on evaluation methods for dialogue systems', Artificial Intelligence Review, 54(1), pp. 755–810. doi: 10.1007/s10462-020-09866-x.
Schriftliche Konferenzbeiträge, peer-reviewed
- Giedemann, P. et al. (2025) 'ViClaim : a multilingual multilabel dataset for automatic claim detection in videos', in Christodoulopoulos, C. et al. (eds) Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, pp. 397–413. doi: 10.18653/v1/2025.emnlp-main.21.
- Stucki, S., Deriu, J. and Cieliebak, M. (2025) 'Voice adaptation for Swiss German', in Proceedings Interspeech 2025. International Speech Communication Association, pp. 4143–4147. doi: 10.21437/interspeech.2025-432.
- von Däniken, P., Deriu, J. M. and Cieliebak, M. (2025) 'A measure of the system dependence of automated metrics', in Che, W. et al. (eds) Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, pp. 87–99. doi: 10.18653/v1/2025.acl-short.8.
- Michot, J. et al. (2024) 'Error-preserving automatic speech recognition of young English learners' language', in Ku, L.-W., Martins, A., and Srikumar, V. (eds) Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, pp. 6444–6454. doi: 10.18653/v1/2024.acl-long.348.
- von Däniken, P. et al. (2024) 'Favi-Score : a measure for favoritism in automated preference ratings for generative AI evaluation', in Ku, L.-W., Martins, A., and Srikumar, V. (eds) Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, pp. 4437–4454. doi: 10.18653/v1/2024.acl-long.243.
- von Däniken, P. et al. (2024) 'Improving quantification with minimal in-domain annotations : beyond classify and count', in Proceedings of the International AAAI Conference on Web and Social Media. AAAI Press, pp. 1585–1598. doi: 10.1609/icwsm.v18i1.31411.
- Peñas, A. et al. (2023) 'Holistic analysis of organised misinformation activity in social networks', in Ceolin, D., Caselli, T., and Tulin, M. (eds) Disinformation in Open Online Media. Cham: Springer, pp. 132–143. doi: 10.1007/978-3-031-47896-3_10.
- von Däniken, P., Deriu, J. M. and Cieliebak, M. (2023) 'ZHAW-CAI at CheckThat! 2023 : ensembling using kernel averaging', in Aliannejadi, M. et al. (eds) Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023). CEUR Workshop Proceedings, pp. 534–545. doi: 10.21256/zhaw-29046.
- Plüss, M. et al. (2023) 'STT4SG-350 : a speech corpus for all Swiss German dialect regions', in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, pp. 1763–1772. doi: 10.18653/v1/2023.acl-short.150.
- Deriu, J. et al. (2023) 'Correction of errors in preference ratings from automated metrics for text generation', in Rogers, A., Boyd-Graber, R., and Okazaki, N. (eds) Findings of the Association for Computational Linguistics: ACL 2023. Association for Computational Linguistics, pp. 6456–6474. doi: 10.18653/v1/2023.findings-acl.404.
- Bollinger, T., Deriu, J. M. and Vogel, M. (2023) 'Text-to-speech pipeline for Swiss German : a comparison', in 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. arXiv. doi: 10.48550/arXiv.2305.19750.
- Luley, P.-P. et al. (2023) 'From concept to implementation : the data-centric development process for AI in industry', in 2023 10th IEEE Swiss Conference on Data Science (SDS). IEEE, pp. 73–76. doi: 10.1109/SDS57534.2023.00017.
- von Däniken, P. et al. (2022) 'Improving NL-to-Query systems through re-ranking of semantic hypothesis', in Abbas, M. and Freihat, A. A. (eds) Proceedings of the 5th International Conference on Natural Language and Speech Processing (ICNLSP 2022). Association for Computational Linguistics, pp. 57–67. doi: 10.21256/zhaw-26147.
- Plüss, M. et al. (2022) 'SDS-200 : a Swiss German speech to Standard German text corpus', in Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022). European Language Resources Association, pp. 3250–3256. doi: 10.21256/zhaw-26131.
- Deriu, J. M. et al. (2022) 'Probing the robustness of trained metrics for conversational dialogue systems', in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 750–761. doi: 10.18653/v1/2022.acl-short.85.
- Ulasik, M. A. et al. (2021) 'ZHAW-CAI : ensemble method for Swiss German speech to Standard German text', in Benites de Azevedo e Souza, F. et al. (eds) Proceedings of the Swiss Text Analytics Conference 2021. CEUR Workshop Proceedings. doi: 10.21256/zhaw-23889.
- Tuggener, D. et al. (2021) 'Are we summarizing the right way? : a survey of dialogue summarization data sets', in Proceedings of the Third Workshop on New Frontiers in Summarization. Association for Computational Linguistics, pp. 107–118. doi: 10.21256/zhaw-23506.
- Campos, J. A. et al. (2020) 'DoQA : accessing domain-specific FAQs via conversational QA', in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 7302–7314. doi: 10.18653/v1/2020.acl-main.652.
- Deriu, J. M. et al. (2020) 'Spot The Bot : a robust and efficient framework for the evaluation of conversational dialogue systems', in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, pp. 3971–3984. doi: 10.18653/v1/2020.emnlp-main.326.
- Deriu, J. M. et al. (2020) 'A methodology for creating question answering corpora using inverse data annotation', in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 897–911. doi: 10.18653/v1/2020.acl-main.84.
- Cieliebak, M., Galibert, O. and Deriu, J. M. (2019) 'Towards understanding lifelong learning for dialogue systems', in IWSDS 2019 Proceedings. IWSDS.
- Deriu, J. M. and Cieliebak, M. (2019) 'Towards a metric for automated conversational dialogue system evaluation and improvement', in 2th International Conference on Natural Language Generation (INLG 2019), Tokyo, Japan, October 29 - November 1, 2019. Available at: https://www.inlg2019.com/assets/papers/132_Paper.pdf.
- Sileo, D. et al. (2019) 'Matching words and knowledge graph entities with meta-embeddings', in Proceedings of CAp2019. PFIA, pp. 34–39.
- Grubenmann, R. et al. (2018) 'SB-CH : a Swiss German corpus with sentiment annotations', in Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018. European Language Resources Association.
- Deriu, J. M. and Cieliebak, M. (2018) 'Syntactic manipulation for generating more diverse and interesting texts', in Proceedings of the 11th International Conference on Natural Language Generation. Association for Computational Linguistics, pp. 22–34. doi: 10.18653/v1/W18-6503.
- Benites de Azevedo e Souza, F. et al. (2018) 'Twist Bytes : German dialect identification with data mining optimization', in Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018). VarDial, pp. 218–227. doi: 10.21256/zhaw-4850.
- Müller, S. et al. (2017) 'TopicThunder at SemEval-2017 Task 4 : sentiment classification using a convolutional neural network with distant supervision', in Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017). Association for Computational Linguistics, pp. 766–771. doi: 10.21256/zhaw-1529.
- Graf, H. D. et al. (2017) 'Four different ways to build a chatbot about movies', in SwissText 2017: 2nd Swiss Text Analytics Conference, Winterthur, 9. Juni 2017.
- von Grünigen, D. et al. (2017) 'Potential and limitations of cross-domain sentiment classification', in Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media. Stroudsburg: Association for Computational Linguistics, pp. 17–24. doi: 10.18653/v1/W17-1103.
- Cieliebak, M. et al. (2017) 'A Twitter corpus and benchmark resources for german sentiment analysis', in 5th International Workshop on Natural Language Processing for Social Media, Boston MA, USA, 11 December 2017. Association for Computational Linguistics, pp. 45–51. doi: 10.18653/v1/W17-1106.
- Deriu, J. M. and Cieliebak, M. (2016) 'Sentiment analysis using convolutional neural networks with multi-task training and distant supervision on italian tweets', in Basili, R. and Montemagni, S. (eds) Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016). Italian Journal of Computational Linguistics. doi: 10.21256/zhaw-1527.
Weitere Publikationen
- Paonessa, C. et al. (2023) Dialect transfer for Swiss German speech translation. arXiv. doi: 10.48550/arXiv.2310.09088.
- von Däniken, P. et al. (2022) 'On the effectiveness of automated metrics for text generation systems', in Findings of the Association for Computational Linguistics: EMNLP 2022. Association for Computational Linguistics, pp. 1503–1522. doi: 10.21256/zhaw-27042.
- Venzin, V. et al. (2019) 'Fact-aware abstractive text summarization using a pointer-generator network', in 4th Swiss Text Analytics Conference (SwissText 2019), Winterthur, June 18-19 2019. Swisstext. doi: 10.21256/zhaw-18988.
- Deriu, J. M. et al. (eds) (2019) Survey on evaluation methods for dialogue. ZHAW Zürcher Hochschule für Angewandte Wissenschaften. doi: 10.21256/zhaw-18985.
- Deriu, J. M. and Cieliebak, M. (2017) 'End-to-end trainable system for enhancing diversity in natural language generation', in End-to-End Natural Language Generation Challenge (E2E NLG), 2017. ZHAW Zürcher Hochschule für Angewandte Wissenschaften. doi: 10.21256/zhaw-4889.
- Deriu, J. M. and Cieliebak, M. (2017) 'SwissAlps at SemEval-2017 Task 3 : attention-based convolutional neural network for community question answering', in Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017). Association for Computational Linguistics, pp. 334–338. doi: 10.18653/v1/S17-2054.
- Deriu, J. M. et al. (2017) 'Leveraging large amounts of weakly supervised data for multi-language sentiment classification', in Proceedings of the 26th International Conference on World Wide Web. Association for Computing Machinery, pp. 1045–1052. doi: 10.1145/3038912.3052611.