- How to leverage information?
- How to find new topics and trends?
- How to derive insight from heterogeneous/unstructured data and information?
- How to allow a «natural» access to data?
- How can software link data automatically?
These are but a few of the questions that the Information Engineering (IE) group is working to answer. While the “data and information flood” is often discussed negatively, we see a great opportunity to leverage data and information using the right approaches – both at search-time, as well as during analysis.
The group transfers insights derived from research and development into teaching for students of the computer science curricula. It offers modules such as “Information Engineering 1 (Information Retrieval)”, “Information Engineering 2 (Data Warehousing & Big Data)” and "Databases". The group is active in both national and international research projects of the EU framework programs.
The Information Engineering group develops solutions for a changing, data-driven world. It performs research at the intersection of databases (DB), information retrieval (IR), data engineering (DE), natural language processing (NLP) and machine learning (ML)
The group covers two main research lines:
We solve challenging problems when working with a range of datasets from very small (nano data) to very large (big data), where the nature of the problems change drastically as we work on different scales:
- Information retrieval for small document collections
- Machine learning for query optimization
- Artificial intelligence for data integration and cleaning
- Quantum databases and quantum machine learning
As we strive for "intelligent" solutions to data-driven problems, classical information systems need to process data at a different level, interpreting it to gain important information. Both structured and unstructured data must be processed not on a mechanical, but on a semantic level - e.g. by using natural language processing and understanding. Data is ultimately connected through graph structures or made accessible via semantic search.
- Natural language interfaces for databases
- Semantic search on entities
- Knowledge graph construction
- Question answering over knowledge graphs
- Stream analytics and event detection
- Information retrieval evaluation
Good practices for responsible development of AI-based applications in healthcare
This project will identify proven methods, practices and standards that support responsible research and development of AI systems for health. They will be tested in use cases from medical imaging and neurotechnology, publicly released and published as a guideline of recommended best practices. ...
GraphQueryML – Using Machine Learning to Optimize Queries in Graph Databases (SNSF/DFG)
Optimizing the brain of databases with machine learning: Query optimization is one of the hardest problems of database systems research. A query optimizer can be considered as the “brain” of the system that makes sure that queries are executed efficiently. Even after several decades of research, many sub-problems ...
Accessible Scientific PDFs for All
PDF is the most popular document format to provide and distribute information on the internet. It was developed by Adobe 1996 but has been an open format since 2008. It was estimated in 2015 that more than 2.5 trillion PDF documents exist on the internet, covering all aspects of life and research, and their number ...
Proceedings of the 37th ICDE.
International Conference on Data Engineering (ICDE), Chania, Greece, 19-22 April 2021.
Available from: https://doi.org/10.21256/zhaw-22000
1st TAILOR Workshop on Trustworthy AI at ECAI 2020, Santiago de Compostela, Spain, 29-30 August 2020.
Available from: https://doi.org/10.21256/zhaw-22061
Journal of Big Data.
Available from: https://doi.org/10.1186/s40537-020-00383-w
Aydarkhanov, Ruslan; Ušćumlić, Marija; Chavarriaga, Ricardo; Gheorghe, Lucian; Millán, José del R,
Journal of Neural Engineering.
18(2), pp. 026010.
Available from: https://doi.org/10.1088/1741-2552/abdfb2
Jao, Ping-Keng; Chavarriaga, Ricardo; Dell'Agnola, Fabio; Arza, Adriana; Atienza, David; Millan, Jose del R.,
IEEE Transactions on Human-Machine Systems.
51(2), pp. 99-108.
Available from: https://doi.org/10.1109/THMS.2020.3038339