Research Centre for Bioinformatics
About us
The Research Centre for Bioinformatics focuses on the theoretical and computational aspects of modelling the molecular biology processes, genome evolution and adaptive change, as well as biomedical data representation and integration. The goal is to bring basic research and new bioinformatics methods to real-world applications, ranging, for example, from biotechnology and forensics to biomedical research and environmental applications. The research area is represented by the several research groups, each focusing on certain methods or application domains.
Computational Genomics
The research group develops computational methods for comparative and evolutionary genomics, including modelling of stochastic processes in molecular evolution. Many research projects focus on the analysis of protein-coding genes and gene families, selection, adaptation, phylodynamics and evolution, including host-pathogen interactions; applications in medical genomics, epidemiology, metagenomics and forensics. Our research includes studies of genomic repeat sequences and indel evolution with applications in cancer genomics and biotechnology, as well as studies of dynamics and evolution of viruses and other pathogens.
Group leader: Prof. Dr. Maria Anisimova
Biomedical String Analysis
The research group is specialized in the analysis of strings (i.e. finite sequences of symbols). The research projects and applications focus on genomic data and biomedical natural language. The group develops new computational science methods and applies existing methods. This includes: mathematical modeling, computational statistics, algorithm design, discrete mathematics, machine and deep learning, natural language processing, semantic web technologies.
Group leader: Dr. Manuel Gil | Learn more about the research group Biomedical String Analysis
Applied Mathematical Biology
The research group develops and applies mathematical models and methods to address open research questions in biology. Many methods use standard calculus, differential equations, machine learning and dynamical systems theory to describe and predict biological phenomena, such as for example, the relationship between codon bias and gene expression via the concept of translational efficiency, applied to codon optimization problems. Further interests lie in the exploration of cancer-immune system interactions and their predictive power for cancer immunotherapies as well as the population genetics of the early infection-phase of partially-recombining viruses.
Group leader: Dr. Victor Garcia
Teaching Activities
The focus includes teaching at BSc, MSc and PhD level in computational sciences with a focus on computational genomics, bioinformatics, mathematical modelling, biostatistics, programming and algorithms for molecular biology.
Projects
-
AI for colorectal cancer: towards the improved CMS classification and interpretability
Access to large complex biomedical data today allows scientists to take full advantage of AI-driven approaches in a variety of applications with high societal impact. One such application is precision medicine, which is gradually becoming reality for some cancers. Unfortunately, for colorectal…
completed, 11/2022 - 11/2023
-
Digital Tools for Codon Optimization
Exploring biotech's potential for negative emissions technologies
completed, 01/2021 - 12/2022
-
Dynamik Knowledge Platform
completed, 10/2020 - 12/2021
-
Trans-omic approach to colorectal cancer: an integrative computational and clinical perspective
Colorectal Cancer (CRC) is an important cause of cancer-related mortality world-wide. The Consensus Molecular Subtypes represent the first comprehensive molecular classification with clinical implications, but many aspects are still missing. We use a transomic approach to improve the stratification,…
ongoing, 10/2020 - 10/2025
-
Data mining in neurological medicine
Pattern recognition algorithms
completed, 09/2019 - 04/2020
-
Computational literature-based natural product drug discovery
Natural products such as medicinal plants and extract mixtures have successfully supported the discovery of pharmaceuticals. Medically relevant products and their properties are often found through systematic analysis of the literature. In 1980s Swanson found hidden links between pieces of knowledge…
completed, 08/2019 - 07/2021
-
Repeat protein Function, Refinement, Annotation and Classification of Topologies (REFRACT)
REFRACT is an international consortium aiming to extend our knowledge on the mechanism of tandem repeat protein (TRP) function and evolution, establishing a common classification and best practices. Starting from available state of the art computational tools and databases, it aims to drive a new…
completed, 04/2019 - 04/2025
-
The effect of programmed ribosomal frameshifting on codon usage bias
The discovery of synonymous codon usage bias (CUB) –the unequal use of codons that code for the same amino-acid– has strengthened the notion that synonymous mutations can alter the fitness of organisms. Synonymous or silent mutations are mutations in DNA that do not alter the encoded amino acid…
completed, 03/2019 - 03/2023
-
Frequentist estimation of the evolutionary history of sequences with substitutions and indels
High throughput sequencing technologies have permitted a wide range of scientists to observe an astonishing molecular diversity across all domains of life. Since all observed molecular sequences area result of a long evolutionary history, most informative inferences can be made only when analysing…
completed, 05/2018 - 01/2023
-
Positive selection detection of genome from Ralstonia bacterium
completed, 09/2017 - 04/2018
-
Enabling Complex, Semantic Queries to Bioinformatics Databases through Intuitive Searching over Data (Bio-SODA)
One of the major promises of Big Data lies in the simultaneous mining of multiple sources of data. This is particularly important in life sciences, where different and complementary data are scattered across multiple resources. To overcome this issue, the use of RDF/semantic web technology is…
completed, 04/2017 - 03/2021
-
Exploring the silent fitness landscape
Since Darwin, natural selection has been recognized as one of major biological forcesshaping genetic patterns in molecular data. Detecting selection on proteins has become an indispensible part of genome studies. Remarkably selection can act not only on proteins, but also on synonymous codons…
completed, 01/2017 - 12/2018
-
Discovering evolutionary innovations by assessing variation and natural selection in protein tandem repeats
Tandem repeats (TRs) are abundant in proteomes across all kingdoms of life. Having an impressive variety of sizes, structures and functions, TRs often offer enhanced binding properties and are associated with disease and immunity related functions. While mechanisms generating protein TRs are poorly…
completed, 10/2016 - 02/2020
-
Fast joint estimation of alignment and phylogeny from genomics sequences in a frequentist framework
The availability of large molecular data demands accurate and fast bioinformatics methods to analyze these data. Molecular sequences of common origin are used to infer phylogenetic trees, which help to test various biological hypotheses or to support subsequent analyses. Phylogeny inference relies…
completed, 02/2015 - 09/2018
Publications
-
Verbiest, Max; Lundström, Oxana; Xia, Feifei; Baudis, Michael; Bilgin Sonay, Tugçe; Anisimova, Maria,
2024.
Short tandem repeat mutations regulate gene expression in colorectal cancer.
Scientific Reports.
14(1), pp. 3331.
Available from: https://doi.org/10.1038/s41598-024-53739-0
-
Apsley, Abner T.; Domico, Emma R.; Verbiest, Max A.; Brogan, Carly A.; Buck, Evan R.; Burich, Andrew J.; Cardone, Kathleen M.; Stone, Wesley J.; Anisimova, Maria; Vandenbergh, David J.,
2023.
A novel hypervariable variable number tandem repeat in the dopamine transporter gene (SLC6A3).
Life Science Alliance.
6(4), pp. e202201677.
Available from: https://doi.org/10.26508/lsa.202201677
-
Verbiest, Max; Maksimov, Mikhail; Jin, Ye; Anisimova, Maria; Gymrek, Melissa; Bilgin Sonay, Tugce,
2022.
Journal of Evolutionary Biology.
36(2), pp. 321-336.
Available from: https://doi.org/10.1111/jeb.14106
-
Jowkar, Gholamhossein; Pecerska, Julija; Maiolo, Massimo; Gil, Manuel; Anisimova, Maria,
2022.
Systematic Biology.
72(2), pp. 307-318.
Available from: https://doi.org/10.1093/sysbio/syac050
-
Sima, Ana Claudia; Mendes de Farias, Tarcisio; Anisimova, Maria; Dessimoz, Christophe; Robinson-Rechavi, Marc; Zbinden, Erich; Stockinger, Kurt,
2022.
Distributed and Parallel Databases.
40(2), pp. 409-440.
Available from: https://doi.org/10.1007/s10619-022-07414-w