TIOZ Howest

Howest Logo

Tetra: Metatec

Metagenomics or metagenome analysis is the study of all DNA present from an isolated sample and involves the simultaneous identification of multiple (micro-)organisms for numerous applications. DNA sequencing technologies have made it possible to obtain the complete DNA of uncultivated microorganisms. However, the identification of organisms from mixed populations using traditional short-read sequencing technologies remains a challenge. The latest nanopore sequencing technology consists of real-time, long read sequencing that can overcome limitations of these methods. In this project we will develop new metagenomics applications using nanopore sequencing with various field partners. Furthermore, automation of sample preparation and artificial intelligence (AI) for data analysis will also be investigated.

Cover image

Quick facts

  • /

    Tetra project

  • /

    2 years (2022-2024)

  • /

    Project of BiKC (Bioinformatics Knowledge Center)

  • /

    Collaboration with Cyber 3.0 (AI part)

Why is this project so relevant?

(Micro)organisms are found everywhere in nature and thrive over an immensely wide range of conditions. For example, the majority of MOs are traditionally difficult or impossible to grow. Metagenomics based on high-throughput sequencing (HTS) enables the study of all (micro-)organisms, regardless of whether they can be cultivated or not, by performing genomic analysis on their extracted DNA. It helps researchers to analyze specific organisms from many samples, such as clinical and environmental samples. This provides knowledge about the species present and makes it possible to extract information about the functionality and properties of certain communities

Metagenomics has been extensively used in the last decade for numerous applications across various domains, such as in clinical microbiology, in industry and for ecological applications. It can be divided into two well-defined methods: whole genome and targeted metagenomics.

The first method involves processing the entire DNA of a sample. This provides a complete picture of all organisms in the sample and offers the opportunity to investigate, for example, the types and functions of the organisms. However, with conventional short read sequencing techniques, it is expensive to sequence all the DNA and involves challenging data analysis due to the size and complexity of the data. For example, databases containing reference DNA with which the DNA must be compared (in order to identify the DNA to be examined) can be immense, which is a limiting factor because many research facilities are not able to store and process such a large amount of data.

Targeted metagenomics, on the other hand, focuses on certain pieces of DNA in the sample (e.g. 16S rRNA gene). This can be done by specifically copying the pieces of DNA of interest to determine, for example, what type of organism they belong to. By amplifying this region before DNA determination, the amount of data to sequence and analyze is greatly reduced. However, the disadvantage is that an amplification step can introduce bias into the obtained data and only information is obtained about the amplified regions.

Metagenomics is therefore on the rise, but plenty of research is still required per application.

What will we do?

During this project, new technologies will be used to perform whole genome metagenomics to answer specific questions from the field, namely Oxford Nanopore Technologies (ONT, third generation DNA technology), automation of sample preparation by means of a pipetting robot (OT- 2) and implementation of AI for the analysis of the complex data.

ONT is a recent DNA determination technology that is increasingly used for metagenomics applications. It has the advantage that DNA codes of 10,000 letters and more can be read in real-time, as opposed to several hundred letters with second-generation DNA technology. Longer stretches of DNA code or long reads facilitate the analysis of the data. In addition, the MinION, ONT's smallest device, is a portable device that enables the identification of MOs on site in real time at a relatively low cost per analysis. However, there are still no standardized methods for performing metagenomics within specific applications, which requires optimization and validation. Expertise has already been built up around this state-of-the-art DNA technology during the PWO project NanoDeTech. During this previous project, the focus was on “pure” samples, where only one organism was present per sample (for example a pure culture of a bacterium). However, with this project we wish to build up expertise in the processing and analysis of more complex samples. These samples require a completely different approach in, among other things, the preparation steps and data analysis.

In addition, automation of sample preparation will be examined. Automation leads to faster sample preparation, as a higher number of samples can be processed simultaneously and human errors are excluded. This leads to higher reproducibility and standardized workflow. Automation therefore leads to a larger platform for doing ONT sequencing for multiple applications. The OT-2 open-source pipetting robot will be financed by our own resources, with this PWO project supporting the project employee to program and operate this OT-2 robot.

Finally, the implementation of AI for the analysis of complex data will also be examined. Applications have already been described in the literature in which AI was applied for more accurate and faster detection of pathogens in clinical samples.

Researchers

  • /

    Kyra Van Den Eynde, AI/CS Researcher, AI Lead

Want to know more about our team?

Visit the team page