Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

summary

This theme addresses the challenges and strategies involved in applying machine learning (ML) techniques to multiomics data from distributed biobanks. It focuses on ensuring that these models are robust, generalizable, and can be effectively translated across diverse datasets and populations.

Key areas include:

  1. Multimodal data integration: The module pursues the integration of diverse omics data types, such as genomics, proteomics, and metabolomics, which provide a comprehensive understanding of biological systems.
  2. Generalisability and transportability: This module examines methods to ensure that machine learning models are not only fitting well to the specific dataset but are also generalizable to other biobanks. This includes strategies to improve the model’s adaptability and performance across different populations and settings.
  3. Model validation: The module highlights the importance of rigorous validation processes to assess the performance and reliability of ML models when applied to distributed datasets, including cross-validation, external validation, and the use of benchmark datasets for effective model evaluation.
  4. Applications in personalized medicine: By enhancing the generalisability and transportability of ML models, the module suggests significant potential improvements in personalized medicine, allowing for more accurate prediction tailored to individual patient profiles across varied demographic groups.

This theme emphasizes the development of ML models that can be validated and employed reliably across different biobanks, enhancing their utility in personalized medicine.

 

references

Theme leads

David Clifton

Patrick Schwab

Paul Newcombe