Přehled

Doctoral study program: Biomedical Sciences
Study plan: Molecular Medicine

Supervisor: Mgr. Vojtěch Bystrý, Ph.D.

Annotation

This Ph.D. project aims to revolutionize clinical oncology by integrating intelligent, AI-driven agents and multimodal generative artificial intelligence. The research will focus on developing and deploying advanced computational frameworks capable of processing, analyzing, and interpreting diverse biomedical datasets—including imaging (MRI, CT, PET), electronic health records, genetic, genomic, transcriptomic, proteomic, and epigenomic data.

The Ph.D. candidate will collaborate on the projects working towards the modular AI agents capable of securely operating within Trusted Research Environments (TREs). These agents will interface with multimodal foundational models, such as dnaBERT, epi-GPT, DeepSNP, and scGPT, deployed within these secure spaces. Agents will translate clinical queries into executable analytical workflows, orchestrate local data processing, and provide aggregated, interpretable outputs to researchers, ensuring compliance with ethical and privacy regulations.

The project will specifically explore the latent-space representations produced by foundational models to unify multimodal health data streams, thereby generating predictive and actionable insights to enhance patient stratification and clinical decision-making. Real-world use cases in hematology, cardiovascular diseases, triple-negative breast cancer, and prostate cancer will be leveraged for validation, particularly within projects like ACGT2, emphasizing long-read sequencing data integration.

Ultimately, this research will produce pioneering methodologies for multimodal generative AI-based predictive modeling, contributing to both clinical oncology and bioinformatics literature, and driving advancements in precision medicine.

Recommended literature

  • Hao, M., Gong, J., Zeng, X., Liu, C., Guo, Y., Cheng, X., Wang, T., Ma, J., Song, L., & Zhang, X. (2023). “Large Scale Foundation Model on Single-cell Transcriptomics.” bioRxiv. https://doi.org/10.1101/2023.05.29.542705
  • Wang, S., et al. (2023). “scGPT: leveraging GPT-like architecture for single-cell RNA-seq analysis.” Nature Methods.
  • Wang, S., et al. (2020). “dnaBERT: Pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome research.” Nature Communications.

Research area: Bioinformatics

Keywords: Generative AI, Multimodal Data Integration, Intelligent Agents, Trusted Research Environments, Cancer Diagnosis, Bioinformatics, Foundational Models, Precision Medicine

Funding of the PhD candidate:

CEITEC Bioinformatics Core Facility budget

EOSC related projects

ELIXIR CZ projects

Requirements for candidate

Candidates should possess a solid background in bioinformatics, data science, machine learning, and computational biology, with demonstrated experience in multimodal health data integration and AI model development. Familiarity with transformer architectures (GPT models), TRE-based secure environments, and multiomics data analysis is highly desirable.

Information about the supervisor

https://www.ceitec.eu/vojtech-bystry-ph-d/u92264

Information about the application process: https://www.ceitec.eu/ls-mm-phd/

Application webpage: https://www.ceitec.eu/integrating-ai-driven-intelligent-agents-and-multimodal-generative-models-for-enhanced-cancer-diagnosis-and-treatment/t11585