Yijiang Liu ☕️
Yijiang Liu

PhD Student (2025-)

About Me

I’m currently a PhD student at NTU School of Chemistry, Chemical Engineering and Biotechnology (CCEB). My research focuses on bioinformatics, with a particular emphasis on the integrative analysis of multi-omics data to investigate health-related issues. In my free time, I like playing badminton and travelling.

Interests
  • Badminton
  • Cats
  • Bioinformatics
Education
  • MSc Biomedical Data Science (2023-2024)

    Nanyang Technological University

  • BSc Chemistry (2018-2022)

    Sun Yat-sen University

📚 My Research
My research interests include bioinformatics, biostatistics and machine learning. Inspired by the rapid development of artificial intelligence, I am particularly interested in combining biology and data science to provide solutions to health-related problems.
Publications
Application of LLMs/Transformer-Based Models for Metabolite Annotation in Metabolomics

Application of LLMs/Transformer-Based Models for Metabolite Annotation in Metabolomics

Liquid Chromatography-Mass Spectrometry (LC-MS) untargeted metabolomics has become a cornerstone of modern biomedical research, enabling the analysis of complex metabolite profiles in biological systems. However, metabolite annotation, a key step in LC-MS untargeted metabolomics, remains a major challenge due to the limited coverage of existing reference libraries and the vast diversity of natural metabolites. Recent advancements in large language models (LLMs) powered by Transformer architecture have shown significant promise in addressing challenges in data-intensive fields, including metabolomics. LLMs, which when fine-tuned with domain-specific datasets such as mass spectrometry (MS) spectra and chemical property databases, together with other Transformer-based models, excel at capturing complex relationships and processing large-scale data and significantly enhance metabolite annotation. Various metabolomics tasks include retention time prediction, chemical property prediction, and theoretical MS2 spectra generation. For example, methods such as LipiDetective and MS2Mol have shown the potential of machine learning in lipid species prediction and de novo molecular structure annotation directly from MS2 spectra. These tools leverage transformer principles and their integration with LLM frameworks could further expand their utility in metabolomics. Moreover, the ability of LLMs to integrate multi-modal datasets—spanning genomics, transcriptomics, and metabolomics—positions them as powerful tools for systems-level biological analysis. This review highlights the application and future perspectives of Transformer-based LLMs for metabolite annotation of LC-MS metabolomics incorporating with multiomics. Such transformative potential paves the way for enhanced annotation accuracy, expanded metabolite coverage, and deeper insights into metabolic processes, ultimately driving advancements in precision medicine and systems biology.

Recent & Upcoming Talks
Recent News

✅ Manage your projects

Easily manage your projects - create ideation mind maps, Gantt charts, todo lists, and more!