Research
I'm interested in medical imaging analysis, clinical decision support systems, and the application
of artificial intelligence in medicine and healthcare. My recent work also explores epidemiological
insights derived from social media data. Much of my research focuses on building AI systems that can
assist clinical diagnosis, treatment planning, and public health surveillance. Selected projects and
publications are highlighted below.
|
|
TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation
Expert Systems with Applications, 2026
project page
/
paper
In this work, we present TOM, an open-source tongue image segmentation method based on multi-teacher knowledge distillation and task-specific diffusion-based data augmentation. The method improves segmentation robustness while reducing model size, with the lightweight student model retaining high segmentation accuracy after substantial parameter compression.
|
|
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine
Advances in Neural Information Processing Systems, 2026
project page
/
paper
In this paper, we introduce TCM-Ladder, the first unified multimodal question-answering benchmark for Traditional Chinese Medicine, consisting of over 52,000 questions across text, image, and video formats.
|
|
Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine
arXiv preprint arXiv:2510.17402, 2025
project page
/
paper
In this work, we introduce Ladder-base, a TCM-focused language model trained with Group Relative Policy Optimization on the textual subset of TCM-Ladder. The model is designed to improve reasoning and factual consistency for Traditional Chinese Medicine question answering.
|
|
BenCao: An Instruction-Tuned Large Language Model for Traditional Chinese Medicine
arXiv preprint arXiv:2510.17415, 2025
project page
/
paper
In this study, we develop BenCao, a ChatGPT-based multimodal assistant for Traditional Chinese Medicine that integrates structured knowledge, diagnostic data, expert feedback, and external tools for tongue-image analysis.
|
|
Multimodal Knowledge Expansion Widget Powered by Plant Protein Phosphorylation Database and ChatGPT
Frontiers in Bioinformatics, 2025
project page
/
paper
This work presents ChatGPT-P3DB, a multimodal question-answering widget that connects ChatGPT with the Plant Protein Phosphorylation Database and uses multimodal LLMs to extract regulatory pathways from scientific figures.
|
|
Real-Time Oil Spill Concentration Assessment Through Fluorescence Imaging and Deep Learning
Journal of Hazardous Materials, 2025
project page
/
paper
We present a real-time oil spill assessment system that integrates fluorescence imaging, deep learning, a mobile app, and a data management platform for rapid field assessment.
|
|
Images of Two Standard Crude Oils Collected Using a Fluorescent Camera Device to Train and Optimize a Machine Learning Model for Real-Time Oil Spill Concentration Assessment
U.S. Geological Survey Data Release, 2025
data release
/
record
This data release provides 1,530 fluorescence images of two crude oil types across concentrations from 0 to 500 mg/L, along with metadata describing oil type and concentration.
|
|
Leveraging Large Language Models for Infectious Disease Surveillance-Using a Web Service for Monitoring COVID-19 Patterns From Self-Reporting Tweets: Content Analysis
Journal of Medical Internet Research, 2025
project page
/
paper
In this work, we developed a real-time COVID-19 surveillance system based on self-reported cases from Twitter, using large language models to automatically detect infections, symptoms, recoveries, and reinfections.
|
|
TOM: A Universal Tongue Optimization Model for Medical Tongue Image Segmentation
MIC Conference, 2024
project page
/
paper
In this paper, we propose a universal tongue optimization model for tongue segmentation and developed an online tongue segmentation tool based on TOM.
|
|
An Online Tool for Understanding and Monitoring COVID-19 Trends and Spread Based on Self-Reporting Tweets
IEEE International Conference on Medical Artificial Intelligence, 2023
project page
/
paper
This work presents Covlab, an online platform for monitoring COVID-19 trends and geographic spread using self-reported tweets.
|
|
Assessing Environmental Oil Spill Based on Fluorescence Images of Water Samples and Deep Learning
Journal of Environmental Informatics, 2023
project page
/
paper
We developed a portable device and deep learning model that estimate oil concentration in water using fluorescence images captured by an iPhone.
|
|
Digital tongue image analyses for health assessment
Medical Review, 2021
project page
/
paper
In this work, we reviewed recent advances in computerized tongue diagnosis for health assessment, covering tongue image acquisition, segmentation, feature extraction, color correction, and intelligent diagnosis systems.
|
|