Publications

* denotes equal contribution; ^ denotes corresponding authorship.

2026

  1. Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation
    Chongcong Jiang, Tianxingjian Ding, Chuhan Song, Jiachen Tu, Ziyang Yan, Yihua Shao, Zhenyi Wang, Yuzhang Shang, Tianyu Han, and Yu Tian
    Technical report (Arxiv), 2026
  2. VLA-Thinker: Boosting Vision-Language-Action Models through Thinking-with-Image Reasoning
    Chaoyang Wang, Wenrui Bao, Sicheng Gao, Bingxin Xu, Yu Tian, Yogesh S. Rawat, Yunhao Ge, and Yuzhang Shang
    Technical report (Arxiv), 2026
  3. CLARITY: Medical World Model for Guiding Treatment Decisions by Modeling Context-Aware Disease Trajectories in Latent Space
    Tianxingjian Ding, Yuanhao Zou, Chen Chen, Mubarak Shah, and Yu Tian
    Technical report (Arxiv), 2026
  4. AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
    Yuyuan Liu, Yuanhong Chen, Chong Wang, Junlin Han, Junde Wu, Can Peng, Jingkun Chen, Yu Tian^, and Gustavo Carneiro
    IEEE Conference on Computer Vision and Pattern Recognition Findings (CVPR Findings), 2026

2025

  1. Preprint
    agent_survey.jpg
    Agentic large-language-model systems in medicine: A systematic review and taxonomy
    Abdul Mohaimen Al Radi, Xu Cao, Fanyang Yu, Yuyuan Liu, Fengbei Liu, Chong Wang, Yuanhong Chen, Jintai Chen, Hu Wang, Yanda Meng, Zhenyi Wang, Chen Chen, Mubarak Shah, Tianyu Han, Christos Davatzikos, MacLean Nasrallah, and Yu Tian
    Technical report (Preprint), 2025
  2. FairFedMed: Benchmarking Group Fairness in Federated Medical Imaging with FairLoRA
    Minghan Li*, Congcong Wen*Yu Tian*, Min Shi, Yan Luo, Hao Huang, Yi Fang, and Mengyu Wang
    IEEE Transactions on Medical Imaging (TMI), 2025
  3. Fourier Transform Multiple Instance Learning for Whole Slide Image Classification
    Anthony Bilic, Guangyu Sun, Ming Li, Md Sanzid Bin Hossain, Yu Tian, Wei Zhang, Laura Brattain, Dexter Hadley, and Chen Chen
    Journal of Medical Imaging (JMI), 2025
  4. Fairness-Aware vCDR-Controlled Generation for Glaucoma Diagnosis
    Ziheng Wang, Shuran Yang, Wen Chen, Zhen Zhang, Mengyu Wang, Feixiang Zhou, Yu Tian, Meng Wang, Yitian Zhao, Yalin Zheng, and  others
    In International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
  5. Equitable deep learning for diabetic retinopathy detection using multi-dimensional retinal imaging with fair adaptive scaling: a retrospective study
    Min Shi, Muhammad Muneeb Afzal, Hao Huang, Congcong Wen, Yan Luo, Muhammad Osama Khan, Yu Tian, Leo Kim, Tobias Elze, Yi Fang, and  others
    Translational Vision Science & Technology (TVST Cover Article), 2025
  6. FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation
    Yan Luo, Muhammad Osama Khan, Congcong Wen, Muhammad Muneeb Afzal, Titus Fidelis Wuermeling, Min Shi, Yu Tian, Yi Fang, and Mengyu Wang
    Science Advances (Science Adv), 2025
  7. Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis
    Chengzhi Liu, Zile Huang, Zhe Chen, Feilong Tang, Yu Tian, Zhongxing Xu, Zihong Luo, Yalin Zheng, and Yanda Meng
    The 39th Annual AAAI Conference on Artificial Intelligence (AAAI Oral), 2025
  8. Equitable Artificial Intelligence for Glaucoma Screening with Fair Identity Normalization
    Min Shi*, Yan Luo*Yu Tian, Lucy Q Shen, Tobias Elze, Nazlee Zebardast, Mohammad Eslami, Saber Kazeminasab, Michael V Boland, David S Friedman, and  others
    npj Digital Medicine (npj Digit Med), 2025
  9. Association Between Cup-to-Disc Ratio and Structural and Functional Damage Parameters in Glaucoma: Insights From Multiparametric Modeling
    Aliah McCalla, Mengyu Wang, Mohammad Eslami, Saber Kazeminasab, Yan Luo, Hannah Rana, Sajib Saha, Min Shi, Yu Tian, Nazlee Zebardast, and  others
    Translational Vision Science & Technology (TVST), 2025
  10. An Artificial Intelligence Method for Phenotyping of OCT-Derived Thickness Maps Using Unsupervised and Self-supervised Deep Learning
    Saber Kazeminasab, Sayuri Sekimitsu, Mojtaba Fazli, Mohammad Eslami, Min Shi, Yu Tian, Yan Luo, Mengyu Wang, Tobias Elze, and Nazlee Zebardast
    Journal of Imaging Informatics in Medicine (JIIM), 2025
  11. FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling
    Yan Luo*Yu Tian*, Min Shi*, Tobias Elze, and Mengyu Wang
    arXiv preprint , 2025

2024

  1. FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification
    Yu Tian, Congcong Wen, Min Shi, Muhammad Muneeb Afzal, Hao Huang, Muhammad Osama Khan, Yan Luo, Yi Fang, and Mengyu Wang
    European Conference on Computer Vision (ECCV), 2024
  2. FairCLIP: Harnessing Fairness in Vision-and-Language Learning
    Yan Luo, Min Shi, Osama Khan, Muhammad Afzal, Hao Huang, Shuaihang Yuan, Yu Tian, and  others
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  3. Anomaly Heterogeneity Learning for Open-set Supervised Anomaly Detection
    Jiawen Zhu, Choubo Ding, Yu Tian, and Guansong Pang
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  4. Impact of Demographics on Regional Visual Field Loss and Deterioration in Glaucoma
    Yueyin Pang, Melody Tang, Min Shi, Yu Tian, Yan Luo, Tobias Elze, Louis R Pasquale, Nazlee Zebardast, Michael V Boland, David S Friedman, and  others
    Translational Vision Science & Technology (TVST), 2024
  5. Transformer-Based Deep Learning Prediction of 10-Degree Humphrey Visual Field Tests From 24-Degree Data
    Min Shi, Anagha Lokhande, Yu Tian, Yan Luo, Mohammad Eslami, Saber Kazeminasab, Tobias Elze, Lucy Q Shen, Louis R Pasquale, Sarah R Wellik, and  others
    Translational Vision Science & Technology (TVST), 2024
  6. Harvard Glaucoma Fairness: A Retinal Nerve Disease Dataset for Fairness Learning and Fair Identity Normalization
    Yan Luo*Yu Tian*, Min Shi*, Tobias Elze, and Mengyu Wang
    IEEE Transactions on Medical Imaging (TMI), 2024
  7. FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling
    Yu Tian, Yan Luo, Min Shi, Ava Kouhana, Tobias Elze, and Mengyu Wang
    International Conference on Learning Representations (ICLR), 2024
  8. AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection
    Qihang Zhou, Guansong Pang, Yu Tian, Shibo He, and Jiming Chen
    International Conference on Learning Representations (ICLR), 2024
  9. Semantic Role Labeling Guided Out-of-distribution Detection
    Jinan Zou*, Maihao Guo*Yu Tian*, Yuhao Lin, Haiyao Cao, Lingqiao Liu, Ehsan Abbasnejad, and Javen Qinfeng Shi
    International Conference on Computational Linguistics (COLING), 2024
  10. Translation Consistent Semi-supervised Segmentation for 3D Medical Images
    Yuyuan Liu, Yu Tian, Chong Wang, Yuanhong Chen, Fengbei Liu, Vasileios Belagiannis, and Gustavo Carneiro
    IEEE Transactions on Medical Imaging (TMI), 2024
  11. BRAIxDet: Learning to Detect Malignant Breast Lesion with Incomplete Annotations
    Yuanhong Chen, Yuyuan Liu, Chong Wang, Michael Elliott, Chun Fung Kwok, Yu Tian, Fengbei Liu, Helen Frazer, Davis J McCarthy, Gustavo Carneiro, and  others
    Medical Image Analysis (MedIA), 2024
  12. RNFLT2Vec: Artifact-Corrected Representation Learning for Retinal Nerve Fiber Layer Thickness Maps
    Min Shi, Yu Tian, Yan Luo, Tobias Elze, and Mengyu Wang
    Medical Image Analysis (MedIA), 2024
  13. Detecting, Localising and Classifying Polyps from Colonoscopy Videos using Deep Learning
    Yu Tian, Leonardo Zorron Cheng Tao Pu, Yuyuan Liu, Gabriel Maicas, Johan W Verjans, Alastair D Burt, Seon Ho Shin, Rajvinder Singh, and Gustavo Carneiro
    Deep Learning for Medical Image Analysis (Book Chapter) , 2024
  14. WorldGPT: a Sora-inspired video AI agent as Rich world models from text and image inputs
    Deshun Yang*, Luhui Hu*Yu Tian*, Zihao Li, Chris Kelly, Bang Yang, Cindy Yang, and Yuexian Zou
    arXiv preprint arXiv:2403.07944 , 2024
  15. Generalized Robot Learning Framework
    Jiahuan Yan, Zhouyang Hong, Yu Zhao, Yu Tian, Yunxin Liu, Travis Davies, and Luhui Hu
    arXiv preprint arXiv:2409.12061 , 2024

2023

  1. Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning
    Yan Luo*, Min Shi*Yu Tian*, Tobias Elze, and Mengyu Wang
    In IEEE/CVF international conference on computer vision (ICCV), 2023
  2. BoMD: Bag of Multi-label Descriptors for Noisy Chest X-ray Classification
    Yuanhong Chen*, Fengbei Liu*, Hu Wang, Chong Wang, Yu Tian, Yuyuan Liu, and Gustavo Carneiro
    In IEEE/CVF international conference on computer vision (ICCV), 2023
  3. Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation
    Yuyuan Liu*, Choubo Ding*Yu Tian, Guansong Pang, Vasileios Belagiannis, Ian Reid, and Gustavo Carneiro
    In IEEE/CVF international conference on computer vision (ICCV), 2023
  4. Learning Support and Trivial Prototypes for Interpretable Image Classification
    Chong Wang, Yuyuan Liu, Yuanhong Chen, Fengbei Liu, Yu Tian, Davis J McCarthy, Helen Frazer, and Gustavo Carneiro
    In IEEE/CVF international conference on computer vision (ICCV), 2023
  5. Artifact Correction in Retinal Nerve Fiber Layer Thickness Maps Using Deep Learning and Its Clinical Utility in Glaucoma
    Min Shi, Jessica A Sun, Anagha Lokhande, Yu Tian, Yan Luo, Tobias Elze, Lucy Q Shen, and Mengyu Wang
    Translational Vision Science & Technology (TVST), 2023
  6. Unsupervised Anomaly Detection in Medical Images with a Memory-augmented Multi-level Cross-attentional Masked Autoencoder
    Yu Tian, Guansong Pang, Yuyuan Liu, Chong Wang, Yuanhong Chen, Fengbei Liu, Rajvinder Singh, Johan W Verjans, Mengyu Wang, and Gustavo Carneiro
    International Workshop on Machine Learning in Medical Imaging (MICCAI-MLMI), 2023
  7. Self-supervised Pseudo Multi-class Pre-training for Unsupervised Anomaly Detection and Segmentation in Medical Images
    Yu Tian*, Fengbei Liu*, Guansong Pang, Yuanhong Chen, Yuyuan Liu, Johan W Verjans, Rajvinder Singh, and Gustavo Carneiro
    Medical Image Analysis (MedIA), 2023
  8. Artifact-Tolerant Clustering-Guided Contrastive Embedding Learning for Ophthalmic Images in Glaucoma
    Min Shi, Anagha Lokhande, Mojtaba S Fazli, Vishal Sharma, Yu Tian, Yan Luo, Louis R Pasquale, Tobias Elze, Michael V Boland, Nazlee Zebardast, and  others
    IEEE Journal of Biomedical and Health Informatics (JBHI), 2023
  9. UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
    Chris Kelly, Luhui Hu, Cindy Yang, Yu Tian, Deshun Yang, Bang Yang, Zaoshan Huang, Zihao Li, and Yuexian Zou
    arXiv preprint , 2023
  10. Asymmetric Co-teaching with Multi-view Consensus for Noisy Label Learning
    Fengbei Liu, Yuanhong Chen, Chong Wang, Yu Tain, and Gustavo Carneiro
    arXiv preprint , 2023

2022

  1. Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving Scenes
    Yu Tian*, Yuyuan Liu*, Guansong Pang, Fengbei Liu, Yuanhong Chen, and Gustavo Carneiro
    European Conference on Computer Vision (ECCV Oral), 2022
  2. Contrastive Transformer-based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection
    Yu Tian, Guansong Pang, Fengbei Liu, Yuyuan Liu, Chong Wang, Yuanhong Chen, Johan W Verjans, and Gustavo Carneiro
    International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
  3. NVUM: Non-Volatile Unbiased Memory for Robust Medical Image Classification
    Fengbei Liu, Yuanhong Chen, Yu Tian, Yuyuan Liu, Chong Wang, Vasileios Belagiannis, and Gustavo Carneiro
    International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
  4. Multi-view Local Co-occurrence and Global Consistency Learning Improve Mammogram Classification Generalisation
    Yuanhong Chen, Wang Hu, Chong Wang, Yu Tian, Fengbei Liu, Yuyuan Liu, Michael Elliott, Davis McCarthy, Helen Frazer, and Gustavo Carneiro
    International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
  5. Knowledge Distillation to Ensemble Global and Interpretable Prototype-based Mammogram Classification Models
    Chong Wang, Yuanhong Chen, Yuyuan Liu, Yu Tian, Fengbei Liu, Davis McCarthy, Michael Elliott, Helen Frazer, and Gustavo Carneiro
    International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
  6. ACPL: Anti-curriculum Pseudo-labelling for Semi-supervised Medical Image Classification
    Fengbei Liu*Yu Tian*, Yuanhong Chen, Yuyuan Liu, Vasileios Belagiannis, and Gustavo Carneiro
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
  7. Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation
    Yuyuan Liu, Yu Tian, Yuanhong Chen, Fengbei Liu, Vasileios Belagiannis, and Gustavo Carneiro
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
  8. Deep One-Class Classification via Interpolated Gaussian Descriptor
    Yuanhong Chen*Yu Tian*^, Guansong Pang, and Gustavo Carneiro
    In Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI Oral), 2022

2021

  1. Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning
    Yu Tian, Guansong Pang, Yuanhong Chen, Rajvinder Singh, Johan W Verjans, and Gustavo Carneiro
    In IEEE/CVF international conference on computer vision (ICCV), 2021
  2. Constrained Contrastive Distribution Learning for Unsupervised Anomaly Detection and Localisation in Medical Images
    Yu Tian, Guansong Pang, Fengbei Liu, Yuanhong Chen, Seon Ho Shin, Johan W Verjans, Rajvinder Singh, and Gustavo Carneiro
    In International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2021
  3. Self-supervised Mean Teacher for Semi-supervised Chest X-ray Classification
    Fengbei Liu*Yu Tian*, Filipe R Cordeiro, Vasileios Belagiannis, Ian Reid, and Gustavo Carneiro
    In International Workshop on Machine Learning in Medical Imaging (MICCAI), 2021

2020

  1. Few-shot anomaly detection for polyp frames from colonoscopy
    Yu Tian, Gabriel Maicas, Leonardo Zorron Cheng Tao Pu, Rajvinder Singh, Johan W Verjans, and Gustavo Carneiro
    In International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2020
  2. Photoshopping colonoscopy video frames
    Yuyuan Liu*Yu Tian*^, Gabriel Maicas, Leonardo Zorron Cheng Tao Pu, Rajvinder Singh, Johan W Verjans, and Gustavo Carneiro
    In IEEE 17th International Symposium on Biomedical Imaging (ISBI), 2020
  3. GIE
    GIE.png
    Computer-aided diagnosis for characterization of colorectal lesions: comprehensive software that includes differentiation of serrated lesions
    Leonardo Zorron Cheng Tao Pu, Gabriel Maicas, Yu Tian, Takeshi Yamamura, Masanao Nakamura, Hiroto Suzuki, Gurfarmaan Singh, Khizar Rana, Yoshiki Hirooka, Alastair D Burt, and  others
    Gastrointestinal endoscopy (GIE), 2020

2019

  1. One-stage five-class polyp detection and classification
    Yu Tian, Leonardo ZCT Pu, Rajvinder Singh, Alastair D Burt, and Gustavo Carneiro
    In IEEE 16th International Symposium on Biomedical Imaging (ISBI), 2019