Computer Engineering and Applications ›› 2024, Vol. 60 ›› Issue (17): 1-16.DOI: 10.3778/j.issn.1002-8331.2401-0136

• Research Hotspots and Reviews • Previous Articles     Next Articles

Review of Application of Visual Foundation Model SAM in Medical Image Segmentation

SUN Xing, CAI Xiaohong, LI Ming, ZHANG Shuai, MA Jingang   

  1. School of Intelligence and Information Engineering, Shandong University of Traditional Chinese Medicine, Jinan 250355, China
  • Online:2024-09-01 Published:2024-08-30

视觉大模型SAM在医学图像分割中的应用综述

孙兴,蔡肖红,李明,张帅,马金刚   

  1. 山东中医药大学 智能与信息工程学院,济南 250355

Abstract: With the continuous development of foundation models technology, visual foundation model represented by the segment anything model (SAM) has made significant breakthroughs in the field of image segmentation. SAM, driven by prompts, accomplishes a series of downstream segmentation tasks, aiming to address all image segmentation issues comprehensively. Therefore, the application of SAM in medical image segmentation is of great significance, as its generalization performance can adapt to various medical images, providing healthcare professionals with a more comprehensive understanding of anatomical structures and pathological information. This paper introduces commonly used datasets for image segmentation, provides detailed explanations of SAM’s network architecture and generalization capabilities. It focuses on a thorough analysis of SAM’s application in five major categories of medical images: whole-slide imaging, magnetic resonance imaging, computed tomography, ultrasound, and multimodal images. The review summarizes the strengths and weaknesses of SAM, along with corresponding improvement methods. Combining current challenges in the field of medical image segmentation, the paper discusses and anticipates future directions for SAM’s development.

Key words: visual foundation model, segment anything model (SAM), medical images, image segmentation

摘要: 随着大模型技术的不断发展,以分割一切模型(segment anything model,SAM)为代表的视觉大模型在图像分割领域取得重要突破。SAM通过提示驱动完成一系列下游分割任务,旨在统一解决所有的图像分割问题。因此,将SAM应用于医学图像分割具有重要意义,其泛化性能够适应多种医学图像,为医生提供更全面的解剖结构和病变信息。介绍了图像分割常用的数据集;对SAM的网络结构和泛化性进行细致阐述;重点对SAM应用在全切片成像、磁共振成像、计算机断层扫描、超声和多模态图像的五大类医学图像进行梳理分析,总结优缺点和相应的改进方法;结合当前医学图像分割领域中存在的实际问题,讨论并展望了SAM未来的发展方向。

关键词: 视觉大模型, 分割一切模型(SAM), 医学图像, 图像分割