Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR AI

Date Submitted: Feb 4, 2025
Open Peer Review Period: Feb 4, 2025 - Mar 18, 2025
Date Accepted: Mar 27, 2025
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Using Segment Anything Model 2 for Zero-Shot 3D Segmentation of Abdominal Organs in Computed Tomography Scans to Adapt Video Tracking Capabilities for 3D Medical Imaging: Algorithm Development and Validation

Yamagishi Y, Hanaoka S, Kikuchi T, Nakao T, Nakamura Y, Nomura Y, Miki S, Yoshikawa T, Abe O

Using Segment Anything Model 2 for Zero-Shot 3D Segmentation of Abdominal Organs in Computed Tomography Scans to Adapt Video Tracking Capabilities for 3D Medical Imaging: Algorithm Development and Validation

JMIR AI 2025;4:e72109

DOI: 10.2196/72109

PMID: 40611505

PMCID: 12231515

Zero-shot 3D Segmentation of Abdominal Organs in CT Scans Using Segment Anything Model 2: Adapting Video Tracking Capabilities for 3D Medical Imaging: Algorithm Development and Validation

  • Yosuke Yamagishi; 
  • Shouhei Hanaoka; 
  • Tomohiro Kikuchi; 
  • Takahiro Nakao; 
  • Yuta Nakamura; 
  • Yukihiro Nomura; 
  • Soichiro Miki; 
  • Takeharu Yoshikawa; 
  • Osamu Abe

ABSTRACT

Background:

Medical image segmentation is crucial for diagnosis and treatment planning in radiology, but traditionally requires extensive manual effort and specialized training data. The Segment Anything Model 2 (SAM 2), with its novel video tracking capabilities, presents a potential solution for automated 3D medical image segmentation without the need for domain-specific training. However, its effectiveness in medical applications, particularly in abdominal CT imaging, remains unexplored.

Objective:

To evaluate the zero-shot performance of SAM 2 in 3D segmentation of abdominal organs in CT scans, and to investigate the effects of prompt settings on segmentation results.

Methods:

In this retrospective study, we used a subset of the TotalSegmentator CT dataset from eight institutions to assess SAM 2's ability to segment eight abdominal organs. Segmentation was initiated from three different z-coordinate levels (caudal, mid, and cranial levels) of each organ. Performance was measured using the Dice similarity coefficient (DSC). We also analyzed the impact of "negative prompts," which explicitly exclude certain regions from the segmentation process, on accuracy.

Results:

123 patients (mean age, 60.7 ± 15.5 years; 63 men, 60 women) were evaluated. As a zero-shot approach, larger organs with clear boundaries demonstrated high segmentation performance, with mean DSCs as follows: liver 0.821 ± 0.192, right kidney 0.862 ± 0.212, left kidney 0.870 ± 0.154, and spleen 0.891 ± 0.131. Smaller organs showed lower performance: gallbladder 0.531 ± 0.291, pancreas 0.361 ± 0.197, and adrenal glands, right 0.203 ± 0.222, left 0.308 ± 0.234. The initial slice for segmentation and the use of negative prompts significantly influenced the results. By removing negative prompts from the input, the DSCs significantly decreased for six organs.

Conclusions:

SAM 2 demonstrated promising zero-shot performance in segmenting certain abdominal organs in CT scans, particularly larger organs. Performance was significantly influenced by input negative prompts and initial slice selection, highlighting the importance of optimizing these factors.


 Citation

Please cite as:

Yamagishi Y, Hanaoka S, Kikuchi T, Nakao T, Nakamura Y, Nomura Y, Miki S, Yoshikawa T, Abe O

Using Segment Anything Model 2 for Zero-Shot 3D Segmentation of Abdominal Organs in Computed Tomography Scans to Adapt Video Tracking Capabilities for 3D Medical Imaging: Algorithm Development and Validation

JMIR AI 2025;4:e72109

DOI: 10.2196/72109

PMID: 40611505

PMCID: 12231515

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.