Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Currently submitted to: JMIR AI

Date Submitted: Mar 12, 2026
Open Peer Review Period: Mar 20, 2026 - May 15, 2026
(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

Warning: This is a unreviewed preprint (What is a preprint?). Readers are warned that the document has not been peer-reviewed by expert/patient reviewers or an academic editor, may contain misleading claims, and is likely to undergo changes before final publication, if accepted, or may have been rejected/withdrawn (a note "no longer under consideration" will appear above).

Peer review me: Readers with interest and expertise are encouraged to sign up as peer-reviewer, if the paper is within an open peer-review period (in this case, a "Peer Review Me" button to sign up as reviewer is displayed above). All preprints currently open for review are listed here. Outside of the formal open peer-review period we encourage you to tweet about the preprint.

Citation: Please cite this preprint only for review purposes or for grant applications and CVs (if you are the author).

Final version: If our system detects a final peer-reviewed "version of record" (VoR) published in any journal, a link to that VoR will appear below. Readers are then encourage to cite the VoR instead of this preprint.

Settings: If you are the author, you can login and change the preprint display settings, but the preprint URL/DOI is supposed to be stable and citable, so it should not be removed once posted.

Submit: To post your own preprint, simply submit to any JMIR journal, and choose the appropriate settings to expose your submitted version as preprint.

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Annotation Budget–Dependent Adaptation of SAM3 for Thyroid Ultrasound Segmentation: LoRA Versus Full Fine-Tuning

  • Karima Bahmane; 
  • ibrahim alkhalil chaouki

ABSTRACT

Background:

Thyroid nodule segmentation in ultrasound imaging is clinically important but typically requires large volumes of expert annotations. Foundation models offer a promising approach to reducing labeled data requirements, yet the relationship between annotation budget and segmentation performance across adaptation strategies remains insufficiently characterized.

Objective:

Thyroid nodule segmentation in ultrasound imaging is clinically important but typically requires large volumes of expert annotations. Foundation models offer a promising approach to reducing labeled data requirements, yet the relationship between annotation budget and segmentation performance across adaptation strategies remains insufficiently characterized.

Methods:

We evaluated four SAM3-based adaptation strategies on TN5000 (5,000 biopsy-confirmed thyroid ultrasound images): full fine-tuning (MedSAM3), zero-shot inference (SAM3-Zero), and Low-Rank Adaptation at ranks 8 and 16 (SAM3-LoRA8, SAM3-LoRA16). All models were initialized via Masked Autoencoder pretraining on the unlabeled corpus and evaluated under four annotation regimes (10%, 25%, 50%, and 100% of labeled data) using Dice, IoU, AUC, sensitivity, specificity, and F1-score, with 95% bootstrap confidence intervals and Bonferroni-corrected significance testing. External validation was conducted on the independent DDTI dataset.

Results:

At full annotation budget, MedSAM3 achieved the highest internal performance (Dice = 0.815, AUC = 0.989), significantly outperforming the zero-shot baseline (Dice = 0.097, p < 0.001 across Wilcoxon, DeLong, and McNemar tests). At 25% label availability, SAM3-LoRA8 matched or exceeded MedSAM3 (Dice = 0.736 vs 0.726) while updating only 1.85% of model parameters, identifying 25% as the annotation-efficiency crossover threshold. No statistically significant difference was observed between LoRA ranks 8 and 16 on per-image Dice (Wilcoxon p = 0.704), indicating that increasing rank beyond 8 yields no measurable performance benefit for this task. External validation on the DDTI dataset preserved model ranking (MedSAM3 AUC = 0.911); a dissociation between AUC and Dice under domain shift was observed across all fine-tuned models, with implications for cross-site deployment.

Conclusions:

Full fine-tuning with domain-adapted initialization provides the best segmentation performance at large annotation budgets. LoRA adaptation with rank 8 offers a more annotation-efficient and parameter-efficient alternative, matching full fine-tuning at 25% label availability and requiring approximately half the training time. These findings provide a concrete decision framework for selecting adaptation strategies in annotation-constrained clinical deployments of thyroid ultrasound segmentation.


 Citation

Please cite as:

Bahmane K, chaouki ia

Annotation Budget–Dependent Adaptation of SAM3 for Thyroid Ultrasound Segmentation: LoRA Versus Full Fine-Tuning

JMIR Preprints. 12/03/2026:95221

DOI: 10.2196/preprints.95221

URL: https://preprints.jmir.org/preprint/95221

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.