A Comprehensive Survey on Segment Anything Model for Vision and Beyond

The First Comprehensive SAM Survey: A Comprehensive Survey on Segment Anything Model for Vision and Beyond. Chunhui Zhang, Li Liu, Yawen Cui, Guanjie Huang, Weilin Lin, Yiqian Yang, Yuehong Hu. [paper] [homepage][中文解读]

Abstract: Artificial intelligence (AI) is evolving towards artificial general intelligence, which refers to the ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence similar to that of a human being. This is in contrast to narrow or specialized AI, which is designed to perform specific tasks with a high degree of efficiency. Therefore, it is urgent to design a general class of models, which we term foundation models, trained on broad data that can be adapted to various downstream tasks. The recently proposed segment anything model (SAM) has made significant progress in breaking the boundaries of segmentation, greatly promoting the development of foundation models for computer vision. To fully comprehend SAM, we conduct a survey study. As the first to comprehensively review the progress of segmenting anything task for vision and beyond based on the foundation model of SAM, this work focuses on its applications to various tasks and data types by discussing its historical development, recent progress, and profound impact on broad applications. We first introduce the background and terminology for foundation models including SAM, as well as state-of-the-art methods contemporaneous with SAM that are significant for segmenting anything task. Then, we analyze and summarize the advantages and limitations of SAM across various image processing applications, including software scenes, real-world scenes, and complex scenes. Importantly, many insights are drawn to guide future research to develop more versatile foundation models and improve the architecture of SAM. We also summarize massive other amazing applications of SAM in vision and beyond. Finally, we maintain a continuously updated paper list and an open-source project summary for foundation model SAM at here.

Awesome Segment Anything Models: A curated list of awesome segment anything models in computer vision and beyond. This repository supplements our survey paper. We intend to continuously update it.

If you like our project, please give us a star ⭐ on GitHub for latest update.

We strongly encourage authors of relevant works to make a pull request and add their paper's information [here].

💥SAM 3.1: ''SAM 3.1 Object Multiplex'' was released.

💥SAM Audio: ''SAM Audio: Segment Anything in Audio'' was released.

💥SAM 3D: ''SAM 3D: 3Dfy Anything in Images'' was released.

💥SAM 3: ''SAM 3: Segment Anything with Concepts'' was released.

💥SAM 2: ''Segment Anything in Images and Videos'' was released.

💥SAM: ''Segment Anything'' was released.

💥SAM & SAM2 for videos: The first survey on Segment Anything for Videos: A Systematic Survey was online.

🔥 Highlights

- 2026.03.27: SAM 3.1 Object Multiplex was released.
- 2025.12.15: SAM Audio was released.
- 2025.11.19: SAM 3 and SAM 3D were released.
- 2025.10.11: SAM 3 arrives! Officially announced and set to launch.
- 2025.04.22: SAM 2 won the ICLR 2025 Best Paper Honorable Mention.
- 2024.07.31: The first survey on SAM & SAM2 for Videos was online.
- 2024.07.29: The SAM 2 was released.
- 2023.07.14: "Segment Anything" was accepted by ICCV 2023 (Best Paper Honorable Mention).
- 2023.05.16: An initial version of this Awesome-Segment-Anything project.
- 2023.05.14: The first comprehensive SAM survey was online.
- 2023.04.05: The paper of "Segment Anything" was online.

Citation

If you find our work useful in your research, please consider citing:

@article{zhang2023comprehensive,
  title={A Comprehensive Survey on Segment Anything Model for Vision and Beyond},
  author={Zhang, Chunhui and Liu, Li and Cui, Yawen and Huang, Guanjie and Lin, Weilin and Yang, Yiqian and Hu, Yuehong},
  journal={arXiv preprint arXiv:2305.08196},
  year={2023}
}

@article{zhang2024segment,
  title={Segment Anything for Videos: A Systematic Survey},
  author={Zhang, Chunhui and Cui, Yawen and Lin, Weilin and Huang, Guanjie and Rong, Yan and Liu, Li and Shan, Shiguang},
  journal={arXiv preprint arXiv:2408.08315},
  year={2024}
}

Survey

The First Comprehensive SAM Survey: Chunhui Zhang, Li Liu, Yawen Cui, Guanjie Huang, Weilin Lin, Yiqian Yang, Yuehong Hu.
"A Comprehensive Survey on Segment Anything Model for Vision and Beyond." ArXiv (2024). [paper] [homepage] [中文解读] [2023.05]
The First Survey on SAM & SAM2 for Videos: Chunhui Zhang, Yawen Cui, Weilin Lin, Guanjie Huang, Yan Rong, Li Liu, Shiguang Shan.
"Segment Anything for Videos: A Systematic Survey." ArXiv (2024). [ArXiv] [ChinaXiv] [ResearchGate] [Project] [中文解读] [2024.07]
SAM4MIS: Yichi Zhang, Rushi Jiao.
"Towards Segment Anything Model (SAM) for Medical Image Segmentation: A Survey." CBM (2024). [paper] [project] [2023.05]
Yichi Zhang, Zhenrong Shen.
"Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey." ArXiv (2024). [paper] [code] [2024.08]
Tianfei Zhou, Fei Zhang, Boyu Chang, Wenguan Wang, Ye Yuan, Ender Konukoglu, Daniel Cremers.
"Image Segmentation in Foundation Model Era: A Survey." ArXiv (2024). [paper] [2024.08]
Chaoning Zhang, Fachrina Dewi Puspitasari, Sheng Zheng, Chenghao Li, Yu Qiao, Taegoo Kang, Xinru Shan, Chenshuang Zhang, Caiyan Qin, Francois Rameau, Lik-Hang Lee, Sung-Ho Bae, Choong Seon Hong.
"A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering." ArXiv (2024). [paper] [2023.05]
Xiaorui Sun, Jun Liu, Heng Tao Shen, Xiaofeng Zhu, Ping Hu.
"On Efficient Variants of Segment Anything Model: A Survey." IJCV (2025). [paper] [2024.10]
Mudassar Ali and Tong Wu and Haoji Hu and Qiong Luo and Dong Xu and Weizeng Zheng and Neng Jin and Chen Yang and Jincao Yao.
"A review of the Segment Anything Model (SAM) for medical image analysis: Accomplishments and perspectives." Computerized Medical Imaging and Graphics (2024). [paper] [2024.12]
Zhang Jiaxing, Tang Hao.
"SAM2 for Image and Video Segmentation: A Comprehensive Survey." ArXiv (2025). [paper] [2025.03]
Kang Wang.
"A survey on SAM-based methods for medical image segmentation." IS-AII (2025). [paper] [2025.07]
Guoping Xu, Jayaram K. Udupa, Yajun Yu, Hua-Chieh Shao, Songlin Zhao, Wei Liu, You Zhang.
"Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future." ArXiv (2025). [paper] [2025.07]
WanSAM4RS-Tracker: Zhipeng Wan and Sheng Wang and Wei Han and Yuewei Wang and Xiaohui Huang and Xiaohan Zhang and Xiaodao Chen and Yunliang Chen.
"A systematic survey and meta-analysis of the segment anything model in remote sensing image processing: Challenges, advances, applications, and opportunities." ISPRS Journal of Photogrammetry and Remote Sensing (2025). [paper] [project] [2025.09]
Yang, Yizai and Cheng, Lechao and Wang, Yaxiong and Hui, Tianrui and Li, Wenjing and Zhong, Zhun.
"A Survey for Point Prompt of Segment Anything Model." MMAsia Workshops (2025). [paper] [2025.12]

Paper List

Seminal Papers

SAM: Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, Ross Girshick.
"Segment Anything." ICCV (2023) Best Paper Honorable Mention. [paper] [homepage] [code] [Zhihu] [Reddit] [2023.04]
SAM 2: Nikhila Ravi∗,†, Valentin Gabeur∗, Yuan-Ting Hu∗, Ronghang Hu∗, Chaitanya Ryali∗, Tengyu Ma∗, Haitham Khedr∗, Roman Rädle∗ Chloe Rolland, Laura Gustafson, Eric Mintun, Junting Pan, Kalyan Vasudev Alwala, Nicolas Carion, Chao-Yuan Wu, Ross Girshick, Piotr Dollár†, Christoph Feichtenhofer∗,†.
"SAM 2: Segment Anything in Images and Videos." ICLR (2025) Best Paper Honorable Mention. [paper] [demo]] [code] [project]] [dataset] [blog] [2024.07]
SAM 3: Nicolas Carion*, Laura Gustafson*, Yuan-Ting Hu*, Shoubhik Debnath*, Ronghang Hu*, Didac Suris*, Chaitanya Ryali*, Kalyan Vasudev Alwala*, Haitham Khedr*, Andrew Huang, Jie Lei, Tengyu Ma, Baishan Guo, Arpit Kalla, Markus Marks, Joseph Greer, Meng Wang, Peize Sun, Roman Rädle, Triantafyllos Afouras, Effrosyni Mavroudi, Katherine Xu°, Tsung-Han Wu°, Yu Zhou°, Liliane Momeni°, Rishi Hazra°, Shuangrui Ding°, Sagar Vaze°, Francois Porcher°, Feng Li°, Siyuan Li°, Aishwarya Kamath°, Ho Kei Cheng°, Piotr Dollar†, Nikhila Ravi†, Kate Saenko†, Pengchuan Zhang†, Christoph Feichtenhofer†.
"SAM 3: Segment Anything with Concepts." ICLR (2026). [paper] [arXiv] [code] [homepage] [中文解读] [2025.10]
SAM 3D: SAM 3D Team, Xingyu Chen, Fu-Jen Chu, Pierre Gleize, Kevin J Liang, Alexander Sax, Hao Tang Weiyao Wang, Michelle Guo, Thibaut Hardin, Xiang Li, Aohan Lin, Jiawei Liu, Ziqi Ma, Anushka Sagar, Bowen Song, Xiaodong Wang, Jianing Yang, Bowen Zhang, Piotr Dollár, Georgia Gkioxari, MattFeiszli, Jitendra Malik.
"SAM 3D: 3Dfy Anything in Images." ArXiv (2025). [paper] [code] [project] [demo] [blog] [中文解读] [2025.11]
SAM 3D Body: Xitong Yang⋆, Devansh Kukreja⋆, Don Pinkus⋆, Anushka Sagar, Taosha Fan, Jinhyung Park◦, Soyong Shin◦, Jinkun Cao, Jiawei Liu, Nicolas Ugrinovic, Matt Feiszli†, Jitendra Malik†, Piotr Dollar†, Kris Kitani†.
"SAM 3D Body: Robust Full-Body Human Mesh Recovery." ArXiv (2025). [paper] [code] [project] [2025.11]
SAM Audio: Bowen Shi∗, Andros Tjandra∗, John Hoffman∗, Helin Wang∗, Yi-Chiao Wu∗, Luya Gao∗, Julius Richter†,Matt Le†, Apoorv Vyas†, Sanyuan Chen†, Christoph Feichtenhofer‡, Piotr Dollár‡, Wei-Ning Hsu‡, Ann Lee‡.
"SAM Audio: Segment Anything in Audio." ArXiv (2025). [paper] [code] [project] [demo] [2025.12]
GPT-4V: OpenAI.
"GPT-4V(ision) System Card." ArXiv (2023). [paper] [homepage] [2023.09]
Gemini: Gemini Team, Google.
"Gemini: A Family of Highly Capable Multimodal Models." ArXiv (2023). [paper] [homepage] [blog] [2023.12]
SEEM: Xueyan Zou, Jianwei Yang, Hao Zhang, Feng Li, Linjie Li, Jianfeng Gao, Yong Jae Lee.
"Segment Everything Everywhere All at Once." NeurIPS (2023). [paper] [code] [2023.04]
SegGPT: Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang.
"SegGPT: Segmenting Everything In Context." ICCV (2023). [paper] [code] [2023.04]
Grounding DINO: Shilong Liu, Zhaoyang Zeng, Tianhe Ren, Feng Li, Hao Zhang, Jie Yang, Chunyuan Li, Jianwei Yang, Hang Su, Jun Zhu, Lei Zhang.
"Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection." ArXiv (2023). [paper] [code] [2023.04]
ImageBind: Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra.
"ImageBind: One Embedding Space To Bind Them All." CVPR (2023). [paper] [homepage] [code] [2023.05]
LanguageBind: Bin Zhu, Bin Lin, Munan Ning, Yang Yan, Jiaxi Cui, HongFa Wang, Yatian Pang, Wenhao Jiang, Junwu Zhang, Zongwei Li, Wancai Zhang, Zhifeng Li, Wei Liu, Li Yuan.
"LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment." ArXiv (2023). [paper] [code]
Meta-Transformer: Yiyuan Zhang, Kaixiong Gong, Kaipeng Zhang, Hongsheng Li, Yu Qiao, Wanli Ouyang, Xiangyu Yue.
"Meta-Transformer: A Unified Framework for Multimodal Learning." ArXiv (2023). [paper] [homepage] [code] [中文解读] [2023.07]
OpenSeeD: Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang.
"A Simple Framework for Open-Vocabulary Segmentation and Detection." ICCV (2023). [paper] [code] [2023.03]
RAM: Youcai Zhang, Xinyu Huang, Jinyu Ma, Zhaoyang Li, Zhaochuan Luo, Yanchun Xie, Yuzhuo Qin, Tong Luo, Yaqian Li, Shilong Liu, Yandong Guo, Lei Zhang.
"Recognize Anything: A Strong Image Tagging Model." ArXiv (2023). [paper] [homepage] [code] [2023.06]
PACGen: Yuheng Li, Haotian Liu, Yangming Wen, Yong Jae Lee.
"Generate Anything Anywhere in Any Scene." ArXiv (2023). [paper] [homepage] [code] [2023.06]
ASM: Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao.
"The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World." ArXiv (2023). [paper] [homepage] [demo] [2023.08]
OneFormer: Jitesh Jain, Jiachen Li, MangTik Chiu, Ali Hassani, Nikita Orlov, Humphrey Shi.
"OneFormer: One Transformer to Rule Universal Image Segmentation." CVPR (2023). [paper] [homepage] [code] [2022.11]
OVSeg: Feng Liang, Bichen Wu, Xiaoliang Dai, Kunpeng Li, Yinan Zhao, Hang Zhang, Peizhao Zhang, Peter Vajda, Diana Marculescu.
"Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP." CVPR (2023). [paper] [homepage] [code] [2022.10]
WAM: Tom Sander, Pierre Fernandez, Alain Durmus, Teddy Furon, Matthijs Douze.
"Watermark Anything with Localized Messages." ArXiv (2024). [paper] [code] [2024.11]
Sa2VA: Haobo Yuan, Xiangtai Li, Tao Zhang, Zilong Huang, Shilin Xu, Shunping Ji, Yunhai Tong, Lu Qi, Jiashi Feng, Ming-Hsuan Yang.
"Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos." ArXiv (2025). [paper] [code] [project] [hugging face] [2025.01]
SAMTok: Yikang Zhou, Tao Zhang, Dengxian Gong, Yuanzheng Wu, Ye Tian, Haochen Wang, Haobo Yuan, Jiacong Wang, Lu Qi, Hao Fei, Anran Wang, Zhuochen Wang, Yujing Wang, Cheng Chen, Shunping Ji, Xiangtai Li.
"SAMTok: Representing Any Mask with Two Words." ArXiv (2026). [paper] [code] [project] [hugging face] [demo] [2026.01]
DAM: Long Lian, Yifan Ding, Yunhao Ge, Sifei Liu, Hanzi Mao, Boyi Li, Marco Pavone, Ming-Yu Liu, Trevor Darrell, Adam Yala, Yin Cui.
"Describe Anything: Detailed Localized Image and Video Captioning." ArXiv (2025). [paper] [code] [project] [huggingface] [2025.04]
DINOv2: Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jegou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski.
"DINOv2: Learning Robust Visual Features without Supervision." TMLR (2024). [paper] [code] [project] [2023.04]
DINOv3: Oriane Siméoni, Huy V. Vo, Maximilian Seitzer, Federico Baldassarre, Maxime Oquab, Cijo Jose, Vasil Khalidov, Marc Szafraniec, Seungeun Yi, Michaël Ramamonjisoa, Francisco Massa, Daniel Haziza, Luca Wehrstedt, Jianyuan Wang, Timothée Darcet, Théo Moutakanni, Leonel Sentana, Claire Roberts, Andrea Vedaldi, Jamie Tolan, John Brandt, Camille Couprie, Julien Mairal, Hervé Jégou, Patrick Labatut, Piotr Bojanowski.
"DINOv3." ArXiv (2025). [paper] [code] [2025.08]
Rex-Omni: Qing Jiang, Junan Huo, Xingyu Chen, Yuda Xiong, Zhaoyang Zeng, Yihao Chen, Tianhe Ren, Junzhi Yu, Lei Zhang.
"Detect Anything via Next Point Prediction." ArXiv (2025). [paper] [project] [code] [2025.10]
Mamba-3: Anonymous authors.
"Mamba-3: Improved Sequence Modeling using State Space Principles." ICLR (2026). [paper] [2025.11]
Depth Anything 3: Haotong Lin, Sili Chen, Junhao Liew, Donny Y. Chen, Zhenyu Li, Guang Shi, Jiashi Feng, Bingyi Kang.
"Depth Anything 3: Recovering the Visual Space from Any Views." ICLR (2026). [paper] [code] [2025.11]

Follow-up Papers

The latest papers within a week are marked with a 💥

2026

IndoorCrowd: Sebastian-Ion Nae, Radu Moldoveanu, Alexandra Stefania Ghita, Adina Magda Florea.
"IndoorCrowd: A Multi-Scene Dataset for Human Detection, Segmentation, and Tracking with an Automated Annotation Pipeline." CVPR Workshop (2026). [paper] [code] [2026.04]
GRAZE: Syed Ahsan Masud Zaidi, Lior Shamir, William Hsu, Scott Dietrich, Talha Zaidi.
"GRAZE: Grounded Refinement and Motion-Aware Zero-Shot Event Localization ." CVPR Workshop (2026). [paper] [code] [2026.04]
DPMO: Hongru Chen, Jiyang Huang, Jia Wan, Antoni B. Chan.
"Dense Point-to-Mask Optimization with Reinforced Point Selection for Crowd Instance Segmentation." ArXiv (2026). [paper] [2026.04]
Derek Austin.
"Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars." ArXiv (2026). [paper] [2026.04]
Xusheng He, Canyang Wu, Jinrong Zhang, Weili Guan, Jianlong Wu, Liqiang Nie.
"The 1st Winner for 5th PVUW MeViS-Text Challenge: Strong MLLMs Meet SAM3 for Referring Video Object Segmentation." CVPR Workshop (2026). [paper] [code] [2026.04]
TEP: Jinrong Zhang, Canyang Wu, Xusheng He, Weili Guan, Jianlong Wu, Liqiang Nie.
"Advancing Complex Video Object Segmentation via Tracking-Enhanced Prompt: The 1st Winner for 5th PVUW MOSE Challenge." CVPR Workshop (2026). [paper] [2026.04]
AdaLoRA-QAT: Prantik Deb, Srimanth Dhondy, N. Ramakrishna, Anu Kapoor, Raju S. Bapi, Tapabrata Chakraborti.
"AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation." ISBI (2026). [paper] [code] [2026.04]
TF-SSD: Zhijin He, Shuo Jin, Siyue Yu, Shuwei Wu, Bingfeng Zhang, Li Yu, Jimin Xiao.
"TF-SSD: A Strong Pipeline via Synergic Mask Filter for Training-free Co-salient Object Detection." CVPR (2026). [paper] [code] [2026.04]
PC-SAM: Chengcheng Lv, Rushi Li, Mincheng Wu, Xiufang Shi, Zhenyu Wen, Shibo He.
"PC-SAM: Patch-Constrained Fine-Grained Interactive Road Segmentation in High-Resolution Remote Sensing Images." ArXiv (2026). [paper] [code] [2026.04]
LunarRockSAM: Wang, Yinan and Ye, Hongxia and Fa, Wenzhe.
"LunarRockSAM: A Domain-Adapted SAM with Bright-Spots Prompting and Conditional Screening for Lunar Rock Extraction." IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (2026). [paper] [2026.03]
DBM-SAM: Wei Gao, Teng Li, Cunang Jiang, Sicheng Wang, Yu Dai.
"DBM-SAM: Dual-branch multiscale adaptation of SAM for medical ultrasound segmentation." Displays (2026). [paper] [2026.03]
SAM2-RoadNet: Feng, Ruyue, Ziyou Guo, Xiao Du, and Tieru Wu.
"SAM2-RoadNet: Topology-Aware Multi-Scale Road Extraction from High-Resolution Remote Sensing Images." Remote Sensing (2026). [paper] [2026.03]
IDRG-mSAM: Wang, Leiquan and Meng, Yu and Luo, Chunbo and Xu, Mingming and Wu, Chunlei and Li, Zhongwei.
"SAM-Based Multi-Scale Fine-Tuning with Inter-layer Difference Guidance for Remote Sensing Change Detection." IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (2026). [paper] [2026.03]
SAM-ColonPolypGen: Shasha Zhang and Yuang Cai and Yijun Chen and Xiang Cai and Peng Li.
"SAM-ColonPolypGen: Enhancing automated colon polyp report generation via reinforcement learning and prompt chaining." Biomedical Signal Processing and Control (2026). [paper] [2026.03]
SemiBUVS: Long Chen and Qingqing Zheng and Yingying Chen and Faqin Lv and Qiong Wang.
"SAM-Guided Semi-Supervised Breast Lesion Segmentation in Ultrasound Videos with A New Dataset." Expert Systems with Applications (2026). [paper] [code] [2026.03]
Mask-CDKD: Daoyu Shu and Zhan Zhang and Xiao Huang and Ru Wang and Nan Jia and Xinzhe Fu and Bingnan Yang and Fang Wan and Jianzhong Lu and Jianya Gong.
"Mask-CDKD: A source-free and label-free cross-domain knowledge distillation framework from SAM for satellite onboard VHR land-cover mapping." ISPRS Journal of Photogrammetry and Remote Sensing (2026). [paper] [2026.03]
SAM2-WaveUNet: Shuzhou Lv and Shubin Zhang and Xiaoshuang Huang and Dong An and Jincun Liu and Yan Meng and Yaoguang Wei.
"SAM2-WaveUNet: A Frequency-Enhanced Segmentation Network for Fine-Grained Marine Organism Delineation." Expert Systems with Applications (2026). [paper] [2026.03]
VLP-SAM: Sakurai, Kosuke, Ryotaro Shimizu, and Masayuki Goto.
"Vision and Language Reference for a Segment Anything Model for Few-Shot Segmentation." Journal of Imaging(2026). [paper] [2026.03]
AutoPrompt-SAM3D: Cheng, W., Tang, J., Wang, T. et al.
"AutoPrompt-SAM3D: integrated generation and selection for SAM2-based 3D medical segmentation." BMC Bioinformatics (2026). [paper] [2026.03]
Shata, Dina, Simon Denman, Sara Omrani, Robin Drogemuller, Hend Ali, and Ayman Wagdy.
"Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation." Buildings (2026). [paper] [2026.03]
HATSAM: Tang, T., Rao, Z., Wang, Y. et al.
"HATSAM: hierarchical adaptation strategy for segment anything model in medical imaging." SIViP (2026). [paper] [2026.03]
SaSaSaSa2VA: Dengxian Gong, Quanzhu Niu, Shihao Chen, Yuanzheng Wu, Yikang Zhou, Tao Zhang, Haobo Yuan, Lu Qi, Shunping Ji.
"SaSaSaSa2VA: 2nd Place of the 5th PVUW MeViS-Text Track." ArXiv (2026). [paper] [2026.03]
Aviraj Bevli, Sofian Chaybouti, Yasser Dahou, Hakim Hacid, Ngoc Dung Huynh, Phuc H. Le Khac, Sanath Narayan, Wamiq Reyaz Para, Ankit Singh.
"Falcon Perception." ArXiv (2026). [paper] [code] [2026.03]
FT-FSOD: Xuanlong Yu, Youyang Sha, Longfei Liu, Xi Shen, Di Yang.
"A Closer Look at Cross-Domain Few-Shot Object Detection: Fine-Tuning Matters and Parallel Decoder Helps." CVPR (2026). [paper] [code] [2026.03]
LIT: Xinyu Yang, Haozheng Yu, Yihong Sun, Bharath Hariharan, Jennifer J. Sun.
"Live Interactive Training for Video Segmentation." CVPR (2026). [paper] [code] [2026.03]
Xinyao Zhang, Chang Liu, Xiao Liang, Minghui Zheng, Sara Behdad.
"Evaluating Large and Lightweight Vision Models for Irregular Component Segmentation in E-Waste Disassembly." MSEC (2026). [paper] [2026.03]
Syn4Seg: Guohuan Xie, Xin He, Dingying Fan, Le Zhang, Ming-Ming Cheng, Yun Liu.
"Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation." ArXiv (2026). [paper] [2026.03]
IP-SAM: Huiyao Zhang, Jin Bai, Rui Guo, JianWen Tan, HongFei Wang, Ye Li.
"IP-SAM: Prompt-Space Conditioning for Prompt-Absent Camouflaged Object Detection." ArXiv (2026). [paper] [2026.03]
OpenDPR: Qi Guo, Jue Wang, Yinhe Liu, Yanfei Zhong.
"OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery." CVPR (2026). [paper] [code] [2026.03]
Industrial3D: Chao Yin, Hongzhe Yue, Qing Han, Difeng Hu, Zhenyu Liang, Fangzhou Lin, Bing Sun, Boyu Wang, Mingkai Li, Wei Yao, Jack C. P. Cheng.
"Industrial3D: A Terrestrial LiDAR Point Cloud Dataset and CrossParadigm Benchmark for Industrial Infrastructure." ArXiv (2026). [paper] [code] [2026.03]
CFR-SAM: Jingze Su, Tianle Zhu, Jiaxin Cai, Zhiyi Wang, Qi Li, Xiao Zhang, Tong Tong, Shu Wang, Wenxi Liu.
"Adapting SAM to Nuclei Instance Segmentation and Classification via Cooperative Fine-Grained Refinement." ArXiv (2026). [paper] [2026.03]
RAP: Zhihao Mao, Bangpu Chen.
"RAP: Retrieve, Adapt, and Prompt-Fit for Training-Free Few-Shot Medical Image Segmentation." IJCNN (2026). [paper] [2026.03]
Samik Some, Vinay P. Namboodiri.
"Can Unsupervised Segmentation Reduce Annotation Costs for Video Semantic Segmentation?." ICVGIP (2026). [paper] [2026.03]
M. Fazri Nizar.
"Domain-Guided YOLO26 with Composite BCE-Dice-Lovász Loss for Multi-Class Fetal Head Ultrasound Segmentation." ArXiv (2026). [paper] [2026.03]
Mask-CDKD: Daoyu Shu and Zhan Zhang and Xiao Huang and Ru Wang and Nan Jia and Xinzhe Fu and Bingnan Yang and Fang Wan and Jianzhong Lu and Jianya Gong.
"Mask-CDKD: A source-free and label-free cross-domain knowledge distillation framework from SAM for satellite onboard VHR land-cover mapping." ISPRS Journal of Photogrammetry and Remote Sensing (2026). [paper] [code] [2026.03]
Colon-Bench: Abdullah Hamdi, Changchun Yang, Xin Gao.
"Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos." ArXiv (2026). [paper] [code] [2026.03]
Nitin Kulkarni, Akhil Devarashetti, Charlie Cluss, Livio Forte, Philip Schneider, Chunming Qiao, Alina Vereshchaka.
"Drive-Through 3D Vehicle Exterior Reconstruction via Dynamic-Scene SfM and Distortion-Aware Gaussian Splatting." ArXiv (2026). [paper] [2026.03]
Guoping Xu, Jayaram K. Udupa, Yubing Tong, Xin Long, Ying Zhang, Jie Deng, Weiguo Lu, You Zhang.
"Adapting Segment Anything Model 3 for Concept-Driven Lesion Segmentation inMedical Images: An Experimental Study." ArXiv (2026). [paper] [code] [2026.03]
Sieradzki, Alexander, Kamil Koszela, Szymon Koszykowski, Jakub Bednarek, and Jarosław Kurek.
"Zero-Shot Vertebral Instance Segmentation on DICOM Spine Radiographs Using Promptable Segment Anything Models." Journal of Clinical Medicine (2026). [paper] [2026.03]
SemiBUVS: Long Chen and Qingqing Zheng and Yingying Chen and Faqin Lv and Qiong Wang.
"SAM-Guided Semi-Supervised Breast Lesion Segmentation in Ultrasound Videos with A New Dataset." ESWA (2026). [paper] [code] [2026.03]
GridVAD: Mohamed Eltahir, Ahmed O. Ibrahim, Obada Siralkhatim, Tabarak Abdallah, Sondos Mohamed.
"GridVAD: Open-Set Video Anomaly Detection via Spatial Reasoning over Stratified Frame Grids." ArXiv (2026). [paper] [code] [2026.03]
XAI-SAM: Abu Noman Md Sakib, Merjulah Roby, Zijie Zhang, Satish Muluk, Mark K. Eskandari, Ender A. Finol.
"Dissecting Model Failures in Abdominal Aortic Aneurysm Segmentation through Explainability-Driven Analysis." CVPR (2026). [paper] [2026.03]
ET-SAM: Xike Zhang, Maoyuan Ye, Juhua Liu, Bo Du.
"ET-SAM: Efficient Point Prompt Prediction in SAM for Unified Scene Text Detection and Layout Analysis." ArXiv (2026). [paper] [2026.03]
UW-VOS: Hongshen Zhao, Jingkang Tai, Yuhang Wu, Wenkang Zhang, Xi Lan, Shangyan Wang, Tianyu Zhang, Wankou Yang.
"UW-VOS: A Large-Scale Dataset for Underwater Video Object Segmentation." ArXiv (2026). [paper] [2026.03]
Mingqi Gao, Sijie Li, Jungong Han.
"Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track." ArXiv (2026). [paper] [2026.03]
AgentRVOS: Woojeong Jin, Jaeho Lee, Heeseong Shin, Seungho Jang, Junhwan Heo, Seungryong Kim.
"AgentRVOS: Reasoning over Object Tracks for Zero-Shot Referring Video Object Segmentation." ArXiv (2026). [paper] [code] [2026.03]
FCL-COD: Jingchen Ni, Quan Zhang, Dan Jiang, Keyu Lv, Ke Zhang, Chun Yuan.
"FCL-COD: Weakly Supervised Camouflaged Object Detection with Frequency-aware and Contrastive Learning." CVPR (2026). [paper] [2026.03]
Miquel Lopez Escoriza, Pau Amargant Alvarez.
"Automatic Segmentation of 3D CT scans with SAM2 using a zero-shot approach." ArXiv (2026). [paper] [2026.03]
VIRST-Audio: Jihwan Hong, Jaeyoung Do.
"3rd Place of MeViS-Audio Track of the 5th PVUW: VIRST-Audio." CVPR workshop (2026). [paper] [code] [2026.03]
FoB: Yuntian Bo, Yazhou Zhu, Piotr Koniusz, Haofeng Zhang.
"Focus on Background: Exploring SAM's Potential in Few-shot Medical Image Segmentation with Background-centric Prompting." CVPR (2026). [paper] [code] [2026.03]
CataractSAM-2: Mohammad Eslami, Dhanvinkumar Ganeshkumar, Saber Kazeminasab, Michael G. Morley, Michael V. Boland, Michael M. Lin, John B. Miller, David S. Friedman, Nazlee Zebardast, Lucia Sobrin, Tobias Elze.
"CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation." ArXiv (2026). [paper] [2026.03]
Lei Huang, Kai-Li Wang, Zhang Chen, Zhen-Huang, Saidjafar Murodzoda, Xin Chen, Jing Chen, Chun-Hao Chen, Yu Xia, Yu-Tong Yang, Jia-Cheng Li, Dilshod Nematov, Ilhan Yavuz, Zhao-Kui Wang.
"SAM Molecular Stacking with Heterogeneous Orientationfor High-Performance Perovskite Photovoltaics." ArXiv (2026). [paper] [2026.03]
Thomas Mendelson, Joshua Francois, Galit Lahav, Tammy Riklin-Raviv.
"Boundary-Aware Instance Segmentation in Microscopy Imaging." ISBI (2026). [paper] [2026.03]
Muhammad Hassan Maqsood, Yanming Zhu, Alfred Lam, Getamesay Dagnaw, Xuefei Yin, Alan Wee-Chung Liew.
"Prompt-Free Lightweight SAM Adaptation for Histopathology Nuclei Segmentation with Strong Cross-Dataset Generalization." ISBI (2026). [paper] [2026.03]
Carolin Teuber, Anwai Archit, Tobias Boothe, Peter Ditte, Jochen Rink, Constantin Pape.
"Evaluating Vision Foundation Models for Pixel and Object Classification in Microscopy." ArXiv (2026). [paper] [2026.03]
Distillation-SAM: Tang, Jiyang and Han, Hu and Shan, Shiguang and Chen, Xilin.
"Distillation-SAM: Knowledge Distillation Based Auto-prompt Embedding Learning for Surgical Image Segmentation." TMI (2026). [paper] [code] [2026.03]
EventVCOD: Zhang, H., Lyu, Y., Liu, H., Song, J., Yuan, D., & Yang, Y.
"Towards Explainable Video Camouflaged Object Detection: SAM2 with Eventstream-Inspired Data." AAAI (2026). [paper] [code] [2026.03]
GoalVLM: MoniJesu James, Amir Atef Habel, Aleksey Fedoseev, Dzmitry Tsetserokou.
"GoalVLM: VLM-driven Object Goal Navigation for Multi-Agent System." ArXiv (2026). [paper] [2026.03]
Perceptio: Yuchen Li, Amanmeet Garg, Shalini Chaudhuri, Rui Zhao, Garin Kessler.
"Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation." ArXiv (2026). [paper] [2026.03]
SCISSR: Haonan Ping, Jian Jiang, Cheng Yuan, Qizhen Sun, Lv Wu, Yutong Ban.
"SCISSR: Scribble-Conditioned Interactive Surgical Segmentation and Refinement." ArXiv (2026). [paper] [2026.03]
LoGSAM: Mohammad Robaitul Islam Bhuiyan, Sheethal Bhat, Melika Qahqaie, Tri-Thien Nguyen, Paula Andrea Pérez Toro, Tomas Arias Vergara, Andreas Maier.
"LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation." ArXiv (2026). [paper] [code] [2026.03]
Anwai Archit, Constantin Pape.
"Revisiting foundation models for cell instance segmentation." MIDL (2026). [paper] [2026.03]
Diederick C. Niehorster, Marcus Nyström.
"Eye image segmentation using visual and concept prompts with Segment Anything Model 3 (SAM3)." ArXiv (2026). [paper] [2026.03]
Paulo Vitor Santana Silva, Arthur Ricardo Sousa Vitória, Diogo Fernandes Costa Silva, Arlindo Rodrigues Galvão Filho.
"Attention Guidance through Video Script: A Case Study of Object Focusing on 360° VR Video Tours." SVR (2026). [paper] [2026.03]
EDP-SAM: Jiyang Huang, Hongru Cheng, Wei Lin, Jia Wan, Antoni B. Chan.
"Exclusivity-Guided Mask Learning for Semi-Supervised Crowd Instance Segmentation and Counting." ArXiv (2026). [paper] [2026.03]
MessyKitchens: Junaid Ahmed Ansari, Ran Ding, Fabio Pizzati, Ivan Laptev.
"MessyKitchens: Contact-rich object-level 3D scene reconstruction." ArXiv (2026). [paper] [code] [2026.03]
SAMSEM: Christian Gehrmann, Jonas Ricker, Simon Damm, Deruo Cheng, Julian Speith, Yiqiong Shi, Asja Fischer, Christof Paar.
"SAMSEM -- A Generic and Scalable Approach for IC Metal Line Segmentation." ArXiv (2026). [paper] [2026.03]
BADSEG: Guangsheng Zhang, Huan Tian, Leo Zhang, Tianqing Zhu, Ming Ding, Wanlei Zhou, Bo Liu.
"Poisoning the Pixels: Revisiting Backdoor Attacks on Semantic Segmentation." ArXiv (2026). [paper] [2026.03]
Shuai Guo, Ao Guo, Junchao Zhao, Qi Chen, Yuxiang Qi, Zechuan Li, Dong Chen, Tianjia Shao, Mingliang Xu.
"Direct Object-Level Reconstruction via Probabilistic Gaussian Splatting." ArXiv (2026). [paper] [2026.03]
Fast-SAM-3D-Body: Timing Yang, Sicheng He, Hongyi Jing, Jiawei Yang, Zhijian Liu, Chuhang Zou, Yue Wang.
"Fast SAM 3D Body: Accelerating SAM 3D Body for Real-Time Full-Body Human Mesh Recovery." ArXiv (2026). [paper] [code] [2026.03]
EviATTA: Jiayi Chen, Yasmeen George, Winston Chong, Jianfei Cai.
"EviATTA: Evidential Active Test-Time Adaptation for Medical Segment Anything Models." ArXiv (2026). [paper] [2026.03]
StAR: Seokju Yun, Dongheon Lee, Noori Bae, Jaesung Jun, Chanseul Cho, Youngmin Ro.
"StAR: Segment Anything Reasoner." ArXiv (2026). [paper] [code] [2026.03]
SAIF: Ke Wu, Shiqi Chen, Yiheng Zhong, Hengxian Liu, Yingxue Su, Yifang Wang, Junhao Jin, Guangyu Ren.
"SAIF: A Stability-Aware Inference Framework for Medical Image Segmentation with Segment Anything Model." ArXiv (2026). [paper] [code] [2026.03]
Colony Grounded SAM2: Daan Korporaal, Patrick de Kruijf, Ralph H. G. M. Litjens, Bas H. M. van der Velden.
"Colony Grounded SAM2: Zero-shot detection and segmentation of bacterial colonies using foundation models." ArXiv (2026). [paper] [2026.03]
Elodie Germani, Krystel Nyangoh-Timoh, Pierre Jannin, John S H Baxter.
"Disentangling Prompt Dependence to Evaluate Segmentation Reliability in Gynecological MRI." ArXiv (2026). [paper] [2026.03]
Tomislav Medic, Liangliang Nan.
"In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels." ArXiv (2026). [paper] [2026.03]
SPARROW: Mohamad Alansari, Naufal Suryanto, Divya Velayudhan, Sajid Javed, Naoufel Werghi, Muzammal Naseer.
"SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs." CVPR (2026). [paper] [project] [code] [2026.03]
SAP: Lutao Jiang, Zidong Cao, Weikai Chen, Xu Zheng, Yuanhuiyi Lyu, Zhenyang Li, Zeyu HU, Yingda Yin, Keyang Luo, Runze Zhang, Kai Yan, Shengju Qian, Haidi Fan, Yifan Peng, Xin Wang, Hui Xiong, Ying-Cong Chen.
"SAP: Segment Any 4K Panorama." ArXiv (2026). [paper] [code] [2026.03]
HFP-SAM: Pingping Zhang, Tianyu Yan, Yuhao Wang, Yang Liu, Tongdan Tang, Yili Ma, Long Lv, Feng Tian, Weibing Sun, and Huchuan Lu.
"HFP-SAM: Hierarchical Frequency Prompted SAM for Efficient Marine Animal Segmentation." TIP (2026). [paper] [code] [2026.03]
SAM FTI-FDet: Guodong Sun, Qihang Liang, Xingyu Pan, Moyun Liu, Yang Zhang.
"Prompt-Driven Lightweight Foundation Model for Instance Segmentation-Based Fault Detection in Freight Trains." ArXiv (2026). [paper] [code] [2026.03]
GoalSwarm: MoniJesu Wonders James, Amir Atef Habel, Aleksey Fedoseev, Dzmitry Tsetserokou.
"GoalSwarm: Multi-UAV Semantic Coordination for Open-Vocabulary Object Navigation." ArXiv (2026). [paper] [2026.03]
AutoSAM: Li, Jiayuan and Wang, Zhen and Sun, Xiao and Xu, Nan and You, Zhuhong and Huang, Deshuang.
"AutoSAM: Auto-Prompting Mamba-Based Vision Foundation Model for Multimodal Remote Sensing Semantic Segmentation." TGRS (2026). [paper] [code] [2026.03]
SAMCM-SR: Junchao Wang et al.
"SAMCM-SR: Applying SAM3 Under Data-Scarce Conditions for Cross-Modal Segmentation of Power Equipment Infrared Images with Super-Resolution Enhancement." Appl. Sci. (2026). [paper] [2026.03]
BloodCellSAM2: Zhening Qiu.
"Research and Analysis of Fine-tuning Techniques for Cell Image Segmentation Model Based on SAM2." ArXiv (2026). [paper] [2026.03]
PolySAM-Lite: Umar Hasan, Muhammad Ali Nayeem.
"PolySAM-Lite: Parameter-efficient adaptation of the Segment Anything Model for colorectal polyp segmentation." ArXiv (2026). [paper] [2026.03]
RT-SAM: Khor, Hee Guan and Yang, Xin and Sun, Yihua and Huang, Sijuan and Wang, Yingni and Wang, Jie and Wang, Shaobin and Bai, Lu and Ma, Longfei and Liao, Hongen.
"RT-SAM: Visual-Prompt Fusion and Uncertainty Enhancement for Nasopharyngeal Carcinoma Radiotherapy Target Delineation." JBHI (2026). [paper] [2026.03]
CPOVIS,: Zheng, Rongkun and Qi, Lu and Chen, Xi and Wang, Yi and Wang, Kun and Qiao, Yu and Zhao, Hengshuang.
"Causal Prompts for Open-vocabulary Video Instance Segmentation." TPAMI (2026). [paper] [2026.03]
USCount-Net: Yu Wang et al.
"Low-Annotation Apple Flower Counting: A Color-SAM Enhanced and Uncertainty-Guided Semi-Supervised Framework." Plant Phenomics (2026). [paper] [2026.03]
Snehalraj Chugh, Dharmendra Singh Chaudhary, Subash Sigdel, Shubham Thapa, Lalit BC, Nishan Ghimire, Bipendra Basnyat, Nirmalya Roy.
"Segment Anything but Farms: Comparing Segmentation Paradigms for Rural UAV Captured Ultra-High-Resolution Imagery." WACVW (2026). [paper] [2026.03]
Linzhu Li et al.
"Classification of Densely Packed Sand Particles Using a Digital Camera and the Segment Anything Model (SAM)." Geo-Congress (2026). [paper] [2026.03]
Zhipeng Chen et al.
"Occlusion-Aware Visual Object Tracking with SAM2-Based Segmentation via Temporal Convolutional Networks and a Dual-Memory Bank." ArXiv (2026). [paper] [2026.03]
Minghui Xu et al.
"Automated flame boundary segmentation from droplet combustion images using SAM2 with auto-prompt selection and RANSAC fitting." ArXiv (2026). [paper] [2026.03]
OAMOT: Guo, Wen and Wang, Tuo and Gao, Junyu and Zhang, Tianzhu and Xu, Changsheng.
"Occlusion-Aware Multi-Object Tracking via Joint Diffusion Motion Prediction and Appearance Purification." TCSVT (2026). [paper] [code] [2026.03]
SegTS: Jinsong Li et al.
"SegTS: Subseries-driven temporo-spatial learning with Segment Anything Model for crop segmentation in satellite image time series." Computers and Electronics in Agriculture (2026). [paper] [2026.03]
Yunhao Hu, Penglin Zou, rongguo yan, Xiyun Zeng and Qi Wang.
"Exploration and Performance Analysis of Deep Learning Applications in Spermatic Vein Ultrasound Segmentation." ArXiv (2026). [paper] [2026.03]
Txai Sibley et al.
"Evaluating and enhancing Segment Anything Model transferability for microstructural image analysis in nuclear materials." Computational Materials Science (2026). [paper] [2026.03]
DIT-SAM: Yuhan Ying et al.
"DIT-SAM: Enhancing segment anything model for automatic medical image segmentation via dual-interactive tuning." Biomedical Signal Processing and Control (2026). [paper] [code] [2026.03]
SA-SAM: Zhuowen Deng, Fangce Li, Shenglin Shan, Jianchang Feng.
"SA-SAM: a scale-adaptative method for wildfire scene segmentation." MLAIA (2026). [paper] [2026.03]
Amirreza Fateh, et al.
"Adapting SAM with a triple-prompt strategy for one-shot semantic segmentation." Neurocomputing (2026). [paper] [2026.03]
PicoSAM3: Pietro Bonazzi, Nicola Farronato, Stefan Zihlmann, Haotong Qin, Michele Magno.
"PicoSAM3: Real-Time In-Sensor Region-of-Interest Segmentation." ArXiv (2026). [paper] [2026.03]
DART: Mehmet Kerem Turkcan.
"Detect Anything in Real Time: From Single-Prompt Segmentation to Multi-Class Detection." ArXiv (2026). [paper] [code] [2026.03]
BALD-SAM: Prithwijit Chowdhury, Mohit Prabhushankar, Ghassan AlRegib.
"BALD-SAM: Disagreement-based Active Prompting in Interactive Segmentation." ArXiv (2026). [paper] [2026.03]
OilSAM2: Shuaiyu Chen, Ming Yin, Peng Ren, Chunbo Luo, Zeyu Fu.
"OilSAM2: Memory-Augmented SAM2 for Scalable SAR Oil Spill Detection." ArXiv (2026). [paper] [code] [2026.03]
SAMONAI: Muhammad Alberb, Jianan Chen, Hossam El-rewaidy, Paul Karanicolas, Arun Seth, Yutaka Amemiya, Anne Martel, Helen Cheung.
"An Automated Radiomics Framework for Postoperative Survival Prediction in Colorectal Liver Metastases using Preoperative MRI." ArXiv (2026). [paper] [2026.03]
Cybo-Waiter: Peng Ren, Haoyang Ge, Chuan Qi, Cong Huang, Hong Li, Jiang Zhao, Pei Chi, Kai Chen.
"Cybo-Waiter: A Physical Agentic Framework for Humanoid Whole-Body Locomotion-Manipulation." ArXiv (2026). [paper] [2026.03]
Caroline Magg, Maaike A. ter Wee, Johannes G. G. Dobbe, Geert J. Streekstra, Leendert Blankevoort, Clara I. Sánchez, Hoel Kervadec.
"Prompting with the human-touch: evaluating model-sensitivity of foundation models for musculoskeletal CT segmentation." ArXiv (2026). [paper] [code] [2026.03]
VQ-SAM: Bing Fan, Minghao Li, Hanzhi Zhang, Shaohua Dong, Naga Prudhvi Mareedu, Weishi Shi, Yunhe Feng, Yan Huang, Heng Fan.
"Towards Visual Query Segmentation in the Wild." ArXiv (2026). [paper] [2026.03]
RPG-SAM: Weikun Lin, Yunhao Bai, Yan Wang.
"RPG-SAM: Reliability-Weighted Prototypes and Geometric Adaptive Threshold Selection for Training-Free One-Shot Polyp Segmentation." ArXiv (2026). [paper] [2026.03]
Haoran Ding, Liang Ma, Yaxun Yang, Wen Yang, Tianyu Liu, Anqing Duan, Xiaodan Liang, Dezhen Song, Ivan Laptev, Yoshihiko Nakamura.
"Choose What to Observe: Task-Aware Semantic-Geometric Representations for Visuomotor Policy." ArXiv (2026). [paper] [2026.03]
StructSAM: Duy M. H. Nguyen, Tuan A. Tran, Duong Nguyen, Siwei Xie, Trung Q. Nguyen, Mai T. N. Truong, Daniel Palenicek, An T. Le, Michael Barz, TrungTin Nguyen, Tuan Dam, Ngan Le, Minh Vu, Khoa Doan, Vien Ngo, Pengtao Xie, James Zou, Daniel Sonntag, Jan Peters, Mathias Niepert.
"StructSAM: Structure- and Spectrum-Preserving Token Merging for Segment Anything Models." ArXiv (2026). [paper] [2026.03]
OPTED: Kibrom Gebremedhin, Hadush Hailu, Bruk Gebregziabher.
"OPTED: Open Preprocessed Trachoma Eye Dataset Using Zero-Shot SAM 3 Segmentation." ArXiv (2026). [paper] [2026.03]
VINE: Hongli Liu, Yu Wang, Shengjie Zhao.
"Unify the Views: View-Consistent Prototype Learning for Few-Shot Segmentation." CVPR (2026). [paper] [code] [2026.03]
HCF-RES: Keshen Zhou, Runnan Chen, Mingming Gong, Tongliang Liu.
"Hierarchical Collaborative Fusion for 3D Instance-aware Referring Expression Segmentation." ArXiv (2026). [paper] [2026.03]
Yonghuang Wu, Zhenyang Liang, Wenwen Zeng, Xuan Xie, Jinhua Yu.
"Prompt Group-Aware Training for Robust Text-Guided Nuclei Segmentation." ArXiv (2026). [paper] [2026.03]
Byeongseong Lee, Jihong Min.
"Training-Free Target Emphasis with SAM2 Pseudo-Masks for Robust Single Object Tracking." WACV workshop (2026). [paper] [2026.03]
Akash Sharma, Pranjal Naman, Roopkatha Banerjee, Priyanshu Pansari, Sankalp Gawali, Mayank Arya, Sharath Chandra, Arun Josephraj, Rakshit Ramesh, Punit Rathore, Anirban Chakraborty, Raghu Krishnapuram, Vijay Kovvali, Yogesh Simmhan.
"Scaling Real-Time Traffic Analytics on Edge-Cloud Fabrics for City-Scale Camera Networks." CCGRID Workshops (2026). [paper] [2026.03]
Akif Islam, Raufun Nahar, Md. Ekramul Hamid.
"When Denoising Hinders: Revisiting Zero-Shot ASR with SAM-Audio and Whisper." ArXiv (2026). [paper] [2026.03]
GarmentPile++: Mingleyang Li, Yuran Wang, Yue Chen, Tianxing Chen, Jiaqi Liang, Zishun Shen, Haoran Lu, Ruihai Wu, Hao Dong.
"GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning." ICRA (2026). [paper] [code] [2026.03]
VANGUARD: Yifei Chen, Xupeng Chen, Feng Wang, Niangang Jiao, Jiayin Liu.
"VANGUARD: Vehicle-Anchored Ground Sample Distance Estimation for UAVs in GPS-Denied Environments." ArXiv (2026). [paper] [2026.03]
L2G-Det: Qifan Zhang, Sai Haneesh Allu, Jikai Wang, Yangxiao Lu, Yu Xiang.
"From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes." ArXiv (2026). [paper] [code] [2026.03]
SMART: Yu Luo, Guangyu Wei, Yangfan Li, Jieyu He, Yueming Lyu.
"Uncertainty-Aware Concept and Motion Segmentation for Semi-Supervised Angiography Videos." ArXiv (2026). [paper] [code] [2026.03]
Carlos Monroy, Benjamin Navarro.
"Leveraging GenAI for Segmenting and Labeling Centuries-old Technical Documents." IEEE-CH (2026). [paper] [2026.03]
STMI: Xingguo Xu, Zhanyu Liu, Weixiang Zhou, Yuansheng Gao, Junjie Cao, Yuhao Wang, Jixiang Luo, Dell Zhang.
"STMI: Segmentation-Guided Token Modulation with Cross-Modal Hypergraph Interaction for Multi-Modal Object Re-Identification." AAAI (2026). [paper] [2026.03]
Abhinav Munagala.
"Zero-Shot and Supervised Bird Image Segmentation Using Foundation Models: A Dual-Pipeline Approach with Grounding DINO 1.5, YOLOv11, and SAM 2.1." ArXiv (2026). [paper] [code] [2026.03]
MFT: Li, Guoqiang and Yuan, Hao and Chen, Suyang and Hu, Qi and Wang, Jun and Jiang, Kunming.
"MFT: Memory-Aware Fine-Tuning of SAM2 for Efficient Long-Sequence Video Object Segmentation." IEEE SPL (2026). [paper] [2026.03]
ReSeg-CLIP: Mohammadreza Heidarianbaei, Mareike Dorozynski, Hubert Kanyamahanga, Max Mehltretter, Franz Rottensteiner.
"Open-Vocabulary Semantic Segmentation in Remote Sensing via Hierarchical Attention Masking and Model Composition." BMVC Workshops (2026). [paper] [code] [2026.03]
SAM2-FNet: Shaoli Li, Zihua Zhang, Dejian Li, Bin Liu, Luyao He, Siying Guo.
"SAM2-FNet: Medical Image Lesion Segmentation Model Based on Frequency Domain Expert Fusion Network." IMA (2026). [paper] [code] [2026.02]
SAM-Zero3D: Zhang, Dejun and Xu, Shifeng and Bai, Yanzi and Wu, Yiqi and Liu, Jun.
"SAM-Zero3D: Extending Segment Anything to Zero Shot 3D Scene Segmentation via Iterative Global–Local Interaction." TCSVT (2026). [paper] [2026.02]
SAM2-ARAFNet: Shi, W., Ding, J., Lei, J. et al.
"SAM2-ARAFNet: adapting SAM2 with an attention-enhanced residual ASPP fusion network for high-resolution remote sensing semantic segmentation." Sci Rep(2026). [paper] [2026.02]
TextureSAM: Inbal Cohen, Boaz Meivar, Peihan Tu, Shai Avidan, Gal Oren.
"Decoupling Shape and Texture in SAM-2 via Controlled Texture Replacement." WACV (2026). [paper] [code] [2026.02]
Interactive Medical-SAM2 GUI: Woojae Hong, Jong Ha Hwang, Jiyong Chung, Joongyeon Choi, Hyunngun Kim, Yong Hwy Kim.
"Interactive Medical-SAM2 GUI: A Napari-based semi-automatic annotation tool for medical images." ArXiv (2026). [paper] [code] [2026.02]
Katja Kossira, Yunxuan Zhu, Jürgen Seiler, André Kaup.
"Towards Object Segmentation Mask Selection Using Specular Reflections." VCIP (2026). [paper] [2026.02]
L2RP: Lokesha Rasanjalee, Jin Lin Tan, Dileepa Pitawela, Rajvinder Singh, Hsiang-Ting Chen.
"Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation." ISBI (2026). [paper] [2026.02]
Yida Lin, Bing Xue, Mengjie Zhang, Sam Schofield, Richard Green.
"Progressive Per-Branch Depth Optimization for DEFOM-Stereo and SAM3 Joint Analysis in UAV Forestry Applications." ArXiv (2026). [paper] [2026.02]
CAD-Prompted SAM3: Zhenran Tang, Rohan Nagabhirava, Changliu Liu.
"CAD-Prompted SAM3: Geometry-Conditioned Instance Segmentation for Industrial Objects." ArXiv (2026). [paper] [2026.02]
YOLO–SAM2: Shiyu Liu, Dylan Lester, Husnu Narman, Ammar Alzarrad, Pingping Zhu.
"Depth-Enhanced YOLO-SAM2 Detection for Reliable Ballast Insufficiency Identification ." ArXiv (2026). [paper] [2026.02]
SMBlurDetect: Ganesh Samarth, Sibendu Paul, Solale Tabarestani, Caren Chen .
"Subtle Motion Blur Detection and Segmentation from Static Image Artworks." WACV (2026). [paper] [2026.02]
TactEx: Felix Verstraete, Lan Wei, Wen Fan, Dandan Zhang.
"TactEx: An Explainable Multimodal Robotic Interaction Framework for Human-Like Touch and Hardness Estimation." ICRA (2026). [paper] [2026.02]
SegMoTE: Yujie Lu, Jingwen Li, Sibo Ju, Yanzhou Su, he yao, Yisong Liu, Min Zhu, Junlong Cheng.
"SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation." ArXiv (2026). [paper] [2026.02]
WOFTSAM: Jonas Serych, Jiri Matas.
"Accurate Planar Tracking With Robust Re-Detection." ArXiv (2026). [paper] [code] [2026.02]
DSS: Yilong Yang, Jianxin Tian, Shengchuan Zhang, Liujuan Cao.
"Discover, Segment, and Select: A Progressive Mechanism for Zero-shot Camouflaged Object Segmentation." CVPR (2026). [paper] [2026.02]
SegSEM: Da Chen, Guangyu Hu, Kaihong Xu, Kaichao Liang, Songjiang Li, Wei Yang, XiangYu Wen, Mingxuan Yuan.
"SegSEM: Enabling and Enhancing SAM2 for SEM Contour Extraction." ISCAS (2026). [paper] [2026.02]
CL-MC: Huayu Wang, Bahaa Alattar, Cheng-Yen Yang, Hsiang-Wei Huang, Jung Heon Kim, Linda Shapiro, Nathan White, Jenq-Neng Hwang.
"Detector-in-the-Loop Tracking: Active Memory Rectification for Stable Glottic Opening Localization." MIDL (2026). [paper] [code] [2026.02]
Hadi Shokati, et al.
"Rapid flood mapping from aerial imagery using fine-tuned SAM and ResNet-backboned U-Net." Hydrology and Earth System Sciences (2026). [paper] [2026.02]
Lin, C., Yang, H., Wu, H. et al.
"Horizontal nystagmus identification with joint SAM segmentation and time series classification." Eur Arch Otorhinolaryngol (2026). [paper] [2026.02]
LDFSAM: Xuanbo Zhao, et al.
"LDFSAM: Localization Distillation-Enhanced Feature Prompting SAM for Medical Image Segmentation." Journal of Imaging (2026). [paper] [2026.02]
DCS: Yan Wan, Yingqi Lang, and Li Yao.
"DCS: A Zero-Shot Anomaly Detection Framework with DINO-CLIP-SAM Integration." Applied Sciences (2026). [paper] [2026.02]
DAS-SAM: Chen, Z., Zhou, N., Fan, Y. et al.
"DAS-SAM: fine-tuning SAM towards drivable area segmentation via efficient multi-scale traffic scene-aware adaptation." Vis. Intell.(2026). [paper] [2026.02]
SAM-IAD: Yichi Chen, et al.
"SAM-IAD: Injecting specific knowledge into SAM for industrial anomaly detection." KBS (2026). [paper] [2026.02]
SynSAM: Krishnan, C., Onuoha, E., Hung, A. et al.
"SynSAM: a hybrid synchronous learning framework with knowledge retention for prostate zonal segmentation leveraging the segment anything model." Med Biol Eng Comput (2026). [paper] [2026.02]
HCCP-SAM2: Rui Zhai, et al.
"SAM2-driven dual-teacher framework using hierarchical cross-slice context for semi-supervised 3D medical image segmentation." Neurocomputing (2026). [paper] [2026.02]
Yili Yang, et al.
"Keeping pace with a changing planet: An interactive segmentation framework for refining delineations of dynamic Earth features with the Segment Anything Model." International Journal of Applied Earth Observation and Geoinformation (2026). [paper] [2026.02]
LG-SAM: Chen Yi, et al.
"Clinically oriented LG-SAM for lung CT tumor segmentation with 2D training achieving 3D-level performance." Biomedical Signal Processing and Control(2026). [paper] [2026.02]
MUOT-3M: Ahsan Baidar Bakht, Mohamad Alansari, Muhayy Ud Din, Muzammal Naseer, Sajid Javed, Irfan Hussain, Jiri Matas, Arif Mahmood.
"MUOT-3M: A 3 Million Frame Multimodal Underwater Benchmark and the MUTrack Tracking Method ." ArXiv (2026). [paper] [code] [2026.02]
Jose Sosa, Danila Rukhovich, Anis Kacem, Djamila Aouada.
"Enabling Training-Free Text-Based Remote Sensing Segmentation." ArXiv (2026). [paper] [code] [2026.02]
Phoenix Yu, Tilo Burghardt, Andrew W Dowsey, Neill W Campbell.
"Automated Re-Identification of Holstein-Friesian Cattle in Dense Crowds." ArXiv (2026). [paper] [code] [2026.02]
TikArt: Hao Ding, Zhichuan Yang, Weijie Ge, Ziqin Gao, Chaoyi Lu, Lei Zhao.
"TikArt: Aperture-Guided Observation for Fine-Grained Visual Reasoning via Reinforcement Learning." ArXiv (2026). [paper] [2026.02]
SAM4Dcap: Li Wang, HaoYu Wang, Xi Chen, ZeKun Jiang, Kang Li, Jian Li.
"SAM4Dcap: Training-free Biomechanical Twin System from Monocular Video." ArXiv (2026). [paper] [code] [2026.02]
SAILS: Shishir Muralidhara, Didier Stricker, René Schuster.
"SAILS: Segment Anything with Incrementally Learned Semantics for Task-Invariant and Training-Free Continual Learning." IEEE CAI (2026). [paper] [2026.02]
Julius Pesonen, Stefan Rua, Josef Taher, Niko Koivumäki, Xiaowei Yu, Eija Honkavaara.
"Learning Image-based Tree Crown Segmentation from Enhanced Lidar-based Pseudo-labels." ArXiv (2026). [paper] [2026.02]
SAM3-LiteText: Chengxi Zeng, Yuxuan Jiang, Ge Gao, Shuai Wang, Duolikun Danier, Bin Zhu, Stevan Rudinac, David Bull, Fan Zhang.
"SAM3-LiteText: An Anatomical Study of the SAM3 Text Encoder for Efficient Vision-Language Segmentation." ArXiv (2026). [paper] [code] [2026.02]
DBTANet: Yun-Cheng Li, Sen Lei, Heng-Chao Li, Ke Li.
"A Dual-Branch Framework for Semantic Change Detection with Boundary and Temporal Awareness." ArXiv (2026). [paper] [2026.02]
Hi-SAM: Pingjun Pan, Tingting Zhou, Peiyao Lu, Tingting Fei, Hongxiang Chen, Chuanjiang Luo.
"Hi-SAM: A Hierarchical Structure-Aware Multi-modal Framework for Large-Scale Recommendation." ArXiv (2026). [paper] [2026.02]
Yiming Zhou, Xuenjie Xie, Panfeng Li, Albrecht Kunz, Ahmad Osman, Xavier Maldague.
"Efficient Segment Anything with Depth-Aware Fusion and Limited Training Data." ArXiv (2026). [paper] [2026.02]
Efficient-SAM2: Jing Zhang, Zhikai Li, Xuewen Liu, Qingyi Gu.
"Efficient-SAM2: Accelerating SAM2 with Object-Aware Visual Encoding and Memory Retrieval." ICLR (2026). [paper] [code] [2026.02]
RECITYGEN: Di Mo, Mingyang Sun, Chengxiu Yin, Runjia Tian, Yanhong Wu, Liyan Xu.
"RECITYGEN -- Interactive and Generative Participatory Urban Design Tool with Latent Diffusion and Segment Anything." ArXiv (2026). [paper] [code] [2026.02]
Thomas H. Schmitt, Maximilian Bundscherer, Tobias Bocklet.
"Learning to Detect Baked Goods with Limited Supervision." ArXiv (2026). [paper] [2026.02]
IR-SIS: Ange Lou, Yamin Li, Qi Chang, Nan Xi, Luyuan Xie, Zichao Li, Tianyu Luan.
"VLM-Guided Iterative Refinement for Surgical Image Segmentation with Foundation Models." ArXiv (2026). [paper] [2026.02]
Yan Luo, Advaith Ravishankar, Serena Liu, Yutong Yang, Mengyu Wang.
"Single-Slice-to-3D Reconstruction in Medical Imaging and Natural Objects: A Comparative Benchmark with SAM 3D." ArXiv (2026). [paper] [2026.02]
GenSeg-R1: Sandesh Hegde, Jaison Saji Chacko, Debarshi Banerjee, Uma Mahesh.
"GenSeg-R1: RL-Driven Vision-Language Grounding for Fine-Grained Referring Segmentation." ArXiv (2026). [paper] [2026.02]
ConceptBank: Gensheng Pei, Xiruo Jiang, Yazhou Yao, Xiangbo Shu, Fumin Shen, Byeungwoo Jeon.
"Taming SAM3 in the Wild: A Concept Bank for Open-Vocabulary Segmentation." ArXiv (2026). [paper] [code] [2026.02]
AdaptOVCD: Mingyu Dou, Shi Qiu, Ming Hu, Yifan Chen, Huping Ye, Xiaohan Liao, Zhe Sun.
"AdaptOVCD: Training-Free Open-Vocabulary Remote Sensing Change Detection via Adaptive Information Fusion." ArXiv (2026). [paper] [code] [2026.02]
SPDA-SAM: Yihan Shang, Wei Wang, Chao Huang, Xinghui Dong.
"SPDA-SAM: A Self-prompted Depth-Aware Segment Anything Model for Instance Segmentation." ArXiv (2026). [paper] [2026.02]
CPAC-SAM: Juzheng Miao and Cheng Chen and Yuchen Yuan and Quanzheng Li and Pheng-Ann Heng.
"SAM-Driven Cross Prompting with Adaptive Sampling Consistency for Semi-supervised Medical Image Segmentation." Medical Image Analysis(2026). [paper] [code] [2026.02]
SAMM: Jiahao Tu, et al.
"SAMM: A General-Purpose Segmentation Model for Material Micrographs Based on the Segment Anything Model 2." Advanced Powder Materials (2026). [paper] [2026.02]
SAM2-PFF: Henghao Sun, et al.
"SAM2-PFF: Bridging SAM2 and Progressive Feature Fusion for Robust Indoor Salient Object Detection." ArXiv (2026). [paper] [code] [2026.02]
Semi-MedSAM: Junhao Li, et al.
"Semi-MedSAM: Adapting SAM-assisted semi-supervised multi-modality learning for medical endoscopic image segmentation." Pattern Recognition (2026). [paper] [2026.02]
SamFusion: Yucheng Zhang, You Ma, Lin Chai.
"SamFusion: A model for multimodal image fusion guided by SAM’s rich semantics." Infrared Physics & Technology (2026). [paper] [2026.02]
Takahashi, H., Kato, T., Yamashita, M. et al.
"Floating object removal in underwater ROV video images using segment anything model and generative image in-painting." Artif Life Robotics (2026). [paper] [2026.02]
Binzagr, Faisal, and Majed Hariri.
"Foundation-Model-Driven Skin Lesion Segmentation and Classification Using SAM-Adapters and Vision Transformers." Diagnostics (2026). [paper] [2026.02]
StructSAM: Liu, M., Yao, Y., Jia, J. et al.
"StructSAM: structure-aware prompt adaptation for robust lung cancer lesion segmentation in CT." npj Digit. Med.(2026). [paper] [2026.02]
Raza, Tayyab and Ul Haq, Muhammad Arslan and Qanitah Naqvi, Syeda and Ramzan, Hafiz Arslan and Rehman, Abdul and Ramzan, Sadia.
"Brain Tumor Segmentation and Classification Using Multi-Scale SAM and VGG16." ICoDT2 (2026).[paper] [2026.02]
Fast-SAM3D: Weilun Feng, Mingqiang Wu, Zhiliang Chen, Chuanguang Yang, Haotong Qin, Yuqi Li, Xiaokun Liu, Guoxin Fan, Zhulin An, Libo Huang, Yulun Zhang, Michele Magno, Yongjun Xu.
"Fast-SAM3D: 3Dfy Anything in Images but Faster." ArXiv (2026). [paper] [code] [2026.02]
CPS: Jiahao Nie, Yun Xing, Wenbin An, Qingsong Zhao, Jiawei Shao, Yap-Peng Tan, Alex C. Kot, Shijian Lu, Xuelong Li.
"Boosting SAM for Cross-Domain Few-Shot Segmentation via Conditional Point Sparsification." ArXiv (2026). [paper] [2026.02]
AtlasPatch: Ahmed Alagha, Christopher Leclerc, Yousef Kotp, Omar Metwally, Calvin Moras, Peter Rentopoulos, Ghodsiyeh Rostami, Bich Ngoc Nguyen, Jumanah Baig, Abdelhakim Khellaf, Vincent Quoc-Huy Trinh, Rabeb Mizouni, Hadi Otrok, Jamal Bentahar, Mahdi S. Hosseini.
"AtlasPatch: An Efficient and Scalable Tool for Whole Slide Image Preprocessing in Computational Pathology." ArXiv (2026). [paper] [code] [2026.02]
FSOD-VFM: Chen-Bin Feng, Youyang Sha, Longfei Liu, Yongjun Yu, Chi Man Vong, Xuanlong Yu, Xi Shen.
"FSOD-VFM: Few-Shot Object Detection with Vision Foundation Models and Graph Diffusion." ICLR (2026). [paper] [code] [2026.02]
MedSAM-Agent: Shengyuan Liu, Liuxin Bao, Qi Yang, Wanting Geng, Boyun Zheng, Chenxin Li, Wenting Chen, Houwen Peng, Yixuan Yuan.
"MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning." ArXiv (2026). [paper] [code] [2026.02]
WATS-DA: Ganggang Huang, Fasheng Wang, Binbin Wang, Hanwei Li, Mingshu Zhang, Mengyin Wang, Fuming Sun & Haojie Li.
"Wild Animal Tracking with High-Quality Segment Anything Model and Domain Adaptation." IJCV (2026). [paper] [code]
S^3SPOT: Lingsong Wang, Mancheng Meng, Ziyan Wu, Terrence Chen, Fan Yang, Dinggang Shen.
"S^3POT: Contrast-Driven Face Occlusion Segmentation via Self-Supervised Prompt Learning." ArXiv (2026). [paper] [code] [2026.02]
Mamba-SAM: Mohammadreza Gholipour Shahraki, Mehdi Rezaeian, Mohammad Ghasemzadeh.
"A Hybrid Mamba-SAM Architecture for Efficient 3D Medical Image Segmentation." ArXiv (2026). [paper] [2026.02]
ZS-TreeSeg: Pengyu Chen, Fangzheng Lyu, Sicheng Wang, Cuizhen Wang.
"ZS-TreeSeg: A Zero-Shot Framework for Tree Crown Instance Segmentation." ArXiv (2026). [paper] [2026.02]
Penghao Deng, Jidong J. Yang, Jiachen Bian.
"Cross-Paradigm Evaluation of Gaze-Based Semantic Object Identification for Intelligent Vehicles." ArXiv (2026). [paper] [2026.02]
Samuel Church, Joshua D. Warner, Danyal Maqbool, Xin Tie, Junjie Hu, Meghan G. Lubner, Tyler J. Bradshaw.
"Opportunistic Promptable Segmentation: Leveraging Routine Radiological Annotations to Guide 3D CT Lesion Segmentation." ArXiv (2026). [paper] [2026.02]
SEAL: Seungjun Lee, Gim Hee Lee.
"Segment Any Events with Language." ICLR (2026). [paper] [code] [2026.01]
OpenVTON-Bench: Jin Li, Tao Chen, Shuai Jiang, Weijie Wang, Jingwen Luo, Chenhui Wu.
"OpenVTON-Bench: A Large-Scale High-Resolution Benchmark for Controllable Virtual Try-On Evaluation." ArXiv (2026). [paper] [2026.01]
Hive: Kai Li, Jintao Cheng, Chang Zeng, Zijun Yan, Helin Wang, Zixiong Su, Bo Zheng, Xiaolin Hu.
"A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation." ArXiv (2026). [paper] [code] [2026.01]
RectiFine-SAM: Lihong Qiao et al.
"RectiFine-SAM: Feature Rectification and Boundary Refinement for Prompt-Free Medical Lesion Segmentation." ArXiv (2026). [paper] [2026.01]
Yung-Chen Cheng et al.
"Automatic pore characterization in SEM images of foams using a fine-tuned segment anything model." Materials & Design (2026). [paper] [2026.01]
AerOSeg++: Saikat Dutta et al.
"AerOSeg++: Scale-Aware and Texture-Guided Open-Vocabulary Segmentation with SAM Features for Remote Sensing Images." ArXiv (2026). [paper] [2026.01]
AutoPromptSeg: Junan Zhu et al.
"AutoPromptSeg: Automated Decoupling of Uncertainty Prompts with SAM for semi-supervised medical image segmentation." Computerized Medical Imaging and Graphics (2026). [paper] [2026.01]
Scrap-SAM-CLIP: Guangda Bao et al.
"Scrap-SAM-CLIP: Assembling Foundation Models for Typical Shape Recognition in Scrap Classification and Rating." Sensors (2026). [paper] [2026.01]
HL-SAM-Seg: Xiong, Siting and Wu, Linfeng and Zhang, Bochen and Zhang, Dejin and Tao, Yu and Tang, Yuzhi.
"HL-SAM-Seg: Complementary High- and Low-Resolution Features Based on SAM for Remote Sensing Image Semantic Segmentation." TGRS (2026). [paper] [2026.01]
Guo, Pengyu and Jiang, Cuicui and Long, Chenrong and Hu, Qinglei and Li, Dongyu.
"Noncooperative Spacecraft Pose Measurement Without Prior Knowledge Based on SAM2." TIM (2026). [paper] [code] [2026.01]
KTVFR: Guoqing Zhang et al.
"Advancing open-set object detection with SAM knowledge transfer and variational feature reconstruction." Neurocomputing (2026). [paper] [2026.01]
PriorSAM-DBNet: Zhang, Qiwei, Yisong Wang, Ning Li, Quanwen Jiang, and Yong He.
"PriorSAM-DBNet: A SAM-Prior-Enhanced Dual-Branch Network for Efficient Semantic Segmentation of High-Resolution Remote Sensing Images." Sensors (2026). [paper] [2026.01]
Weiping M.A.
"Study on salient object segmentation based on depth information guidance and SAM low-rank adaptation fine-tuning." ArXiv (2026). [paper] [2026.01]
BLO-Inst: Li Zhang, Pengtao Xie.
"BLO-Inst: Bi-Level Optimization Based Alignment of YOLO and SAM for Robust Instance Segmentation." ArXiv (2026). [paper] [code] [2026.01]
DeepSeek-OCR 2: Haoran Wei, Yaofeng Sun, Yukun Li.
"DeepSeek-OCR 2: Visual Causal Flow." ArXiv (2026). [paper] [code] [2026.01]
SAJ: Helin Wang, Bowen Shi, Andros Tjandra, John Hoffman, Yi-Chiao Wu, Apoorv Vyas, Najim Dehak, Ann Lee, Wei-Ning Hsu.
"SAM Audio Judge: A Unified Multimodal Framework for Perceptual Evaluation of Audio Separation." ArXiv (2026). [paper] [code] [2026.01]
DSTCS: Yalin Luo, Shun Long, Huijin Wang, Jieyun Bai.
"DSTCS: Dual-Student Teacher Framework with Segment Anything Model for Semi-Supervised Pubic Symphysis Fetal Head Segmentation." ArXiv (2026). [paper] [2026.01]
Puzhen Wu, Han Weng, Quan Zheng, Yi Zhan, Hewei Wang, Yiming Li, Jiahui Han, Rui Xu.
"CLIP-Guided Unsupervised Semantic-Aware Exposure Correction." ICASSP (2026). [paper] [2026.01]
Zeineb Dridi, Jihen Bennaceur, Amine Ben Hassouna.
"Dynamic Mask-Based Backdoor Attack Against Vision AI Models: A Case Study on Mushroom Detection." ArXiv (2026). [paper] [2026.01]
C-RADIOv4: Mike Ranzinger, Greg Heinrich, Collin McCarthy, Jan Kautz, Andrew Tao, Bryan Catanzaro, Pavlo Molchanov.
"C-RADIOv4 (Tech Report)." ArXiv (2026). [paper] [code] [2026.01]
StealthMark: Qinkai Yu, Chong Zhang, Gaojie Jin, Tianjin Huang, Wei Zhou, Wenhui Li, Xiaobo Jin, Bo Huang, Yitian Zhao, Guang Yang, Gregory Y. H. Lip, Yalin Zheng, Aline Villavicencio, Yanda Meng.
"StealthMark: Harmless and Stealthy Ownership Verification for Medical Segmentation via Uncertainty-Guided Backdoors." ArXiv (2026). [paper] [code] [2026.01]
Rabin Dulal, Wenfeng Jia, Lihong Zheng, Jane Quinn.
"Agreement-Driven Multi-View 3D Reconstruction for Live Cattle Weight Estimation." ArXiv (2026). [paper] [2026.01]
SC-SAM: Vi Vu, Thanh-Huy Nguyen, Tien-Thinh Nguyen, Ba-Thinh Lam, Hoang-Thien Nguyen, Tianyang Wang, Xingjian Li, Min Xu.
"From Specialist to Generalist: Unlocking SAM's Learning Potential on Unlabeled Medical Images." ISBI (2026). [paper] [code] [2026.01]
MV-SAM: Yoonwoo Jeong, Cheng Sun, Yu-Chiang Frank Wang, Minsu Cho, Jaesung Choe.
"MV-SAM: Multi-view Promptable Segmentation using Pointmap Guidance." ArXiv (2026). [paper] [code] [2026.01]
Takato Yasuno.
"Multi-stage Bridge Inspection System: Integrating Foundation Models with Location Anonymization." ArXiv (2026). [paper] [2026.01]
MPS-CLIP: Yifan Li, Shiying Wang, Jianqiang Huang.
"Multi-Perspective Subimage CLIP with Keyword Guidance for Remote Sensing Image-Text Retrieval." ArXiv (2026). [paper] [code] [2026.01]
AutoPromptSeg: Junan Zhu, Zhizhe Tang, Ping Ma, Zheng Liang, Chuanjian Wang.
"AutoPromptSeg: Automated Decoupling of Uncertainty Prompts with SAM for semi-supervised medical image segmentation." Computerized Medical Imaging and Graphics (2026). [paper] [2026.01]
AM-SAM: Li, Y., Zhang, L., Liang, Y. et al.
"Am-sam: a spatially-aware prompt learning and mask calibration framework for few-shot semantic segmentation." International Journal of Machine Learning and Cybernetics (2026). [paper] [code] [2026.01]
Huanyu Li, Li Li, Hao Wang, Weibo Zhang & Peng Ren.
"Large Foundation Model Empowered Region-aware Underwater Image Captioning." IJCV (2026). [paper] [2026.01]
SAMTok: Yikang Zhou, Tao Zhang, Dengxian Gong, Yuanzheng Wu, Ye Tian, Haochen Wang, Haobo Yuan, Jiacong Wang, Lu Qi, Hao Fei, Anran Wang, Zhuochen Wang, Yujing Wang, Cheng Chen, Shunping Ji, Xiangtai Li.
"SAMTok: Representing Any Mask with Two Words." ArXiv (2026). [paper] [code] [2026.01]
FeTal-SAM: Qi Zeng, Weide Liu, Bo Li, Ryne Didier, P. Ellen Grant, Davood Karimi.
"Atlas-Assisted Segment Anything Model for Fetal Brain MRI (FeTal-SAM)." ArXiv (2026). [paper] [2026.01]
BREPS: Andrey Moskalenko, Danil Kuznetsov, Irina Dudko, Anastasiia Iasakova, Nikita Boldyrev, Denis Shepelev, Andrei Spiridonov, Andrey Kuznetsov, Vlad Shakhuro.
"BREPS: Bounding-Box Robustness Evaluation of Promptable Segmentation." AAAI (2026). [paper] [code] [2026.01]
BBoxMaskPose v2: Miroslav Purkrabek, Constantin Kolomiiets, Jiri Matas.
"BBoxMaskPose v2: Expanding Mutual Conditioning to 3D." ArXiv (2026). [paper] [code] [2026.01]
OmniOVCD: Xu Zhang, Danyang Li, Yingjie Xia, Xiaohang Dong, Hualong Yu, Jianye Wang, Qicheng Li.
"OmniOVCD: Streamlining Open-Vocabulary Change Detection with SAM 3." ArXiv (2026). [paper] [2026.01]
OCCAM: Michail Spanakis, Iason Oikonomidis, Antonis Argyros.
"OCCAM: Class-Agnostic, Training-Free, Prior-Free and Multi-Class Object Counting." ArXiv (2026). [paper] [code] [2026.01]
DepthCropSeg++: Jiafei Zhang, Songliang Cao, Binghui Xu, Yanan Li, Weiwei Jia, Tingting Wu, Hao Lu, Weijuan Hu, Zhiguo Han.
"DepthCropSeg++: Scaling a Crop Segmentation Foundation Model With Depth-Labeled Data." IEEE Journal of Selected Topics in Signal Processing (2026). [paper] [2026.01]
SynthFM-3D: Satrajit Chakrabarty, Sourya Sengupta, Gopal Avinash, Ravi Soni.
"Synthetic Volumetric Data Generation Enables Zero-Shot Generalization of Foundation Models in 3D Medical Image Segmentation." ISBI (2026). [paper] [2026.01]
SAMA: Zezhong Fan, Xiaohan Li, Topojoy Biswas, Kaushiki Nag, Kannan Achan.
"Segment and Matte Anything in a Unified Model." AAAI (2026). [paper] [2026.01]
VideoMaMa: Sangbeom Lim, Seoung Wug Oh, Jiahui Huang, Heeji Yoon, Seungryong Kim, Joon-Young Lee.
"VideoMaMa: Mask-Guided Video Matting via Generative Prior." ArXiv (2026). [paper] [code] [2026.01]
MQC-SAM: H. Jiang, Y. Sun, Z. Dong, T. Liu, Y. Gu.
"CroBIM-V: Memory-Quality Controlled Remote Sensing Referring Video Object Segmentation." ArXiv (2026). [paper] [2026.01]
3D LPA: Yanrui Lu, Danyang Chen, Haowen Xiao, Jiarui Zhu, Fukang Ge, Binqian Zou, Jiali Guan, Jiayin Liang, Yuting Wang, Ziqian Guan, Xiangcheng Bao, Jinhao Bi, Lin Gu, Jun He, Yingying Zhu.
"Large-scale EM Benchmark for Multi-Organelle Instance Segmentation in the Wild." ArXiv (2026). [paper] [2026.01]
Raffaele Mazza, Ciro Natale, Pietro Falco.
"Active Cross-Modal Visuo-Tactile Perception of Deformable Linear Objects." ArXiv (2026). [paper] [2026.01]
Medical SAM3: Chongcong Jiang, Tianxingjian Ding, Chuhan Song, Jiachen Tu, Ziyang Yan, Yihua Shao, Zhenyi Wang, Yuzhang Shang, Tianyu Han, Yu Tian.
"Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation." ArXiv (2026). [paper] [code] [2026.01]
PRISM-CAFO: Oishee Bintey Hoque, Nibir Chandra Mandal, Kyle Luong, Amanda Wilson, Samarth Swarup, Madhav Marathe, Abhijin Adiga.
"PRISM-CAFO: Prior-conditioned Remote-sensing Infrastructure Segmentation and Mapping for CAFOs." ArXiv (2026). [paper] [code] [2026.01]
WetSAM: Shuai Yuan, Tianwu Lin, Shuang Chen, Yu Xia, Peng Qin, Xiangyu Liu, Xiaoqing Xu, Nan Xu, Hongsheng Zhang, Jie Wang, Peng Gong.
"Wetland mapping from sparse annotations with satellite image time series and temporal-aware segment anything model." ArXiv (2026). [paper] [2026.01]
SAMannot: Gergely Dinya, András Gelencsér, Krisztina Kupán, Clemens Küpper, Kristóf Karacs, Anna Gelencsér-Horváth.
"SAMannot: A Memory-Efficient, Local, Open-source Framework for Interactive Video Instance Segmentation based on SAM2." ArXiv (2026). [paper] [2026.01]
VMPicker: Bo Zhu et al.
"VMPicker: A novel cryo-EM particle picker leveraging vision mamba and the segment anything model." Micron (2026). [paper] [2026.01]
Jiuyi Zhang, Jiaqi Ji, Sijia Feng, Huiying Ru.
"3D Gaussian-Driven SAM2 Repair Method]{3D Gaussian-Driven SAM2 Multi-View Fusion Detection and Triple-Constrained Symmetry Plane Generation Repair Method." ArXiv (2026). [paper] [2026.01]
FILFArch: Huang, Junqing and Ji, Shucheng and Wang, Yapeng and Xia, Min and Yuan, Xiaochen.
"An SAM Fine-Tuning Framework With Frequency-Domain Interactive LoRA for Remote Sensing Change Detection." TGRS (2026). [paper] [code] [2026.01]
Miao Liu et al.
"Settlements Extraction and Spatiotemporal Analysis with SAM and Random Forest from High-Resolution Remote Sensing." Agriculture Communications (2026). [paper] [2026.01]
PMG-SAM: Gao, Jixue, Xiaoyan Jiang, Anjie Wang, Yongbin Gao, Zhijun Fang, and Michael S. Lew.
"PMG-SAM: Boosting Auto-Segmentation of SAM with Pre-Mask Guidance." Sensors (2026). [paper] [2026.01]
TextSAM: Xiang, Y., Xian, Y., Cairang, X. et al.
"Handwritten text line segmentation with TextSAM: An enhanced segment anything model via multi-module fusion." IJDAR (2026). [paper] [2026.01]
Deboch Eyob Abera et al.
"Automated prompt-guided multi-modality cell segmentation with shape-aware classification and boundary-aware SAM adaptation." Displays (2026). [paper] [code] [2026.01]
DescriptorMedSAM: Zhang, W., Luo, L., He, M. et al.
"DescriptorMedSAM: language-image fusion with multi-aspect text guidance for medical image segmentation." Sci Rep (2026). [paper] [code] [2026.01]
TA-MedSAM: Siyuan Tang et al.
"TA-MedSAM: Text-augmented improved MedSAM for pulmonary lesion segmentation." Computerized Medical Imaging and Graphics (2026). [paper] [2026.01]
SAMURAI: Yang, Cheng-Yeng and Huang, Hsiang-Wei and Jiang, Zhongyu and Chai, Wenhao and Hwang, Jenq-Neng.
"SAMURAI: Motion-Aware Memory for Training-Free Visual Object Tracking with SAM 2." TIP (2026). [paper] [code] [2026.01]
MedVL-SAM2: Yang Xing, Jiong Wu, Savas Ozdemir, Ying Zhang, Yang Yang, Wei Shao, Kuang Gong.
"MedVL-SAM2: A unified 3D medical vision-language model for multimodal reasoning and prompt-driven segmentation." ArXiv (2026). [paper] [2026.01]
SAM-guided-RGB-D-COD: Dongdong Zhang and Chunping Wang and Qiang Fu and Yao Song.
"SAM-guided Depth-aware Weakly Supervised Camouflaged Object Detection with Spatial-Frequency Exploration." KBS (2026). [paper] [code] [2026.01]
SAM3-DMS: Ruiqi Shen, Chang Liu, Henghui Ding.
"SAM3-DMS: Decoupled Memory Selection for Multi-target Video Segmentation of SAM3." ArXiv (2026). [paper] [code] [2026.01]
BrainSegNet: Yucheng Li, Xiaofan Wang, Junyi Wang, Yijie Li, Xi Zhu, Mubai Du, Dian Sheng, Wei Zhang, Fan Zhang.
"BrainSegNet: A Novel Framework for Whole-Brain MRI Parcellation Enhanced by Large Models." ArXiv (2026). [paper] [2026.01]
SAM-Aug: Kai Hu, Yaozu Feng, Vladimir Lysenko, Ya Guo Member, Huayi Wu.
"SAM-Aug: Leveraging SAM Priors for Few-Shot Parcel Segmentation in Satellite Time Series." ArXiv (2026). [paper] [2026.01]
SAM-pose2seg: Constantin Kolomiiets, Miroslav Purkrabek, Jiri Matas.
"SAM-pose2seg: Pose-Guided Human Instance Segmentation in Crowds." ArXiv (2026). [paper] [code] [2026.01]
3AM: Yang-Che Sun, Cheng Sun, Chin-Yang Lin, Fu-En Yang, Min-Hung Chen, Yen-Yu Lin, Yu-Lun Liu.
"3AM: Segment Anything with Geometric Consistency in Videos." ArXiv (2026). [paper] [code] [2026.01]
Sunusi Ibrahim Muhammad, Ismail Ismail Tijjani, Saadatu Yusuf Jumare, Fatima Isah Jibrin.
"Sesame Plant Segmentation Dataset: A YOLO Formatted Annotated Dataset." ICCAIT (2026). [paper] [code] [2026.01]
SAM-RefiSeR: Dillan Imans, Phuoc-Nguyen Bui, Duc-Tai Le, Hyunseung Choo.
"Unsupervised Domain Adaptation with SAM-RefiSeR for Enhanced Brain Tumor Segmentation." ArXiv (2026). [paper] [2026.01]
PanoSAMic: Mahdi Chamseddine, Didier Stricker, Jason Rambach.
"PanoSAMic: Panoramic Image Segmentation from SAM Feature Encoding and Dual View Fusion." ArXiv (2026). [paper] [code] [2026.01]
Aizierjiang Aiersilan, Ruting Cheng, James Hahn.
"Investigating Anthropometric Fidelity in SAM 3D Body." ArXiv (2026). [paper] [2026.01]
Sanjay Pradeep, Chen Wang, Matthew M. Dahm, Jeff D. Eldredge, Candace S. J. Tsai.
"Quantification and Classification of Carbon Nanotubes in Electron Micrographs using Vision Foundation Models." ArXiv (2026). [paper] [2026.01]
WaveRNet: Chanchan Wang, Yuanfang Wang, Qing Xu, Guanxin Chen.
"WaveRNet: Wavelet-Guided Frequency Learning for Multi-Source Domain-Generalized Retinal Vessel Segmentation." ESWA (2026). [paper] [code] [2026.01]
Prompt-Free SAM: Samuel E. Johnny, Bernes L. Atabonfack, Israel Alagbe, Assane Gueye.
"Prompt-Free SAM-Based Multi-Task Framework for Breast Ultrasound Lesion Segmentation and Classification." ArXiv (2026). [paper] [2026.01]
SSP-SAM: Tang, Wei and Liu, Xuejing and Sun, Yanpeng and Li, Zechao.
"SSP-SAM: SAM with Semantic-Spatial Prompt for Referring Expression Segmentation." TCSVT (2026). [paper] [code] [2026.01]
DivAS: Ayush Pande.
"DivAS: Interactive 3D Segmentation of NeRFs via Depth-Weighted Voxel Aggregation." ArXiv (2026). [paper] [2026.01]
Detector-Augmented SAMURAI: Tamara R. Lenhard, Andreas Weinmann, Hichem Snoussi, Tobias Koch.
"Detector-Augmented SAMURAI for Long-Duration Drone Tracking." WACV Workshop (2026). [paper] [2026.01]
HyperCOD & HSC-SAM: Shuyan Bai, Tingfa Xu, Peifu Liu, Yuhao Qiu, Huiyan Bai, Huan Chen, Yanyan Peng, Jianan Li.
"HyperCOD: The First Challenging Benchmark and Baseline for Hyperspectral Camouflaged Object Detection." ArXiv (2026). [paper] [code] [2026.01]
SemiBCP-SAM2: Guangqi Yang et al.
"SemiBCP-SAM2: Semi-supervised model via enhanced bidirectional copy-paste based on SAM2 for medical image segmentation." Information Processing & Management (2026). [paper] [code] [2026.01]
SAMOIS: Bing He et al.
"SAMOIS: efficient fine-tuned SAM with multi-scale enhancement for optical remote sensing image segmentation." European Journal of Remote Sensing (2026). [paper] [2026.01]
LoGoSAM: Khang Ta Gia, Quan Nguyen Dinh, Giang Kang Dong & Tho Quan Thanh.
"LoGoSAM: Enhancing Prototypical Networks for Medical Image One-Shot Segmentation Using Local-Global Encoder Integration and Visual Prompting." ICTIS (2026). [paper] [2026.01]
XSegTx-SAM2: Devis Salierno et al.
"Ego-Exo Object Correspondence bySAM2 and Cross-View Prompting." ICIAP (2026). [paper] [2026.01]
E2SAM: Ziyi Li, Yinghui Xing, Feng Sang, Shizhou Zhang, Lingyan Ran & Yanning Zhang.
"E2SAM: Edge-Enhanced SAM with FFC Adapter for Few-Shot Infrared Small Target Detection." JCRAI (2026). [paper] [2026.01]
Jiaxuan Wang et al.
"Weakly Supervised Blue-Carbon Mapping of Reef Algae with SAM-Bootstrapped NnU-Net." ACIVS (2026). [paper] [2026.01]
Jieming Yu et al.
"SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation." EMA4MICCAI (2026). [paper] [2026.01]
EvSAM: Yi Ding, Bowen Yao, Yuhan Liu, Hao Chen, Ding Ding, Zhen Yang, Youfu Li, Yongjian Deng.
"EvSAM: Segment Anything Model with Event-based Assistance." ACM Trans. Multimedia Comput. Commun. Appl. (2026). [paper] [2026.01]
Fatih Fehmi Şimşek, Melih Altay.
"Phenology aware agricultural boundary extraction using segment anything model and planet scope imagery (zero shot learning approach)." Advances in Space Research (2026). [paper] [2026.01]
Yangxin Liu, De Li, and Xun Jin.
"Research on game object segmentation method based on SAM." ICEEIE (2026). [paper] [2026.01]
DGA-Net: Yuetong Li, Qing Zhang, Yilin Zhao, Gongyang Li, Zeming Liu.
"DGA-Net: Enhancing SAM with Depth Prompting and Graph-Anchor Guidance for Camouflaged Object Detection." ArXiv (2026). [paper] [2026.01]
Longzhen Li, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama.
"Foreground-Aware Dataset Distillation via Dynamic Patch Selection." ArXiv (2026). [paper] [2026.01]
PatchAlign3D: Souhail Hadgi, Bingchen Gong, Ramana Sundararaman, Emery Pierson, Lei Li, Peter Wonka, Maks Ovsjanikov.
"PatchAlign3D: Local Feature Alignment for Dense 3D Shape Understanding." ArXiv (2026). [paper] [code] [2026.01]
TopoLoRA-SAM: Salim Khazem.
"TopoLoRA-SAM: Topology-Aware Parameter-Efficient Adaptation of Foundation Segmenters for Thin-Structure and Cross-Domain Binary Semantic Segmentation." ArXiv (2026). [paper] [code] [2026.01]
GleSAM++: Guangqian Guo, Aixi Ren, Yong Guo, Xuehui Yu, Jiacheng Tian, Wenli Li, Yaoxing Wang, Shan Gao.
"Towards Any-Quality Image Segmentation via Generative and Adaptive Latent Space Enhancement." ArXiv (2026). [paper] [code] [2026.01]
Riccardo Gelato, Carlo Sgaravatti, Jakob Grahn, Giacomo Boracchi, Filippo Maria Bianchi.
"Promptable Foundation Models for SAR Remote Sensing: Adapting the Segment Anything Model for Snow Avalanche Segmentation." ArXiv (2026). [paper] [2026.01]
Devis Salierno, Matteo Dunnhofer & Christian Micheloni.
"Ego-Exo Object Correspondence by SAM2 and Cross-View Prompting." ICIAP (2026). [paper] [2026.01]
VNS-SAM: Guangqian Guo, Pengfei Chen, Yong Guo, Huafeng Chen, Boqiang Zhang, Shan Gao.
"Boosting Segment Anything Model to Generalize Visually Non-Salient Scenarios." TIP (2026). [paper] [code] [2026.01]
Miguel Abreu Cardenas, et al.
"Few-Shot Cataract Detection via Feature Density Learning: Evaluating SAM Models and Backbone Embeddings." ArXiv (2026). [paper] [2026.01]

2025

Evol-SAM3: Kai Ye, Xiaotong You, Jianghang Lin, Jiayi Ji, Pingyang Dai, Liujuan Cao.
"Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting." ArXiv (2025). [paper] [code] [2025.12]
Edit3r: Jiageng Liu, Weijie Lyu, Xueting Li, Yejie Guo, Ming-Hsuan Yang.
"Edit3r: Instant 3D Scene Editing from Sparse Unposed Images." ArXiv (2025). [paper] [code] [2025.12]
OFL-SAM2: Meng Lan, Lefei Zhang, Xiaomeng Li.
"OFL-SAM2: Prompt SAM2 with Online Few-shot Learner for Efficient Medical Image Segmentation." AAAI (2026). [paper] [code] [2025.12]
Hilbert-VLM: Hao Wu, Hui Li, Yiyun Su.
"Bridging the Perception-Cognition Gap:Re-engineering SAM2 with Hilbert-Mamba for Robust VLM-based Medical Diagnosis." ArXiv (2025). [paper] [2025.12]
CF-SAM: Mao, Yanliang, Kubwimana Olivier, Guangzhi Niu, and Liping Chen.
"CF-SAM: An Efficient and Precise SAM Model for Instance Segmentation of Cotton Top Leaves." Agronomy (2025). [paper] [2025.12]
Shoreline-SAM: Chen, Guoquan and Tang, Hutian and Wang, Weijun and Xie, Daoshun.
"A Temporal-Enhanced Segment Anything Model for Precise and Robust Shoreline Segmentation." ArXiv (2025). [paper] [2025.12]
Huibin Li, et al.
"Achieving precise cropland parcel extraction from remote sensing images through integration of segment anything model and adaptive mask refinement." CEA (2025). [paper] [2025.12]
YSAM-SLAM: Zhang Lin, Chen Tao, Yang Ming, Ma ZongFang, Zhang YingJie, Chen Yun, Gao Xiao, Xin MeiTing.
"YSAM-SLAM: A real-time performance enhancement algorithm for visual SLAM in dynamic environments." Information Sciences (2025). [paper] [2025.12]
Jeongbae Jeon and Solhee Kim and STaegon Kim.
"Improving agricultural land use detection through YOLO–SAM fusion framework." Information Processing in Agriculture (2025). [paper] [[code](https://data.mendeley.com/datasets/znxyv9rtwp/1 (DOI: 10.17632/znxyv9rtwp.1)] [2025.12]
UncertSAM: Jesse Brouwers, Xiaoyan Xing, Alexander Timans .
"Towards Integrating Uncertainty for Domain-Agnostic Segmentation." NeurIPS Workshop (2025). [paper] [code] [2025.12]
SOFTooth: Xiaolan Li, Wanquan Liu, Pengcheng Li, Pengyu Jie, Chenqiang Gao .
"SOFTooth: Semantics-Enhanced Order-Aware Fusion for Tooth Instance Segmentation." ArXiv (2025). [paper] [2025.12]
Tiny-YOLOSAM:: Kenneth Xu, Songhan Wu .
"Tiny-YOLOSAM: Fast Hybrid Image Segmentation." ArXiv (2025). [paper] [code] [2025.12]
SAM3_Tracking_Zoo: Mohamad Alansari, Muzammal Naseer, Hasan Al Marzouqi, Naoufel Werghi, Sajid Javed .
"Rethinking Memory Design in SAM-Based Visual Object Tracking." ArXiv (2025). [paper] [code] [2025.12]
Junsheng Yao, Lichao Mou, Qingyu Li .
"SAM 3D for 3D Object Reconstruction from Remote Sensing Images ." ArXiv (2025). [paper] [2025.12]
Brayden Miao, Zain Rehman, Xin Miao, Siming Liu, Jianjie Wang.
"MedSAM-based lung masking for multi-label chest X-ray classification." ArXiv (2025). [paper] [2025.12]
Scalpel-SAM: Zihan Liu, Xiangning Ren, Dezhang Kong, Yipeng Zhang, Meng Han .
"Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection ." ArXiv (2025). [paper] [2025.12]
BadVSFM: Zongmin Zhang, Zhen Sun, Yifan Liao, Wenhan Dong, Xinlei He, Xingshuo Han, Shengmin Xu, Xinyi Huang.
"Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models." ArXiv (2025). [paper] [2025.12]
Zhangzheng Tum, Kailun Su, Shaolong Zhu, Yukun Zheng.
"World-Coordinate Human Motion Retargeting via SAM 3D Body." ArXiv (2025). [paper] [2025.12]
Marek SOMPOLSKI, Michał TYMPALSKI, and Wojciech MILCZAREK.
"Exploratory Research on the Implementation of Segment Anything Model (SAM) 2 for Glacier Calving Front Detection using SAR Imagery." ArXiv (2025). [paper] [2025.12]
SAM-MedClass: Zhu, H., Zhu, Z. & Lu, SY.
"Multi-stage attention for efficient brain tumor classification with SAM-Med2D." Multimedia Systems(2025). [paper] [2025.12]
Zhong, Xiukun, Guohong Liang, Lingbei Meng, Wei Xi, Lin Gu, Nana Tian, Yong Zhai, Yutong He, Yuqiong Huang, Fengmin Jin, and et al.
"Automated Particle Size Analysis of Supported Nanoparticle TEM Images Using a Pre-Trained SAM Model." Nanomaterials (2025). [paper] [2025.12]
SAM-FireAdapter: Yanan Wu, Chaoqun Hong, Yongfeng Chen, Haixi Cheng.
"SAM-FireAdapter: An adapter for fire segmentation with SAM." Journal of Visual Communication and Image Representation (2025). [paper] [2025.12]
M. Cihad Arslanoglu and Abdulkadir Albayrak and Huseyin Acar.
"Towards automated metaphase cell detection using foundation models: A SAM and DINO-based approach." Engineering Science and Technology, an International Journal (2025). [paper] [2025.12]
Avilasha Mandal, Chaoning Zhang, Fachrina Dewi Puspitasari, Xudong Wang, Jiaquan Zhang, Caiyan Qin, Guoqing Wang, Yang Yang, Heng Tao Shen.
"Fast SAM2 with Text-Driven Token Pruning." ArXiv (2025). [paper] [2025.12]
D3ETOR: Jiawei Ge, Jiuxin Cao, Xinyi Li, Xuelin Zhu, Chang Liu, Bo Liu, Chen Feng, Ioannis Patras.
"D3ETOR: Debate-Enhanced Pseudo Labeling and Frequency-Aware Progressive Debiasing for Weakly-Supervised Camouflaged ObjectDetection with Scribble Annotations." ArXiv (2025). [paper] [2025.12]
Think2Seg-RS: Xu Zhang, Junyao Ge, Yang Zheng, Kaitai Guo, Jimin Liang.
"Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing." ArXiv (2025). [paper] [code] [2025.12]
Charilaos Kapelonis, Marios Antonakakis, Konstantinos Politof, Aristomenis Antoniadis, Michalis Zervakis.
"Automated Mosaic Tesserae Segmentation via Deep Learning Techniques." ArXiv (2025). [paper] [2025.12]
FALCON-SFOD: Sairam VCR, Rishabh Lalla, Aveen Dayal, Tejal Kulkarni, Anuj Lalla, Vineeth N Balasubramanian, Muhammad Haris Khan.
"Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection." ArXiv (2025). [paper] [2025.12]
ReMeDI-SAM3: Valay Bundele, Mehran Hosseinzadeh, Hendrik P. A. Lensch.
"Memory-Enhanced SAM3 for Occlusion-Robust Surgical Instrument Segmentation." ArXiv (2025). [paper] [code] [2025.12]
SNOW: Tin Stribor Sohn, Maximilian Dillitzer, Jason J. Corso, Eric Sax.
"SNOW: Spatio-Temporal Scene Understanding with World Knowledge for Open-World Embodied Reasoning." ArXiv (2025). [paper] [2025.12]
SegGraph: Yueyang Hu, Haiyong Jiang, Haoxuan Song, Jun Xiao, Hao Pan.
"SegGraph: Leveraging Graphs of SAM Segments for Few-Shot 3D Part Segmentation." NeurIPS (2025). [paper] [code] [2025.12]
Roni Blushtein-Livnon, Osher Rafaeli, David Ioffe, Amir Boger, Karen Sandberg Esquenazi, Tal Svoray.
"On the Effectiveness of Textual Prompting with Lightweight Fine-Tuning for SAM3 Remote Sensing Segmentation." ArXiv (2025). [paper] [2025.12]
Leo Segre, Or Hirschorn, Shai Avidan.
"Multi-View Foundation Model." ArXiv (2025). [paper] [code] [2025.12]
MoonSeg3R: Zhipeng Du, Duolikun Danier, Jan Eric Lenssen, Hakan Bilen.
"MoonSeg3R: Monocular Online Zero-Shot Segment Anything in 3D with Reconstructive Foundation Priors." ArXiv (2025). [paper] [2025.12]
Hao Tian and Tingting Zhao and Tongming Qu and Shaoteng Liu and Kaixuan Ju and Zhiqiang Li and Yuntian Feng.
"Virtual dispersion and gradation prediction of stacked particles via improved Pix2Pix and SAM framework." Computers and Geotechnics (2025). [paper] [2025.12]
SAM-SPOT: Felix Y. Zhou, Adam Norton-Steele, Lewis Marsh, Helen M. Byrne, Heather A. Harrington, Xin Lu.
"Development of a universal imaging “phenome” using Shape, Appearance and Motion (SAM) features and the SAM Observation Tool (SPOT)." ArXiv (2025). [paper] [2025.12]
Quang, N.H., Kim, N., Lee, H. et al.
"Semantic water body extraction by the high-quality segment anything model using multiple optical and SAR imagery." Acta Geophys (2025). [paper] [2025.12]
CellSAM: Marks, M., Israel, U., Dilip, R. et al..
"CellSAM: a foundation model for cell segmentation." Nature Methods (2025). [paper] [code] [2025.12]
SaSAM: You Ma and Hongwei Tong and Lin Chai and Shihan Mao and Yucheng Zhang.
"SaSAM: Scale-aware segmentation anything model for multimodal remote sensing images." Information Fusion (2025). [paper] [code] [2025.12]
DASAM: Yang, Lihong, Pengfei Liu, Guilong Zhang, Huaici Zhao, and Chunyang Zhao.
"Domain-Adaptive Segment Anything Model for Cross-Domain Water Body Segmentation in Satellite Imagery." Journal of Imaging (2025). [paper] [2025.12]
Hongyu Chen and Zhen Zhang and Jie Su and Siya Wen.
"Robust water level measurement using adaptive prompt staff gauge image segmentation based on EdgeSAM." Journal of Hydrology (2025). [paper] [2025.12]
Buyukpatpat, H., Sezer, E.A. & Guzel, M.S.
"A Comparative Evaluation of Zero-Shot Performance of SAM, SAM2, MedSAM, and MedSAM2 Models on Lung Segmentation." J Digit Imaging. Inform. med.(2025). [paper] [2025.12]
Gauthami Vijayakumar Kuttuva , Prawin J.
"Structural defect segmentation using a semi-supervised algorithm integrating YOLO and the segment anything model." Automation in Construction (2025). [paper] [2025.12]
Mandal, S., Saha, A.
"A novel deep learning based spatial ensemble approach and segment anything model for landslide risk assessment in Chamoli district of Garhwal Himalayas." Sci Rep (2025). [paper] [2025.12]
SemanticHRI: Hengxu You, et al.
"Semantic: An Advanced Human-Robot Collaboration System Based on FastSAM." Computing in Civil Engineering (2025). [paper] [2025.12]
SAM2-DEGNet: Zhong, Z., Jiao, G., Li, G. et al.
"SAM2-DEGNet: dual-stage edge guidance network for camouflaged object detection using SAM2." Vis Comput (2025). [paper] [code] [2025.12]
UniVCD: Ziqiang Zhu, Bowei Yang.
"UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era." ArXiv (2025). [paper] [code] [2025.12]
Ranjan Sapkota, Konstantinos I. Roumeliotis, Manoj Karkee, Nikolaos D. Tselikas.
"Generalization vs. Specialization: Evaluating Segment Anything Model (SAM3) Zero-Shot Segmentation Against Fine-Tuned YOLO Detectors." ArXiv (2025). [paper] [code] [2025.12]
SAM2VideoX: Yang Fei, George Stoica, Jingyuan Liu, Qifeng Chen, Ranjay Krishna, Xiaojuan Wang, Benlin Liu.
"Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation." ArXiv (2025). [paper] [code] [2025.12]
3DTeethSAM: Zhiguo Lu, Jianwen Lou, Mingjun Ma, Hairong Jin, Youyi Zheng, Kun Zhou.
"3DTeethSAM: Taming SAM2 for 3D Teeth Segmentation." AAAI (2025). [paper] [code] [2025.12]
Depth-Copy-Paste: Qiushi Guo.
"Depth-Copy-Paste: Multimodal and Depth-Aware Compositing for Robust Face Detection." ArXiv (2025). [paper] [2025.12]
SSL-MedSAM2: Zhendi Gong, Xin Chen.
"SSL-MedSAM2: A Semi-supervised Medical Image Segmentation Framework Powered by Few-shot Learning of SAM2." MICCAI workshop (2025). [paper] [code] [2025.12]
MaGRoad: Wenfei Guan, Jilin Mei, Tong Shen, Xumin Wu, Shuo Wang, Cheng Min, Yu Hu.
"Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction." ArXiv (2025). [paper] [code] [2025.12]
Polyp-DiFoM: Shivanshu Agnihotri, Snehashis Majhi, Deepak Ranjan Nayak, Debesh Jha.
"From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation." ArXiv (2025). [paper] [code] [2025.12]
Zihao Ding, Mufeng Zhu, Zhongze Tang, Sheng Wei, Yao Liu.
"A Distributed Framework for Privacy-Enhanced Vision Transformers on the Edge." SEC (2025). [paper] [2025.12]
Senem Aktas, Charles Markham, John McDonald, Rozenn Dahyot.
"Benchmarking SAM2-based Trackers on FMOX." AICS (2025). [paper]
MultiMotion: Penghui Liu, Jiangshan Wang, Yutong Shen, Shanhui Mo, Chenyang Qi, Yue Ma.
"MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer." ArXiv (2025). [paper] [2025.12]
SAM-Body4D: Mingqi Gao, Yunqi Miao, Jungong Han.
"SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos." ArXiv (2025). [paper] [code] [2025.12]
LapFM: Qing Xu, Kun Yuan, Yuxiang Luo, Yuhao Zhai, Wenting Duan, Nassir Navab, Zhen Chen.
"LapFM: A Laparoscopic Segmentation Foundation Model via Hierarchical Concept Evolving Pre-training." ArXiv (2025). [paper] [code] [2025.12]
OpenMonoGS-SLAM: Jisang Yoo, Gyeongjin Kang, Hyun-kyu Ko, Hyeonwoo Yu, Eunbyung Park.
"OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics." ArXiv (2025). [paper] [2025.12]
SegEarth-OV3: Kaiyu Li, Shengqi Zhang, Yupeng Deng, Zhi Wang, Deyu Meng, Xiangyong Cao.
"SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images." ArXiv (2025). [paper] [code] [2025.12]
SAM+CSRT: Chamath Ranasinghe, Uthayasanker Thayasivam.
"Team-Aware Football Player Tracking with SAM: An Appearance-Based Approach to Occlusion Recovery." ArXiv (2025). [paper] [2025.12]
YOLO-World+SAM: Yu Zhu, Naoya Chiba, Koichi Hashimoto.
"Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion." BMVC (2025). [paper] [2025.12]
Wenzhen Dong, Jieming Yu, Yiming Huang, Hongqiu Wang, Lei Zhu, Albert C. S. Chung, Hongliang Ren, Long Bai.
"More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery." ArXiv (2025). [paper] [2025.12]
Ranjan Sapkota, Konstantinos I. Roumeliotis, Manoj Karkee.
"The SAM2-to-SAM3 Gap in the Segment Anything Model Family: Why Prompt-Based Expertise Fails in Concept-Driven Image Segmentation." ArXiv (2025). [paper] [code] [2025.12]
DYNAPO: Zhuoyuan Wu, Xurui Yang, Jiahui Huang, Yue Wang, Jun Gao.
"The Dynamic Prior: Understanding 3D Structures for Casual Dynamic Videos." ArXiv (2025). [paper] [code] [2025.12]
DepSeg: Kunyi Yang, Qingyu Wang, Cheng Yuan, Yutong Ban.
"See in Depth: Training-Free Surgical Scene Segmentation with Monocular Depth Priors." ArXiv (2025). [paper] [2025.12]
SAM3-I: Jingjing Li, Yue Feng, Yuchen Guo, Jincai Huang, Yongri Piao, Qi Bi, Miao Zhang, Xiaoqi Zhao, Qiang Chen, Shihao Zou, Wei Ji, Huchuan Lu, Li Cheng.
"SAM3-I: Segment Anything with Instructions." ArXiv (2025). [paper] [code] [2025.12]
BA-TTA-SAM: Chenlin Xu, Lei Zhang, Lituan Wang, Xinyu Pu, Pengfei Ma, Guangwu Qian, Zizhou Wang, Yan Wang.
"Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation." ArXiv (2025). [paper] [code] [2025.12]
Sheng Hang, Chaoxiang He, Hongsheng Hu, Hanqing Hu, Bin Benjamin Zhu, Shi-Feng Sun, Dawu Gu, Shuo Wang.
"Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot." ArXiv (2025). [paper] [2025.12]
Motion4D: Haoran Zhou, Gim Hee Lee.
"Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding." NeurIPS (2025). [paper] [code] [2025.12]
Kwaku Opoku-Ware, Gideon Opoku.
"AfroBeats Dance Movement Analysis Using Computer Vision: A Proof-of-Concept Framework Combining YOLO and Segment Anything Model." ArXiv (2025). [paper] [2025.12]
NAS-LoRA: Renqi Chen, Haoyang Su, Shixiang Tang.
"NAS-LoRA: Empowering Parameter-Efficient Fine-Tuning for Visual Foundation Models with Searchable Adaptation." ArXiv (2025). [paper] [2025.12]
SAM2Grasp: Shengkai Wu, Jinrong Yang, Wenqiu Luo, Linfeng Gao, Chaohui Shang, Meiyu Zhi, Mingshan Sun, Fangping Yang, Liangliang Ren, Yong Zhao.
"SAM2Grasp: Resolve Multi-modal Grasping via Prompt-conditioned Temporal Action Prediction." ArXiv (2025). [paper] [2025.12]
LISA-3D: Zhongbin Guo, Jiahe Liu, Wenyu Gao, Yushan Li, Chengzhi Li, Ping Jian.
"LISA-3D: Lifting Language-Image Segmentation to 3D via Multi-View Consistency." ArXiv (2025). [paper] [code] [2025.12]
SAM3-UNet: Xinyu Xiong, Zihuang Wu, Lei Lu, Yufa Xia.
"SAM3-UNet: Simplified Adaptation of Segment Anything Model 3." ArXiv (2025). [paper] [code] [2025.12]
Syed Hesham Syed Ariff, Yun Liu, Guolei Sun, Jing Yang, Henghui Ding, Xue Geng, Xudong Jiang.
"Evaluating SAM2 for Video Semantic Segmentation." ArXiv (2025). [paper] [2025.12]
Qi Song, Ziyuan Luo, Renjie Wan.
"Creating Blank Canvas Against AI-enabled Image Forgery." AAAI (2026). [paper] [code] [2025.11]
Satrajit Chakrabarty, Ravi Soni.
"Comparing SAM 2 and SAM 3 for Zero-Shot Segmentation of 3D Medical Data." ArXiv (2025). [paper] [2025.11]
Yaqi Wang, Zhi Li, Chengyu Wu, Jun Liu, Yifan Zhang, Jiaxue Ni, Qian Luo, Jialuo Chen, Hongyuan Zhang, Jin Liu, Can Han, Kaiwen Fu, Changkai Ji, Xinxu Cai, Jing Hao, Zhihao Zheng, Shi Xu, Junqiang Chen, Qianni Zhang, Dahong Qian, Shuai Wang, Huiyu Zhou.
"MICCAI STS 2024 Challenge: Semi-Supervised Instance-Level Tooth Segmentation in Panoramic X-ray and CBCT Images." ArXiv (2025). [paper] [code] [2025.11]
RobotSeg: Haiyang Mei, Qiming Huang, Hai Ci, Mike Zheng Shou.
"RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video." ArXiv (2025). [paper] [code] [2025.11]
YOLIC Labeling: Su, Kai and Zhao, Aihua and Hua, Jing and Chen, Chunqin.
"YOLIC Labeling: A Semi-Automated Image Annotation Tool with Segment Anything Model for Cell-wise Labeling." ArXiv (2025). [paper] [2025.11]
UnSAM-MoME: Jing Li and Yixuan Wu and Xiaorou Zheng and Shoubin Dong.
"Unsupervised SAM-guided mixture-of-multimodal-experts fusion network for medical image diagnosis." Neural Networks (2025). [paper] [2025.11]
ProtoSAM : Ayzenberg, L., Giryes, R. & Greenspan, H.
"ProtoSAM for automated one shot medical image segmentation using foundational models." Sci Rep (2025). [paper] [2025.11]
SAM-KDNet: Omid Halimi Milani et al.
"SAM-KDNet: A Segmentation and Knowledge Distillation Framework for Automated CVM Stages Classification from CBCT." ArXiv (2025). [paper] [2025.11]
Jun-Seok Yun et al.
"MedSAM-prior-guided coarse-to-fine CBCT synthesis from ZTE MRI." ArXiv (2025). [paper] [2025.11]
Maria Chiara Fiorentino and Lorenzo Federici and Alessandro Pietro La Camera and Enrico Gianluca Caiani.
"Adapt or specialize? A comprehensive evaluation of adapted SAM versus task-specific CNNs for fetal abdominal segmentation." Computer Methods and Programs in Biomedicine (2025). [paper] [2025.11]
UAT-SAM: Cao, Zhuoyuan, Kevin Wang, Saleh Abdelrahman, Jeffery Wu, and Dharsan Ravindran.
"Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization." Electronics (2025). [paper] [2025.11]
MAFF-Net: Yifeng Zhou and Huiling Gong and Ruiyun Qiu and Shaofeng Wei and Zhixun Li and Wei Zhang.
"MAFF-Net: SAM-powered Mixed Multi-scale Perception Adaptation and frequency-guided feature fusion for robust skin lesion segmentation." Biomedical Signal Processing and Control (2025). [paper] [2025.11]
Vision-Language SAM: Guangyu Ren · Hengyan Liu · Michalis Lazarou · Tania Stathaki.
"Multi-modal Segment Anything Model for Camouflaged Scene Segmentation." ICCV (2025). [paper] [code] [2025.11]
V^2-SAM: Jiancheng Pan, Runze Wang, Tianwen Qian, Mohammad Mahdi, Yanwei Fu, Xiangyang Xue, Xiaomeng Huang, Luc Van Gool, Danda Pani Paudel, Yuqian Fu.
"V^2-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence." ArXiv (2025). [paper] [code] [2025.11]
Futian Wang, Mengqi Wang, Xiao Wang, Haowen Wang, Jin Tang.
"SAM Guided Semantic and Motion Changed Region Mining for Remote Sensing Change Captioning." ArXiv (2025). [paper] [code] [2025.11]
ReSAM: M. Naseer Subhani.
"ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images." ArXiv (2025). [paper] [2025.11]
SATA: Tianlu Zhang, Qiang Zhang, Guiguang Ding, Jungong Han.
"Tracking and Segmenting Anything in Any Modality." AAAI (2026). [paper]
VESSA: Jiaqi Guo, Mingzhen Li, Hanyu Su, Santiago López, Lexiaozi Fan, Daniel Kim, Aggelos Katsaggelos.
"Vision-Language Enhanced Foundation Model for Semi-supervised Medical Image Segmentation." ArXiv (2025). [paper] [2025.11]
DRIFT: Youngseo Kim, Dohyun Kim, Geohee Han, Paul Hongsuck Seo.
"Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos." ArXiv (2025). [paper] [2025.11]
SPROUT: Wen Zhang, Qin Ren, Wenjing Liu, Haibin Ling, Chenyu You.
"Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting." ArXiv (2025). [paper] [code] [2025.11]
SAM-MI: Lin Chen, Yingjian Zhu, Qi Yang, Xin Niu, Kun Ding, Shiming Xiang.
"SAM-MI: A Mask-Injected Framework for Enhancing Open-Vocabulary Semantic Segmentation with SAM." ArXiv (2025). [paper] [2025.11]
BoxPromptIML: Zhiqing Guo, Dongdong Xi, Songlin Li, Gaobo Yang.
"From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations." AAAI (2026). [paper] [code] [2025.11]
Keith Moore.
"Not Quite Anything: Overcoming SAMs Limitations for 3D Medical Imaging." AIAS (2025). [paper] [2025.11]
MedSAM3: Anglin Liu, Rundong Xue, Xu R. Cao, Yifan Shen, Yi Lu, Xiang Li, Qianqian Chen, Jintai Chen.
"MedSAM3: Delving into Segment Anything with Medical Concepts." ArXiv (2025). [paper] [code] [2025.11]
Ref-SAM3D: Yun Zhou, Yaoting Wang, Guangquan Jie, Jinyu Liu, Henghui Ding.
"Ref-SAM3D: Bridging SAM3D with Text for Reference 3D Reconstruction." ArXiv (2025). [paper] [code] [2025.11]
SAM3-Adapter: Tianrun Chen, Runlong Cao, Xinda Yu, Lanyun Zhu, Chaotao Ding, Deyi Ji, Cheng Chen, Qi Zhu, Chunyan Xu, Papa Mao, Ying Zang.
"SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation." ArXiv (2025). [paper] [code] [2025.11]
SCALER: Chunming He, Rihan Zhang, Longxiang Tang, Ziyun Yang, Kai Li, Deng-Ping Fan, Sina Farsiu.
"SCALER: SAM-Enhanced Collaborative Learning for Label-Deficient Concealed Object Segmentation." ArXiv (2025). [paper] [2025.11]
SatSAM2: Tianrun Chen, Runlong Cao, Xinda Yu, Lanyun Zhu, Chaotao Ding, Deyi Ji, Cheng Chen, Qi Zhu, Chunyan Xu, Papa Mao, Ying Zang.
"SatSAM2: Motion-Constrained Video Object Tracking in Satellite Imagery using Promptable SAM2 and Kalman Priors." ArXiv (2025). [paper] [code] [2025.11]
DEAP-3DSAM: Fangda Chen, Jintao Tang, Pancheng Wang, Ting Wang, Shasha Li, Ting Deng.
"DEAP-3DSAM: Decoder Enhanced and Auto Prompt SAM for 3D Medical Image Segmentation." BIBM (2025). [paper] [2025.11]
Grc-SAM: Qiyang Yu, Yu Fang, Tianrui Li, Xuemei Cao, Yan Chen, Jianghao Li, Fan Min, Yi Zhang.
"Granular Computing-driven SAM: From Coarse-to-Fine Guidance for Prompt-Free Segmentation." ArXiv (2025). [paper] [2025.11]
AGE-VLM: Shweta Mahajan, Hoang Le, Hyojin Park, Farzad Farhadzadeh, Munawar Hayat, Fatih Porikli.
"Attention Guided Alignment in Efficient Vision-Language Models." NeurIPS workshop (2025). [paper] [2025.11]
CellFMCount: Abdurahman Ali Mohammed, Catherine Fonder, Ying Wei, Wallapak Tavanapong, Donald S Sakaguchi, Qi Li, Surya K. Mallapragada.
"CellFMCount: A Fluorescence Microscopy Dataset, Benchmark, and Methods for Cell Counting." ICDM (2025). [paper] [2025.11]
CA-SAM: Jiayi Wang, Wei Dai, Haoyu Wang, Sihan Yang, Haixia Bi, Jian Sun.
"Continual Alignment for SAM: Rethinking Foundation Models for Medical Image Segmentation in Continual Learning." ArXiv (2025). [paper] [code] [2025.11]
SVG360: Mengnan Jiang, Zhaolin Sun, Christian Franke, Michele Franco Adesso, Antonio Haas, Grace Li Zhang.
"SVG360: Multi-View SVG Generation with Geometric and Color Consistency from a Single SVG." ArXiv (2025). [paper] [2025.11]
CoroSAM: Michela Ferrari and Mario Urtis and Edoardo Spairani and Antonio Tescari and Francesca Sessa and Maurizia Grasso and Francesco Prati and Eloisa Arbustini and Giovanni Magenes.
"CoroSAM: adaptation of the Segment Anything Model for interactive segmentation in Coronary angiograms." Computer Methods and Programs in Biomedicine (2025). [paper] [2025.11]
OBIFlo-SAM: Wang, S., Zhang, X., Li, B. et al.
"OBIFlo-SAM: multi-task semantic recognition and segmentation of oracle bone inscription." npj Herit. Sci (2025). [paper] [2025.11]
Shokri, MohammadJavad and Yao, Yuchong and Desai, Nandakishor and Rao, Aravinda S. and Sharobeam, Angelos and Yan, Bernard and Palaniswami, Marimuthu.
"Zero-shot Stroke Lesion Segmentation via CAM-guided Prompting of MedSAM2." CIKM (2025). [paper] [2025.11]
Nguyen, XT. et al.
"Can Frequency Filtering Approximate CNNs for Enhancing Segment Anything?." ICCCI (2025). [paper] [2025.11]
VG-SAM: Dai, Gang, Qingfeng Wang, Yutao Qin, Gang Wei, and Shuangping Huang.
"VG-SAM: Visual In-Context Guided SAM for Universal Medical Image Segmentation." Fractal and Fractional (2025). [paper] [2025.11]
SCSU-Net: Yadong Li and Jun Tian and Yang Chen and Hongdong Wang and Dong Han and Hui Yan and Danyun Zhang.
"SCSU-Net: Skip connections and SAM-based U-Net for precise segmentation in underground coal mine locomotive safety systems." ESWA (2025). [paper] [2025.11]
Mine-SAM: Yuxiang Wang and Ke You and Yutian Jiang and Shuai Hu and Zhangang Wu and Cheng Zhou.
"Mine hazardous obstacle segmentation for automated bulldozer with segment anything model." Engineering Applications of Artificial Intelligence (2025). [paper] [2025.11]
Krupa Chary Pasunoori Rajendra Prasad Ch , * Open Modal iD and Raj Kumar K.
"Self-Prompting Hybrid YOLOv12-SAM 2 Model for MRI Brain Tumour Segmentation in Real Time." The Open Public Health Journal (2025). [paper] [2025.11]
SCMamba-SAM: Wu, Yifeng and cai, yaxin and Zhang, Xiaodong and Huang, Huimin and Huang, Yawen and Jiang, Jiaxuan and Wu, Hongtao and Sun, Wenfang and Wu, Xian and Hu, Qingmao and Zheng, Yefeng and Xu, Jinping.
"SCMamba-SAM: Transferring Knowledge from Natural Images to Medical Tasks." ArXiv (2025). [paper] [2025.11]
Seg-R1: Zuyao You.
"Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning." NeurIPS workshop (2025). [paper] [code] [2025.11]
Earth2Ocean: Bingyu Li, Tao Huo, Da Zhang, Zhiyuan Zhao, Junyu Gao, Xuelong Li.
"Exploring the Underwater World Segmentation without Extra Training." ArXiv (2025). [paper] [code] [2025.11]
LBMS-SAM: Yu Qi and Jun Zhang and Jian Kuang and Tingting Ren and Dong Wang and Zhuanti Wu and Hao Zheng and Qiaqia Zhang.
"LBMS-SAM: Segment Anything Model Guided SEM Image Segmentation for Lithium Battery Materials." Neural Networks (2025). [paper] [2025.11]
Daniil Mikhailenko, Denis Nikoshin, Nataliya N. Matveeva, Alexander A. Sovetsky, Peter A. Chizhov, Alexander L. Matveyev, Vladimir Y. Zaitsev, and Lev A. Matveev.
"Revealing tissue macro-structures by clustering of SAM-obtained foundational meso-segments from OCT attenuation maps." Optics in Health Care and Biomedical Optics (2025). [paper] [2025.11]
MS-SAM-LESS: Federica {Proietto Salanitri} and Giovanni Bellitto and Salvatore Calcagno and Ulas Bagci and Concetto Spampinato and Manuela Pennisi.
"SAM-guided prompt learning for Multiple Sclerosis lesion segmentation." Pattern Recognition Letters (2025). [paper] [code] [2025.11]
PV-SAM: Chaoyang Song and Jinxia Zhang and Shixiong Fang and Liping Chen.
"A photovoltaic defect segmentation framework integrating domain knowledge and fine-tuned SAM." Solar Energy (2025). [paper] [2025.11]
VideoSeg-R1: Zishan Xu, Yifu Guo, Yuquan Lu, Fengyu Yang, Junxin Li.
"VideoSeg-R1:Reasoning Video Object Segmentation via Reinforcement Learning." ArXiv (2025). [paper] [code] [2025.11]
SAM2S: Haofeng Liu, Ziyue Wang, Sudhanshu Mishra, Mingqi Gao, Guanyi Qin, Chang Han Low, Alex Y. W. Kong, Yueming Jin.
"SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking." ArXiv (2025). [paper] [code] [2025.11]
EfficientSAM3: Chengxi Zeng, Yuxuan Jiang, Aaron Zhang.
"EfficientSAM3: Progressive Hierarchical Distillation for Video Concept Segmentation from SAM1, 2, and 3." ArXiv (2025). [paper] [code] [2025.11]
UniUltra: Yue Li, Qing Xu, Yixuan Zhang, Xiangjian He, Qian Zhang, Yuan Yao, Fiseha B. Tesem, Xin Chen, Ruili Wang, Zhen Chen, Chang Wen Chen.
"UniUltra: Interactive Parameter-Efficient SAM2 for Universal Ultrasound Segmentation." TMM (2025). [paper] [code] [2025.11]
Click2Graph: Raphael Ruschel, Hardikkumar Prajapati, Awsafur Rahman, B. S. Manjunath.
"Click2Graph: Interactive Panoptic Video Scene Graphs from a Single Click." ArXiv (2025). [paper] [2025.11]
SA-FARI: Dante Francisco Wasmuht, Otto Brookes, Maximillian Schall, Pablo Palencia, Chris Beirne, Tilo Burghardt, Majid Mirmehdi, Hjalmar Kühl, Mimi Arandjelovic, Sam Pottie, Peter Bermant, Brandon Asheim, Yi Jin Toh, Adam Elzinga, Jason Holmberg, Andrew Whitworth, Eleanor Flatt, Laura Gustafson, Chaitanya Ryali, Yuan-Ting Hu, Baishan Guo, Andrew Westbury, Kate Saenko, Didac Suris.
"The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification." ArXiv (2025). [paper] [code] [2025.11]
Xabier Lekunberri, Ahmad Kamal, Izaro Goienetxea, Jon Ruiz, Iñaki Quincoces, Jaime Valls Miro, Ignacio Arganda-Carreras, Jose A. Fernandes-Salvador.
"Deep Learning for Accurate Vision-based Catch Composition in Tropical Tuna Purse Seiners." ArXiv (2025). [paper] [2025.11]
CS3: Kranti Kumar Parida, Omar Emara, Hazel Doughty, Dima Damen.
"Segmenting Collision Sound Sources in Egocentric Videos." ArXiv (2025). [paper] [code] [2025.11]
SAM-Fed: Sahar Nasirihaghighi, Negin Ghamsarian, Yiping Li, Marcel Breeuwer, Raphael Sznitman, Klaus Schoeffmann.
"SAM-Fed: SAM-Guided Federated Semi-Supervised Learning for Medical Image Segmentation." ArXiv (2025). [paper] [2025.11]
WeSTAR: Yan Huang, Yongyi Su, Xin Lin, Le Zhang, Xun Xu.
"Enhancing Generalization of Depth Estimation Foundation Model via Weakly-Supervised Adaptation with Regularization." AAAI (2026). [paper] [2025.11]
SAAS: Hengrui Hu, Kaining Ying, Henghui Ding.
"Segment Anything Across Shots: A Method and Benchmark." AAAI (2026). [paper] [code] [2025.11]
FGNet: Zhenghua Li, Hang Chen, Zihao Sun, Kai Li, Xiaolin Hu.
"FGNet: Leveraging Feature-Guided Attention to Refine SAM2 for 3D EM Neuron Segmentation." AAAI (2026). [paper] [2025.11]
UnSAMv2: Junwei Yu, Trevor Darrell, XuDong Wang.
"UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity." ArXiv (2025). [paper] [code] [2025.11]
DelAnyFlow: Mykola Lavreniuk, Nataliia Kussul, Andrii Shelestov, Yevhenii Salii, Volodymyr Kuzin, Sergii Skakun, Zoltan Szantoi.
"Delineate Anything Flow: Fast, Country-Level Field Boundary Detection from Any Source." ArXiv (2025). [paper] [2025.11]
LithoSeg: Xinyu He, Botong Zhao, Bingbing Li, Shujing Lyu, Jiwei Shen, Yue Lu.
"LithoSeg: A Coarse-to-Fine Framework for High-Precision Lithography Segmentation." ArXiv (2025). [paper] [2025.11]
SAM-DAQ: Jia Lin, Xiaofei Zhou, Jiyuan Liu, Runmin Cong, Guodao Zhang, Zhi Liu, Jiyong Zhang.
"SAM-DAQ: Segment Anything Model with Depth-guided Adaptive Queries for RGB-D Video Salient Object Detection." AAAI (2026). [paper] [code] [2025.11]
SAT3D: Himashi Peiris, Sizhe Wang, Gary Egan, Mehrtash Harandi, Meng Law, Zhaolin Chen.
"Segment Any Tumour: An Uncertainty-Aware Vision Foundation Model for Whole-Body Analysis." ArXiv (2025). [paper] [2025.11]
Cascade HQP-DETR: Zhiyuan Chen, Yuelin Guo, Zitong Huang, Haoyu He, Renhao Lu, Weizhe Zhang.
"High-Quality Proposal Encoding and Cascade Denoising for Imaginary Supervised Object Detection." ArXiv (2025). [paper] [2025.11]
Mehmet Batuhan Duman, Alejandro Carnero, Cristian Martín, Daniel Garrido, Manuel Díaz.
"Foam Segmentation in Wastewater Treatment Plants: A Federated Learning Approach with Segment Anything Model 2." ArXiv (2025). [paper] [code] [2025.11]
NOVO: Kyung-Yoon Yoon, Yeong-Jun Cho.
"NOVO: Bridging LLaVA and SAM with Visual-only Prompts for Reasoning Segmentation." ArXiv (2025). [paper] [code] [2025.11]
Xing Yao, Ahana Gangopadhyay, Hsi-Ming Chang, Ravi Soni.
"Towards Better Ultrasound Video Segmentation Foundation Model: An Empirical study on SAM2 Finetuning from Data Perspective." ArXiv (2025). [paper] [2025.11]
U(PM)^2: Chang Li, Xingtao Peng.
"U(PM)^2: Unsupervised polygon matching with pre-trained models for challenging stereo images." ArXiv (2025). [paper] [2025.11]
4D3R: Mengqi Guo, Bo Xu, Yanyan Li, Gim Hee Lee.
"4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos." NeurIPS (2025). [paper] [2025.11]
SaliencyCLIP-SAM: Ying Yuan, Yingying Zhang, Shuai Zhang & Hongjuan Wang.
"SaliencyCLIP-SAM: Bridging Text and Image Towards Text-Driven Salient Object Detection." ICIG (2025). [paper] [2025.11]
UltraSAM: Tao Jiang et al.
"UltraSAM: A foundational medical ultrasound segmentation model with limited training data." Expert Systems With Applications (2025). [paper] [2025.11]
Liu, Zongxian, Chen Chen, Huibao Huang, Jiankang Chen, Pengtao Zhang, and Jianghan Xue.
"SAM-Based Approach for Automated Fabric Anisotropy Quantification in Concrete Aggregates." Sensors (2025). [paper] [2025.11]
FM-SAM: Haohua Que et al.
"FM-SAM: individual tree crown delineation and classification based on Segmentation Anything Model (SAM) and YOLOv10 in UAV imagery for forest monitoring." Computers and Electronics in Agriculture (2025). [paper] [2025.11]

vAlahmari, Saeed S., Michael R. Gardner, and Tawfiq Salem.
"Zero-Shot SAM for Food Image Segmentation." Electronics (2025). [paper] [2025.11]

MSDP-SAM2-UNet: Liu, Shuai, Cong Zhang, and Zheng Wang.
"MSDP-SAM2-UNet: A Novel Multi-Scale and Dual-Path Model for Wheat Leaf Disease Segmentation Based on SAM2-UNet." Applied Sciences (2025). [paper] [2025.11]
SAMConvFormer+LLM: Muhammad Owais et al.
"SAMConvFormer+LLM: Exploring synergistic fusion of Segment Anything Model with joint convolutional transformer and large language model to advance dense agricultural crop analysis." Computers and Electronics in Agriculture (2025). [paper] [code] [2025.11]
HideAndSeg: Alan de Aguiar, Michaella Pereira Andrade, Charles Morphy D. Santos, João Paulo Gois.
"HideAndSeg: an AI-based tool with automated prompting for octopus segmentation in natural habitats." ArXiv (2025). [paper] [2025.11]
SRFT-GaLore: Yun-Chen Lin, Jiayuan Huang, Hanyuan Zhang, Sergi Kavtaradze, Matthew J. Clarkson, Mobarak I. Hoque.
"Subsampled Randomized Fourier GaLore for Adapting Foundation Models in Depth-Driven Liver Landmark Segmentation." ArXiv (2025). [paper] [2025.11]
SAM2-Animal-Tracking: Jan Frederik Meier, Timo Lüddecke.
"Zero-Shot Multi-Animal Tracking in the Wild." ArXiv (2025). [paper] [code] [2025.11]
Jiajia Li, Keyi Zhu, Qianwen Zhang, Dong Chen, Qi Sun, Zhaojian Li.
"Object-Centric 3D Gaussian Splatting for Strawberry Plant Reconstruction and Phenotyping." ArXiv (2025). [paper] [2025.11]
CenterMamba-SAM: Yu Tian, Zhongheng Yang, Chenshi Liu, Yiyun Su, Ziwei Hong, Zexi Gong, Jingyuan Xu.
"CenterMamba-SAM: Center-Prioritized Scanning and Temporal Prototypes for Brain Lesion Segmentation." ArXiv (2025). [paper] [2025.11]
MIQ-SAM3D: Jierui Qu, Jianchun Zhao.
"MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement." ArXiv (2025). [paper] [2025.11]
VesSAM: Suzhong Fu, Rui Sun, Xuan Ding, Jingqi Dong, Yiming Yang, Yao Zhu, Min Chang Jordan Ren, Delin Deng, Angelica Aviles-Rivero, Shuguang Cui, Zhen Li.
"VesSAM: Efficient Multi-Prompting for Segmenting Complex Vessel." ArXiv (2025). [paper] [2025.11]
SpinalSAM-R1: Jiaming Liu, Dingwei Fan, Junyong Zhao, Chunlin Li, Haipeng Si, Liang Sun.
"SpinalSAM-R1: A Vision-Language Multimodal Interactive System for Spine CT Segmentation." ArXiv (2025). [paper] [code] [2025.11]
MapSAM2: Xue Xia, Randall Balestriero, Tao Zhang, Yixin Zhou, Andrew Ding, Dev Saini, Lorenz Hurni.
"MapSAM2: Adapting SAM2 for Automatic Segmentation of Historical Map Images and Time Series." ArXiv (2025). [paper] [2025.10]
AD-SAM: Mario Camarena, Het Patel, Fatemeh Nazari, Evangelos Papalexakis, Mohamadhossein Noruzoliaee, Jia Chen.
"AD-SAM: Fine-Tuning the Segment Anything Vision Foundation Model for Autonomous Driving Perception." ArXiv (2025). [paper] [2025.10]
SANSA: Claudia Cuttano, Gabriele Trivigno, Giuseppe Averta, Carlo Masone.
"SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation." NeurIPS (2025). [paper] [code]
SAM-RNet: Yanliang Ge, Yuxi Zhong, Qiao Zhang, Hongbo Bi and Tian-Zhu Xiang.
"Weakly-supervised Camouflaged Object Detection via SAM-guided Resolution Iteration Learning." IEEE Transactions on Big Data (2025). [paper] [code] [2025.10]
CHD: Chi, Hanyang and Liu, Mengyu and Wang, Jin and Gao, Xuru and Luo, Guixun and Zhang, Bingfeng and Liu, Weifeng.
"Cross-Hierarchical Decoding with SAM for Semi-Supervised Medical Image Segmentation." TCSVT (2025). [paper] [2025.10]
Cabral, Rafael, Ricardo Santos, José A. F. O. Correia, and Diogo Ribeiro.
"A Hybrid YOLO and Segment Anything Model Pipeline for Multi-Damage Segmentation in UAV Inspection Imagery." Sensors (2025). [paper] [2025.10]
SemFusion: Ting Li, Songtao Li, Shuaifeng Li, Xiaolin Qin, Maoyuan Zhao, Luping Ji, Mao Ye.
"SAM-Guided Semantic Knowledge Fusion for Visible-Infrared Object Detection." ACM MM (2025). [paper] [code] [2025.10]
SAMVSR: Hongtao Wu, Yifeng Wu, Jiaxuan Jiang, Chengyu Wu, Hong Wang, Yefeng Zheng.
"SAMVSR: Leveraging Semantic Priors to Zone-Focused Mamba for Video Snow Removal." ACM MM (2025). [paper] [2025.10]
CoMed-SAM: Kim, Minkyu and Ryu, Kanghyun and Han, Yoseob.
"CoMed-SAM: Collaborative Medical SAM for Multi-Modality Image Segmentation." IEEE Access (2025). [paper] [code] [2025.10]
Javier Rodriguez-Sanchez et al.
"Aerial Imagery and Segment Anything Model for Architectural Trait Phenotyping to Support Genetic Analysis in Peanut Breeding." Plant Phenomics (2025). [paper] [2025.10]
DIG: Jiawei Wang et al.
"An interactive framework integrating segment anything model and structure-from-motion for three-dimensional discontinuity identification in rock masses." International Journal of Mining Science and Technology (2025). [paper] [2025.10]
SAM-EM: Alexander Wang, Max Xu, Risha Goel, Zain Shabeeb, Isabel Panicker, Vida Jamali.
"SAM-EM: Real-Time Segmentation for Automated Liquid Phase Transmission Electron Microscopy." NeurIPS Workshop (2025). [paper] [2025.10]
SHAP: Yaojin Jiang, Tianyuan Liu, Jinsong Bao.
"What are the eigen visual features for penetration state recognition?." Expert Systems with Applications (2025). [paper] [2025.10]
Prompt-SAM: Uma Gurav & Sanket Jadhav .
"Prompt-SAM: A Vision-Language and SAM based Hybrid Framework for Prompt-Augmented Zero-Shot Segmentation." Human-Centric Intelligent Systems (2025). [paper] [2025.10]
ETU-SAM: Huang, Bin and Liu, Zhong and Liu, Jingming and Wen, Huiying and Chen, Xin and Huang, Bingsheng and Li, Shuo.
"ETU-SAM: Efficient and Transparent Uncertainty Estimation for Segment Anything Model in Ultrasound Segmentation." ArXiv (2025). [paper] [2025.10]
Giuseppe Martino, Niccolò Camarlinghi, Antonio Di Tommaso, Benedetto Michelozzi, Giacomo Fontanelli, Andrea Masini, Marco Cococcioni.
"From bounding boxes to semantic segmentation: leveraging SAM for weak supervision in remote sensing." Artificial Intelligence for Security and Defence Applications (2025). [paper] [2025.10]
DRF-YOLOv8n-SAM: Li, Youlin and Yang, Yang and He, Hongjie and He, Sha and Luo, Jiqing and Peng, Xin and Peng, Xiaowei and He, Jianghai and Zhong, Fengcheng.
"Spatial-aware pipeline occupancy monitoring: A dual-stage collaborative framework integrating UAV dynamic pose with DRF-YOLOv8n-SAM." ArXiv (2025). [paper] [code] [2025.10]
Qiang Fan, Yue Yang, Bo Lei.
"Beyond manual labeling: integrating multimodal foundation models with SAM for scalable data curation." AOPC (2025). [paper] [2025.10]
SAMRI: Zhao Wang, Wei Dai, Thuy Thanh Dao, Steffen Bollmann, Hongfu Sun, Craig Engstrom, Shekhar S. Chandra.
"SAMRI: Segment Anything Model for MRI." ArXiv (2025). [paper] [code] [2025.10]
Valentin Boussot, Cédric Hémon, Jean-Claude Nunes, Jean-Louis Dillenseger.
"Fine-tuning Segment Anything for Real-Time Tumor Tracking in Cine-MRI." ArXiv (2025). [paper] [code] [2025.10]
M-SAM: Hossein R. Nowdeh, Jie Ji, Xiaolong Ma, Fatemeh Afghah.
"Modality-Aware SAM: Sharpness-Aware-Minimization Driven Gradient Modulation for Harmonized Multimodal Learning." NeurIPS (2025). [paper] [2025.10]
UAP-SAM2: Ziqi Zhou, Yifan Hu, Yufei Song, Zijing Li, Shengshan Hu, Leo Yu Zhang, Dezhong Yao, Long Zheng, Hai Jin.
"Vanish into Thin Air: Cross-prompt Universal Adversarial Attacks for SAM2." NeurIPS (2025). [paper] [code] [2025.10]
FastSAM-Splat: Anthony Opipari, Aravindhan K Krishnan, Shreekant Gayaka, Min Sun, Cheng-Hao Kuo, Arnie Sen, Odest Chadwicke Jenkins.
"Explicit Memory through Online 3D Gaussian Splatting Improves Class-Agnostic Video Segmentation." IEEE Robotics and Automation Letters (2025). [paper] [code] [2025.10]
Marouane Tliba, Mohamed Amine Kerkouri, Yassine Nasser, Nour Aburaed, Aladine Chetouani, Ulas Bagci, Rachid Jennane.
"Morphology-Aware KOA Classification: Integrating Graph Priors with Vision Models." ArXiv (2025). [paper] [2025.10]
ProFSAM: Emmanuel U. Ugwu, Zhang Xinming.
"Promptable Fire Segmentation: Unleashing SAM2's Potential for Real-Time Mobile Deployment with Strategic Bounding Box Guidance." ICIGP (2026). [paper] [code] [2025.10]
I-SAM-YOLOv5: Jun Tang and Dan Li and Jiawei Yang and Jing Chen and Ruiping Yuan.
"Leveraging large visual models for enhanced object detection: An improved SAM-YOLOv5 model." Knowledge-Based Systems (2025). [paper] [2025.10]
O-MaMa: Lorenzo Mur-Labadia, Maria Santos-Villafranca, Jesus Bermudez-Cameo, Alejandro Perez-Yus, Ruben Martinez-Cantin, Jose J. Guerrero.
"O-MaMa: Learning Object Mask Matching between Egocentric and Exocentric Views." ICCV (2025). [paper] [code] [2025.10]
RISE: Ji Du, Xin Wang, Fangwei Hao, Mingyang Yu, Chunyuan Chen, Jiesheng Wu, Bin Wang, Jing Xu, Ping Li.
"Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection." ICCV (2025). [paper] [code] [2025.10]
SDFormer: Yujie Xue, Huilong Pi, Jiapeng Zhang, Yunchuan Qin, Zhuo Tang, Kenli Li, Ruihui Li.
"SDFormer: Vision-based 3D Semantic Scene Completion via SAM-assisted Dual-channel Voxel Transformer." ICCV (2025). [paper] [2025.10]
Li Yi, Jie Hu, Songan Zhang, Guannan Jiang.
"Adapt Foundational Segmentation Models with Heterogeneous Searching Space." ICCV (2025). [paper] [2025.10]
SAMora: Shuhang Chen, Hangjie Yuan, Pengwei Liu, Hanxue Gu, Tao Feng, Dong Ni.
"SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images." ICCV (2025). [paper] [code] [2025.10]
MedSAM-CL: Athira Kalladayil Shibu, Sriprabha Ramanarayanan, Vinoth Kanna, Jaikishan Jayakumar, Keerthi Ram, Mohanasankar Sivaprakasam.
"MedSAM-Guided Curriculum Learning for White Matter Tract Segmentation in Block Face Imaging of Fetal Brain." ICCVW (2025). [paper] [2025.10]
Ryota Nakai, Kazuhiro Hotta.
"Unsupervised Nuclei Segmentation by Improving Pseudo Labels from Segment Anything Model." ICCVW (2025). [paper] [2025.10]
UR-SAM: Yichi Zhang, Shiyao Hu, Le Xue, Sijie Ren, Zixin Hu, Yuan Cheng, Yuan Qi.
"Enhancing the Reliability of Auto-Prompting SAM for Medical Image Segmentation with Uncertainty Estimation and Rectification." ICCVW (2025). [paper] [2025.10]
SAM-SPJunc:: Minasadat Attari, Kannappan Palaniappan, Filiz Bunyak.
"SAM-SPJunc: Self-Prompting for Junction Detection in Retinal Images via Radius-Based Representations." ICCVW (2025). [paper] [2025.10]
Mayolo Valencia Mendoza, Alexei Skurikhin, Judith Cohn, Luther Mcdonald, Kari Sentz.
"SAM- and mSAM- Based Inference of Nuclear Materials Processing History from SEM Imagery." ICCVW (2025). [paper] [2025.10]
BioDet: Jiaqi Hu, Hongli Xu, Junwen Huang, Peter KT Yu, Slobodan Ilic, Benjamin Busam.
"BioDet: Boosting Industrial Object Detection with Image Preprocessing Strategies." ICCVW (2025). [paper] [code] [2025.10]
IC-MoE: Xinwei Zhang, Hu Chen, Zhe Yuan, Sukun Tian, Peng Feng.
"Intelligent Communication Mixture-of-Experts Boosted-Medical Image Segmentation Foundation Model." ArXiv (2025). [paper] [2025.10]
MSSAM: Wang, Dandan and Wang, Zhichao and Zhao, Xiaoming and Chen, Qi and Chen, Qiuyue and Zhang, Shiqing and Lu, Hongsheng.
"MSSAM: A Robust Multi-scale Adaptation of Segment Anything Model for General Medical Image Segmentation." ArXiv (2025). [paper] [2025.10]
Valentin Boussot, Cédric Hémon, Jean-Claude Nunes, Jean-Louis Dillenseger.
"Why Registration Quality Matters: Enhancing sCT Synthesis with IMPACT-Based Registration." ArXiv (2025). [paper] [code] [2025.10]
APSAM: Xingzheng Wang, Shaoyong Wu, Jianbin Wu, Jiahui Li.
"APSAM: Adaptive Progressive Learning for Segment Anything Model in anomaly detection." Image and Vision Computing (2025). [paper] [2025.10]
TAP-v2: Ting Pan, Lulu Tang, Xinlong Wang, Xin Liu & Shiguang Shan.
"Consistent multimodal pre-training for visual tokenization." SCIENCE CHINA Information Sciences (2025). [paper] [2025.10]
HyperET: Zelin Peng, Zhengqin Xu, Qingyang Liu, Xiaokang Yang, Wei Shen.
"HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models." NeurIPS (2025). [paper] [2025.10]
PartNeXt: Penghao Wang, Yiyang He, Xin Lv, Yukai Zhou, Lan Xu, Jingyi Yu, Jiayuan Gu.
"PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding." NeurIPS (2025). [paper] [code] [2025.10]
DecAF: Su Ho Han, Jeongseok Hyun, Pilhyeon Lee, Minho Shim, Dongyoon Wee, Seon Joo Kim.
"Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation." ArXiv (2025). [paper] [project] [code] [2025.10]
SAM 2++: Jiaming Zhang, Cheng Liang, Yichun Yang, Chenkai Zeng, Yutao Cui, Xinwen Zhang, Xin Zhou, Kai Ma, Gangshan Wu, Limin Wang.
"SAM 2++: Tracking Anything at Any Granularity." ArXiv (2025). [paper] [code] [2025.10]
EMA-SAM: Maryam Dialameh, Hossein Rajabzadeh, Jung Suk Sim, Hyock Ju Kwon.
"EMA-SAM: Exponential Moving-average for SAM-based PTMC Segmentation." ArXiv (2025). [paper] [code] [2025.10]
AttSAM: Lan, Lixiang and Yang, Yang and Zhao, Guangyu and Li, Yifeng and Liu, Wanting and Wang, Jikui.
"AttSAM: Attention-Augmented Segment Anything Model for Accurate Polyp Segmentation." WRC SARA (2025). [paper] [2025.10]
Akhila Kambhatla, Ahmed R Khaled.
"Beyond RGB: Leveraging Vision Transformers for Thermal Weapon Segmentation." ArXiv (2025). [paper] [2025.10]
UCIS-SAM: Chuhong Wang, Hua Li, Chongyi Li, Huazhong Liu, Xiongxin Tang, Sam Kwong.
"Expose Camouflage in the Water: Underwater Camouflaged Instance Segmentation and Dataset." ArXiv (2025). [paper] [code] [2025.10]
Semantic-E2VID: Jingqian Wu, Shengpeng Xu, Yunbo Jia, Edmund Y. Lam.
"Exploring The Missing Semantics In Event Modality." ArXiv (2025). [paper] [2025.10]
M Saifuzzaman Rafat, Mohd Ruhul Ameen, Akif Islam, Abu Saleh Musa Miah, Jungpil Shin.
"From Pixels to People: Satellite-Based Mapping and Quantification of Riverbank Erosion and Lost Villages in Bangladesh." ArXiv (2025). [paper] [2025.10]
Masoud Khairi Atani, Alon Harell, Hyomin Choi, Runyu Yang, Fabien Racape, Ivan V. Bajic.
"How Universal Are SAM2 Features?." IEEE PCS (2025). [paper] [2025.10]
Memory-SAM: Joongwon Chae, Lihui Luo, Xi Yuan, Dongmei Yu, Zhenglin Chen, Lian Zhang, Peiwu Qin.
"Memory-SAM: Human-Prompt-Free Tongue Segmentation via Retrieval-to-Prompt." ArXiv (2025). [paper] [code] [2025.10]
MViT-AE: Gerard Comas-Quiles, Carles Garcia-Cabrera, Julia Dietlmeier, Noel E. O'Connor, Ferran Marques.
"Towards Label-Free Brain Tumor Segmentation: Unsupervised Learning with Multimodal MRI." ArXiv (2025). [paper] [2025.10]
UA-EPT: Lei Shi, Gang Li, Junxing Zhang.
"Uncertainty-Aware Extreme Point Tracing for WeaklySupervised Ultrasound Image Segmentation." ArXiv (2025). [paper] [2025.10]
CIGOcc: Rongtao Xu, Jinzhou Lin, Jialei Zhou, Jiahua Dong, Changwei Wang, Ruisheng Wang, Li Guo, Shibiao Xu, Xiaodan Liang.
"Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion." ArXiv (2025). [paper] [code] [2025.10]
Simon Ravé, Jean-Christophe Lombardo, Pejman Rasti, Alexis Joly, David Rousseau.
"Unlocking Zero-Shot Plant Segmentation with Pl@ntNet Intelligence." ArXiv (2025). [paper] [2025.10]
CurriFlow: Jinzhou Lin, Jie Zhou, Wenhao Xu, Rongtao Xu, Changwei Wang, Shunpeng Chen, Kexue Fu, Yihua Shao, Li Guo, Shibiao Xu.
"CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion." ArXiv (2025). [paper] [2025.10]
SAM2LoRA: Sayan Mandal, Divyadarshini Karthikeyan, Manas Paldhe.
"SAM2LoRA: Composite Loss-Guided, Parameter-Efficient Finetuning of SAM2 for Retinal Fundus Segmentation." ICMLA (2025). [paper] [code] [2025.10]
LM-EEC: Yijun Hu, Bing Fan, Xin Gu, Haiqing Ren, Dongfang Liu, Heng Fan, Libo Zhang.
"Robust Ego-Exo Correspondence with Long-Term Memory." NeurIPS (2025). [paper] [code] [2025.10]
SNAP: Aniket Gupta, Hanhui Wang, Charles Saunders, Aruni RoyChowdhury, Hanumant Singh, Huaizu Jiang.
"SNAP: Towards Segmenting Anything in Any Point Cloud." ArXiv (2025). [paper] [code] [2025.10]
SparseUWSeg: César Borja, Carlos Plou, Rubén Martinez-Cantín, Ana C. Murillo.
"SparseUWSeg: Active Sparse Point-Label Augmentation for Underwater Semantic Segmentation." ArXiv (2025). [paper] [2025.10]
CLIP-SAM: Yanning Hou, Ke Xu, Junfa Li, Yanran Ruan, Jianfeng Qiu.
"Enhancing Zero-Shot Anomaly Detection: CLIP-SAM Collaboration with Cascaded Prompts." PRCV (2025). [paper] [2025.10]
HuLiRAG: Suyang Xi, Chenxi Yang, Hong Ding, Yiqing Ni, Catherine C. Liu, Yunhao Liu, Chengqi Zhang.
"Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs." ArXiv (2025). [paper] [2025.10]
J-RAS: Salma J. Ahmed, Emad A. Mohammed, Azam Asilian Bidgoli.
"J-RAS: Enhancing Medical Image Segmentation via Retrieval-Augmented Joint Training." ArXiv (2025). [paper] [2025.10]
SAM2-3dMed: Yeqing Yang, Le Xu, Lixia Tian.
"SAM2-3dMed: Empowering SAM2 for 3D Medical Image Segmentation." ArXiv (2025). [paper] [2025.10]
Parhom Esmaeili, Virginia Fernandez, Pedro Borges, Eli Gibson, Sebastien Ourselin, M. Jorge Cardoso.
"A methodology for clinically driven interactive segmentation evaluation." MICCAI (2025). [paper] [2025.10]
ORIC340: Yupeng Du, Qiang Qu, Cong Zhang, Xu Wang, Guowen Kuang, Mengna Wen, Man Liu, Jinfeng Yang, Fengyu Liang, and Panpan Yuan.
"Oral Radiological Indices Calculation Based on SAM." Engineering Letters (2025). [paper] [2025.10]
Akshat Vashisht et al.
"Effective Segmentation of Grape Leaves Using Segment Anything Model 2." ArXiv (2025). [paper] [2025.10]
SSRA: Shaohui Jing et al.
"SSRA: Semantic Segmentation-Guided Region-Attention Colorization Method." LGRS (2025). [paper] [2025.10]
DenSiSeg: Yichao Cao et al.
"Refining the granularity of smoke representation: SAM-powered density-aware progressive smoke segmentation framework." Pattern Recognition (2025). [paper] [code] [2025.10]
CUGD: Zeng, Xiangrui and Zhu, Lingyu and Yang, Wenhan and Leung, Howard and Wang, Shiqi and Kwong, Sam.
"Low-Light Image Enhancement via Diffusion Models with Semantic Priors of Any Region." TCSVT (2025). [paper] [[code](https://github.com/lingyzhu0101/Diffusion Image Enhancement.git)] [2025.10]
SPSIS: Wang, Yinda and Pan, Yaozhong and Lei, Hao and Jin, Decai and Chen, Jiahui.
"SPSIS: Single-Point Supervised Instance Segmentation for Remote Sensing." JSTARS (2025). [paper] [2025.10]
Chai, M., Huang, R., Zeng, Z., Chang, R., Tan, W., Feng, S.
"Potential of an adapting segment anything model (SAM) for automatically extracting supraglacial lakes from satellite imagery over the Greenland ice sheet." International Journal of Digital Earth (2025). [paper] [2025.10]
SAMMed-VR: Vahid Pooryousef et al.
"SAMMed-VR: Integrated Segment Anything Model in Virtual Reality for Supervised Brain Tumour Segmentation." ArXiv (2025). [paper] [2025.10]
USD: Wang, Jin and Zhang, Bingfeng and Pang, Jian and Liu, Weifeng and Liu, Baodi and Chen, Honglong.
"Unbiased Semantic Decoding With Vision Foundation Models for Few-Shot Segmentation." TNNLS (2025). [paper] [2025.10]
Hong-Deok Seo;Eui-Myoung Kim.
"Building Segmentation Using Multiprompts and Fine-tuned Segment Anything Model 2." Sensors & Materials (2025). [paper] [2025.10]
ZSPose: Sheng Yu et al.
"ZSPose: Instance-Level Zero-Shot Object Pose Estimation with Segment Anything Model." IEEE TASE (2025). [paper] [2025.10]
Fauzia Aristalindra et al.
"Penerapan SAM-Geo untuk Delineasi Otomatis Batas Bidang Tanah Pertanian pada Ortofoto." ArXiv (2025). [paper] [2025.10]
Agus Ambarwari et al.
"Domain-Aware Transfer Learning with SAM-Assisted Mask R-CNN for Urban Tree Crown Delineation from UAV Orthophotos." IEEE Access (2025). [paper] [2025.10]
ESAM2-BLS: Lishuang Guo, Haonan Zhang, Chenbin Ma.
"ESAM2-BLS: Enhanced Segment Anything Model 2 for Efficient Breast Lesion Segmentation in Ultrasound Imaging." Computerized Medical Imaging and Graphics (2025). [paper] [2025.10]
OSLSM: Wei Y, Guo Z, Li C, Li W, Wang S.
"SAM-Based Few-Shot Learning for Coastal Vegetation Segmentation in UAV Imagery via Cross-Matching and Self-Matching." Remote Sens (2025). [paper] [2025.10]
Space-SAM: Yizhuo Zhao, Hongwei Yang.
"Space-SAM: On-Orbit Real-Time Semantic Segmentation for Space Imaging Objects." Advances in Space Research (2025). [paper] [2025.10]
Su, Y., Tan, S., Zhang, Y. et al.
"Universal forged image detection and localization via self-supervised data generation and large-scale model adaptation." Multimedia Systems (2025). [paper] [2025.10]
Cynthia Baseman, Yingtian Shi, Zikang Leng, Yaqi Liu, Gabriel Santamarina, Marcos C. Schechter, Maya Fayfman, Thomas Ploetz, Rosa I. Arriaga.
"Towards More Equitable Ulcer Recognition Models: A Dataset of Naturalistic Foot Images from People of Color Living with Diabetes." IEEE BHI (2025). [paper] [2025.10]
Ranit Karmakar , William V. Trim , Marc Kirschner, Simon F. Nørrelykke.
"Kidney Tissue Characterization using Normalized Raman Imaging and Segment-Anything." ArXiv (2025). [paper] [2025.10]
SwinMas: Mohammed Lawal, Dewei Yi.
"SwinMas: Shifted Windows and Mask Unit Attention with auxiliary supervision for medical image segmentation." Biomedical Signal Processing and Control (2025). [paper] [code] [2025.10]
Tanveer, Jawad.
"Draft Version of Refining Early Breast Cancer Detection Accuracy: Leveraging a Large Vision-Language Model Framework in Medical Imaging for Enhanced Precision." ArXiv (2025). [paper] [2025.10]
Ayhan Can Erdur et al.
"Independent Benchmarking of Prompt-Based Medical Segmentation Models." ArXiv (2025). [paper] [2025.10]
nnSAM2: Zhongyi Zhang, Julie A. Hides, Enrico De Martino, Abdul Joseph Fofanah, Gervase Tuxworth.
"nnSAM2: nnUNet-Enhanced One-Prompt SAM2 for Few-shot Multi-Modality Segmentation and Composition Analysis of Lumbar Paraspinal Muscles." ArXiv (2025). [paper] [2025.10]
SegMASt3R: Rohit Jayanti, Swayam Agrawal, Vansh Garg, Siddharth Tourani, Muhammad Haris Khan, Sourav Garg, Madhava Krishna.
"SegMASt3R: Geometry Grounded Segment Matching." NeurIPS (2025). [paper] [code] [2025.10]
Med-K2N: Feng Yuan, Yifan Gao, Yuehua Ye, Haoyue Li, Xin Gao.
"Med-K2N: Flexible K-to-N Modality Translation for Medical Image Synthesis." ArXiv (2025). [paper] [code] [2025.10]
YOLOv10+MobileSAM: Lyes Saad Saoud, Loic Lesobre, Enrico Sorato, Irfan Hussain.
"Real-Time Threaded Houbara Detection and Segmentation for Wildlife Conservation using Mobile Platforms." ArXiv (2025). [paper] [code] [2025.10]
UGround: Rui Qian, Xin Yin, Chuanhang Deng, Zhiyuan Peng, Jian Xiong, Wei Zhai, Dejing Dou.
"UGround: Towards Unified Visual Grounding with Unrolled Transformers." ArXiv (2025). [paper] [code] [2025.10]
Günel Aghakishiyeva, Jiayi Zhou, Saagar Arya, James David Poling, Holly R. Houliston, Jamie N. Womble, David W. Johnston, Brinnae Bent.
"Photorealistic Inpainting for Perturbation-based Explanations in Ecological Monitoring." NeurIPS Workshop (2025). [paper] [2025.10]
SAMSOD: Zhengyi Liu, Xinrui Wang, Xianyong Fang, Zhengzheng Tu, Linbo Wang.
"SAMSOD: Rethinking SAM Optimization for RGB-T Salient Object Detection." TMM (2025). [paper] [2025.10]
Runchen Wang, Junlin Guo, Siqi Lu, Ruining Deng, Zhengyi Lu, Yanfan Zhu, Yuechen Yang, Chongyu Qu, Yu Wang, Shilin Zhao, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo.
"Evaluating New AI Cell Foundation Models on Challenging Kidney Pathology Cases Unaddressed by Previous Foundation Models." ArXiv (2025). [paper] [2025.10]
Woowon Jang, Jiwon Im, Juseung Choi, Niki Rashidian, Wesley De Neve, Utku Ozbulak.
"When Tracking Fails: Analyzing Failure Modes of SAM2 for Point-Based Tracking in Surgical Videos." MICCAI Workshop (2025). [paper] [2025.10]
SAM2-CD: Qin, Yuan and Wang, Chaoting and Fan, Yuanyuan and Pan, Chanling.
"SAM2-CD: Remote Sensing Image Change Detection With SAM2." JSTARS (2025). [paper] [2025.09]
SGAMFNet: Liu, Yun and You, Zhi-Hui and Chen, Si-Bao and Wang, Xiao and Xu, Li-Xiang and Tang, Jin and Luo, Bin.
"SAM-Guided Attention Maps Fusion for Region Supervised Remote Sensing Image Change Detection." JSTARS (2025). [paper] [2025.09]
CHSAM: Zhang, JY., Zhang, H., Yin, F.
"CHSAM: Efficient Scene Text Segmentation via SAM with Convolutional Adapters and Hierarchical Decoding." ICDAR (2025). [paper] [2025.09]
TextSAM-LoRA: Carlos de la Fuente, Adrián Sánchez-Hernández & Jorge Calvo-Zaragoza.
"TextSAM-LoRA: Efficient Fine-Tuning of Segment Anything Model for Text Detection with Low-Rank Adaptation." ICDAR (2025). [paper] [2025.09]
SAMirror: Qiushi Meng, Yunxun Liu, Ran Hu, Mengmeng Liang, Jixue Yan & Lei Zhu.
"SAMirror: enhancing mirror detection via integrated visual-depth cues in segment anything model." The Visual Computer (2025). [paper] [2025.09]
Swapnil Biswas, Syed Muhammad Mahdi Raza, Thien Huu Nguyen, Robert LeAnder, Scott E. Umbaugh.
"Advancing skin lesion classification: the role of SAM-based segmentation in enhancing convolutional neural network performance." ArXiv (2025). [paper] [2025.09]
SAMONAIfbs: Shijie Huang; Kaicong Sun; Kai Zhang; Lingnan Kong; Fangmei Zhu; Zhongxiang Ding.
"An Integrated Automatic Framework for Super-Resolution Reconstruction of Motion-Corrupted Fetal Brain MRI with Prior Anatomical Knowledge." TBME (2025). [paper] [2025.09]
Zhenhui Jin et al.
"Non-contact evaluation of liquid transport behavior in sportswear textiles: a SAM-enhanced infrared thermography approach." International Journal of Clothing Science and Technology (2025). [paper] [2025.09]
Thuan Ha et al.
"Field boundary delineation with seasonal Sentinel 2 imagery using Segment-Anything Model (SAM)." ArXiv (2025). [paper] [2025.09]
SAM-SPL: Yimin Fu et al.
"A Unified SAM-Guided Self-Prompt Learning Framework for Infrared Small Target Detection." TGRS (2025). [paper] [code] [2025.09]
Yichen Ren.
"Exploration of lightweight SAM + CLIP: harnessing the power of advanced image understanding and segmentation." CVIT (2025). [paper] [2025.09]
CT-PromptSAM: Xin Wang; Bei Li; Li Li.
"CT-PromptSAM: Collaborative Prompting with Hybrid CNN-Transformer for Remote Sensing Semantic Segmentation." LGRS (2025). [paper] [code] [2025.09]
GastricSAM: Dan Chen, Hongpeng Yuan.
"SAM-guided feature enhancement network for precise gastric pathology segmentation." MVAID (2025). [paper] [2025.09]
UM-SAM: Jia Fu, He Li, Tao Lu, Shaoting Zhang & Guotai Wang.
"UM-SAM: Unsupervised Medical Image Segmentation Using Knowledge Distillation from Segment Anything Model." MICCAI (2025). [paper] [2025.09]
HA-SAM: Zihao Peng, Susu Kang, Xuping Huang, Xucheng Xiang, Gengyu He, Tianzhu Liu, Wei Mei & Shan Tan.
"HA-SAM: Hierarchically Adapting SAM for Nerve Segmentation in Ultrasound Images." MICCAI (2025). [paper] [2025.09]
AM-SAM: Cuong M. Pham, Phi Le Nguyen, Thanh Trung Nguyen, Vu Minh Hieu Phan & Binh P. Nguyen.
"Unleashing SAM for Few-Shot Medical Image Segmentation with Dual-Encoder and Automated Prompting." MICCAI (2025). [paper] [2025.09]
SA-Net: Huaqiang Su, Zaiyi Liu, Lisha Yao, Sunyun Li, Hun Lin, Guoliang Chen, Xin Chen, Haijun Lei & Baiying Lei.
"Sparsely Annotated Medical Image Segmentation via Cross-SAM of 3D and 2D Networks." MICCAI (2025). [paper] [code] [2025.09]
OralSAM: Logiraj Kumaralingam, Anparasy Sivaanpu, Manh-Hai Hoang, Javaneh Alavi, Kim-Cuong T. Nguyen, Kumaradevan Punithakumar, Edmond H. M. Lou, Paul Major & Lawrence H. Le.
"OralSAM: One-Shot Segmentation for Intraoral Ultrasound Videos with Adaptive Feature Correlation and Self-prompting Strategy." MICCAI (2025). [paper] [code] [2025.09]
FluoroSAM: Benjamin D. Killeen, Liam J. Wang, Blanca Iñígo, Han Zhang, Mehran Armand, Russell H. Taylor, Greg Osgood & Mathias Unberath.
"FluoroSAM: A Language-Promptable Foundation Model for Flexible X-Ray Image Segmentation." MICCAI (2025). [paper] [code] [2025.09]
MoE-SAM: Ruocheng Li, Lei Wu, Jingjun Gu, Qi Xu, Wanyi Chen, Xiaoxu Cai & Jiajun Bu.
"MoE-SAM: Enhancing SAM for Medical Image Segmentation with Mixture-of-Experts." MICCAI (2025). [paper] [code] [2025.09]
SUGFW: Xiaochuan Ma, Jia Fu, Lanfeng Zhong, Ning Zhu & Guotai Wang.
"SUGFW: A SAM-Based Uncertainty-Guided Feature Weighting Framework for Cold Start Active Learning." MICCAI (2025). [paper] [code] [2025.09]
Seg-Quality-Control: Yujia Li, Tao Zhou, Ruixuan Wang, Shuo Wang & Yizhe Zhang.
"Unsupervised Quality Control and Enhancement of Polyp Segmentation in Colonoscopy Videos Using Spatiotemporal Consistency ." MICCAI (2025). [paper] [code] [2025.09]
SR-SAM: Xixi Jiang, Chen Yang, Liang Zhang, Tim Kwang-Ting Cheng & Xin Yang.
"SR-SAM: Subspace Regularization for Domain Generalization of Segment Anything Model." MICCAI (2025). [paper] [code] [2025.09]
TGSAM-2: Runtian Yuan, Ling Zhou, Jilan Xu, Qingqiu Li, Mohan Chen, Yuejie Zhang, Rui Feng, Tao Zhang & Shang Gao.
"TGSAM-2: Text-Guided Medical Image Segmentation Using Segment Anything Model 2." MICCAI (2025). [paper] [2025.09]
FDAS: Xiaoran Qi, Guoning Zhang, Jianghao Wu, Shaoting Zhang, Xiaorong Hou & Guotai Wang.
"FDAS: Foundation Model Distillation and Anatomic Structure-Aware Multi-task Learning for Self-Supervised Medical Image Segmentation." MICCAI (2025). [paper] [code] [2025.09]
CD-PolypNet: Changpeng Yue, Jianxiang Zhao, Chao Wang, Xinglun Zhao, Axiu Mao, Jia Hou, Chenggang Yan, Kai Zhao & Shuai Wang.
"CD-PolypNet: Cross-Domain Polyp Segmentation Network with Internal Feature Distillation and Dual-Stream Boundary Focus via Large Vision Model." ArXiv (2025). [paper] [code] [2025.09]
GA-SAM: Shumeng Li, Jian Zhang, Lei Qi & Yinghuan Shi.
"GA-SAM: Geometry-Aware SAM Adaptation with Sparse Annotation-Driven Point Cloud Completion." MICCAI (2025). [paper] [code] [2025.09]
Nora: Zhikai Wei, Chao Wu, Hanyu Du, Rui Yu, Bo Du & Yongchao Xu.
"Noise-Robust Tuning of SAM for Domain Generalized Ultrasound Image Segmentation." MICCAI (2025). [paper] [code] [2025.09]
ESF-SAM: Ziqiang Wang, Zhiyu Hou, Danping Cao.
"Enhancing SAM-based digital rock image segmentation via edge-semantics fusion." Applied Computing and Geosciences (2025). [paper] [2025.09]
CA-SAM2: Hanbin Huang, Hongliang He, Liying Xu, Xudong Zhu, Siwei Feng, Guohong Fu.
"CA-SAM2: SAM2-Based Context-Aware Network with Auto-prompting for Nuclei Instance Segmentation." ArXiv (2025). [paper] [code] [2025.09]
e-CNN-GRU-SAM: Santosh Bisoyi, Amit Kumar Rathi & Swarup Mahato.
"Fault diagnosis of rolling bearing failures using a multi-stage e-CNN-GRU-SAM network." Scientific Reports (2025). [paper] [2025.09]
HalF-SAM: Mayank Golhar, Luojie Huang, Nicholas J. Durr.
"HalF-SAM: SAM-Based Haustral Fold Detection in Colonoscopy with Debris Suppression and Temporal Consistency." MICCAI (2025). [paper] [2025.09]
KSAM: Hengyuan Zhang, Peng Qiao, Wenyu Li, Yan Jia, Yong Dou.
"Knowledge Bridges the Intent Gap: Contextual Fusion in Medical Fine-Grained Segmentation." MICCAI (2025). [paper] [2025.09]
TemSAM: Liang Zhang, Xixi Jiang, Xiaohuan Ding, Zihang Huang, Tianyu Zhao, Xin Yang.
"TemSAM: Temporal-Aware Segment Anything Model for Cerebrovascular Segmentation in Digital Subtraction Angiography Sequences." MICCAI (2025). [paper] [code] [2025.09]
Zhang, X., Ali, S., Kang, Y. et al.
"Liver mask-guided SAM-enhanced dual-decoder network for landmark segmentation in AR-guided surgery." Int J CARS (2025). [paper] [2025.09]
TongueSAM_Lite: Qunsheng Ruan, Shan Cao, Zhirong Luo.
"A lightweight segmentation model based on Segment Anything Model for tongue image segmentation." Engineering Applications of Artificial Intelligence (2025). [paper] [code] [2025.09]
M. Franaszek, P. Piliptchak, P. Rachakonda, and K. S. Saidi.
"Investigating the Ambiguity of SAM when Applied to Depth and RGB Images." SEIA (2025). [paper] [2025.09]
AdaptVFMs-RSCD: Wandong Jiang, Yuli Sun, Lin Lei, Gangyao Kuang, Kefeng Ji.
"AdaptVFMs-RSCD: Advancing Remote Sensing Change Detection from binary to semantic with SAM and CLIP." ISPRS Journal of Photogrammetry and Remote Sensing (2025). [paper] [2025.09]
GC-SAM: Lanlan Li, Chongyang Wang, Yi Geng et al.
"Segment Anything Model for Gastric Cancer." Cancer Medicine (2025). [paper] [2025.09]
BiSTC-SAM: Minghao Wang et al.
"A segment anything model for transesophageal echocardiography based on bidirectional spatiotemporal context fusion." Information Fusion (2025). [paper] [2025.09]
Embed-MedSAM: Zhang, Y., Ye, F., Yu, X. et al.
"Embedded framework for clinical medical image segment anything in resource limited healthcare regions." npj Digit. Med (2025). [paper] [2025.09]
IPLC+: Zhang, Guoning and Qi, Xiaoran and Wu, Jianghao and Yan, Bo and Wang, Guotai.
"IPLC+: SAM-Guided Iterative Pseudo Label Correction for Source-Free Domain Adaptation in Medical Image Segmentation." JBHI (2025). [paper] [code] [2025.09]
Zhirong Li, Fuzhong Bai, Yue Liu, Xiaojuan Gao and Zhaoxin Xu.
"Binocular 3D reconstruction for wood ice cream sticks via speckle projection and SAM2 mask extraction." Engineering Research Express (2025). [paper] [2025.09]
PlaneSAM Zhongchen Deng et al.
"Multimodal plane instance segmentation with the Segment Anything Model." Automation in Construction (2025). [paper] [2025.09]
S-T-Simi: Cao, Yuan and He, Shuyi and Wang, Feng and Su, Shuai and Sun, Yongkui.
"A Large-Model-Enhanced Method for Rail Surface Defect Detection in Heavy-Haul Railway." TITS (2025). [paper] [2025.09]
H-fusion SEG: El�Shafai, W., Ali, A.M., Alzaben, N. et al.
"H-fusion SEG: dual-branch hyper-attention fusion network with SAM integration for robust skin disease segmentation." Scientific Reports (2025). [paper] [code] [2025.09]
Sihang Zhang, Zhi Yang, Yupeng Li, Chang Liu, Boyan Jia, Hongliang Liu.
"Flood detection for satellite and UAV remote sensing based on fine-tuned SAM." RSGPA (2025). [paper] [2025.09]
SAMAug: Lorenzo Carisi, Francesco Chiereghin, Carlo Fantozzi and Loris Nanni.
"SAM-Based Input Augmentations and Ensemble Strategies for Image Segmentation." Information (2025). [paper] [2025.09]
CGFSeg: Tingmin Li, Yixuan Li, Yang Yang.
"The 1st Solution for MOSEv1 Challenge on LSVOS 2025: CGFSeg." ArXiv (2025). [paper] [2025.09]
Point2RBox-v3: Teng Zhang, Ziqian Fan, Mingxin Liu, Xin Zhang, Xudong Lu, Wentong Li, Yue Zhou, Yi Yu, Xiang Li, Junchi Yan, Xue Yang.
"Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization." ArXiv (2025). [paper] [code] [2025.09]
EasyOcc: Seamie Hayes, Ganesh Sistu, Ciarán Eising.
"EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models." ArXiv (2025). [paper] [2025.09]
DSGA: Xintong Jiang, Yixue Liu, Mohamed Debbagh, Yu Tian, Valerio Hoyos-Villegas, Viacheslav Adamchuk, Shangpeng Sun.
"Adapting SAM with Dynamic Similarity Graphs for Few-Shot Parameter-Efficient Small Dense Object Detection: A Case Study of Chickpea Pods in Field Conditions." ArXiv (2025). [paper] [2025.09]
SeC (ZS): Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang.
"2nd Place Report of MOSEv2 Challenge 2025: Concept Guided Video Object Segmentation via SeC." ArXiv (2025). [paper] [2025.09]
CORE-3D: Mohamad Amin Mirzaei, Pantea Amoie, Ali Ekhterachian, Matin Mirzababaei.
"CORE-3D: Context-aware Open-vocabulary Retrieval by Embeddings in 3D." ArXiv (2025). [paper] [2025.09]
BALR-SAM: Zelin Liu, Sicheng Dong, Bocheng Li, Yixuan Yang, Jiacheng Ruan, Chenxu Zhou, Suncheng Xiang.
"BALR-SAM: Boundary-Aware Low-Rank Adaptation of SAM for Resource-Efficient Medical Image Segmentation." ArXiv (2025). [paper] [2025.09]
CalSAM: Behraj Khan, Tahir Qasim Syed.
"Confidence-Calibrating Regularization for Robust Brain MRI Segmentation Under Domain Shift." ArXiv (2025). [paper] [2025.09]
RefAM: Anna Kukleva, Enis Simsar, Alessio Tonioni, Muhammad Ferjad Naeem, Federico Tombari, Jan Eric Lenssen, Bernt Schiele.
"RefAM: Attention Magnets for Zero-Shot Referral Segmentation." ArXiv (2025). [paper] [code] [2025.09]
RAU: Yiwei Li, Yikang Liu, Jiaqi Guo, Lin Zhao, Zheyuan Zhang, Xiao Chen, Boris Mailhe, Ankush Mukherjee, Terrence Chen, Shanhui Sun.
"RAU: Reference-based Anatomical Understanding with Vision Language Models." ArXiv (2025). [paper] [2025.09]
LG-CD: Yixiao Liu, Yizhou Yang, Jinwen Li, Jun Tao, Ruoyu Li, Xiangkun Wang, Min Zhu, Junlong Cheng.
"LG-CD: Enhancing Language-Guided Change Detection through SAM2 Adaptation." ArXiv (2025). [paper] [2025.09]
PartSAM: Zhe Zhu, Le Wan, Rui Xu, Yiheng Zhang, Honghua Chen, Zhiyang Dou, Cheng Lin, Yuan Liu, Mingqiang Wei.
"PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D Data." ArXiv (2025). [paper] [2025.09]
CubistMerge: Wenyi Gong, Mieszko Lis.
"CubistMerge: Spatial-Preserving Token Merging For Diverse ViT Backbones." ArXiv (2025). [paper] [2025.09]
Aleksa Jelaca, Ying Jiao, Chang Tian, Marie-Francine Moens.
"Automated Prompt Generation for Creative and Counterfactual Text-to-image Synthesis." ArXiv (2025). [paper] [2025.09]
KG-SAM: Yu Li, Da Chang, Xi Xiao.
"KG-SAM: Injecting Anatomical Knowledge into Segment Anything Models via Conditional Random Fields." ArXiv (2025). [paper] [2025.09]
Opt-SynSet: Chao Zhang and Lars Christian Gansel and Marc Bracke and Ricardo da Silva Torres.
"An image synthesis framework for enhanced salmon louse larvae (Lepeophtheirus Salmonis) detection in complex seawater conditions." Computers and Electronics in Agriculture (2025). [paper] [code] [2025.09]
Eleftherios Papadopoulos, Yagmur Güçlütürk.
"Interactive Semantic Segmentation for Phosphene Vision Neuroprosthetics." ArXiv (2025). [paper] [2025.09]
OSDA: Siyi Chen, Kai Wang, Weicong Pang, Ruiming Yang, Ziru Chen, Renjun Gao, Alexis Kai Hon Lau, Dasa Gu, Chenchen Zhang, Cheng Li.
"OSDA: A Framework for Open-Set Discovery and Automatic Interpretation of Land-cover in Remote Sensing Imagery." ArXiv (2025). [paper] [code] [2025.09]
Prompt-DAS: Jiabao Chen, Shan Xiong, Jialin Peng.
"Prompt-DAS: Annotation-Efficient Prompt Learning for Domain Adaptive Semantic Segmentation of Electron Microscopy Images." MICCAI (2025). [paper] [2025.09]
MOIS-SAM2: Georgii Kolokolnikov, Marie-Lena Schmalhofer, Sophie Götz, Lennart Well, Said Farschtschi, Victor-Felix Mautner, Inka Ristow, Rene Werner.
"MOIS-SAM2: Exemplar-based Segment Anything Model 2 for multilesion interactive segmentation of neurobromas in whole-body MRI." ArXiv (2025). [paper] [2025.09]
Ilhan Skender, Kailin Tong, Selim Solmaz, Daniel Watzenig.
"Investigating Traffic Accident Detection Using Multimodal Large Language Models." IAVVC (2025). [paper] [2025.09]
Ioannis Sarafis, Alexandros Papadopoulos, Anastasios Delopoulos.
"Weakly Supervised Food Image Segmentation using Vision Transformers and Segment Anything Model." ArXiv (2025). [paper] [2025.09]
PPD: Xueyu Liu, Xiaoyi Zhang, Guangze Shi, Meilin Liu, Yexin Lai, Yongfei Wu, Mingqiang Wei.
"Attack for Defense: Adversarial Agents for Point Prompt Optimization Empowering Segment Anything Model." ArXiv (2025). [paper] [2025.09]
HyPSAM: Ruichao Hou, Xingyuan Li, Tongwei Ren, Dongming Zhou, Gangshan Wu, Jinde Cao.
"HyPSAM: Hybrid Prompt-driven Segment Anything Model for RGB-Thermal Salient Object Detection." ArXiv (2025). [paper] [code] [2025.09]
STMFSAM: Tu, Z., Zong, L., Jiang, B., Wang, H., Wang, K., Zhang, C.
"Spatial-Temporal Memory Filtering SAM for Lesion Segmentation in Breast Ultrasound Videos." MICCAI (2025). [paper] [code] [2025.09]
SaSaSa2VA: Quanzhu Niu, Dengxian Gong, Shihao Chen, Tao Zhang, Yikang Zhou, Haobo Yuan, Lu Qi, Xiangtai Li, Shunping Ji.
"The 1st Solution for 7th LSVOS RVOS Track: SaSaSa2VA." ICCVW (2025). [paper] [code] [2025.09]
Ran Hong, Feng Lu, Leilei Cao, An Yan, Youhai Jiang, Fengjie Zhu.
"Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track." ICCVW (2025). [paper] [2025.09]
SAMSON: Yujie Xie, Hongyang Zhang, Zhihui Liu, Shihai Ruan.
"SAMSON: 3rd Place Solution of LSVOS 2025 VOS Challenge." ICCVW (2025). [paper] [2025.09]
Mingqi Gao, Jingkun Chen, Yunqi Miao, Gengshen Wu, Zhijin Qin, Jungong Han.
"The 1st Solution for MOSEv2 Challenge 2025: Long-term and Concept-aware Video Segmentation via SeC." ICCVW (2025). [paper] [code] [2025.09]
SCOPE: Chang Soo Lim, Joonyoung Moon, Donghyeon Cho.
"Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution." ICCVW (2025). [paper] [code] [2025.09]
Sa2VA-i: Alexey Nekrasov, Ali Athar, Daan de Geus, Alexander Hermans, Bastian Leibe.
"3rd Place Report of LSVOS 2025 MeViS Track: Sa2VA-i: Improving Sa2VA Results with Consistent Training and Inference." ICCVW (2025). [paper] [code] [2025.09]
SimToken: Dian Jin, Yanghao Zhou, Jinxing Zhou, Jiaqi Ma, Ruohao Guo, Dan Guo.
"SimToken: A Simple Baseline for Referring Audio-Visual Segmentation." ArXiv (2025). [paper] [project] [2025.09]
FTP4RM: Yongliang Wang, Hamidreza Kasaei.
"Fast Trajectory Planner with a Reinforcement Learning-based Controller for Robotic Manipulators." ArXiv (2025). [paper] [project] [code] [2025.09]
MirrorSAM2: Mingchen Xu, Yukun Lai, Ze Ji, Jing Wu.
"MirrorSAM2: Segment Mirror in Videos with Depth Perception." ArXiv (2025). [paper] [2025.09]
SAM-DCE: Yingzhen Hu, Yiheng Zhong, Ruobing Li, Yingxue Su, Jiabao An, Feilong Tang, Jionglong Su, Imran Razzak.
"SAM-DCE: Addressing Token Uniformity and Semantic Over-Smoothing in Medical Segmentation." ArXiv (2025). [paper] [2025.09]
RangeSAM: Paul Julius Kühn, Duc Anh Nguyen, Arjan Kuijper, Holger Graf, Dieter Fellner, Saptarshi Neil Sinha.
"RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation." ArXiv (2025). [paper] [2025.09]
ENSAM: Elias Stenhede, Agnar Martin Bjørnstad, Arian Ranjbar.
"ENSAM: an efficient foundation model for interactive segmentation of 3D medical images." ArXiv (2025). [paper] [2025.09]
TASAM: Tianyang Wang, Xi Xiao, Gaofei Chen, Hanzhang Chi, Qi Zhang, Guo Cheng, Yingrui Ji.
"TASAM: Terrain-and-Aware Segment Anything Model for Temporal-Scale Remote Sensing Segmentation." ArXiv (2025). [paper] [2025.09]
FloorSAM: Han Ye, Haofu Wang, Yunchi Zhang, Jiangjian Xiao, Yuqiang Jin, Jinyuan Liu, Wen-An Zhang, Uladzislau Sychou, Alexander Tuzikov, Vladislav Sobolevskii, Valerii Zakharov, Boris Sokolov, Minglei Fu.
"FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion." ArXiv (2025). [paper] [code] [2025.09]
pFedSAM: Tong Wang, Xingyue Zhao, Linghao Zhuang, Haoyu Zhao, Jiayi Yin, Yuyang He, Gang Yu, Bo Lin.
"pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation." ArXiv (2025). [paper] [2025.09]
ORB: Jinkai Qiu, Yungjun Kim, Gaurav Sethia, Tanmay Agarwal, Siddharth Ghodasara, Zackory Erickson, Jeffrey Ichnowski.
"ORB: Operating Room Bot, Automating Operating Room Logistics through Mobile Manipulation." IEEE CASE (2025). [paper] [2025.09]
DUR-Net+: Qin, Chuanbo and Chen, Zhuyuan and Wang, Dong and Zheng, Bin and Luo, Jun and Zeng, Junying and Jia, Xudong and Wen, Jin and Hu, Maoqing and Zhai, Yikui and Coscia, Pasquale and Genovese, Angelo.
"DUR-Net+: Semi-Supervised Abdominal CT Pheochromocytoma Segmentation Via Dynamic Uncertainty Rectified and Prior Knowledge From SAM-Med3D." JBHI (2025). [paper] [2025.09]
Ramón A. Mollineda and Karel Becerra and Boris Mederos.
"Sex classification from hand X-ray images in pediatric patients: How zero-shot Segment Anything Model (SAM) can improve medical image analysis." Computers in Biology and Medicine (2025). [paper] [2025.09]
SAM2MS: Zhang, Pengnian, Junxiang Li, Chenggang Wang, and Yifeng Niu.
"SAM2MS: An Efficient Framework for HRSI Road Extraction Powered by SAM2." Remote Sensing (2025). [paper] [2025.09]
BenchPRISM: Chengze Li et al.
"BenchPRISM: Benchmarking Physical Relationship Understanding In Segmentation Models." ArXiv (2025). [paper] [2025.09]
Komei Ryu et al.
"Enhancing Segment Anything Model (SAM) for Brain Tumor Image Segmentation." ArXiv (2025). [paper] [2025.09]
YOLOSAM: Yu, R., Chen, W., Fan, J. et al.
"YOLOSAM: A unified and efficient anomaly detection model based on auto mask prompt." Signal, Image and Video Processing (2025). [paper] [2025.09]
An Yan, Leilei Cao, Feng Lu, Ran Hong, Youhai Jiang, Fengjie Zhu.
"Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track." ArXiv (2025). [paper] [2025.09]
SAM4SAM: Jovana Videnovic, Matej Kristan, Alan Lukezic.
"Distractor-Aware Memory-Based Visual Object Tracking." ArXiv (2025). [paper] [code] [2025.09]
Jeongwoo Park, Seabin Lee, Changmin Park, Wonjong Lee, Changjoo Nam.
"Reinforcement Learning for Robotic Insertion of Flexible Cables in Industrial Settings." ArXiv (2025). [paper] [2025.09]
SVP: Xiaobo Yang, Xiaojin Gong.
"Re-purposing SAM into Efficient Visual Projectors for MLLM-Based Referring Image Segmentation." ArXiv (2025). [paper] [2025.09]
SAMIR: Yue He, Min Liu, Qinghao Liu, Jiazheng Wang, Yaonan Wang, Hang Zhang, Xiang Chen.
"SAMIR, an efficient registration framework via robust feature learning from SAM." ArXiv (2025). [paper] [2025.09]
ReCOT: Xiaohan Zhang, Si-Yuan Cao, Xiaokai Bai, Yiming Li, Zhangkai Shen, Zhe Wu, Xiaoxi Hu, Hui-liang Shen.
"Recurrent Cross-View Object Geo-Localization." ArXiv (2025). [paper] [2025.09]
SPAM: Julien Walther, Rémi Giraud, Michaël Clément.
"Superpixel Anything: A general object-based framework for accurate yet regular superpixel segmentation." ArXiv (2025). [paper] [code] [2025.09]
Seg2Track-SAM2: Diogo Mendonça, Tiago Barros, Cristiano Premebida, Urbano J. Nunes.
"Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization." ArXiv (2025). [paper] [code] [2025.09]
IMD: Ruimin Ma, Sebastian Zudaire, Zhen Li, Chi Zhang.
"IMD: A 6-DoF Pose Estimation Benchmark for Industrial Metallic Objects." RCAE (2025). [paper] [2025.09]
MAE-SAM2: Xin Xing, Irmak Karaca, Samira Badrloo, Quan Dong Nguyen, Mahadevan Subramaniam.
"MAE-SAM2: Mask Autoencoder-Enhanced SAM2 for Clinical Retinal Vascular Leakage Segmentation." ArXiv (2025). [paper] [2025.09]
ViSTR-GP: Navid Aftabi, Philip Samaha, Jin Ma, Long Cheng, Ramy Harik, Dan Li.
"ViSTR-GP: Online Cyberattack Detection via Vision-to-State Tensor Regression and Gaussian Processes in Automated Robotic Operations." ArXiv (2025). [paper] [2025.09]
SAM-TTT: Zhenni Yu, Li Zhao, Guobao Xiao, Xiaoqin Zhang.
"SAM-TTT: Segment Anything Model via Reverse Parameter Configuration and Test-Time Training for Camouflaged Object Detection." ACM MM (2025). [paper] [code] [2025.09]
FS-SAM2: Bernardo Forni, Gabriele Lombardi, Federico Pozzi, Mirco Planamente.
"FS-SAM2: Adapting Segment Anything Model 2 for Few-Shot Semantic Segmentation via Low-Rank Adaptation." ICIAP (2025). [paper] [code] [2025.09]
EMeRALDS: Hafza Eman, Furqan Shaukat, Muhammad Hamza Zafar, Syed Muhammad Anwar.
"EMeRALDS: Electronic Medical Record Driven Automated Lung Nodule Detection and Classification in Thoracic CT Images." ArXiv (2025). [paper] [2025.09]
Organoid Tracker: Xiaoyu Huang, Lauren M Maxson, Trang Nguyen, Cheng Jack Song, Yuankai Huo.
"Organoid Tracker: A SAM2-Powered Platform for Zero-shot Cyst Analysis in Human Kidney Organoid Videos." ArXiv (2025). [paper] [code] [2025.09]
SAM4CellTracking: Zhu Chen, Mert Edgü, Er Jin, Johannes Stegmaier.
"Segment Anything for Cell Tracking." ArXiv (2025). [paper] [code] [2025.09]
MM SAM-adapter: Iacopo Curti, Pierluigi Zama Ramirez, Alioscia Petrelli, Luigi Di Stefano.
"Multimodal SAM-adapter for Semantic Segmentation." IEEE Access (2025). [paper] [code] [2025.09]
SCOPE: Akkala, Shruthi and Chawada, Tanisha and Dutta, Saikat and Chaudhuri, Subhasis and Banerjee, Biplab.
"SCOPE: Segmenting Common Objects with Prompt-conditioned Encoding and SAM Distillation." ArXiv (2025). [paper] [2025.09]
OrthoSAM: Chan, V., Rheinwalt, A., and Bookhagen, B.
"OrthoSAM: Multi-Scale Extension of the Segment Anything Model for River Pebble Delineation from Large Orthophotos." EGUsphere (2025). [paper] [2025.09]
MEPNet: Jiang, T., Wang, Y., Hou, F. et al.
"Enhancing video salient object detection via SAM-based multimodal energy prompting." Pattern Anal Applic (2025). [paper] [2025.09]
ISA: Fan Huang, Liming Zheng, Haiying Wen, Min Dai & Zhisheng Zhang.
"A novel data augmentation method for few-shot industrial surface defect detection based on segment anything model adapter." J Intell Manuf (2025). [paper] [2025.09]
Lite ENSAM: Agnar Martin Bjørnsta, Elias Stenhede, Arian Ranjbar.
"Lite ENSAM: a lightweight cancer segmentation model for 3D Computed Tomography." MICCAI Workshop (2025). [paper] [2025.09]
EdgeSAM: Chong Zhou, Xiangtai Li, Chen Change Loy, Bo Dai .
"EdgeSAM: Prompt-In-the-Loop Distillation for SAM." IJCV (2025). [paper] [code] [2025.09]
BiSAM-CD: Qin, Yuan and Chen, Jinyun and Wang, Chaoting and Pan, Chanling.
"BiSAM-CD: Zero-Shot Remote Sensing Change Detection via Bidirectional Temporal Memory in SAM2." TGRS (2025). [paper] [code] [2025.09]
SA4L: Zhipeng Jiang, Yanlan Wu, Hui Yang.
"Adapting segment anything model for land cover classification: the SA4L model and its applications in remote sensing." RSTSM (2025). [paper] [2025.09]
MBiTGA: Tang, Ziyi and Luo, Xinyi and Yan, Zijia and Li, Shiyi and Xiao, Sujie and Li, Hao.
"An Automatic Sample Augmentation Method for Paddy Rice Mapping Based on Segment Anything Model and Phenological Features—A Case Study in Southwest China." JSTARS (2025). [paper] [2025.09]
SAMPLE: L. Guan, M. Ge and X. Yuan.
"Enhancing Semi-Supervised Instance Segmentation Through SAM-Driven Pseudo-Label Generation in Autonomous Driving Environment." TITS (2025). [paper] [2025.09]
MSCG: Xinjun Yu and Zhoushan Feng and Xiaohong Wu and Jianqiu Chen and Weidong Chen and Baisheng Li and Huan Kuang.
"Medical SAM-Clip Grafting for brain tumor segmentation." Computers in Biology and Medicine (2025). [paper] [2025.09]
Tijana Geroski and Amir A Amini.
"Optimizing Cardiac MR Image Segmentation: Fine-Tuning the Foundational Segment Anything Model (SAM)." ArXiv (2025). [paper] [2025.09]
Han, KyeongHwan, JaeHyung Lim, Jin-Soo Ahn, and Ki-Sun Lee.
"The Evaluation of a Deep Learning Approach to Automatic Segmentation of Teeth and Shade Guides for Tooth Shade Matching Using the SAM2 Algorithm." Bioengineering (2025). [paper] [2025.09]
EAGT-LG: Sun, Rui and Xiong, Jiahang and Zhu, Jing and Wang, Xueying and Ma, Xibo.
"Edge-Attention Guided Tracking Algorithm with Line Graph and SAM Segmentation for Cell Analysis." ArXiv (2025). [paper] [2025.09]
Zeru Cui, Jianfeng Xue, Shengping Wang.
"Side-scan sonar sea bottom line extraction based on Segment Anything Model 2." CVAR (2025). [paper] [2025.09]
PeftCD: Sijun Dong, Yuxuan Hu, LiBo Wang, Geng Chen, Xiaoliang Meng.
"PeftCD: Leveraging Vision Foundation Models with Parameter-Efficient Fine-Tuning for Remote Sensing Change Detection." ArXiv (2025). [paper] [code] [2025.09]
SAMONAI: Muhammad Alberb, Helen Cheung, Anne Martel.
"Live(r) Die: Predicting Survival in Colorectal Liver Metastasis." ArXiv (2025). [paper] [2025.09]
SLENet: Xinxin Huang, Han Sun, Ningzhong Liu, Huiyu Zhou, Yinan Yao.
"SLENet: A Guidance-Enhanced Network for Underwater Camouflaged Object Detection." PRCV (2025). [paper] [2025.09]
CLAPS: Zhihao Zhao, Yinzheng Zhao, Junjie Yang, Xiangtong Yao, Quanmin Liang, Shahrooz Faghihroohi, Kai Huang, Nassir Navab, M. Ali Nasseri.
"CLAPS: A CLIP-Unified Auto-Prompt Segmentation for Multi-Modal Retinal Imaging." BIBM (2025). [paper] [2025.09]
SAM: Kamyar Barakati, Utkarsh Pratiush, Sheryl L. Sanchez, Aditya Raghavan, Delia J. Milliron, Mahshid Ahmadi, Philip D. Rack, Sergei V. Kalinin.
"SAM*: Task-Adaptive SAM with Physics-Guided Rewards." ArXiv (2025). [paper] [2025.09]
Phongsakon Mark Konrad, Andrei-Alexandru Popa, Yaser Sabzehmeidani, Liang Zhong, Elisa A. Liehn, Serkan Ayvaz.
"Challenges in Deep Learning-Based Small Organ Segmentation: A Benchmarking Perspective for Medical Research with Limited Datasets." ArXiv (2025). [paper] [2025.09]
Phantom-Insight: Hua Zhang, Changjiang Luo, Ruoyu Chen.
"Phantom-Insight: Adaptive Multi-cue Fusion for Video Camouflaged Object Detection with Multimodal LLM." ArXiv (2025). [paper] [2025.09]
P3-SAM: Changfeng Ma, Yang Li, Xinhao Yan, Jiachen Xu, Yunhan Yang, Chunshi Wang, Zibo Zhao, Yanwen Guo, Zhuo Chen, Chunchao Guo.
"P3-SAM: Native 3D Part Segmentation." ArXiv (2025). [paper] [2025.09]
Probabilistic SAM: Tyler Ward, Abdullah Imran.
"A Probabilistic Segment Anything Model for Ambiguity-Aware Medical Image Segmentation." ArXiv (2025). [paper] [code] [2025.09]
UAT-SAM: Dharsan Ravindran, Kevin Wang, Zhuoyuan Cao, Saleh Abdelrahman, Jeffery Wu.
"Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization." ArXiv (2025). [paper] [2025.09]
Allabadi, G., Lucic, A., Wang, YX. et al.
" Learning to Detect Novel Species with SAM in the Wild." IJCV (2025). [paper] [2025.09]
SA3D: Cen, J., Fang, J., Zhou, Z. et al.
"Segment Anything in 3D with Radiance Fields." IJCV (2025). [paper] [code] [2025.09]
VisioFirm: Safouane El Ghazouali, Umberto Michelucci.
"VisioFirm: Cross-Platform AI-assisted Annotation Tool for Computer Vision." ArXiv (2025). [paper] [code] [2025.09]
InfraDiffusion: Yixiong Jing, Cheng Zhang, Haibing Wu, Guangming Wang, Olaf Wysocki, Brian Sheil.
"InfraDiffusion: zero-shot depth map restoration with diffusion models and prompted segmentation from sparse infrastructure point clouds." ArXiv (2025). [paper] [code] [2025.09]
SOPSeg: Chenhao Wang, Yingrui Ji, Yu Meng, Yunjian Zhang, Yao Zhu.
"SOPSeg: Prompt-based Small Object Instance Segmentation in Remote Sensing Imagery." AAAI (2025). [paper] [2025.09]
SegAssess: Bingnan Yang, Mi Zhang, Zhili Zhang, Zhan Zhang, Yuanxin Zhao, Xiangyun Hu, Jianya Gong.
"SegAssess: Panoramic quality mapping for robust and transferable unsupervised segmentation assessment." ArXiv (2025). [paper] [code] [2025.09]
TReF-6: Yuxuan Ding, Shuangge Wang, Tesca Fitzgerald.
"TReF-6: Inferring Task-Relevant Frames from a Single Demonstration for One-Shot Skill Generalization." ArXiv (2025). [paper] [2025.09]
You, Ruixi and Xu, Feng and Liu, Min.
"SAR Aircraft Segmentation With SAR-to-Optical Image Translation and Segment Anything Model." JSTARS (2025). [paper] [2025.07]
Mohammad Marjani, Masoud Mahdianpari, Daniel J. Varon, Fariba Mohammadimanesh.
"The integration of vision transformers and SAM for automated methane super-emitter detection using TROPOMI data." Journal of Environmental Management (2025). [paper] [2025.08]
Abdul Rehman, Ilona Heldal, Jerry Chun-Wei Lin.
"Spatiotemporal EEG-Based Emotion Recognition Using SAM Ratings from Serious Games with Hybrid Deep Learning." ArXiv (2025). [paper] [2025.08]
Kaouther Mouheb, Marawan Elbatel, Janne Papma, Geert Jan Biessels, Jurgen Claassen, Huub Middelkoop, Barbara van Munster, Wiesje van der Flier, Inez Ramakers, Stefan Klein, Esther E. Bron.
"Federated Fine-tuning of SAM-Med3D for MRI-based Dementia Classification." MICCAI Workshop (2025). [paper] [2025.08]
FSA: Zhixiang Chi, Yanan Wu, Li Gu, Huan Liu, Ziqiang Wang, Yang Zhang, Yang Wang, Konstantinos N. Plataniotis.
"Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation." ICCV (2025). [paper] [code] [2025.08]
Amir Jmal, Chaima Chtourou, Mahdi Louati, Abdelaziz Kallel, Houda Khmila.
"Olive Tree Satellite Image Segmentation Based On SAM and Multi-Phase Refinement." ArXiv (2025). [paper] [2025.08]
SPGrasp: Yunpeng Mei, Hongjie Cao, Yinqiu Xia, Wei Xiao, Zhaohan Feng, Gang Wang, Jie Chen.
"SPGrasp: Spatiotemporal Prompt-driven Grasp Synthesis in Dynamic Scenes." ArXiv (2025). [paper] [code] [2025.08]
FreeVPS: Qiang Hu, Ying Zhou, Gepeng Ji, Nick Barnes, Qiang Li, Zhiwei Wang.
"FreeVPS: Repurposing Training-Free SAM2 for Generalizable Video Polyp Segmentation." ArXiv (2025). [paper] [2025.08]
SPLF-SAM: Qiyao Xu, Qiming Wu, Xiaowei Li.
"SPLF-SAM: Self-Prompting Segment Anything Model for Light Field Salient Object Detection." ArXiv (2025). [paper] [code] [2025.08]
Lechun You, Zhonghua Wu, Weide Liu, Xulei Yang, Jun Cheng, Wei Zhou, Bharadwaj Veeravalli, Guosheng Lin.
"Integrating SAM Supervision for 3D Weakly Supervised Point Cloud Segmentation." ArXiv (2025). [paper] [2025.08]
AUSM: Miran Heo, Sukjun Hwang, Min-Hung Chen, Yu-Chiang Frank Wang, Albert Gu, Seon Joo Kim, Ryo Hachiuma.
"Autoregressive Universal Video Segmentation Model." ArXiv (2025). [paper] [2025.08]
Propose-Rectify: Keyang Zhang, Chenqi Kong, Hui Liu, Bo Ding, Xinghao Jiang, Haoliang Li.
"Propose and Rectify: A Forensics-Driven MLLM Framework for Image Manipulation Localization." ArXiv (2025). [paper] [2025.08]
E-BayesSAM: Bin Huang, Zhong Liu, Huiying Wen, Bingsheng Huang, Xin Chen, Shuo Li.
"E-BayesSAM: Efficient Bayesian Adaptation of SAM with Self-Optimizing KAN-Based Interpretation for Uncertainty-Aware Ultrasonic Segmentation." MICCAI (2025). [paper] [code] [2025.08]
QTT-SEG: Breenda Das, Lennart Purucker, Timur Carstensen, Frank Hutter.
"Quickly Tuning Foundation Models for Image Segmentation." AutoML (2025). [paper] [code] [2025.08]
AdvCP: Zhenghui Zhao, Chen Wu, Di Wang, Hongruixuan Chen, Cuiqun Chen, Zhuo Zheng, Bo Du, Liangpei Zhang.
"Advancing Weakly-Supervised Change Detection in Satellite Images via Adversarial Class Prompting." ArXiv (2025). [paper] [code] [2025.08]
DeH4R: Dengxian Gong, Shunping Ji.
"DeH4R: A Decoupled and Hybrid Method for Road Network Graph Extraction." ArXiv (2025). [paper] [code] [2025.08]
SAMDWICH: Seunghun Lee, Jiwan Seo, Jeonghoon Kim, Siwon Kim, Haeun Yun, Hyogyeong Jeon, Wonhyeok Choi, Jaehoon Jeong, Zane Durante, Sang Hyun Park, Sunghoon Im.
"SAMDWICH: Moment-aware Video-text Alignment for Referring Video Object Segmentation." ArXiv (2025). [paper] [project] [code] [2025.08]
Q-MiniSAM2: Xuanxuan Ren , Xiangyu Li , Kun Wei , Xu Yang , Yanhua Yang.
"Q-MiniSAM2: A Quantization-based Benchmark for Resource-Efficient Video Segmentation." IJCAI (2025). [paper] [2025.08]
ODS-SAM: Chao Huang et al.
"Omni-Dimensional State Space Model-driven SAM for Pixel-level Anomaly Detection." IJCAI (2025). [paper] [2025.08]
DenseSAM: Linyun Zhou et al.
"DenseSAM: Semantic Enhance SAM For Efficient Dense Object Segmentation." IJCAI (2025). [paper] [code] [2025.08]
PG-SAM: Hadee Madadum, Fazal E Nasir, Kanjana Haruehansapong.
"Optimizing Watermelon Leaf Disease Detection Using SAM-based Augmentation with YOLO for Practical Agricultural Solutions." Smart Agricultural Technology (2025). [paper] [2025.08]
ShadowCraft-Nerf: Xun Chen et al.
"ShadowCraft-Nerf: Occlusion and Shadow Mitigation via SAM-Guided Nerf." CASA (2025). [paper] [2025.08]
SAM2Med3D: Ying Chen and Wenjing Cui and Xiaoyan Dong and Shuai Zhou and Zhongqiu Wang.
"SAM2Med3D: Leveraging video foundation models for 3D breast MRI segmentation." Computers & Graphics (2025). [paper] [2025.08]
Huang, X., Long, A., Han, W., Chen, Y., Min, G., & Yan, D.
"A segment anything model-based geological remote sensing interpretation method with a distributed data-parallel deep learning framework." International Journal of Digital Earth (2025). [paper] [2025.08]
MFB-SAC: Xutao Sun et al.
"MFB-SAC: A Multi-Scale Frequency and Boundary-Enhanced SAM for Cell Segmentation." ICIP (2025). [paper] [code] [2025.08]
Self-TrainingSAM: Mauricio Fernandez M. et al.
"SAM 2-Driven Self-Training for Mammogram Segmentation: Zero-Shot Mask Generation Via Pseudo-Video." ICIP (2025). [paper] [code] [2025.08]
PAL-SAM: Christopher Bunn et al.
"Re-Purposing Segment Anything For Skeleton Action Localization." ICIP (2025). [paper] [2025.08]
subCellSAM: Jacob Hanimann, Daniel Siegismund, Mario Wieser, Stephan Steigele.
"subCellSAM: Zero-Shot (Sub-)Cellular Segmentation for Hit Validation in Drug Discovery." GCPR (2025). [paper] [2025.08]
Lang2Lift: Huy Hoang Nguyen, Johannes Huemer, Markus Murschitz, Tobias Glueck, Minh Nhat Vu, Andreas Kugi.
"Lang2Lift: A Framework for Language-Guided Pallet Detection and Pose Estimation Integrated in Autonomous Outdoor Forklift Operation." ArXiv (2025). [paper] [code] [2025.08]
All-in-SAM: Xueyuan Li, Can Cui, Ruining Deng, Yucheng Tang, Quan Liu, Tianyuan Yao, Shunxing Bao, Naweed Chowdhury, Haichun Yang, Yuankai Huo.
"Fine-grained Multi-class Nuclei Segmentation with Molecular-empowered All-in-SAM Model." Journal of Medical Imaging(2025). [paper] [2025.08]
RAG-SEG: Wutao Liu, YiDan Wang, Pan Gao.
"First RAG, Second SEG: A Training-Free Paradigm for Camouflaged Object Detection." ArXiv (2025). [paper] [2025.08]
TOM: Jiacheng Xie, Ziyang Zhang, Biplab Poudel, Congyu Guo, Yang Yu, Guanghui An, Xiaoting Tang, Lening Zhao, Chunhui Xu, Dong Xu.
"TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation." ArXiv (2025). [paper] [code] [2025.08]
EASSA: Wang, Jinghan and Cao, Zhen and Fu, Shiyang and Kang, Zhizhong and Wang, Jingyi.
"A Novel Flexible Architecture Based on SAM for Automatic Exraction of Rampart Craters From Martian High Resolution Images." IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (2025). [paper] [2025.08]
UGCA: Haowen Pang, Xiaoming Hong, Peng Zhang & Chuyang Ye.
"Cascaded Diffusion Model and Segment Anything Model for Medical Image Synthesis via Uncertainty-Guided Prompt Generation." IPMI (2025). [paper] [2025.08]
MSAM: Xing, Jiezhen, and Jicong Zhang.
"Segmentation of Brain Tumors Using a Multi-Modal Segment Anything Model (MSAM) with Missing Modality Adaptation." Bioengineering (2025). [paper] [2025.08]
Co2SAM: Liu, Chunmeng and Shen, Yao and Zhou, Haoran and Xiao, Qingguo and Chen, Qiaochuan and Li, Guangyao.
"Co2SAM: Exploring Co-occurrence Challenges With SAM in Weakly Supervised Semantic Segmentation." IEEE Internet of Things Journal (2025). [paper] [code] [2025.08]
CD-SAM: Guowei Zheng and Pengbo Bo and Songhua Xu and Linqin Wang and Zhaoyang Cong and Liangliang Liu and Ziyang Zhao and Caiming Zhang.
"Enhancing Segment Anything Model with spatial context and textural detail for cardiac MRI segmentation." Biomedical Signal Processing and Control(2025). [paper] [code] [2025.08]
LENS: Lianghui Zhu, Bin Ouyang, Yuxuan Zhang, Tianheng Cheng, Rui Hu, Haocheng Shen, Longjin Ran, Xiaoxin Chen, Li Yu, Wenyu Liu, Xinggang Wang.
"LENS: Learning to Segment Anything with Unified Reinforced Reasoning." AAAI (2026). [paper] [code] [2025.08]
GeoSAM2: Ken Deng, Yunhan Yang, Jingxiang Sun, Xihui Liu, Yebin Liu, Ding Liang, Yan-Pei Cao.
"GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation." ArXiv (2025). [paper] [code] [2025.08]
InstDrive: Hongyuan Liu, Haochen Yu, Jianfei Jiang, Qiankun Liu, Jiansheng Chen, Huimin Ma.
"InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes." ArXiv (2025). [paper] [2025.08]
CoFi: Hongjin Fang, Daniel Reisenbüchler, Kenji Ikemura, Mert R. Sabuncu, Yihe Yang, Ruining Deng.
"CoFi: A Fast Coarse-to-Fine Few-Shot Pipeline for Glomerular Basement Membrane Segmentation." ArXiv (2025). [paper] [code] [2025.08]
Cesar Alan Contreras, Manolis Chiou, Alireza Rastegarpanah, Michal Szulik, Rustam Stolkin.
"Utilizing Vision-Language Models as Action Models for Intent Recognition and Assistance." IEEE RO-MAN (2025). [paper] [2025.08]
MedSAMix: Yanwu Yang, Guinan Su, Jiesi Hu, Francesco Sammarco, Jonas Geiping, Thomas Wolfers.
"MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation." ArXiv (2025). [paper] [2025.08]
Bolt-SAM: Yangjie Xiao, Ke Zhang, Jiacun Wang, Xin Sheng, Yurong Guo, Meijuan Chen, Zehua Ren, Zhaoye Zheng, Zhenbing Zhao.
"A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection." ArXiv (2025). [paper] [code] [2025.08]
SAM-CEM-CD: Humza Naveed, Xina Zeng, Mitch Bryson, Nagita Mehrseresht.
"Adapting SAM via Cross-Entropy Masking for Class Imbalance in Remote Sensing Change Detection." ArXiv (2025). [paper] [code] [2025.08]
PG-SAM: Zhongyuan Wu, Chuan-Xian Ren, Yu Wang, Xiaohua Ban, Jianning Xiao, Xiaohui Duan.
"Multi-Sequence Parotid Gland Lesion Segmentation via Expert Text-Guided Segment Anything Model." ArXiv (2025). [paper] [2025.08]
SegDAC: Alexandre Brown, Glen Berseth.
"SegDAC: Segmentation-Driven Actor-Critic for Visual Reinforcement Learning." ArXiv (2025). [paper] [code] [2025.08]
AutoSAME: Tuo Liu, Qinghan Yang, Yu Zhang, Rongjun Ge, Yang Chen, Guangquan Zhou.
"Think as Cardiac Sonographers: Marrying SAM with Left Ventricular Indicators Measurements According to Clinical Guidelines." ArXiv (2025). [paper] [code] [2025.08]
RSFIQA: Chenyue Song, Chen Hui, Haiqi Zhu, Feng Jiang, Yachun Mi, Wei Zhang, Shaohui Liu.
"Segmenting and Understanding: Region-aware Semantic Attention for Fine-grained Image Quality Assessment with Large Language Models." ArXiv (2025). [paper] [2025.08]
CAV-SAM: Haoran Wang, Zekun Li, Jian Zhang, Lei Qi, Yinghuan Shi.
"Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild." ArXiv (2025). [paper] [code] [2025.08]
CLUE: Youqi Wang, Shunquan Tan, Rongxuan Peng, Bin Li, Jiwu Huang.
"CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization." ArXiv (2025). [paper] [code] [2025.08]
ForensicsSAM: Rongxuan Peng, Shunquan Tan, Chenqi Kong, Anwei Luo, Alex C. Kot, Jiwu Huang.
"ForensicsSAM: Toward Robust and Unified Image Forgery Detection and Localization Resisting to Adversarial Attack." ArXiv (2025). [paper] [code] [2025.08]
S2-UniSeg: Huihui Xu, Jin Ye, Hongqiu Wang, Changkai Ji, Jiashi Lin, Ming Hu, Ziyan Huang, Ying Chen, Chenglong Ma, Tianbin Li, Lihao Liu, Junjun He, Lei Zhu.
"S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision." ArXiv (2025). [paper] [code] [2025.08]
IAPF: Chao Yin, Jide Li, Xiaoqiang Li.
"A Simple yet Powerful Instance-Aware Prompting Framework for Training-free Camouflaged Object Segmentation." ArXiv (2025). [paper] [code] [2025.08]
SAGOnline: Wentao Sun, Quanyun Wu, Hanqing Xu, Kyle Gao, Zhengsen Xu, Yiping Chen, Dedong Zhang, Lingfei Ma, John S. Zelek, Jonathan Li.
"SAGOnline: Segment Any Gaussians Online." ArXiv (2025). [paper] [2025.08]
SAM-Med3D: Wang, Haoyu and Guo, Sizheng and Ye, Jin and Deng, Zhongying and Cheng, Junlong and Li, Tianbin and Chen, Jianpin and Su, Yanzhou and Huang, Ziyan and Shen, Yiqing and Fu, Bin and Zhang, Shaoting and He, Junjun.
"SAM-Med3D: A Vision Foundation Model for General-Purpose Segmentation on Volumetric Medical Images." TNNLS (2025). [paper] [code] [2025.08]
AP-SAM: Peigang Liu and Peijie Wang and Chaozhi Yang and Honghao Dong and Jing Ma and Zongmin Li.
"Automatic pore structure analysis of mudstone and shale in hydrocarbon-rich areas using SEM images and AP-SAM." Fuel (2025). [paper] [2025.08]
AP-SAM: Feng, Yunjian and Li, Jun.
"Auto-Prompting SAM for Container Detection and Localization in Container Yards." TITS (2025). [paper] [2025.08]
Y. Xia et al.
"Box-Prompt Zero-Shot Smart Segmentation in Radiation Oncology Using a SAM-Based Model: SmartSAM." AAPM (2025). [paper] [2025.08]
Chenxiao Zhang and Peng Yue.
"Toward unsupervised building extraction from very high-resolutionremote sensing images using SAM and CLIP." GIScience & Remote Sensing (2025). [paper] [2025.08]
ScribbleSAM: Tae Hun Lee , Jae Yeol Lee.
"ScribbleSAM: Weakly supervised salient object detection and localization in remote sensing images using Transformer and Segment Anything Model." Journal of Computational Design and Engineering (2025). [paper] [2025.08]
2AM: Ren, Chenyu, Liwen Zou, and Luying Gui.
"2AM: Weakly Supervised Tumor Segmentation in Pathology via CAM and SAM Synergy." Electronics (2025). [paper] [2025.08]
Ying Qu.
"Attitude Estimation for Cardan-Connected End-Effector of an Underwater Robot by Integrating SAM-based Segmentation and Neural Network." ICECET (2025). [paper] [2025.08]
Rajesh Bhayana, Bo Wang.
"Segment Anything in the Ovary: Toward Scalable AI-assisted Lesion Classification." Radiology (2025). [paper] [2025.08]
Vessel-SAM2: Zihuang Wu and Xinyu Xiong.
"Vessel-SAM2: Adapting Segment Anything 2 for Patch-Free Retinal Vessel Segmentation in Ultra-High Resolution Fundus Images." IEEE Sensors Letters (2025). [paper] [2025.08]
TFSeg: He, Jiaqi and Li, Haofeng and Yang, Liang and Chen, Bohui.
"Training-Free Breast Ultrasound Image Segmentation with Retrieval-based SAM2." IEEE Transactions on Biomedical Engineering (2025). [paper] [2025.08]
BCTDNet: Zhang, Wei, Jinsong Li, Shuaipeng Wang, and Jianhua Wan.
"BCTDNet: Building Change-Type Detection Networks with the Segment Anything Model in Remote Sensing Images." Remote Sensing (2025). [paper] [2025.08]
Prompt-DINO: Yuchen Guan, Chong Sun, Canmiao Fu, Zhipeng Huang, Chun Yuan, Chen Li.
"Text-guided Visual Prompt DINO for Generic Segmentation." ArXiv (2025). [paper] [code] [2025.08]
VeSCA: Yi Qin, Rui Wang, Tao Huang, Tong Xiao, Liping Jing.
"SAM Encoder Breach by Adversarial Simplicial Complex Triggers Downstream Model Failures." ICCV(2025). [paper] [code] [2025.08]
Sri Ramana Saketh Vasanthawada, Pengkun Liu, Pingbo Tang.
"Enhancing Construction Site Analysis and Understanding with 3D Segmentation." ArXiv (2025). [paper] [2025.08]
Ahmad Farooq, Kamran Iqbal.
"Integrating Vision Foundation Models with Reinforcement Learning for Enhanced Object Interaction." RCVE(2025). [paper] [2025.08]
TSMS-SAM2: Guoping Xu, Hua-Chieh Shao, You Zhang.
"TSMS-SAM2: Multi-scale Temporal Sampling Augmentation and Memory-Splitting Pruning for Promptable Video Object Segmentation and Tracking in Surgical Scenarios." ArXiv (2025). [paper] [code] [2025.08]
Ojonugwa Oluwafemi Ejiga Peter, Akingbola Oluwapemiisin, Amalahu Chetachi, Adeniran Opeyemi, Fahmi Khalifa, Md Mahmudur Rahman.
"Synthetic Data-Driven Multi-Architecture Framework for Automated Polyp Segmentation Through Integrated Detection and Mask Generation." ArXiv (2025). [paper] [2025.08]
SGDFuse: Xiaoyang Zhang, Zhen Hua, Yakun Ju, Wei Zhou, Jun Liu, Alex C. Kot.
"SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion." ArXiv (2025). [paper] [code] [2025.08]
SMOL-MapSeg: Yunshuang Yuan, Frank Thiemann, Thorsten Dahms, Monika Sester.
"SMOL-MapSeg: Show Me One Label." ArXiv (2025). [paper] [2025.08]
Semanur Küçük, Cosimo Della Santina, Angeliki Laskari.
"Segmenting the Complex and Irregular in Two-Phase Flows: A Real-World Empirical Study with SAM2." ArXiv (2025). [paper] [2025.08]
DecoupleCSS: Yifu Guo, Yuquan Lu, Wentao Zhang, Zishan Xu, Dexia Chen, Siyu Zhang, Yizhe Zhang, Ruixuan Wang.
"Decoupling Continual Semantic Segmentation." ArXiv (2025). [paper] [code] [2025.08]
TGS-Agent: Jinxing Zhou, Yanghao Zhou, Mingfei Han, Tong Wang, Xiaojun Chang, Hisham Cholakkal, Rao Muhammad Anwer.
"Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation." ArXiv (2025). [paper] [code] [2025.08]
X-SAM: Hao Wang, Limeng Qiao, Zequn Jie, Zhijian Huang, Chengjian Feng, Qingfang Zheng, Lin Ma, Xiangyuan Lan, Xiaodan Liang.
"X-SAM: From Segment Anything to Any Segmentation." ArXiv (2025). [paper] [code] [2025.08]
Edge2Prompt: Nathan Hollet, Oumeymah Cherkaoui, Philippe C. Cattin, Sidaty El hadramy.
"Edge2Prompt: Modality-Agnostic Model for Out-of-Distribution Liver Segmentation." ArXiv (2025). [paper] [2025.08]
SAV: Xiao Wang, Ziwen Wang, Wentao Wu, Anjie Wang, Jiashu Wu, Yantao Pan, Chenglong Li.
"Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark." ArXiv (2025). [paper] [code] [2025.08]
MLLMSeg: Jingchao Wang, Zhijian Wu, Dingjiang Huang, Yefeng Zheng, Hong Wang.
"Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode." ArXiv (2025). [paper] [code] [2025.08]
RAP-SAM: Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang.
"RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything." ICLR (2025). [paper] [code] [2025.08]
ParticleSAM: Yu Zhou, Pelle Thielmann, Ayush Chamoli, Bruno Mirbach, Didier Stricker, Jason Rambach.
"ParticleSAM: Small Particle Segmentation for Material Quality Monitoring in Recycling Processes." EUSIPCO(2025). [paper] [2025.08]
SAM2-UNeXT: Xinyu Xiong, Zihuang Wu, Lei Zhang, Lei Lu, Ming Li, Guanbin Li.
"SAM2-UNeXT: An Improved High-Resolution Baseline for Adapting Foundation Models to Downstream Segmentation Tasks." ArXiv (2025). [paper] [code] [2025.08]
MAUP: Yazhou Zhu, Haofeng Zhang.
"MAUP: Training-free Multi-center Adaptive Uncertainty-aware Prompting for Cross-domain Few-shot Medical Image Segmentation." MICCAI (2025). [paper] [code] [2025.08]
Freida Barnatan, Emunah Goldstein, Einav Kalimian, Orchen Madar, Avi Huri, David Zitoun, Ya'akov Mandelbaum, Moshe Amitay.
"Zero-shot Shape Classification of Nanoparticles in SEM Images using Vision Foundation Models." ArXiv (2025).
MFNet: Ma, Xianping and Zhang, Xiaokang and Pun, Man-On and Huang, Bo.
"A Unified Framework with Multimodal Fine-tuning for Remote Sensing Semantic Segmentation." TGRS (2025). [paper] [code] [2025.08]
SAMPO: Yonghuang Wu, Wenwen Zeng, Xuan Xie, Chengqian Zhao, Guoqing Wu, Jinhua Yu.
"SAMPO: Visual Preference Optimization for Intent-Aware Segmentation with Vision Foundation Models." ArXiv (2025). [paper] [2025.08]
Kumail Abbas, Zeeshan Afzal, Aqeel Raza, Taha Mansouri, Andrew W. Dowsey, Chaidate Inchaisri, Ali Alameer.
"Vision transformer-based multi-camera multi-object tracking framework for dairy cow monitoring." ArXiv (2025). [paper] [2025.08]
PromptReg: Shiqi Huang, Tingfa Xu, Wen Yan, Dean Barratt, Yipeng Hu.
"Register Anything: Estimating "Corresponding Prompts" for Segment Anything Model." ArXiv (2025). [paper] [2025.08]
Rein++: Zhixiang Wei, Xiaoxiao Ma, Ruishen Yan, Tao Tu, Huaian Chen, Jinjin Zheng, Yi Jin, Enhong Chen.
"Rein++: Efficient Generalization and Adaptation for Semantic Segmentation with Vision Foundation Models." ArXiv (2025). [paper] [code] [2025.08]
VELA: Zhixuan Li, Yujia Liu, Chen Hui, Weisi Lin.
"Single Point, Full Mask: Velocity-Guided Level Set Evolution for End-to-End Amodal Segmentation." ArXiv (2025). [paper] [2025.08]
Yang Liu, Muzhi Zhu, Hao Chen, Xinlong Wang, Bo Feng, Hao Wang, Shiyu Li, Raviteja Vemulapalli & Chunhua Shen .
"Segment Anything in Context with Vision Foundation Models." IJCV (2025). [paper] [2025.08]
SAMSA 2.0: Alfie Roddan, Tobias Czempiel, Chi Xu, Daniel S. Elson, Stamatia Giannarou.
"SAMSA 2.0: Prompting Segment Anything with Spectral Angles for Hyperspectral Interactive Medical Image Segmentation." ArXiv (2025). [paper] [2025.08]
Omni-Scan: Tianshuang Qiu, Zehan Ma, Karim El-Refai, Hiya Shah, Chung Min Kim, Justin Kerr, Ken Goldberg.
"Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging ." ArXiv (2025). [paper] [code] [2025.08]
Aymane Abdali, Bartosz Boguslawski, Lucas Drumetz, Vincent Gripon.
"Object-Centric Cropping for Visual Few-Shot Classification." ArXiv (2025). [paper] [2025.08]
SAM-PTx: Shayan Jalilian, Abdul Bais.
"SAM-PTx: Text-Guided Fine-Tuning of SAM with Parameter-Efficient, Parallel-Text Adapters." ArXiv (2025). [paper] [2025.08]
SeC: Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Songxin He, Jianfan Lin, Junsong Tang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang.
"SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction." ArXiv (2025). [paper] [project] [code] [dataset] [2025.07]
SAMSA: Alfie Roddan, Tobias Czempiel, Chi Xu, Daniel S. Elson, Stamatia Giannarou.
"SAMSA: Segment Anything Model Enhanced with Spectral Angles for Hyperspectral Interactive Medical Image Segmentation." ArXiv (2025). [paper] [2025.07]
ST-SAM: Xihang Hu, Fuming Sun, Jiazhe Liu, Feilong Xu, Xiaoli Zhang.
"ST-SAM: SAM-Driven Self-Training Framework for Semi-Supervised Camouflaged Object Detection." ACM MM (2025). [paper] [code] [2025.07]
Solha Kang, Eugene Kim, Joris Vankerschaver, Utku Ozbulak.
"Towards Affordable Tumor Segmentation and Visualization for 3D Breast MRI Using SAM2." MICCAI Workshop (2025). [paper] [2025.07]
Lalithkumar Seenivasan, Jiru Xu, Roger D. Soberanis Mukul, Hao Ding, Grayson Byrd, Yu-Chun Ku, Jose L. Porras, Masaru Ishii, Mathias Unberath.
"Beyond Rigid AI: Towards Natural Human-Machine Symbiosis for Interoperative Surgical Assistance." ArXiv (2025). [paper] [2025.07]
MergeSAM: Meiqi Hu, Lingzhi Lu, Chengxi Han, Xiaoping Liu.
"MergeSAM: Unsupervised change detection of remote sensing images based on the Segment Anything Model." ArXiv (2025). [paper] [2025.07]
SAMITE: Qianxiong Xu, Lanyun Zhu, Chenxi Liu, Guosheng Lin, Cheng Long, Ziyue Li, Rui Zhao.
"SAMITE: Position Prompted SAM2 with Calibrated Memory for Visual Object Tracking." ArXiv (2025). [paper] [code] [2025.07]
Maoquan Zhang, Bisser Raytchev, Xiujuan Sun.
"Semantic Segmentation of iPS Cells: Case Study on Model Complexity in Biomedical Imaging." MVA(2025). [paper] [2025.07]
Marcel Moran, Arunav Gupta, Jiali Qian, Debra Laefer.
"Scaling Pedestrian Crossing Analysis to 100 U.S. Cities via AI-based Segmentation of Satellite Imagery." ArXiv (2025). [paper] [2025.07]
SAMwave: Saurabh Yadav, Avi Gupta, Koteswar Rao Jerripothula.
"SAMwave: Wavelet-Driven Feature Enrichment for Effective Adaptation of Segment Anything Model." BMVC(2025). [paper] [2025.07]
HQ-SMem: Elham Soltani Kazemi, Imad Eddine Toubal, Gani Rahmon, Jaired Collins, K. Palaniappan.
"HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback." ArXiv (2025). [paper] [2025.07]
SAM2-Aug: Guoping Xu, Yan Dai, Hengrui Zhao, Ying Zhang, Jie Deng, Weiguo Lu, You Zhang.
"SAM2-Aug: Prior knowledge-based Augmentation for Target Volume Auto-Segmentation in Adaptive Radiation Therapy Using Segment Anything Model 2." ArXiv (2025). [paper] [code] [2025.07]
Lacune_Detection: Pon Deepika et al.
"Automated Detection of Lacunes in Brain MR Images Using SAM with Robust Prompts via Self-Distillation and Anatomy-Informed Priors." ArXiv (2025). [paper] [code] [2025.07]
SinkSAM-Net: Osher Rafaeli and Tal Svoray and Ariel Nahlieli.
"SinkSAM-Net: Knowledge-driven self-supervised sinkhole segmentation using topographic priors and Segment Anything Model." ISPRS Journal of Photogrammetry and Remote Sensing (2025). [paper] [code] [2025.07]
Sergi, G., Bocchino, F., Ravanelli, R., & Crespi, M.
"Monitoring water reservoirs extent with Segment Anything Model applied to Sentinel imagery." European Journal of Remote Sensing(2025). [paper] [2025.07]
DefectSAM: Yan, Feng and Jiang, Xiaoheng and Lu, Yang and Cao, Jiale and Xu, Mingliang.
"DefectSAM: Hierarchically Adapting SAM for Pixel-Wise Surface Defect Detection." TNNLS (2025). [paper] [2025.07]
GAM: Ge, Rongjun and Li, Ruiyi and Wang, Chong and Liu, Yuxin and Zhu, Heng and Coatrieux, Jean-Louis and Zhang, Daoqiang and Lu, Jian and Chen, Yang and Li, Shuo and He, Yuting.
"Adaptation follow human attention: Gaze-assisted medical segment anything model." TCSVT (2025). [paper] [code] [2025.07]
IPPIS: Hui Chen, Nannan Li, Ming An, Chengxi Xia & Kekun Zhu.
"Enhancing image dehazing with polarization awareness and SAM-guided fusion." ArXiv (2025). [paper] [code] [2025.07]
TS-SAM: Zhang, E. et al.
"Unleashing the Potential of SAM for Change Detection: A Two-Stage Approach for Enhanced Remote Sensing Analysis." ICIC (2025). [paper] [2025.07]
EdgeSAM-CASD: Zhang, J. et al.
"EdgeSAM-CASD: Lightweight Mural Damage Segmentation via Convolutional Adapter." ICIC (2025). [paper] [2025.07]
SMBA-MIL: Zhou, B., Wang, C., Wu, X., Liao, C., Wang, P., Wang, H.
"SMBA-MIL: SAM-Enhanced Multi-branch Attention Multi-instance Learning for Whole Slide Image Classification." ICIC (2025). [paper] [2025.07]
Yolo-HLSAM: Banteng Liu and Hongguang Chen and Tianyi Zhu and Zanting Ye and Haidong Cui and Ke Wang.
"Yolo-HLSAM: Adapting foundation segment anything model for semi-automatic detection and segmentation of breast cancer microcalcification clusters." Biomedical Signal Processing and Control (2025). [paper] [code] [2025.07]
Rad_SAM2: Ezekiel Chukwujindu and Khunsa Faiz and Alexandra {De Sequeira} and Stephanie Chidom and Hafsa Faiz.
"Improving medical image segmentation with SAM2: analyzing the impact of object characteristics and finetuning on multi-planar datasets." European Journal of Radiology Artificial Intelligence (2025). [paper] [code] [2025.07]
Jia, Runda and Wang, Jinglong and Zheng, Jun and Li, Jiahao and He, Dakuo.
"Feature Extraction for Mining Industry Image Based on SAM: A Case Study From Froth Flotation." TII (2025). [paper] [2025.07]
SAID-Net: Wu, Y., Zhao, T., Hu, S. et al.
"SAID-Net: enhancing segment anything model with implicit decoding for echocardiography sequences segmentation." Med Biol Eng Comput (2025). [paper] [2025.07]
TP-SAM: Xiao, T., Ling, Y.
"TP-SAM: Fine-Tuning SAM with Task-Specific Prompt in the Loop." ICIC (2025). [paper] [2025.07]
3DGS-SAM 2: Duan, D., Wang, Z., Xin, Y. et al.
"Defect segmentation and 3D reconstruction in concrete structures using SAM 2 and 3D Gaussian splatting." J Civil Struct Health Monit(2025). [paper] [2025.07]
DCVA-SAM: Sudipan Saha; Kanishk Awadhiya.
"Integrating Deep Change Vector Analysis and SAM for Class-Specific Change Detection." IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (2025). [paper] [code] [2025.07]
KD-MedSAM: Zhou, W., Zhu, J., Chen, W., Li, C., He, Y., He, M.
"KD-MedSAM: Lightweight Knowledge Distillation of Segment Anything Model for Multi-modality Medical Image Segmentation." ICIC (2025). [paper] [2025.07]
FreqSAM2-UNet: Wang, C., Cao, J., Gao, Y., Wang, J.
"FreqSAM2-UNet: Adapter Fine-Tuning Frequency-Aware Network of SAM2 for Universal Medical Segmentation." ICIC (2025). [paper] [2025.07]
SCCAM: Liu, Y., Fei, Y., Dai, X., Zhou, Y., Wang, X., Huang, X.
"Annotation-Free Salient Object Detection via Spatial-Enhanced Contrastive Learning and SAM." ICIC (2025). [paper] [2025.07]
YOSAM: Li, Z., Chen, L., Lu, L., Ding, Y., Hao, X.
"YOSAM: A YOLO and MedSAM-Based Framework for Automatic Measurement of Fetal Head Circumference in Ultrasound Images." ICIC (2025). [paper] [2025.07]
Li, J., Yan, F., Zhang, X., Yang, L.
"Two-Stage Multi-stained Cell Analysis with the Segment Anything Model for Pathological Image Segmentation." ICIC (2025). [paper] [2025.07]
FB-SAM: Wen, Z., Ma, J.
"FB-SAM: An Effective Learning Framework for First Break Picking Based on the SAM Model with Limited Data." ICIC (2025). [paper] [2025.07]
SAM2-DFBCNet: Yuan, Cao, Libang Liu, Yaqin Li, and Jianxiang Li.
"SAM2-DFBCNet: A Camouflaged Object Detection Network Based on the Heira Architecture of SAM2." Sensors (2025). [paper] [2025.07]
Bolutife Atoki, Jenny Benois-Pineau, Renaud Péteri, Fabien Baldacci, Aymar de Rugy.
"Object segmentation in the wild with foundation models: application to vision assisted neuro-prostheses for upper limbs." ArXiv (2025). [paper] [code] [2025.07]
TextSAM-EUS: Pascal Spiegler, Taha Koleilat, Arash Harirpoush, Corey S. Miller, Hassan Rivaz, Marta Kersten-Oertel, Yiming Xiao.
"TextSAM-EUS: Text Prompt Learning for SAM to Accurately Segment Pancreatic Tumor in Endoscopic Ultrasound." ICCVW (2025). [paper] [2025.07]
FA-SAM: Huanli Zhuo, Leilei Ma, Haifeng Zhao, Shiwei Zhou, Dengdi Sun, Yanping Fu.
"Fully Automated SAM for Single-source Domain Generalization in Medical Image Segmentation." IEEE SMC (2025). [paper] [2025.07]
ScSAM: Bo Fang, Jianan Fan, Dongnan Liu, Hang Chang, Gerald J. Shami, Filip Braet, Weidong Cai.
"ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation." ECAI (2025). [paper] [2025.07]
MARSCalib: Seokhwan Jeong, Hogyun Kim, Younggun Cho.
"MARSCalib: Multi-robot, Automatic, Robust, Spherical Target-based Extrinsic Calibration in Field and Extraterrestrial Environments." ArXiv (2025). [paper] [code] [2025.07]
SpelkeNet: Rahul Venkatesh, Klemen Kotar, Lilian Naing Chen, Seungwoo Kim, Luca Thomas Wheeler, Jared Watrous, Ashley Xu, Gia Ancone, Wanhee Lee, Honglin Chen, Daniel Bear, Stefan Stojanov, Daniel Yamins.
"Discovering and using Spelke segments." ArXiv (2025). [paper] [code] [2025.07]
CMP: Shuai Chen, Fanman Meng, Chunjin Yang, Haoran Wei, Chenhao Wu, Qingbo Wu, Hongliang Li.
"CMP: A Composable Meta Prompt for SAM-Based Cross-Domain Few-Shot Segmentation." ArXiv (2025). [paper] [2025.07]
DFR: Shuai Chen, Fanman Meng, Xiwei Zhang, Haoran Wei, Chenhao Wu, Qingbo Wu, Hongliang Li.
"DFR: A Decompose-Fuse-Reconstruct Framework for Multi-Modal Few-Shot Segmentation." ArXiv (2025). [paper] [2025.07]
PlantSAM: Youcef Sklab, Florian Castanet, Hanane Ariouat, Souhila Arib, Jean-Daniel Zucker, Eric Chenin, Edi Prifti.
"PlantSAM: An Object Detection-Driven Segmentation Pipeline for Herbarium Specimens." ArXiv (2025). [paper] [2025.07]
OP-SAM: Xinyu Mao, Xiaohan Xing, Fei Meng, Jianbang Liu, Fan Bai, Qiang Nie, Max Meng.
"One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Iterative Prompt Evolution." ICCV (2025). [paper] [code] [2025.07]
HFS-SAM2: Wu, Zihuang and Xiong, Xinyu and Gao, Guangwei and Li, Hongwei and Chen, Hua.
"HFS-SAM2: Segment Anything Model 2 with High-Frequency Feature Supplementation for Camouflaged Object Detection." IEEE SPL (2025). [paper] [code] [2025.07]
PseudonuScenes: Atharv Goel, Mehar Khurana.
"Just Add Geometry: Gradient-Free Open-Vocabulary 3D Detection Without Human-in-the-Loop." ArXiv (2025). [[paper](Just Add Geometry: Gradient-Free Open-Vocabulary 3D Detection Without Human-in-the-Loop)] [code] [2025.07]
ConformalSAM: Danhui Chen, Ziquan Liu, Chuxi Yang, Dan Wang, Yan Yan, Yi Xu, Xiangyang Ji.
"ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction." ICCV (2025). [paper] [2025.07]
FastSmoothSAM: Jiasheng Xu, Yewang Chen.
"FastSmoothSAM: A Fast Smooth Method For Segment Anything Model." ArXiv (2025). [paper] [code] [2025.07]
DD-SAM2: Guoping Xu, Christopher Kabat, You Zhang.
"Depthwise-Dilated Convolutional Adapters for Medical Object Tracking and Segmentation Using the Segment Anything Model 2." ArXiv (2025). [paper] [code] [2025.07]
SAM2Plus: Yin, Jun, Fei Wu, Hao Su, Peng Huang, and Yuetong Qixuan.
"Improvement of SAM2 Algorithm Based on Kalman Filtering for Long-Term Video Object Segmentation." Sensors (2025). [paper] [2025.07]
BreastSegNet: Qihang Li, Jichen Yang, Yaqian Chen, Yuwen Chen, Hanxue Gu, Lars J. Grimm, Maciej A. Mazurowski.
"BreastSegNet: Multi-label Segmentation of Breast MRI." ArXiv (2025). [paper] [2025.07]
CSW-SAM: Tianyi Zhang and Yi Ren and Weibin Li and Chenhao Qin and Licheng Jiao and Hua Su.
"CSW-SAM: a cross-scale algorithm for very-high-resolution water body segmentation based on segment anything model 2." ISPRS Journal of Photogrammetry and Remote Sensing(2025). [paper] [2025.07]
Hanxue Gu, Yaqian Chen, Nicholas Konz, Qihang Li, Maciej A. Mazurowski.
"Are Vision Foundation Models Ready for Out-of-the-Box Medical Image Registration?." ArXiv (2025). [paper] [code] [2025.07]
RegCL: Yuan-Chen Shu, Zhiwei Lin, Yongtao Wang.
"RegCL: Continual Adaptation of Segment Anything Model via Model Merging." ArXiv (2025). [paper] [2025.07]
Antonio Finocchiaro, Giovanni Maria Farinella, Antonino Furnari.
"Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation." International Conference on Image Analysis and Processing(2025). [paper] [code] [2025.07]
SAMST: Jun Yin, Fei Wu, Yupeng Ren, Jisheng Huang, Qiankun Li, Heng jin, Jianhai Fu, Chanjie Cui.
"SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation." IGARSS(2025). [paper] [2025.07]
Ekaterina Stansfield, Jennifer A. Mitterer, Abdulrahman Altahhan.
"Landmark Detection for Medical Images using a General-purpose Segmentation Model." ArXiv (2025). [paper] [2025.07]
CSCPNet: Jinglin Zhang et al.
"Controlled-SAM and Context Promoting Network for Fine-Grained Semantic Segmentation." JSTARS (2025). [paper] [2025.07]
Wang Zhicheng, Satoshi Yagi, Satoshi Yamamori, Jun Morimoto.
"Object-Centric Mobile Manipulation through SAM2-Guided Perception and Imitation Learning." ArXiv (2025). [paper] [2025.07]
StaRFM: Behraj Khan, Tahir Syed.
"Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distribution Shift." ArXiv (2025). [paper] [code] [2025.07]
DEARLi: Ivan Martinović, Josip Šarić, Marin Oršić, Matej Kristan, Siniša Šegvić.
"DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation." ICCV Workshop (2025). [paper] [code] [2025.07]
FOCAL: Utkarsh Singhal, Ryan Feng, Stella X. Yu, Atul Prakash.
"Test-Time Canonicalization by Foundation Models for Robust Perception." ICML (2025). [paper] [code] [2025.07]
Inter2Former: You Huang, Lichao Chen, Jiayi Ji, Liujuan Cao, Shengchuan Zhang, Rongrong Ji.
"Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive." ICCV (2025). [paper] [2025.07]
MA-SAM2: Ming Yin, Fu Wang, Xujiong Ye, Yanda Meng, Zeyu Fu.
"Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation." ArXiv (2025). [paper] [code] [2025.07]
Yidong Jiang.
"Prompt Engineering in Segment Anything Model: Methodologies, Applications, and Emerging Challenges." ArXiv (2025). [paper] [2025.07]
Birkhoff: Juntong Fan, Zhiwei Hao, Jianqiang Shen, Shang-Ling Jui, Yi Zhang, Jing-Xiao Liao, Feng-Lei Fan.
"Compress Any Segment Anything Model (SAM)." ArXiv (2025). [paper] [code] [2025.07]
Wu, Xiaoqin, Dacheng Wang, Caihong Ma, Yi Zeng, Yongze Lv, Xianmiao Huang, and Jiandong Wang.
"Parcel Segmentation Method Combined YOLOV5s and Segment Anything Model Using Remote Sensing Image." ArXiv (2025). [paper] [2025.07]
PlantSAM: Daniel J Petti, Changying Li, Alina Zare.
"PlantSAM: Towards Real-Time Plant Segmentation with Efficient Vision-Language Foundation Models." ArXiv (2025). [paper] [2025.07]
HiM2SAM: Ruixiang Chen, Guolei Sun, Yawei Li, Jie Qin, Luca Benini.
"HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking." ArXiv (2025). [paper] [code] [2025.07]
Objectomaly: Jeonghoon Song, Sunghun Kim, Jaegyun Im, Byeongjoon Noh.
"Objectomaly: Objectness-Aware Refinement for OoD Segmentation with Structural Consistency and Boundary Precision." ArXiv (2025). [paper] [2025.07]
Seg-Wild: Yongtang Bao, Chengjie Tang, Yuze Wang, Haojie Li.
"Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections." ArXiv (2025). [paper] [code] [2025.07]
LangSplatV2: Wanhua Li, Yujie Zhao, Minghan Qin, Yang Liu, Yuanhao Cai, Chuang Gan, Hanspeter Pfister.
"LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS." ArXiv (2025). [paper] [code] [2025.07]
Raps-3D: Théo Danielou, Daniel Tordjman, Pierre Manceron, Corentin Dancette.
"RAPS-3D: Efficient interactive segmentation for 3D radiological imaging." MIUA (2025). [paper] [2025.07]
SMML: Guoyan Liang, Qin Zhou, Jingyuan Chen, Bingcang Huang, Kai Chen, Lin Gu, Zhe Wang, Sai Wu, Chang Yao.
"Semantic-guided Masked Mutual Learning for Multi-modal Brain Tumor Segmentation with Arbitrary Missing Modalities." AAAI (2025). [paper] [2025.07]
SpatialReasoner: Zhenyang Liu, Sixiao Zheng, Siyu Chen, Cairong Zhao, Longfei Liang, Xiangyang Xue, Yanwei Fu.
"A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding." ACM MM (2025). [paper] [code] [2025.07]
RSRefSeg 2: Keyan Chen, Chenyang Liu, Bowen Chen, Jiafan Zhang, Zhengxia Zou, Zhenwei Shi.
"RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation Models." ArXiv (2025). [paper] [code] [2025.07]
Accordion: Jingye Chen, Zhaowen Wang, Nanxuan Zhao, Li Zhang, Difan Liu, Jimei Yang, Qifeng Chen.
"Rethinking Layered Graphic Design Generation with a Top-Down Approach." ArXiv (2025). [paper] [2025.07]
OpenWorldSAM: Shiting Xiao, Rishabh Kabra, Yuhang Li, Donghyun Lee, Joao Carreira, Priyadarshini Panda.
"OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts." ArXiv (2025). [paper] [2025.07]
SAMed-2: Zhiling Yan, Sifan Song, Dingjie Song, Yiwei Li, Rong Zhou, Weixiang Sun, Zhennong Chen, Sekeun Kim, Hui Ren, Tianming Liu, Quanzheng Li, Xiang Li, Lifang He, Lichao Sun.
"SAMed-2: Selective Memory Enhanced Medical Segment Anything Model." MICCAI (2025). [paper] [code] [2025.07]
Causal-SAM-LLM: Tao Tang, Shijie Xu, Yiting Wu, Zhixiang Lu.
"Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation." ArXiv (2025). [paper] [2025.07]
SAM2RL: Alen Adamyan, Tomáš Čížek, Matej Straka, Klara Janouskova, Martin Schmid.
"SAM2RL: Towards Reinforcement Learning Memory Control in Segment Anything Model 2." RLC Workshop on RL4RS (2025). [paper] [2025.07]
PAP-SAM: Jizhe Yu, Xiya Bu, Yu Liu, and Kaiping Xu.
"PAP-SAM: Global-Local Prior Adaptive Perception SAM for Co-Salient Object Detection." ICMR (2025). [paper] [2025.07]
Light SAM: Jiahong Chen, Hui Li, Shengnan Shen, Yingjie Wang, Rongzhe Ma, Haoyuan Chen, Hong Yin, Liwei Xia.
"Universal Light SAM algorithm for in-situ melting pool monitoring of L-DED/L-PBF/PAM processes." Journal of Computational Design and Engineering(2025). [paper] [2025.07]
Zhanghao Qin.
"Diffusion-Based Adversarial Generation with SAM-Guided Spatial Semantics for Text-to-Image Models." ICMR (2025). [paper] [2025.07]
EFI-SAM: Huang, Junqing and Bao, Junqi and Xia, Min and Yuan, Xiaochen.
"SAM-Based Efficient Feature Integration Network for Remote Sensing Change Detection: A Case Study on Macao Sea Reclamation." JSTARS (2025). [paper] [code] [2025.07]
FlexiSAM: Zhan Zhang et al.
"FlexiSAM: A flexible SAM-based semantic segmentation model for land cover classification using high-resolution multimodal remote sensing imagery." ISPRS Journal of Photogrammetry and Remote Sensing (2025). [paper] [2025.07]
Yuan Meng et al.
"Unsupervised SAM segmentation of zebrafish body: Application to melanin analysis." Environmental Pollution (2025). [paper] [2025.07]
Gandul, Luis Villanueva, Antonio Madueño-Luna, José Miguel Madueño-Luna, Miguel Calixto López-Gordillo, and Manuel Jesús González-Ortega.
"Diagnosis by SAM Linked to Machine Vision Systems in Olive Pitting Machines." Applied Sciences (2025). [paper] [2025.07]
Sui, Y., Hu, Q. & Zhang, Y.
"Cross-domain subcortical brain structure segmentation algorithm based on low-rank adaptation fine-tuning SAM." BMC Med Imaging (2025). [paper] [2025.07]
SAMUSA: Baptiste Podvin et al.
"SAMUSA: Segment Anything Model 2 for UltraSound Annotation." ArXiv (2025). [paper] [2025.07]
Sway: Gupta, J., Sharma, S. Sway.
"Sway: efficient pedestrian detection using SAM-based walkable area segmentation and YOLO." Int. j. inf. tecnol.(2025). [paper] [2025.07]
Luis Villanueva Gandul et al.
"Diagnosis by SAM Linked to Machine Vision Systems in Olive Pitting Machines." Appl. Sci. (2025). [paper] [2025.07]
FESS-SAM: Hangbin Wu and Shaojun Zhou and Zhengwen Xu and Haili Sun and Lianbi Yao.
"FESS-SAM: Full-element semantic segmentation of tunnel linear array images based on the segment anything model." Advanced Engineering Informatics (2025). [paper] [2025.07]
SAMOccNet: Qifan Tan and Wenzhuo Liu and Han Bi and Lening Wang and Lei Yang and Yicheng Qiao and Zhuo Zhao and Yanhuan Jiang and Qiannan Guo and Huaping Liu and Zhiwei Li and Cheng Qiu.
"SAMOccNet: Refined SAM-based Surrounding Semantic Occupancy Perception for Autonomous Driving." Neurocomputing (2025). [paper] [2025.07]
Training-free: Miguel Espinosa, Chenhongyi Yang, Linus Ericsson, Steven McDonagh, Elliot J. Crowley.
"No time to train! Training-Free Reference-Based Instance Segmentation." ArXiv (2025). [paper] [code] [2025.07]
WeCoL: Weiwei Duan, Luping Ji, Shengjia Chen, Sicheng Zhu, Jianghong Huang, Mao Ye.
"Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection." ArXiv (2025). [paper] [code] [2025.07]
ViRefSAM: Hanbo Bi, Yulong Xu, Ya Li, Yongqiang Mao, Boyuan Tong, Chongyang Li, Chunbo Lang, Wenhui Diao, Hongqi Wang, Yingchao Feng, Xian Sun.
"ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation." ArXiv (2025). [paper] [2025.07]
NOCTIS: Max Gandyra, Alessandro Santonicola, Michael Beetz.
"NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation." ArXiv (2025). [paper] [code] [2025.07]
ADA-SAM: Tyler Ward, Meredith K. Owen, O'Kira Coleman, Brian Noehren, Abdullah-Al-Zubaer Imran.
"Autoadaptive Medical Segment Anything Model." ArXiv (2025). [paper] [code] [2025.07]
SGP: Zihong Guo, Chen Wan, Yayin Zheng, Hailing Kuang, Xiaohai Lu.
"Boosting Adversarial Transferability Against Defenses via Multi-Scale Transformation." ArXiv (2025). [paper] [2025.07]
SAM-MaGuP: Tapas K. Dutta, Snehashis Majhi, Deepak Ranjan Nayak, Debesh Jha.
"Mamba Guided Boundary Prior Matters: A New Perspective for Generalized Polyp Segmentation." MICCAI (2025). [paper] [code] [2025.07]
Seg-R1: Zuyao You, Zuxuan Wu.
"Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning." ArXiv (2025). [paper] [code] [2025.06]
SFMS: Zhan, Tianming and Qi, Jiaqiang and Zhang, Jinjin and Yu, Xiaobin and Du, Qian and Wu, Zebin.
"Spatial–Spectral Feature-Enhanced Mamba and SAM-Guided Hyperspectral Multiclass Change Detection." TGRS (2025). [paper] [2025.06]
CRISP-SAM2: Xinlei Yu, Chanmiao Wang, Hui Jin, Ahmed Elazab, Gangyong Jia, Xiang Wan, Changqing Zou, Ruiquan Ge.
"CRISP-SAM2: SAM2 with Cross-Modal Interaction and Semantic Prompting for Multi-Organ Segmentation." ArXiv (2025). [paper] [code] [2025.06]
MaTIR: Li-Cheng Shen, Jih-Kang Hsieh, Wei-Hua Li, Chu-Song Chen.
"Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval." ArXiv (2025). [paper] [code] [2025.06]
DeSa2VA: Dang Jisheng, Wu Xudong, Wang Bimei, Lv Ning, Chen Jiayu, Jingwen Zhao, Yichu liu, Jizhao Liu, Juncheng Li, Teng Wang.
"Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder." ArXiv (2025). [paper] [code] [2025.06]
GroundingDINO-US-SAM: Hamza Rasaee, Taha Koleilat, Hassan Rivaz.
"GroundingDINO-US-SAM: Text-Prompted Multi-Organ Segmentation in Ultrasound with LoRA-Tuned Vision-Language Models." ArXiv (2025). [paper] [2025.06]
VoteSplat: Minchao Jiang, Shunyu Jia, Jiaming Gu, Xiaoyuan Lu, Guangming Zhu, Anqi Dong, Liang Zhang.
"VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding." ICCV (2025). [paper] [code] [2025.06]
Shubhabrata Mukherjee, Jack Lang, Obeen Kwon, Iryna Zenyuk, Valerie Brogden, Adam Weber, Daniela Ushizima.
"Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data." ICPPW (2025). [paper] [2025.06]
MedSAM-CA: Peiting Tian, Xi Chen, Haixia Bi, Fan Li.
"MedSAM-CA: A CNN-Augmented ViT with Attention-Enhanced Multi-Scale Fusion for Medical Image Segmentation." ArXiv (2025). [paper] [2025.06]
Fangyijie Wang, Kevin Whelan, Félix Balado, Guénolé Silvestre, Kathleen M. Curran.
"Diffusion Model-based Data Augmentation Method for Fetal Head Ultrasound Segmentation." ArXiv (2025). [paper] [2025.06]
SurgTPGS: Yiming Huang, Long Bai, Beilei Cui, Kun Yuan, Guankun Wang, Mobarakol Islam, Nicolas Padoy, Nassir Navab, Hongliang Ren.
"SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting." MICCAI (2025). [paper] [code] [2025.06]
DC-TTA: Jihun Kim, Hoyong Kwon, Hyeokjun Kweon, Wooseong Jeong, Kuk-Jin Yoon.
"DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation." ArXiv (2025). [paper] [2025.06]
TASeg: Meng Yu, Te Cui, Qitong Chu, Wenjie Song, Yi Yang, Yufeng Yue.
"TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models." lROS(2025). [paper] [2025.06]
ProSAM: Xiaoqi Wang, Clint Sebastian, Wenbin He, Liu Ren.
"ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts." ArXiv (2025). [paper] [2025.06]
Xianjun Han and Can Bai and Jie Wang and Zijian Wu.
"Improving a segment anything model for segmenting low-quality medical images via an adapter." CVIU(2025). [paper] [2025.06]
LiSegAgr: Yunkai Wang, Yanfeng Lu.
"LiSegAgr: Labeled Instance Segmentation for Agricultural Remote Sensing Images Through Iterative SAM." Neural Information Processing (2025). [paper] [code] [2025.06]
DoRL: Yongcheng Li and Lingcong Cai and Ying Lu and Cheng Lin and Yupeng Zhang and Jingyan Jiang and Genan Dai and Bowen Zhang and Jingzhou Cao and Xiangzhong Zhang and Xiaomao Fan.
"Domain-invariant representation learning via SAM for blood cell classification." Pattern Recognition (2025). [paper] [code] [2025.06]
Michal Stastny, Sean Harrell.
"Application of SAM2 for Defect Detection in Manufacturing." ArXiv (2025). [paper] [2025.06]
SAM4D: Jianyun Xu, Song Wang, Ziqian Ni, Chunyong Hu, Sheng Yang, Jianke Zhu, Qiang Li.
"SAM4D: Segment Anything in Camera and LiDAR Streams." ICCV (2025). [paper] [code] [2025.06]
FFCL-SAM: Tyler Ward, Xiaoqin Wang, Braxton McFarland, Md Atik Ahamed, Sahar Nozad, Talal Arshad, Hafsa Nebbache, Jin Chen, Abdullah Imran.
"Detection of Breast Cancer Lumpectomy Margin with SAM-incorporated Forward-Forward Contrastive Learning." ArXiv (2025). [paper] [code] [2025.06]
PathSegmentor: Zhixuan Chen, Junlin Hou, Liqi Lin, Yihui Wang, Yequan Bie, Xi Wang, Yanning Zhou, Ronald Cheong Kin Chan, Hao Chen.
"Segment Anything in Pathology Images with Natural Language." ArXiv (2025). [paper] [code] [2025.06]
Connor Ludwig, Khashayar Namdar, Farzad Khalvati.
"AI-Driven MRI-based Brain Tumour Segmentation Benchmarking." ArXiv (2025). [paper] [2025.06]
CON-SAM: Zihang Huang, Yaning Feng, Lilin Guo, Qiutao Shi, Wei Jin.
"Fully Automated Mandibular Condyle Segmentation: More Detailed Extraction With Hybrid Customized SAM." International Journal of Imaging Systems and Technology (2025). [paper] [2025.06]
MLFA-SAM: Hui Yang and Zhipeng Jiang and Yaobo Zhang and Yanlan Wu and Heng Luo and Peng Zhang and Biao Wang.
"A high-resolution remote sensing land use/land cover classification method based on multi-level features adaptation of segment anything model." International Journal of Applied Earth Observation and Geoinformation(2025). [paper] [2025.06]
Siwei Xie.
"Accelerating Segment Anything Models via Token Merging: A Comparative Study and a Spectrum Preservation-Based Approach." ArXiv (2025). [paper] [2025.06]
PanSAM3D: Zheng, Yifeng and Liu, Yuanyuan and Zhang, Yangfan and Qian, Huiying and Cao, Yi and Hou, Jue and Mei, Ying and Wang, Shuxin and Liu, Xiaoqing and Qian, Haifeng and Zhong, Jing and Yan, Qiang.
"PanSAM3D: A SAM Foundation Model-Based Framework for Automatic 3D Pancreatic Segmentation Across Multi-Sequence MRI." ArXiv (2025). [paper] [2025.06]
Wang, Shengyi; Huang, Yuxiang; and El-Gohary, Nora.
"SAM-based Segmentation of Multi-Class Bridge Components from Diverse Real-Scene Inspection Images." CIB Conferences (2025). [paper] [2025.06]
Bamwenda, Julius, Mehmet Siraç Özerdem, Orhan Ayyıldız, and Veysı Akpolat.
"A Hybrid Deep Learning Framework for Accurate Cell Segmentation in Whole Slide Images Using YOLOv11, StarDist, and SAM2." Bioengineering (2025). [paper] [2025.06]
SAM-MyoNet: Yuhan Ying and Xingyu Fang and Yiwen Zhao and XinGang Zhao and Yufeng Zhou and Gang Du and Ying Zhan and Tian Gao and Andi Li and Dandan Sun and Guoli Song.
"SAM-MyoNet: A fine-grained perception myocardial ultrasound segmentation network based on segment anything model with prior knowledge driven." Biomedical Signal Processing and Control (2025). [paper] [code] [2025.06]
adaptive-SAM: Chenqi Fang, Kai Duan, Zhipeng Lv, Juncai Huang, Qirui Zhong, Jing Chen, Di Lon.
"Improving image-based water-level monitoring by coupling water-line detection techniques and the Segment Anything Model." Environmental Modelling and Software (2025). [paper] [2025.06]
AL-SAM: Yuze Sun, Hongwei Zhao, Jianhang Zhou.
"Segment Anything Model for detecting salient objects with accurate prompting and Ladder Directional Perception." PRL (2025). [paper] [2025.06]
SAM2-SGP: Yang Xing, Jiong Wu, Yuheng Bu, Kuang Gong.
"SAM2-SGP: Enhancing SAM2 for Medical Image Segmentation via Support-Set Guided Prompting." ArXiv (2025). [paper] [code] [2025.06]
COCUS: Kai Zhao, Wubang Yuan, Zheng Wang, Guanyi Li, Xiaoqiang Zhu, Deng-ping Fan, Dan Zeng.
"Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models." ArXiv (2025). [paper] [code] [2025.06]
GeNIE: Jiaming Wang, Diwen Liu, Jizhuo Chen, Jiaxuan Da, Nuowen Qian, Tram Minh Man, Harold Soh.
"GeNIE: A Generalizable Navigation System for In-the-Wild Environments." ArXiv (2025). [paper] [2025.06]
Scene-R1: Zhihao Yuan, Shuyi Jiang, Chun-Mei Feng, Yaolun Zhang, Shuguang Cui, Zhen Li, Na Zhao.
"Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations." ArXiv (2025). [paper] [2025.06]
Yufan Liu, Yi Wu, Gweneth Ge, Haoliang Cheng, Rui Liu.
"Reflective VLM Planning for Dual-Arm Desktop Cleaning: Bridging Open-Vocabulary Perception and Precise Manipulation." ArXiv (2025). [paper] [2025.06]
SafeClick: Yifan Gao, Jiaxi Sheng, Wenbin Wu, Haoyue Li, Yaoxian Dong, Chaoyang Ge, Feng Yuan, Xin Gao.
"SafeClick: Error-Tolerant Interactive Segmentation of Any Medical Volumes via Hierarchical Expert Consensus." MICCAI (2025). [paper] [code] [2025.06]
PicoSAM2: Pietro Bonazzi, Nicola Farronato, Stefan Zihlmann, Haotong Qi, Michele Magno.
"PicoSAM2: Low-Latency Segmentation In-Sensor for Edge Vision Applications." ArXiv (2025). [paper] [2025.06]
MedSeg-R: Hao Shao, Qibin Hou.
"MedSeg-R: Medical Image Segmentation with Clinical Reasoning." ArXiv (2025). [paper] [code] [2025.06]
ERAS: Carmelo Scribano, Elena Govi, Paolo bertellini, Simone Parisi, Giorgia Franchini, Marko Bertogna.
"Segment Anything for Satellite Imagery: A Strong Baseline and a Regional Dataset for Automatic Field Delineation." ICIAP (2025). [paper] [code] [2025.06]
STAR: Qiwei Liang and Rulin Zhou and Yijing Zhou and Guankun Wang and Peng Peng and Xiaopin Zhong.
"STAR: Empowering Semi-Supervised Medical Image Segmentation with SAM-based Teacher-Student Architecture and Contrastive Consistency Regularization." Expert Systems with Applications(2025). [paper] [2025.06]
FreqWeaver Adapter: Junhao Wu, Aboagye-Ntow Stephen, Chuyuan Wang, Gang Chen, Xin Huang.
"Baltimore Atlas: FreqWeaver Adapter for Semi-supervised Ultra-high Spatial Resolution Land Cover Classification." ArXiv (2025). [paper] [2025.06]
Leader360V: Weiming Zhang, Dingwen Xiao, Aobotao Dai, Yexin Liu, Tianbo Pan, Shiqi Wen, Lei Chen, Lin Wang.
"Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment." ArXiv (2025). [paper] [2025.06]
SASAM: Xingyi Zhang, Changqi Yu, Shihao Chen and Bin Pang.
"SASAM: A Semantic-Aware SAM Framework for Weld Seam Vision Measurement." Measurement Science and Technology(2025). [paper] [2025.06]
SAMSelect: van Dalen, Joost and Asano, Yuki M. and Rußwurm, Marc.
"SAMSelect: A Spectral Index Search for Marine Debris Visualization Using Segment Anything." IEEE Geoscience and Remote Sensing Letters (2025). [paper] [2025.06]
SAAF: Peilin Li, Jun Yin, Jing Zhong, Ran Luo, Pengyu Zeng, Miao Zhang.
"Segment Any Architectural Facades (SAAF):An automatic segmentation model for building facades, walls and windows based on multimodal semantics guidance." ArXiv (2025). [paper] [2025.06]
Text3DSAM: Yu Xin, Gorkem Can Ates, Wei Shao.
"Text3DSAM: Text-Guided 3D Medical Image Segmentation Using SAM-Inspired Architecture." CVPRW (2025). [paper] [code] [2025.06]
SIT-SAM: Wentao Shi, Junjun He, Yiqing Shen.
"SIT-SAM: A semantic-integration transformer that adapts the Segment Anything Model to zero-shot medical image semantic segmentation." Biomedical Signal Processing and Control (2025). [paper] [code] [2025.06]
Chang, Christian, Hudson Law, Connor Poon, Sydney Yen, Kaustubh Lall, Armin Jamshidi, Vadim Malis, Dosik Hwang, and Won C. Bae.
"Segment Anything Model (SAM) and Medical SAM (MedSAM) for Lumbar Spine MRI." Sensors (2025). [paper] [2025.06]
conSAMme: Josh Myers-Dean, Kangning Liu, Brian Price, Yifei Fan, Jason Kuen, Danna Gurari.
"conSAMme: Achieving Consistent Segmentations with SAM." CVPRW (2025). [paper] [2025.06]
T-SAM: Rangel Daroya, Deepak Chandran, Subhransu Maji, Andrea Fanelli.
"T-SAM: Transductive Learning for Segment Anything Model." CVPRW (2025). [paper] [2025.06]
U-SAM: Rohit Kundu, Sudipta Paul, Arindam Dutta, Amit Roy-Chowdhury.
"Repurposing SAM for User-Defined Semantics Aware Segmentation." CVPRW (2025). [paper] [code] [2025.06]
Almazroey, Alaa Atallah, Salma kammoun Jarraya, and Reem Alnanih.
"SAM for Road Object Segmentation: Promising but Challenging." Journal of Imaging (2025). [paper] [2025.06]
RouGE: Guo, Yizhen and Guo, Hang and Dai, Tao and Wang, Zhi and Chen, Bin and Xia, Shutao.
"Learning Gated Experts for Segment Anything in the Wild." ArXiv (2025). [paper] [2025.06]
Niu, Xuewei and Zhang, Jianyuan and Bai, Yu and Gao, Mengman and Yang, Xin.
"SAM-Guided Accurate Pulmonary Nodule Image Segmentation." IEEE Access (2025). [paper] [2025.06]
MorphSAM: Dingwei Fan, Junyong Zhao, Chunlin Li, Xinlong Wang, Ronghan Zhang, Mingliang Wang, Qi Zhu, Haipeng Si, Daoqiang Zhang, Liang Sun.
"MorphSAM: Learning the Morphological Prompts from Atlases for Spine Image Segmentation." ArXiv (2025). [paper] [2025.06]
TAViS: Ziyang Luo, Nian Liu, Xuguang Yang, Salman Khan, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Junwei Han.
"TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models." ArXiv (2025). [paper] [2025.06]
Occ: Yunhan Ren, Ruihuang Li, Lingbo Liu, Changwen Chen.
"Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling." ICME(2025). [paper] [code] [2025.06]
PSLGSAM: Shuyang Li, Shuang Wang, Zhuangzhuang Sun, Jing Xiao.
"Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation." ArXiv (2025). [paper] [2025.06]
Andrea Moglia, Matteo Leccardi, Matteo Cavicchioli, Alice Maccarini, Marco Marcon, Luca Mainardi, Pietro Cerveri.
"Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches." ArXiv (2025). [paper] [2025.06]
Q-SAM2: Nicola Farronato, Florian Scheidegger, Mattia Rigotti, Cristiano Malossi, Michele Magno, Haotong Qin.
"Q-SAM2: Accurate Quantization for Segment Anything Model 2." ArXiv (2025). [paper] [2025.06]
SRPL-SFDA: Xinya Liu, Jianghao Wu, Tao Lu, Shaoting Zhang, Guotai Wang.
"SRPL-SFDA: SAM-Guided Reliable Pseudo-Labels for Source-Free Domain Adaptation in Medical Image Segmentation." Neurocomputing (2025). [paper] [code] [2025.06]
SemanticSplat: Qijing Li, Jingxiang Sun, Liang An, Zhaoqi Su, Hongwen Zhang, Yebin Liu.
"SemanticSplat: Feed-Forward 3D Scene Understanding with Language-Aware Gaussian Fields." ArXiv (2025). [paper] [code] [2025.06]
SSS: Hongjie Zhu, Xiwei Liu, Rundong Xue, Zeyu Zhang, Yong Xu, Daji Ergu, Ying Cai, Yang Zhao.
"SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging Segmentation." ArXiv (2025). [paper] [code] [2025.06]
SEE: Chunming He, Kai Li, Yachao Zhang, Ziyun Yang, Youwei Pang, Longxiang Tang, Chengyu Fang, Yulun Zhang, Linghe Kong, Xiu Li, Sina Farsiu.
"Segment Concealed Objects with Incomplete Supervision." IEEE TPAMI (2025). [paper] [2025.06]
SAMSelect: Joost van Dalen, Yuki M. Asano, Marc Russwurm.
"SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything." ArXiv (2025). [paper] [2025.06]
SAM-SVN: Jintao Tong, Ran Ma, Yixiong Zou, Guangyao Chen, Yuhua Li, Ruixuan Li.
"Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation." ICML (2025). [paper] [2025.06]
Mingqi Gao, Haoran Duan, Tianlu Zhang, Jungong Han.
"THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation." ArXiv (2025). [paper] [2025.06]
RDVP-MSD: Chao Yin, Hao Li, Kequan Yang, Jide Li, Pinpin Zhu, Xiaoqiang Li.
"Stepwise Decomposition and Dual-stream Focus: A Novel Approach for Training-free Camouflaged Object Segmentation." ArXiv (2025). [paper] [code] [2025.06]
Beining Xu, Junxian Li.
"Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods." ArXiv (2025). [paper] [2025.06]
OpenSplat3D: Jens Piekenbrinck, Christian Schmidt, Alexander Hermans, Narunas Vaskevicius, Timm Linder, Bastian Leibe.
"OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting." ArXiv (2025). [paper] [2025.06]
SAM2Auto: Arash Rocky, Q.M. Jonathan Wu.
"SAM2Auto: Auto Annotation Using FLASH." ArXiv (2025). [paper] [2025.06]
Yannis Spyridis, Vasileios Argyriou.
"Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models." IEEE DCOSS IoTi5(2025). [paper] [2025.06]
Talk2SAM: Luka Vetoshkin, Dmitry Yudin.
"Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation." ArXiv (2025). [paper] [code] [2025.06]
TSAM: Abduljalil Radman, Jorma Laaksonen.
"TSAM: Temporal SAM Augmented with Multimodal Prompts for Referring Audio-Visual Segmentation." CVPR (2025). [paper] [code] [2025.06]
STT: Tanner Schmidt, Richard Newcombe.
"Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation." CVPR (2025). [paper] [2025.06]
ESC-Net: Minhyeok Lee, Suhwan Cho, Jungho Lee, Sunghun Yang, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee.
"Effective SAM Combination for Open-Vocabulary Semantic Segmentation." CVPR (2025). [paper] [2025.06]
UNICL-SAM: Dianmo Sheng, Dongdong Chen, Zhentao Tan, Qiankun Liu, Qi Chu, Tao Gong, Bin Liu, Jing Han, Wenbin Tu, Shengwei Xu, Nenghai Yu.
"UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery." CVPR (2025). [paper] [2025.06]
PPO: Xueyu Liu, Rui Wang, Yexin Lai, Guangze Shi, Feixue Shao, Fang Hao, Jianan Zhang, Jia Shen, Yongfei Wu, Wen Zheng.
"Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater." CVPR (2025). [paper] [code] [2025.06]
SAM2Object: Jihuai Zhao, Junbao Zhuo, Jiansheng Chen, Huimin Ma.
"SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation." CVPR (2025). [paper] [code] [2025.06]
TAO: Yuzhi Huang, Chenxin Li, Haitao Zhang, Zixu Lin, Yunlong Lin, Hengyu Liu, Wuyang Li, Xinyu Liu, Jiechao Gao, Yue Huang, Xinghao Ding, Yixuan Yuan.
"Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline." CVPR (2025). [paper] [code] [2025.06]
EntitySAM: Mingqiao Ye, Seoung Wug Oh, Lei Ke, Joon-Young Lee.
"EntitySAM: Segment Everything in Video." CVPR (2025). [paper] [code] [2025.06]
SAM-REF: Chongkai Yu, Ting Liu, Anqi Li, Xiaochao Qu, Chengjing Wu, Luoqi Liu, Xiaolin Hu.
"SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything Model." CVPR (2025). [paper] [2025.06]
VideoMolmo: Ghazi Shazan Ahmad, Ahmed Heakl, Hanan Gani, Abdelrahman Shaker, Zhiqiang Shen, Ranjay Krishna, Fahad Shahbaz Khan, Salman Khan.
"VideoMolmo: Spatio-Temporal Grounding Meets Pointing." ArXiv (2025). [paper] [code] [2025.06]
ORES: Shengcao Cao, Zijun Wei, Jason Kuen, Kangning Liu, Lingzhi Zhang, Jiuxiang Gu, HyunJoon Jung, Liang-Yan Gui, Yu-Xiong Wang.
"Refer to Anything with Vision-Language Prompts." ArXiv (2025). [paper] [code] [2025.06]
PAM: Weifeng Lin, Xinyu Wei, Ruichuan An, Tianhe Ren, Tingwei Chen, Renrui Zhang, Ziyu Guo, Wentao Zhang, Lei Zhang, Hongsheng Li.
"Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos." ArXiv (2025). [paper] [code] [2025.06]
SAM-TTA: Jianghao Wu, Yicheng Wu, Yutong Xie, Wenjia Bai, You Zhang, Feilong Tang, Yulong Li, Yasmeen George, Imran Razzak.
"SAM-aware Test-time Adaptation for Universal Medical Image Segmentation." ArXiv (2025). [paper] [code] [2025.06]
BalSAM: Mélisande Teng, Arthur Ouaknine, Etienne Laliberté, Yoshua Bengio, David Rolnick, Hugo Larochelle.
"Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery." ArXiv (2025). [paper] [2025.06]
Michelle Chen, David Russell, Amritha Pallavoor, Derek Young, Jane Wu.
"Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery." ICLR Workshop (2025). [paper] [code] [2025.06]
GaRA-SAM: Sohyun Lee, Yeho Kwon, Lukas Hoyer, Suha Kwak.
"GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation." ArXiv (2025). [paper] [2025.06]
HSP-SAM: Mengmeng Zhang, Xingyuan Dai, Yicheng Sun, Jing Wang, Yueyang Yao, Xiaoyan Gong, Fuze Cong, Feiyue Wang, Yisheng Lv.
"Hierarchical Self-Prompting SAM: A Prompt-Free Medical Image Segmentation Framework." ArXiv (2025). [paper] [2025.06]
SAMJ: Carlos Garcia-Lopez-de-Haro, Caterina Fuster-Barcelo, Curtis T. Rueden, Jonathan Heras, Vladimir Ulman, Daniel Franco-Barranco, Adrian Ines, Kevin W. Eliceiri, Jean-Christophe Olivo-Marin, Jean-Yves Tinevez, Daniel Sage, Arrate Munoz-Barrutia.
"SAMJ: Fast Image Annotation on ImageJ/Fiji via Segment Anything Model." ArXiv (2025). [paper] [2025.06]
SAM2-LOVE: Yuji Wang, Haoran Xu, Yong Liu, Jiaze Li, Yansong Tang.
"SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes." CVPR (2025). [paper] [code] [2025.06]
SAM-I2V: Haiyang Mei, Pengyu Zhang, Mike Zheng Shou.
"SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost." CVPR (2025). [paper] [code] [2025.06]
AuralSAM2: Yuyuan Liu, Yuanhong Chen, Chong Wang, Junlin Han, Junde Wu, Can Peng, Jingkun Chen, Yu Tian, Gustavo Carneiro.
"AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting." ArXiv (2025). [paper] [code] [2025.06]
Gang Xu and Yingshui Zhang and Qingrui Yue and Xiaogang Liu.
"Automated detection and quantification of structural component dimensions using segment anything model (SAM)-based segmentation." Automation in Construction (2025). [paper] [2025.06]
EASAM: Jianguo Zhang, Meng Lin, Hu Hou, Benhao Sun, Fengling Hu, Youcheng Yu & Menghan Li .
"EASAM: an edge-aware SAM-based paradigm for tooth segmentation." Signal, Image and Video Processing (2025). [paper] [2025.06]
Daixin Fu, Bowen Kuang, Lin Mi, Yongjie Liu, Qingyuan Wang, Junnan Lv, Lang Li.
"A threshold-based prompt generation method for segment anything model to identify fission gas bubbles." ArXiv (2025). [paper] [2025.06]
Liangyang Ouyang, Yuki Sakai, Ryosuke Furuta, Hisataka Nozawa, Hikoro Matsui, Yoichi Sato.
"Leadership Assessment in Pediatric Intensive Care Unit Team Training." CVPRW (2025). [paper] [2025.05]
KairosAD: Uzair Khan, Franco Fummi, Luigi Capogrosso.
"KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices." ICIAP (2025). [paper] [code] [2025.05]
Gang Xu and Yingshui Zhang and Qingrui Yue and Xiaogang Liu.
"Automated detection and quantification of structural component dimensions using segment anything model (SAM)-based segmentation." Automation in Construction(2025). [paper] [2025.05]
SAMamba: Wenhao Xu, Shuchen Zheng, Changwei Wang, Zherui Zhang, Chuan Ren, Rongtao Xu, Shibiao Xu.
"SAMamba: Adaptive State Space Modeling with Hierarchical Vision for Infrared Small Target Detection." Information Fusion(2025). [paper] [code] [2025.05]
TextRegion: Yao Xiao, Qiqian Fu, Heyi Tao, Yuqun Wu, Zhen Zhu, Derek Hoiem.
"TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models." ArXiv (2025). [paper] [code] [2025.05]
MIAS-SAM: Marco Colussi, Dragan Ahmetovic, Sergio Mascetti.
"MIAS-SAM: Medical Image Anomaly Segmentation without thresholding." ArXiv (2025). [paper] [code] [2025.05]
SAM-R1: Jiaqi Huang, Zunnan Xu, Jun Zhou, Ting Liu, Yicheng Xiao, Mingwen Ou, Bowen Ji, Xiu Li, Kehong Yuan.
"SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning." ArXiv (2025). [paper] [2025.05]
ELE-SAM: Hang Chen, Maoyuan Ye, Peng Yang, Haibin He, Juhua Liu, Bo Du.
"Adapting Segment Anything Model for Power Transmission Corridor Hazard Segmentation." ArXiv (2025). [paper] [code] [2025.05]
InfoSAM: Yuanhong Zhang, Muyao Yuan, Weizhan Zhang, Tieliang Gong, Wen Wen, Jiangyong Ying, Weijie Shi.
"InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective." ICML(2025). [paper] [2025.05]
SANSA: Claudia Cuttano, Gabriele Trivigno, Giuseppe Averta, Carlo Masone.
"SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation." NeurIPS (2025). [paper] [code] [2025.05]
TSP-SAM: Bilin Wang, Changda Lei, Yunbo Guo, Kaicheng Hong, Xiuji Kan, Yifan Ouyang, Junbo Li, Rui Li.
"Task-Specific Prompting SAM for Multi-task Gastric Cancer Diagnosis in Endoscopic Images." Expert Systems with Applications(2025). [paper] [2025.05]
Kenneth Ball, Erin Taylor, Nirav Patel, Andrew Bartels, Gary Koplik, James Polly, Jay Hineman.
"Geometric Feature Prompting of Image Segmentation Models." ArXiv (2025). [paper] [code] [2025.05]
ReaMOT: Sijia Chen, Yanqiu Yu, En Yu, Wenbing Tao.
"ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking." ArXiv (2025). [paper] [code] [2025.05]
RoBiS: Xurui Li, Zhonesheng Jiang, Tingxuan Ai, Yu Zhou.
"RoBiS: Robust Binary Segmentation for High-Resolution Industrial Images." ArXiv (2025). [paper] [code] [2025.05]
CCL-LGS: Lei Tian, Xiaomin Li, Liqian Ma, Hefei Huang, Zirui Zheng, Hao Yin, Taiqing Li, Huchuan Lu, Xu Jia.
"CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting." ArXiv (2025). [paper] [code] [2025.05]
A3Tune: Aofei Chang, Le Huang, Alex James Boyd, Parminder Bhatia, Taha Kass-Hout, Cao Xiao, Fenglong Ma.
"Focus on What Matters: Enhancing Medical Vision-Language Models with Automatic Attention Alignment Tuning." ACL (2025). [paper] [code] [2025.05]
VL-SAM-V2: Zhiwei Lin, Yongtao Wang.
"VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion." ArXiv (2025). [paper] [2025.05]
SAMA: Ye Sun, Hao Zhang, Henghui Ding, Tiehua Zhang, Xingjun Ma, Yu-Gang Jiang.
"SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models." ArXiv (2025). [paper] [code] [2025.05]
FHGS: Q. G. Duan, Benyun Zhao, Mingqiao Han Yijun Huang, Ben M. Chen.
"FHGS: Feature-Homogenized Gaussian Splatting." ArXiv (2025). [paper] [code] [2025.05]
Nagito Saito, Shintaro Ito, Koichi Ito, Takafumi Aoki.
"Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation." ICIP(2025). [paper] [2025.05]
PolyCL: Tyler Ward, Aaron Moseley, Abdullah-Al-Zubaer Imran.
"Domain and Task-Focused Example Selection for Data-Efficient Contrastive Medical Image Segmentation." ArXiv (2025). [paper] [code] [2025.05]
Mobina Mansoori, Sajjad Shahabodini, Farnoush Bayatmakou, Jamshid Abouei, Konstantinos N. Plataniotis, Arash Mohammadi.
"Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation Models." ArXiv (2025). [paper] [code] [2025.05]
ThinkVideo: Shiu-hong Kao, Yu-Wing Tai, Chi-Keung Tang.
"ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts." ArXiv (2025). [paper] [code] [2025.05]
REN: Savya Khosla, Sethuraman TV, Barnett Lee, Alexander Schwing, Derek Hoiem.
"REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders." ArXiv (2025). [paper] [code] [2025.05]
TAGS: Sirui Li, Linkai Peng, Zheyuan Zhang, Gorkem Durak, Ulas Bagci.
"TAGS: 3D Tumor-Adaptive Guidance for SAM." ArXiv (2025). [paper] [2025.05]
Mahmoud Chick Zaouali, Todd Charter, Homayoun Najjaran.
"From Flight to Insight: Semantic 3D Reconstruction for Aerial Inspection via Gaussian Splatting and Language-Guided Segmentation." ArXiv (2025). [paper] [2025.05]
UWIPL_ETRI: Cheng-Yen Yang, Hsiang-Wei Huang, Pyong-Kun Kim, Chien-Kai Kuo, Jui-Wei Chang, Kwang-Ju Kim, Chung-I Huang, Jenq-Neng Hwang.
"Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking." ICPRW (2025). [paper] [2025.05]
RemoteSAM: Liang Yao, Fan Liu, Delong Chen, Chuanyi Zhang, Yijun Wang, Ziyun Chen, Wei Xu, Shimin Di, Yuhui Zheng.
"RemoteSAM: Towards Segment Anything for Earth Observation." ACM MM (2025). [paper] [code] [2025.05]
Track Anything Annotate: Nikita Ivanov, Mark Klimov, Dmitry Glukhikh, Tatiana Chernysheva, Igor Glukhikh.
"Track Anything Annotate: Video annotation and dataset generation of computer vision models." ArXiv (2025). [paper] [code] [2025.05]
H2-COMPACT: Geeta Chandra Raju Bethala, Hao Huang, Niraj Pudasaini, Abdullah Mohamed Ali, Shuaihang Yuan, Congcong Wen, Anthony Tzes, Yi Fang.
"H2-COMPACT: Human-Humanoid Co-Manipulation via Adaptive Contact Trajectory Policies." ArXiv (2025). [paper] [code] [2025.05]
Martin Villagrana, Francisco Lopez-Tiro, Clement Larose, Gilberto Ochoa-Ruiz, Christian Daul.
"Assessing the generalization performance of SAM for ureteroscopy scene understanding." MIUA (2025). [paper] [2025.05]
BuildingSAM: Feng, Wenqing and Guan, Fangli and Tu, Jihui and Xu, Wei.
"BuildingSAM: A Dual-Branch Feature-Augmented Segment Anything Model for Remote Sensing Building Extraction." IEEE Geoscience and Remote Sensing Letters (2025). [paper] [2025.05]
SAM-TS: Jianhong Gan, et al.
"A segmentation method for oral CBCT image based on Segment Anything Model and semi-supervised teacher-student model." ArXiv (2025). [paper] [2025.05]
MGNet: Xia Li, Xinran Liu, Lin Qi, Junyu Dong.
"Weakly supervised camouflaged object detection based on the SAM model and mask guidance." Image and Vision Computing (2025). [paper] [2025.05]
ESAM: Yuehong Chen, et al.
"Edge-enhanced SAM for extracting photovoltaic power plants from remote sensing imagery." International Journal of Applied Earth Observation and Geoinformation (2025). [paper] [2025.05]
UPLS: Luda Tian, et al.
"Attention-based unsupervised prompt learning for SAM in leaf disease segmentation." Knowledge-Based Systems (2025). [paper] [code] [2025.05]
DLK: Nan Mo.
"Brain image registration optimization method via SAM-Med3D multi-scale feature migration." ICBB(2025). [paper] [2025.05]
EchoSAM: Xue Li and Qian Hu and Xiangbo Lin and Yushi Li and Yu Dong and Tong Lin.
"EchoSAM: SAM adaption for unified 2D echocardiography segmentation and ejection fraction calculation." Biomedical Signal Processing and Control (2025). [paper] [2025.05]
Zheshuo Lin, et al.
"Deploying Vision Foundation AI Models on the Edge. The SAM2 Experience." ArXiv (2025). [paper] [2025.05]
Zhang, J., Chen, X., Yu, M. et al.
"Two-stage landslide satellite image recognition in the southeastern tibet region based on Cascade R-CNN and SAM2." Earth Sci Inform (2025). [paper] [2025.05]
SemiT-SAM: Jing Hao, Moyun Liu, Lei He, Lei Yao, James Kit Hon Tsoi & Kuo Feng Hung.
"SemiT-SAM: Building A Visual Foundation Model for Tooth Instance Segmentation on Panoramic Radiographs." MICCAI (2024). [paper] [code] [dataset] [2025.05]
HF-SAM: Shangwang Liu, Ruonan Xu.
"Multi-scale feature fusion based SAM for high-quality few-shot medical image segmentation." CVIU (2025). [paper] [code] [2025.05]
MapSAM: Juncheng Wang and Lei Shang and Wang Lu and Xiangyang Ji and Shujun Wang.
"Model-agnostic personalized adaptation for segment anything model." Neurocomputing (2025). [paper] [code] [2025.05]
Eff-SAM: Nisar Ahmad and Yao-Tien Chen.
"Eff-SAM: SAM-based Efficient Method for Brain Tumor Segmentation in Multimodal 3D MRI Scans." IEEE Access (2025). [paper] [2025.05]
PASS-SAM: Yin Tang; Rui Chen; Gensheng Pei; Qiong Wang.
"PASS-SAM: Integration of segment anything model for large-scale unsupervised semantic segmentation." Computational Visual Media (2025). [paper] [code] [2025.05]
MASG-SAM: Zhou, Wei and Guan, Guilin and Gao, Yuan and Si, Pengju and Xu, Mengjia and Yan, Qifeng.
"MASG-SAM: Enhancing Few-Shot Medical Image Segmentation with Multi-Scale Attention and Semantic Guidance." JBHI (2025). [paper] [code] [2025.05]
COD-SAM: Dongyang Gao and Yichao Zhou and Hui Yan and Chen Chen and Xiyuan Hu.
"COD-SAM: Camouflage object detection using SAM." Pattern Recognition(2025). [paper] [2025.05]
SAMba-UNet: Guohao Huo, Ruiting Dai, Hao Tang.
"SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation." ArXiv (2025). [paper] [2025.05]
TextureSAM: Inbal Cohen, Boaz Meivar, Peihan Tu, Shai Avidan, Gal Oren.
"TextureSAM: Towards a Texture Aware Foundation Model for Segmentation." ArXiv (2025). [paper] [2025.05]
InstructSAM: Yijie Zheng, Weijie Wu, Qingyun Li, Xuehui Wang, Xu Zhou, Aiai Ren, Jun Shen, Long Zhao, Guoqing Li, Xue Yang.
"InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition." NeurIPS (2025). [paper] [code] [2025.05]
GEN2SEG: Om Khangaonkar, Hamed Pirsiavash.
"GEN2SEG: Generative Models Enable Generalizable Instance Segmentation." ArXiv (2025). [paper] [code] [2025.05]
VP Lab: Niccolo Avogaro, Thomas Frick, Yagmur G. Cinar, Daniel Caraballo, Cezary Skura, Filip M. Janicki, Piotr Kluska, Brown Ebouky, Nicola Farronato, Florian Scheidegger, Cristiano Malossi, Konrad Schindler, Andrea Bartezzaghi, Roy Assaf, Mattia Rigotti.
"VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation." ArXiv (2025). [paper] [2025.05]
UWSAM: Hua Li, Shijie Lian, Zhiyuan Li, Runmin Cong, Sam Kwong.
"UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset." ArXiv (2025). [paper] [code] [2025.05]
Tatyana Shmykova, Leila Khaertdinova, Ilya Pershin.
"Zero-Shot Gaze-based Volumetric Medical Image Segmentation." CVPRW (2025). [paper] [2025.05]
FSSAM: Qianxiong Xu, Lanyun Zhu, Xuanyi Liu, Guosheng Lin, Cheng Long, Ziyue Li, Rui Zhao.
"Unlocking the Power of SAM 2 for Few-Shot Segmentation." ICML (2025). [paper] [2025.05]
IPENS: Wentao Song, He Huang, Youqiang Sun, Fang Qu, Jiaqi Zhang, Longhui Fang, Yuwei Hao, Chenyang Peng.
"IPENS: Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion." ArXiv (2025). [paper] [2025.05]
Long-RVOS: Tianming Liang, Haichao Jiang, Yuting Yang, Chaolei Tan, Shuai Li, Wei-Shi Zheng, Jian-Fang Hu.
"Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation." ArXiv (2025). [paper] [code] [2025.05]
iSegMan: Yian Zhao, Wanshi Xu, Ruochong Zheng, Pengchong Qiao, Chang Liu, Jie Chen.
"iSegMan: Interactive Segment-and-Manipulate 3D Gaussians." CVPR (2025). [paper] [code] [2025.05]
InsCore: Shinichi Mae, Ryosuke Yamada, Hirokatsu Kataoka.
"Industry-focused Synthetic Segmentation Pre-training." ArXiv (2025). [paper] [2025.05]
Yijie Zheng, Jinxuan Yang, Yu Chen, Yaxuan Wang, Yihang Lu, Guoqing Li.
"Beluga Whale Detection from Satellite Imagery with Point Labels." IGARSS (2025). [paper] [code] [2025.05]
AoP-SAM: Yi Chen, Mu-Young Son, Chuanbo Hua, Joo-Young Kim.
"AoP-SAM: Automation of Prompts for Efficient Segmentation." AAAI (2025). [paper] [2025.05]
SurgPose: Utsav Rai, Haozheng Xu, Stamatia Giannarou.
"SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision." ICRA (2025). [paper] [2025.05]
uLLSAM: Manyu Li, Ruian He, Zixian Zhang, Weimin Tan, Bo Yan.
"Unifying Segment Anything in Microscopy with Multimodal Large Language Model." ArXiv (2025). [paper] [code] [2025.05]
ISGR: Dayong Liang, Changmeng Zheng, Zhiyuan Wen, Yi Cai, Xiao-Yong Wei, Qing Li.
"Seeing Beyond the Scene: Enhancing Vision-Language Models with Interactional Reasoning." ArXiv (2025). [paper] [code] [2025.05]
PPT-net: Guoying Liang ,Su Yang.
"Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance." ArXiv (2025). [paper] [2025.05]
Mohammad Wasil, Ahmad Drak, Brennan Penfold, Ludovico Scarton, Maximilian Johenneken, Alexander Asteroth, Sebastian Houben.
"Parameter-Efficient Fine-Tuning of Vision Foundation Model for Forest Floor Segmentation from UAV Imagery." IEEE ICRA Workshop (2025). [paper] [2025.05]
ReSurgSAM2: Haofeng Liu, Mingqi Gao, Xuxiao Luo, Ziyue Wang, Guanyi Qin, Junde Wu, Yueming Jin.
"ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking." MICCAI (2025). [paper] [code] [2025.05]
DFG: Zheang Huai, Hui Tang, Yi Li, Zhuangzhuang Chen, Xiaomeng Li.
"Leveraging Segment Anything Model for Source-Free Domain Adaptation via Dual Feature Guided Auto-Prompting." ArXiv (2025). [paper] [code] [2025.05]
SLAG: Laszlo Szilagyi, Francis Engelmann, Jeannette Bohg.
"SLAG: Scalable Language-Augmented Gaussian Splatting." ArXiv (2025). [paper] [code] [2025.05]
CPC-SAM: Jingyao Wang, Jianqi Zhang, Wenwen Qiang, Changwen Zheng.
"Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation." ArXiv (2025). [paper] [code] [2025.05]
MarkMatch: Fei Zhao, Runlin Zhang, Chengcui Zhang, Nitesh Saxena.
"MarkMatch: Same-Hand Stuffing Detection." ArXiv (2025). [paper] [2025.05]
MAIS: Mauricio Orbes-Arteaga, Oeslle Lucena, Sabastien Ourselin, M. Jorge Cardoso.
"MAIS: Memory-Attention for Interactive Segmentation." MIDL (2025). [paper] [2025.05]
ABS-Mamba: Feng Yuan, Yifan Gao, Wenbin Wu, Keqing Wu, Xiaotong Guo, Jie Jiang, Xin Gao.
"ABS-Mamba: SAM2-Driven Bidirectional Spiral Mamba Network for Medical Image Translation." ArXiv (2025). [paper] [code] [2025.05]
SAMSR: Zihang Liu, Zhenyu Zhang, Hao Tang.
"Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution." ArXiv (2025). [paper] [code] [2025.05]
BrainSegDMlF: Hongming Wang, Yifeng Wu, Huimin Huang, Hongtao Wu, Jia-Xuan Jiang, Xiaodong Zhang, Hao Zheng, Xian Wu, Yefeng Zheng, Jinping Xu, Jing Cheng.
"BrainSegDMlF: A Dynamic Fusion-enhanced SAM for Brain Lesion Segmentation." ArXiv (2025). [paper] [2025.05]
SLCA: Pengfei Gu, Haoteng Tang, Islam A. Ebeid, Jose A. Nunez, Fabian Vazquez, Diego Adame, Marcus Zhan, Huimin Li, Bin Fu, Danny Z. Chen.
"Adapting a Segmentation Foundation Model for Medical Image Classification." ArXiv (2025). [paper] [2025.05]
SAMSelect: Joost van Dalen, Yuki M. Asano, Marc Rußwurm.
"SAMSELECT: A SPECTRAL INDEX SEARCH FOR MARINE DEBRIS VISUALIZATION USING SAM." ICLR Workshop (2025). [paper] [2025.05]
TP-SA3M: Li, T., Jiang, Z., Jin, Y. et al.
"TP-SA3M: text prompts-assisted SAM for myopic maculopathy segmentation." Vis Comput (2025). [paper] [2025.05]
Wu, Z., Yang, JY., Yan, CB. et al.
"Integrating SAM priors with U-Net for enhanced multiclass cell detection in digital pathology." Sci Rep (2025). [paper] [2025.05]
SAMSAR: Mahdi Rahimi, Saeed Sharifian.
"SAMSAR: A modified SAM architecture for oceanic ship segmentation of satellite SAR images using CNN-based Cross-Fused Attention." ESWA (2025). [paper] [2025.05]
SAMSnake: Yejun Wu and Jiao Zhan and Chi Guo and Huyin Zhang.
"SAMSnake: A generic contour-based instance segmentation network assistedby Efficient Segment Anything Model." Neural Networks (2025). [paper] [code] [2025.05]
Tao, Kunjian, He Li, Chong Huang, Qingsheng Liu, Junyan Zhang, and Ruoqi Du.
"Extraction of Cropland Based on Multi-Source Remote Sensing and an Improved Version of the Deep Learning-Based Segment Anything Model (SAM)." Agronomy (2025). [paper] [2025.05]
UncertainSAM: Timo Kaiser, Thomas Norrenbrock, Bodo Rosenhahn.
"UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model." ICML (2025). [paper] [2025.05]
Mix-QSAM: Navin Ranjan, Andreas Savakis.
"Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model." ArXiv (2025). [paper] [2025.05]
CAPNet: Baoshun Shi, Zheng Liu, Xin Meng, Yan Yang.
"Cross-organ all-in-one parallel compressed sensing magnetic resonance imaging." ArXiv (2025). [paper] [code] [2025.05]
MAISY: Andrew Zhang, Hao Wang, Shuchang Ye, Michael Fulham, Jinman Kim.
"MAISY: Motion-Aware Image SYnthesis for Medical Image Motion Correction." ArXiv (2025). [paper] [2025.05]
CaRaFFusion: Huawei Sun, Bora Kunter Sahin, Georg Stettinger, Maximilian Bernhard, Matthias Schubert, Robert Wille.
"CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting." RA-L(2025). [paper] [2025.05]
Siming He, Zachary Osman, Fernando Cladera, Dexter Ong, Nitant Rai, Patrick Corey Green, Vijay Kumar, Pratik Chaudhari.
"Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera." ICRA Workshop (2025). [paper] [2025.05]
SARTM: Dong Xing, Xianxun Zhu, Wei Zhou, Qika Lin, Hang Yang, Yuqing Wang.
"Segment Any RGB-Thermal Model with Language-aided Distillation." ArXiv (2025). [paper] [2025.05]
SLM-SAM 2: Yuwen Chen, Zafer Yildiz, Qihang Li, Yaqian Chen, Haoyu Dong, Hanxue Gu, Nicholas Konz, Maciej A. Mazurowski.
"Accelerating Volumetric Medical Image Annotation via Short-Long Memory SAM 2." TMI (2025). [paper] [2025.05]
SAM-TIFF: Michael Marinaccio, Fatemeh Afghah.
"Seeing Heat with Color - RGB-Only Wildfire Temperature Inference from SAM-Guided Multimodal Distillation using Radiometric Ground Truth." ArXiv (2025). [paper] [2025.05]
ZS-VCOS: Wenqi Guo, Shan Du.
"ZS-VCOS: Zero-Shot Outperforms Supervised Video Camouflaged Object Segmentation." ArXiv (2025). [paper] [code] [2025.05]
Malte Mosbach, Sven Behnke.
"Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning." ArXiv (2025). [paper] [code] [2025.05]
MoSAM: Qiushi Yang, Yuan Yao, Miaomiao Cui, Liefeng Bo.
"MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection." ArXiv (2025). [paper] [2025.05]
Yamagishi Y, Hanaoka S, Kikuchi T, Nakao T, Nakamura Y, Nomura Y, Miki S, Yoshikawa T, Abe O.
"Using Segment Anything Model 2 for Zero-Shot 3D Segmentation of Abdominal Organs in Computed Tomography Scans to Adapt Video Tracking Capabilities for 3D Medical Imaging: Algorithm Development and Validation." JMIR AI (2025). [paper] [2025.05]
Chen, Jinlong, Fuqiang Jin, Yingjie Jiao, Yongsong Zhan, and Xingguo Qin.
"Improving Dynamic Gesture Recognition with Attention-Enhanced LSTM and Grounding SAM." Electronics (2025). [paper] [2025.05]
Angelo Moroncelli and Sylvain Populus and Armand Rossi and Emanuele Carpanzano and Loris Roveda.
"Vision-based robotic disassembly of aircraft engines with YOLO-SAM: a novel method for task orientation estimation." CIRP Annals (2025). [paper] [2025.05]
ICA-SAMv7: Xiaotian Yan and Yuting Guo and Ziyi Pei and Xinyu Zhang and Jinghao Li and Zitao Zhou and Lifang Liang and Shuai Li and Peng Lun and Aimin Hao.
"ICA-SAMv7: Internal carotid artery segmentation with coarse to fine network." Computerized Medical Imaging and Graphics (2025). [paper] [code] [2025.05]
Muturi, T. W., & Adu-Gyamfi, Y.
"Enhanced Crack Segmentation Using Meta’s Segment Anything Model with Low-Cost Ground Truths and Multimodal Prompts." ArXiv (2025). [paper] [2025.05]
UN-SAM: Zhen Chen and Qing Xu and Xinyu Liu and Yixuan Yuan.
"UN-SAM: Domain-adaptive self-prompt segmentation for universal nuclei images." Medical Image Analysis (2025). [paper] [code] [2025.05]
Milman, Oded, Dovi Yellin, and Yehudit Aperstein.
"Adapting SAM for Visible-Light Pupil Segmentation Baseline." Electronics (2025). [paper] [2025.05]
SV-Unet: Wei Wang and Chong Yu and Tengyu Zhang and Feiyu Chen and Yufan Liu and Zongze Wu.
"Oversized ore segmentation using SAM-enhanced U-Net with self-supervised pre-training and semi-supervised self-training." Expert Systems with Applications (2025). [paper] [2025.05]
SAM-Brain3D: Zhongying Deng, Haoyu Wang, Ziyan Huang, Lipei Zhang, Angelica I. Aviles-Rivero, Chaoyu Liu, Junjun He, Zoe Kourtzi, Carola-Bibiane Schönlieb.
"Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis." ArXiv (2025). [paper] [2025.05]
Shuang Zhang, Carleton Coffin, Karyn L. Rogers, Catherine Ann Royer, Ge Wang.
"AI-Driven High-Resolution Cell Segmentation and Quantitative Analysis." ArXiv (2025). [paper] [2025.05]
IDRA-H: Marc Glocker, Peter Hönig, Matthias Hirschmanner, Markus Vincze.
"LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics." ArXiv (2025). [paper] [code] [2025.04]
SAM4EM: Uzair Shah, Marco Agus, Daniya Boges, Vanessa Chiappini, Mahmood Alzubaidi, Jens Schneider, Markus Hadwiger, Pierre J. Magistretti, Mowafa Househ, Corrado Calı.
"SAM4EM:Efficient memory-based two stage prompt-free segment anything model adapter for complex 3D neuroscience electron microscopy stacks." CVPRW (2025). [paper] [code] [2025.04]
UniBiomed: Linshan Wu, Yuxiang Nie, Sunan He, Jiaxin Zhuang, Hao Chen.
"UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation." ArXiv (2025). [paper] [2025.04]
RadSAM: Julien Khlaut, Elodie Ferreres, Daniel Tordjman, Hélène Philippe, Tom Boeken, Pierre Manceron, Corentin Dancette.
"RadSAM: Segmenting 3D radiological images with a 2D promptable model." ArXiv (2025). [paper] [2025.04]
RRL-MedSAM: Jia Wang, Yunan Mei, Jiarui Liu, and Xin Fan.
"SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation." ArXiv (2025). [paper] [2025.04]
Follow Everything: Qianyi Zhang, Shijian Ma, Boyi Liu, Jingtai Liu, Jianhao Jiao, Dimitrios Kanoulas.
"Follow Everything: A Leader-Following and Obstacle Avoidance Framework with Goal-Aware Adaptation." ArXiv (2025). [paper] [code] [2025.04]
Res-SAM: Xiren Zhou, Shikang Liu, Xinyu Yan, Yizhan Fan, Xiangyu Wang, Yu Kang, Jian Cheng, Huanhuan Chen.
"Reservoir-enhanced Segment Anything Model for Subsurface Diagnosis." ArXiv (2025). [paper] [2025.04]
RSFR: Jiahao Huang, Fanwen Wang, Pedro F. Ferreira, Haosen Zhang, Yinzhe Wu, Zhifan Gao, Lei Zhu, Angelica I. Aviles-Rivero, Carola-Bibiane Schonlieb, Andrew D. Scott, Zohya Khalique, Maria Dwornik, Ramyah Rajakulasingam, Ranil De Silva, Dudley J. Pennell, Guang Yang, Sonia Nielles-Vallespin.
"RSFR: A Coarse-to-Fine Reconstruction Framework for Diffusion Tensor Cardiac MRI with Semantic-Aware Refinement." Medical Image Analysis (2025). [paper] [2025.04]
Tooth-ASAM: Peijuan Wang, Hanjie Gu & Yuliang Sun.
"Tooth segmentation on multimodal images using adapted segment anything model." Scientific Reports (2025). [paper] [2025.04]
Zhang, N., Ling, H., Zhang, W. et al.
"A prediction method for radiation proctitis based on SAM-Med2D model." Scientific Reports (2025). [paper] [2025.04]
ToothSC-SAM: Li, C., Cheng Wang and Zhanchuan Cai.
"ToothSC-SAM: A Novel Network Model Based on Skip-Connections and SAM for Tooth Segmentation in CBCT Images." ArXiv (2025). [paper] [2025.04]
FLSSAM: Li, Jiayuan and Wang, Zhen and Xu, Nan and You, Zhuhong.
"Fine-Tuning SAM for Forward-Looking Sonar with Collaborative Prompts and Embedding." LGRS (2025). [paper] [code] [2025.04]
IIIM-SAM: Zhang, Zhe and Zhou, Yuhang and Yue, Jiahe and Zhang, Runchu and Ma, Jie.
"IIIM-SAM: Zero-shot Texture Anomaly Detection Without External Prompts." IEEE TASE (2025). [paper] [2025.04]
MIT-SAM: Zhou, Xichuan and Yan, Lingfeng and Ding, Rui and Atabansi, Chukwuemeka Clinton and Nie, Jing and Chen, Lihui and Feng, Yujie and Liu, Haijun.
"MIT-SAM: Medical Image-Text SAM with Mutually Enhanced Heterogeneous Features Fusion for Medical Image Segmentation." IEEE JBHI (2025). [paper] [code] [2025.04]
PTSAM: Tristan Piater, Björn Barz, Alexander Freytag.
"Prompt-Tuning SAM: From Generalist to Specialist with only 2,048 Parameters and 16 Training Images." ArXiv (2025). [paper] [2025.04]
Boyue Xu, Ruichao Hou, Tongwei Ren, Gangshan Wu.
"RGB-DVideo Object Segmentation via Enhanced Multi-store Feature Memory." ArXiv (2025). [paper] [2025.04]
AffordanceSAM: Dengyang Jiang, Mengmeng Wang, Teli Ma, Hengzhuang Li, Yong liu, Guang Dai, Lei Zhang.
"AffordanceSAM: Segment Anything Once More in Affordance Grounding." ArXiv (2025). [paper] [2025.04]
FS-DINO: Wei Zhuo, Zhiyue Tang, Wufeng Xue, Hao Ding, Linlin Shen.
"DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining." ArXiv (2025). [paper] [2025.04]
LSR-ST: Guoyi Zhang, Siyang Chen, Guangsheng Xu, Han Wang, Xiaohu Zhang.
"Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation." ArXiv (2025). [paper] [2025.04]
EmoSEM: Jing Zhang, Dan Guo, Zhangbin Li, Meng Wang.
"EmoSEM: Segment and Explain Emotion Stimuli in Visual Art." ArXiv (2025). [paper] [2025.04]
SAC: Ghodsiyeh Rostami, Po-Han Chen, Mahdi S. Hosseini.
"Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection." ArXiv (2025). [paper] [2025.04]
HSACNet: Qi'ao Xu, Pengfei Wang, Yanjun Li, Tianwen Qian, Xiaoling Wang.
"HSACNet: Hierarchical Scale-Aware Consistency Regularized Semi-Supervised Change Detection." ICME (2025). [paper] [2025.04]
Oliver Mills, Philip Conaghan, Nishant Ravikumar, Samuel Relton.
"Putting the Segment Anything Model to the Test with 3D Knee MRI -- A Comparison with State-of-the-Art Performance." BMVC (2024). [paper] [code] [2025.04]
ProtoSAM-2D: Yiqing Shen, David Dreizin, Blanca Inigo, Mathias Unberath.
"ProtoSAM-2D: 2D semantic Segment Anything Model with mask-level prototype-learning and distillation." ArXiv (2025). [paper] [2025.04]
GlomSAM: Pan, S., Tang, X., Chen, B., Lai, X., & Jin, W.
"GlomSAM: Hybrid customized SAM for multi-glomerular detection and segmentation in immunofluorescence images." PLOS ONE (2025). [paper] [2025.04]
STSAMNet: Yang, M., Yang, R., Wang, M., Xu, H., & Xu, G.
"Integrating unsupervised domain adaptation and SAM technologies for image semantic segmentation: a case study on building extraction from high-resolution remote sensing images." International Journal of Digital Earth (2025). [paper] [2025.04]
SAMBV: Yuhan Wang and Shoujun Zhou and Ke Lu and Yuanquan Wang and Lei Zhang and Weipeng Liu and Zhida Wang.
"SAMBV: A Fine-tuned SAM with Interpolation Consistency Regularization for Semi-supervised Bi-ventricle Segmentation from Cardiac MRI." Medical Engineering & Physics (2025). [paper] [2025.04]
Gutiérrez, Juan D., Emilio Delgado, Carlos Breuer, José M. Conejero, and Roberto Rodriguez-Echeverria.
"Prompt Once, Segment Everything: Leveraging SAM 2 Potential for Infinite Medical Image Segmentation with a Single Prompt." Algorithms(2025). [paper] [2025.04]
DepthForge: Siyu Chen, Ting Han, Changshe Zhang, Xin Luo, Meiliu Wu, Guorong Cai, Jinhe Su.
"Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation." ArXiv (2025). [paper] [code] [2025.04]
SAM-ESP: Xinyu Zhao, Jun Liu, Faqiang Wang, Li Cui, Yuping Duan.
"Contour Field based Elliptical Shape Prior for the Segment Anything Model." ArXiv (2025). [paper] [2025.04]
FAEWNet: Yun-Cheng Li, Sen Lei, Yi-Tao Zhao, Heng-Chao Li, Jun Li, Antonio Plaza.
"SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping." ArXiv (2025). [paper] [code] [2025.04]
DC-SAM: Mengshi Qi, Pengfei Zhu, Xiangtai Li, Xiaoyang Bi, Lu Qi, Huadong Ma, Ming-Hsuan Yang.
"DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency." TPAMI (2026). [paper] [code] [2025.04]
CAGS: Wei Sun, Yanzhao Zhou, Jianbin Jiao, Yuan Li.
"CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting." ArXiv (2025). [paper] [2025.04]
PathSeqSAM: Mingyang Zhu, Yinting Liu, Mingyu Li, Jiacheng Wang.
"PathSeqSAM: Sequential Modeling for Pathology Image Segmentation with SAM2." ArXiv (2025). [paper] [code] [2025.04]
Robust SAM: Long, J., Xu, Z., Jiang, T., Yao, W., Jia, S., Ma, C., & Chen, X.
"Robust SAM: On the Adversarial Robustness of Vision Foundation Models." AAAI (2025). [paper] [2025.04]
Jiahuan Long, Tingsong Jiang, Wen Yao, Yizhe Xiong, Zhengqin Xu, Shuai Jia, Chao Ma.
"Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models." ArXiv (2025). [paper] [2025.04]
AerOSeg: Saikat Dutta, Akhil Vasim, Siddhant Gole, Hamid Rezatofighi, Biplab Banerjee.
"AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images." CVPRW (2025). [paper] [2025.04]
MoSE: Jia Wei, Xiaoqi Zhao, Jonghye Woo, Jinsong Ouyang, Georges El Fakhri, Qingyu Chen, Xiaofeng Liu.
"Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation." CVPRW (2025). [paper] [2025.04]
ToolTipNet: Zijian Wu, Shuojue Yang, Yueming Jin, Septimiu E Salcudean.
"ToolTipNet: A Segmentation-Driven Deep Learning Baseline for Surgical Instrument Tip Detection." ArXiv (2025). [paper] [2025.04]
ATOMIC: Jingyun Yang, Ruoyan Avery Yin, Chi Jiang, Yuepeng Hu, Xiaokai Zhu, Xingjian Hu, Sutharsika Kumar, Xiao Wang, Xiaohua Zhai, Keran Rong, Yunyue Zhu, Tianyi Zhang, Zongyou Yin, Jing Kong, Neil Zhenqiang Gong, Zhichu Ren, Haozhe Wang.
"Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials." ArXiv (2025). [paper] [2025.04]
Yiwen Wang, Ying Liang, Yuxuan Zhang, Xinning Chai, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Rong Xie, Li Song.
"Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution." ArXiv (2025). [paper] [code] [2025.04]
STSeg: Kehuan Song, Xinglin Xie, Kexin Zhang, Licheng Jiao, Lingling Li, Shuyuan Yang.
"STSeg- Complex Video Object Segmentation: The 1st Solution for 4th PVUW MOSE Challenge." ArXiv (2025). [paper] [2025.04]
MASSeg: Xuqiang Cao, Linnan Zhao, Jiaxuan Zhao, Fang Liu, Puhua Chen, Wenping Ma.
"MASSeg: 2nd Technical Report for 4th PVUW MOSE Track." ArXiv (2025). [paper] [code] [2025.04]
FVOS: Mengjiao Wang, Junpei Zhang, Xu Liu, Yuting Yang, Mengru Ma.
"FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution." ArXiv (2025). [paper] [2025.04]
SynthFM: Sourya Sengupta, Satrajit Chakrabarty, Keerthi Sravan Ravi, Gopal Avinash, Ravi Soni.
"SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data." ArXiv (2025). [paper] [2025.04]
FMLGS: Xin Tan, Yuzhou Ji, He Zhu, and Yuan Xie.
"FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents." ArXiv (2025). [paper] [2025.04]
ChildlikeSHAPES: Astitva Srivastava, Harrison Jesse Smith, Thu Nguyen-Phuoc, Yuting Ye.
"ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings." ArXiv (2025). [paper] [2025.04]
FindAnything: Sebastián Barbas Laina, Simon Boche, Sotiris Papatheodorou, Simon Schaefer, Jaehyung Jung, Stefan Leutenegger.
"FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment." ArXiv (2025). [paper] [2025.04]
WS-SAM: Zhang, Z., Ma, F., Liu, H. et al.
"WS-SAM: self-prompting SAM with wavelet and spatial domain for OCTA retinal vessel segmentation." The Journal of Supercomputing (2025). [paper] [2025.04]
Anbalagan, N., Balraj, N., Jayasingh, N., & Thangavel, P.
"Brain tumor detection using segment anything model." AIP Conference Proceedings(2025). [paper] [2025.04]
Amirkhosro Kazemi, Aryan Ghazipour, Tyler Settle, Marcus F. Stoddard, and Amir A.
"Semi-automated segmentation of magnitude images in 4D flow MR scans using segment anything model 2 (SAM 2)." Medical Imaging 2025: Clinical and Biomedical Imaging(2025). [paper] [2025.04]
MIRSAM: Zhang, M., Xu, Q., Wang, Y. et al.
"MIRSAM: multimodal vision-language segment anything model for infrared small target detection." Visual Intelligence.(2025). [paper] [2025.04]
DPSAM: Peng Liu, Jinhong Deng, Lixin Duan, Wen Li, and Fengmao Lv.
"Segmenting Anything in the Dark via Depth Perception." TMM (2025). [paper] [code] [2025.04]
Jinlong Huang, Xiao Sun, and Lisheng Wang.
"Enhancing Foundation Model Robustness for Multi-center Real-World Medical Image Analysis." ArXiv (2025). [paper] [2025.04]
UV-AdaptFormer: Feng, W., Guan, F., Tu, J., & Xu, W.
"UV-AdaptFormer: adapting the segment anything model for urban village identification from high-resolution satellite imagery." Remote Sensing Letters (2025). [paper] [2025.04]
LORA-MedSAM: Hu, Jiamin, Xu, Xuwei, and Zou, Zhenmin.
"LORA-MedSAM: Efficient Medical Image Segmentation." Lecture Notes in Electrical Engineering(2025). [paper] [2025.04]
BuildWin-SAM: Li, Zhengnan and Yan, Yizhen and Huang, Bo.
"BuildWin-SAM: An Improved SAM-Based Method for Extracting Building Windows From Street View Images." IEEE Access (2025). [paper] [code] [2025.04]
Li, Hao, Jianxi Yang, Shixin Jiang, and Xiaoxia Yang.
"SAM-Guided Concrete Bridge Damage Segmentation with Mamba–ResNet Hierarchical Fusion Network." Electronics (2025). [paper] [2025.04]
Takashi Nagaoka.
"Accurate Segmentation of Pigmented Skin Lesions Using Grounded-segment-anything." J Med Imaging Case Rep (2025). [paper] [2025.04]
SAMJAM: Joshua Li, Fernando Jose Pena Cantu, Emily Yu, Alexander Wong, Yuchen Cui, Yuhao Chen.
"SAMJAM:Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos." ArXiv (2025). [paper] [2025.04]
RP-SAM2: Nuren Zhaksylyk, Ibrahim Almakky, Jay Paranjape, S. Swaroop Vedula, Shameema Sikder, Vishal M. Patel, Mohammad Yaqub.
"RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation." ArXiv (2025). [paper] [code] [2025.04]
MovSAM: Chang Nie, Yiqing Xu, Guangming Wang, Zhe Liu, Yanzi Miao, Hesheng Wang.
"MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking." ArXiv (2025). [paper] [code] [2025.04]
Marco Acerbis, Nataša Sladoje, Joakim Lindblad.
"A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology." SCIA (2025). [paper] [2025.04]
Wheat3DGS: Daiwei Zhang, Joaquin Gajardo, Tomislav Medic, Isinsu Katircioglu, Mike Boss, Norbert Kirchgessner, Achim Walter, Lukas Roth.
"Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting." CVPRW (2025). [paper] [code] [2025.04]
econSG: Can Zhang, Gim Hee Lee.
"econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians." ICLR (2025). [paper] [code] [2025.04]
CAT-V: Yunlong Tang, Jing Bi, Chao Huang, Susan Liang, Daiki Shimada, Hang Hua, Yunzhong Xiao, Yizhi Song, Pinxin Liu, Mingqian Feng, Junjia Guo, Zhuo Liu, Luchuan Song, Ali Vosoughi, Jinxi He, Liu He, Zeliang Zhang, Jiebo Luo, Chenliang Xu.
"Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting." ArXiv (2025). [paper] [code] [2025.04]
KAN-SAM: Xingyuan Li, Ruichao Hou, Tongwei Ren, Gangshan Wu.
"KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection." ICME (2025). [paper] [2025.04]
HRMedSeg: Qing Xu, Zhenye Lou, Chenxin Li, Xiangjian He, Rong Qu, Tesema Fiseha Berhanu, Yi Wang, Wenting Duan, Zhen Chen.
"HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling." ArXiv (2025). [paper] [code] [2025.04]
SAM2MOT: Junjie Jiang, Zelin Wang, Manqi Zhao, Yin Li, DongSheng Jiang.
"SAM2MOT:ANovelParadigm of Multi-Object Tracking by Segmentation." ArXiv (2025). [paper] [2025.04]
S4M: Heeji Yoon, Heeseong Shin, Eunbeen Hong, Hyunwook Choi, Hansang Cho, Daun Jeong, Seungryong Kim.
"S^4M: Boosting Semi-Supervised Instance Segmentation with SAM." ArXiv (2025). [paper] [code] [2025.04]
Hamza Riaz, Alan F. Smeaton.
"Resilience of Vision Transformers for Domain Generalisation in the Presence of Out-of-Distribution Noisy Images." ArXiv (2025). [paper] [2025.04]
UCS: Dianshuo Li, Li Chen, Yunxiang Cao, Kai Zhu, Jun Cheng.
"UCS: A Universal Model for Curvilinear Structure Segmentation." ArXiv (2025). [paper] [2025.04]
Mengyuan Liu, Yixiao Chen, Anning Tian, Xinmeng Wu, Mozhi Shen, Tianchou Gong, Jeongkyu Lee.
"Performance Analysis of Deep Learning Models for Femur Segmentation in MRI Scan." ArXiv (2025). [paper] [2025.04]
CMaP-SAM: Shuai Chen, Fanman Meng, Haoran Wei, Chenhao Wu, Qingbo Wu, Linfeng Xu, Hongliang Li.
"CMaP-SAM: Contraction Mapping Prior for SAM-driven Few-shot Segmentation." ArXiv (2025). [paper] [2025.04]
MedSAM2: Jun Ma, Zongxin Yang, Sumin Kim, Bihui Chen, Mohammed Baharoon, Adibvafa Fallahpour, Reza Asakereh, Hongwei Lyu, Bo Wang.
"MedSAM2: Segment Anything in 3D Medical Images and Videos." ArXiv (2025). [paper] [code] [2025.04]
GraphSeg: Haozhan Tang, Tianyi Zhang, Oliver Kroemer, Matthew Johnson-Roberson, Weiming Zhi.
"GraphSeg: Segmented 3D Representations via Graph Edge Addition and Contraction." ArXiv (2025). [paper] [code] [2025.04]
Delineate-Anything: Mykola Lavreniuk, Nataliia Kussul, Andrii Shelestov, Bohdan Yailymov, Yevhenii Salii, Volodymyr Kuzin, Zoltan Szantoi.
"Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery." ArXiv (2025). [paper] [code] [2025.04]
APSeg: Liying Xu, Hongliang He, Wei Han, Hanbin Huang, Siwei Feng, Guohong Fu.
"APSeg: Auto-Prompt Model with Acquired and Injected Knowledge for Nuclear Instance Segmentation and Classification." ArXiv (2025). [paper] [2025.04]
F-ViTA: Jay N. Paranjape, Celso de Melo, Vishal M. Patel.
"F-ViTA: Foundation Model Guided Visible to Thermal Translation." ArXiv (2025). [paper] [code] [2025.04]
MVP-Lab: Hao Fang, Runmin Cong, Xiankai Lu, Zhiyang Chen, Wei Zhang.
"The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation." ArXiv (2025). [paper] [2025.04]
Sa2VA: Haobo Yuan, Tao Zhang, Xiangtai Li, Lu Qi, Zilong Huang, Shilin Xu, Jiashi Feng, Ming-Hsuan Yang.
"4th PVUW MeViS 3rd Place Report: Sa2VA." ArXiv (2025). [paper] [code] [2025.04]
BiSeg-SAM: Encheng Su, Hu Cao, Alois Knoll.
"BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models." BIBM (2024). [paper] [code] [2025.04]
DBF-UNet: Haoxuan Li, Wei Song, Aofan Liu, Peiwu Qin.
"DBF-UNet: A Two-Stage Framework for Carotid Artery Segmentation with Pseudo-Label Generation." ArXiv (2025). [paper] [code] [2025.04]
SAL-4D: Yushan Zhang, Aljoša Ošep, Laura Leal-Taixé, Tim Meinhardt.
"Zero-Shot 4D Lidar Panoptic Segmentation." ArXiv (2025). [paper] [2025.04]
CamoSAM2: Xin Zhang, Keren Fu, Qijun Zhao.
"CamoSAM2: Motion-Appearance Induced Auto-Refining Prompts for Video Camouflaged Object Detection." ArXiv (2025). [paper] [2025.04]
HybridGL: Ting Liu, Siyuan Li.
"Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation." ArXiv (2025). [paper] [code] [2025.04]
SmartScan: Savinay Nagendra, Kashif Rashid.
"SmartScan: An AI-based Interactive Framework for Automated Region Extraction from Satellite Images." ArXiv (2025). [paper] [2025.04]
ReferDINO-Plus: Tianming Liang, Haichao Jiang, Wei-Shi Zheng, Jian-Fang Hu.
"ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025." ArXiv (2025). [paper] [code] [2025.03]
DSU-Net: Yimin Xu, Fan Yang, Bin Xu.
"DSU-Net: An Improved U-Net Model Based on DINOv2 and SAM2 with Multi-scale Cross-model Feature Enhancement." ArXiv (2025). [paper] [code] [2025.3]
SALT: Yanbo Wang, Yongtao Chen, Chuan Cao, Tianchen Deng, Wentao Zhao, Jingchuan Wang, Weidong Chen.
"SALT: A Flexible Semi-Automatic Labeling Tool for General LiDAR Point Clouds with Cross-Scene Adaptability and 4D Consistency." ArXiv (2025). [paper] [code] [2025.03]
MedCL: Ke Zhang, Vishal M. Patel.
"MedCL: Learning Consistent Anatomy Distribution for Scribble-supervised Medical Image Segmentation." ArXiv (2025). [paper] [code] [2025.03]
ReasonGrounder: Zhenyang Liu, Yikai Wang, Sixiao Zheng, Tongying Pan, Longfei Liang, Yanwei Fu, Xiangyang Xue.
"ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning." ArXiv (2025). [paper] [code] [2025.03]
Uxue Delaquintana-Aramendi, Leire Benito-del-Valle, Aitor Alvarez-Gila, Javier Pascau, Luisa F Sánchez-Peralta, Artzai Picón, J Blas Pagador, Cristina L Saratxaga.
"AI-Assisted Colonoscopy: Polyp Detection and Segmentation using Foundation Models." ArXiv (2025). [paper] [2025.03]
IMPACT: Valentin Boussot, Cédric Hémon, Jean-Claude Nunes, Jason Downling, Simon Rouzé, Caroline Lafond, Anaïs Barateau, Jean-Louis Dillenseger.
"IMPACT: A Generic Semantic Loss for Multimodal Medical Image Registration." ArXiv (2025). [paper] [2025.03]
MGD-SAM2: Haoran Shen, Peixian Zhuang, Jiahao Kou, Yuxin Zeng, Haoying Xu, Jiangyun Li.
"MGD-SAM2: Multi-view Guided Detail-enhanced Segment Anything Model 2 for High-Resolution Class-agnostic Segmentation." ArXiv (2025). [paper] [code] [2025.03]
Motion-Seg: Nan Huang, Wenzhao Zheng, Chenfeng Xu, Kurt Keutzer, Shanghang Zhang, Angjoo Kanazawa, Qianqian Wang.
"Segment Any Motion in Videos." CVPR (2025). [paper] [code] [2025.03]
SCHNet: Kunliang Liu, Jianming Wang, Rize Jin, Wonjun Hwang, Tae-Sun Chung.
"SCHNet: SAM Marries CLIP for Human Parsing." ArXiv (2025). [paper] [2025.03]
BlooDet: Jialun Pei, Zhangjun Zhou, Diandian Guo, Zhixi Li, Jing Qin, Bo Du, Pheng-Ann Heng.
"Synergistic Bleeding Region and Point Detection in Surgical Videos." ArXiv (2025). [paper] [2025.03]
Erosion-SAM: Hadi Shokati and Andreas Engelhardt and Kay Seufferheld and Ruhollah Taghizadeh-Mehrjardi and Peter Fiener and Hendrik P.A. Lensch and Thomas Scholten.
"Erosion-SAM: Semantic segmentation of soil erosion by water." Catena(2025). [paper] [2025.03]
SD-YOLOv8: Zhang, Xintong, Dasheng Wu, and Fengya Xu.
"SD-YOLOv8: SAM-Assisted Dual-Branch YOLOv8 Model for Tea Bud Detection on Optical Images." Agriculture(2025). [paper] [2025.03]
WS-SAM: Zhang, Z., Ma, F., Liu, H. et al.
"WS-SAM: self-prompting SAM with wavelet and spatial domain for OCTA retinal vessel segmentation." The Journal of Supercomputing (2025). [paper] [2025.03]
DGSUnet: Yimin Xu.
"DGSUnet: An Improved Unet Model with DINO-Guided SAM2 for Multi-Scale Feature Collaboration." ArXiv (2025). [paper] [code] [2025.03]
seconGS: Hairong Yin, Huangying Zhan, Yi Xu, Raymond A. Yeh.
"Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying." ArXiv (2025). [paper] [code] [2025.03]
AMA-SAM: Jiahe Qian, Yaoyu Fang, Jinkui Hao, Bo Zhou.
"AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation." ArXiv (2025). [paper] [2025.03]
Feature4X: Shijie Zhou, Hui Ren, Yijia Weng, Shuwang Zhang, Zhen Wang, Dejia Xu, Zhiwen Fan, Suya You, Zhangyang Wang, Leonidas Guibas, Achuta Kadambi.
"Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields." CVPR (2025). [paper] [code] [2025.03]
CABL: Xinghao Wang, Changtao Miao, Dianmo Sheng, Tao Gong, Qi Chu, Bin Liu, Nenghai Yu.
"Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement." ArXiv (2025). [paper] [2025.03]
DSMPrompter: Mélisande Teng, Arthur Ouaknine, Etienne Laliberté, Yoshua Bengio, David Rolnick, Hugo Larochelle.
"Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery." ICLR ML4RS workshop (2025). [paper] [2025.03]
Optimized MedSAM: Boyi Li, Ye Yuan, Wenjun Tan.
"Optimization of MedSAM model based on bounding box adaptive perturbation algorithm." ArXiv (2025). [paper] [2025.03]
BiPrompt-SAM: Suzhe Xu, Jialin Peng, Chengyuan Zhang.
"BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts." ArXiv (2025). [paper] [2025.03]
CamSAM2: Yuli Zhou, Guolei Sun, Yawei Li, Yuqian Fu, Luca Benini, Ender Konukoglu.
"CamSAM2: Segment Anything Accurately in Camouflaged Videos." NeurIPS (2025). [paper] [code] [2025.03]
Siamese-SAM: Wei, Gang and Miao, Yuqi and Wang, Zhicheng.
"Siamese-SAM: Remote Sensing Image Change Detection with Siamese Structure Segment Anything Model." Applied Sciences (2025). [paper] [2025.03]
PanoGS: Hongjia Zhai, Hai Li, Zhenzhe Li, Xiaokun Pan, Yijia He, Guofeng Zhang.
"PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding." CVPR (2025). [paper] [code] [2025.03]
OCRT: Luyao Tang, Yuxuan Yuan, Chaoqi Chen, Zeyu Zhang, Yue Huang, Kun Zhang.
"OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad." CVPR (2025). [paper] [code] [2025.03]
HU-MCD: Arne Grobrügge, Niklas Kühl, Gerhard Satzger, Philipp Spitzer.
"Towards Human-Understandable Multi-Dimensional Concept Discovery." ArXiv (2025). [paper] [code] [2025.03]
PG-SAM: Yiheng Zhong, Zihong Luo, Chengzhi Liu, Feilong Tang, Zelin Peng, Ming Hu, Yingzhen Hu, Jionglong Su, Zongyuan Geand, Imran Razzak.
"PG-SAM: Prior-Guided SAM with Medical for Multi-organ Segmentation." ArXiv (2025). [paper] [code] [2025.03]
MambaSAM: Liang, Pengchen and Shi, Leijun and Pu, Bin and Wu, Renkai and Chen, Jianguo and Zhou, Lixin and Xu, Lite and Chen, Zhuangzhuang and Chang, Qing and Li, Yiwei.
"MambaSAM: A Visual Mamba-Adapted SAM Framework for Medical Image Segmentation." JBHI (2025). [paper] [2025.03]
SFA-Net: Tian Gao and Chaozhen Lan and Wenjun Huang and Sheng Wang.
"SFA-Net: A SAM-guided focused attention network for multimodal remote sensing image matching." ISPRS Journal of Photogrammetry and Remote Sensing (2025). [paper] [code] [2025.03]
DSATNet: Li Y, Huang J, Zhang Y, Deng J, Zhang J, Dong L, Wang D, Mei L, Lei C.
"Dual branch segment anything model-transformer fusion network for accurate breast ultrasound image segmentation." Med Phys (2025). [paper] [code] [2025.03]
M2N2V2: Markus Karmann, Peng-Tao Jiang, Bo Li, Onay Urfalioglu.
"M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation." ArXiv (2025). [paper] [2025.03]
USAM-Net: Joseph Emmanuel DL Dayo, Prospero C. Naval Jr.
"USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network." ArXiv (2025). [paper] [2025.03]
TransCaGNet: Annalena Blänsdorf, Tristan Wirth, Arne Rak, Thomas Pöllabauer, Volker Knauthe, Arjan Kuijper.
"Semantic Segmentation of Transparent and Opaque Drinking Glasses with the Help of Zero-shot Learning." ArXiv (2025). [paper] [2025.03]
SED-MVS: Zhenlong Yuan, Zhidong Yang, Yujun Cai, Kuangxin Wu, Mufan Liu, Dapeng Zhang, Hao Jiang, Zhaoxin Li, Zhaoqi Wang.
"SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint." ArXiv (2025). [paper] [2025.03]
Yizhou Li, Yusuke Monno, Masatoshi Okutomi, Yuuichi Tanaka, Seiichi Kataoka, Teruaki Kosiba.
"Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis." VISAPP (2025). [[paper]([2503.14219] Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis)] [code] [2025.03]
OMT-SAM: Wenjie Zhang, Ziyang Zhang, Mengnan He, Jiancheng Ye.
"Organ-aware Multi-scale Medical Image Segmentation Using Text Prompt Engineering." ArXiv (2025). [paper] [2025.03]
ROS-SAM: Zhe Shan, Yang Liu, Lei Zhou, Cheng Yan, Heng Wang, Xia Xie.
"ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object." CVPR (2025). [paper] [code] [2025.03]
SPC-GS: Guibiao Liao, Qing Li, Zhenyu Bao, Guoping Qiu, Kanglin Liu.
"SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs." CVPR (2025). [paper] [code] [2025.03]
GleSAM: Guangqian Guo, Yoong Guo, Xuehui Yu, Wenbo Li, Yaoxing Wang, Shan Gao.
"Segment Any-Quality Images with Generative Latent Space Enhancement." CVPR (2025). [paper] [2025.03]
MSMV-Swin: Farnoush Bayatmakou, Reza Taleei, Milad Amir Toutounchian, Arash Mohammadi.
"Integrating AI for Human-Centric Breast Cancer Diagnostics: A Multi-Scale and Multi-View Swin Transformer Framework." ArXiv (2025). [paper] [2025.03]
3DAxisPrompt: Dingning Liu, Cheng Wang, Peng Gao, Renrui Zhang, Xinzhu Ma, Yuan Meng, Zhihui Wang.
"3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o." ArXiv (2025). [paper] [2025.03]
CL-Net: Dazhou Guo, Zhanghexuan Ji, Yanzhou Su, Dandan Zheng, Heng Guo, Puyang Wang, Ke Yan, Yirui Wang, Qinji Yu, Zi Li, Minfeng Xu, Jianfeng Zhang, Haoshen Li, Jia Ge, Tsung-Ying Ho, Bing-Shen Huang, Tashan Ai, Kuaile Zhao, Na Shen, Qifeng Wang, Yun Bian, Tingyu Wu, Peng Du, Hua Zhang, Feng-Ming Kong, Alan L. Yuille, Cher Heng Tan, Chunyan Miao, Perry J. Pickhardt, Senxiang Yan, Ronald M. Summers, Le Lu, Dakai Jin, Xianghua Ye.
"AContinual Learning-driven Model for Accurate and Gen eralizable Segmentation of Clinically Comprehensive and Fine-grained Whole-body Anatomies in CT." ArXiv (2025). [paper] [2025.03]
SAM2-ELNet: Jianhao Yang, Wenshuo Yu, Yuanchao Lv, Jiance Sun, Bokang Sun, Mingyang Liu.
"SAM2-ELNet: Label Enhancement and Automatic Annotation for Remote Sensing Segmentation." ArXiv (2025). [paper] [2025.03]
E-SAM: Weiming Zhang, Dingwen Xiao, Lei Chen, Lin Wang.
"E-SAM: Training-Free Segment Every Entity Model." ArXiv (2025). [paper] [2025.03]
EgoSplat: Di Li, Jie Feng, Jiahao Chen, Weisheng Dong, Guanbin Li, Guangming Shi, Licheng Jiao.
"EgoSplat: Open-Vocabulary Egocentric Scene Understanding with Language Embedded 3D Gaussian Splatting." ArXiv (2025). [paper] [2025.03]
PPO: Xueyu Liu · Rui Wang · Yexin Lai · Guangze Shi · Feixue Shao · Fang Hao · Jianan Zhang · Jia Shen · Yongfei Wu · Wen Zheng.
"Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater." CVPR (2025). [paper] [code] [2025.03]
EPAF: Li, Dongsheng and Zang, Chunyan and Zhang, Huijie and Lin, Yiming and Xia, Qiushi.
"An Efficient Pore Annotation Framework for Tight Sandstone Images with Segment Anything Model." ICASSP (2025). [paper] [code] [2025.03]
SAM2-SP: Wei, Sheng and Qiu, Song and Zhou, Mei and Zhang, He and Wang, Yan and Li, Qingli.
"Self-Prompting Driven SAM2 for 3D Medical Image Segmentation." ICASSP (2025). [paper] [2025.03]
EP-SAM: Wang, Zhitao and Wen, Jiangtao and Han, Yuxing.
"EP-SAM: An Edge-Detection Prompt SAM Based Efficient Framework for Ultra-Low Light Video Segmentation." ICASSP (2025). [paper] [code] [2025.03]
U-SAM: Jin, Xiaofeng and Hu, Jie and Lin, Jianghang and Zhang, Shengchuan and Cao, Liujuan.
"U-SAM: Upgrade Segment Anything Model With Semantic-Aware and Memory-Efficient." ICASSP (2025). [paper] [2025.03]
VLIMNet: Chen, Zhongyuan and Zhang, Zhan and Zuo, Decheng and Wang, Ning and Fan, Liufeng and Liu, Zhiwei.
"VLIMNet: A Visible Light And Infrared Image Matching Network Based On Segment Anything Model And SuperPoint." ICASSP (2025). [paper] [2025.03]
Ling, Yinzhou and Luo, Jingjing and Han, Yuan and Li, Wenxian and Wang, Hongbo.
"Instance Segmentation of Airway Anatomies Using Mask R-CNN Prompt Adaptation-SAM." ICASSP (2025). [paper] [2025.03]
SeqSAM: Benjamin Towle, Xin Chen, Ke Zhou.
"SeqSAM: Autoregressive Multiple Hypothesis Prediction for Medical Image Segmentation using SAM." ISBI (2025). [paper] [code] [2025.03]
NVP-HRI: Yuzhi Lai, Shenghai Yuan, Youssef Nassar, Mingyu Fan, Thomas Weber, Matthias Rätsch.
"NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model." ESWA(2025). [paper] [code] [2025.03]
nnInteractive: Fabian Isensee, Maximilian Rokuss, Lars Krämer, Stefan Dinkelacker, Ashis Ravindran, Florian Stritzke, Benjamin Hamm, Tassilo Wald, Moritz Langenberg, Constantin Ulrich, Jonathan Deissler, Ralf Floca, Klaus Maier-Hein.
"nnInteractive: Redefining 3D Promptable Segmentation." ArXiv (2025). [paper] [code] [2025.03]
Qipeng Mei, Dimitri Bulatov, Dorota Iwaszczuk.
"Polygonizing roof segments from high-resolution aerial images using YOLOv8-based edge detection." VISAPP(2025). [paper] [2025.03]
Julian Rene Cuellar Buritica, Vu Dinh, Manjula Burri, Julie Roelandts, James Wendling, Jon D. Klingensmith.
"Evaluation of state-of-the-art deep learning models in the segmentation of the heart ventricles in parasternal short-axis echocardiograms." ArXiv (2025). [paper] [2025.03]
VTPSeg: Xing Zi, Kairui Jin, Xian Tao, Jun Li, Ali Braytee, Rajiv Ratn Shah, Mukesh Prasad.
"Visual and Text Prompt Segmentation: A Novel Multi-Model Framework for Remote Sensing." ArXiv (2025). [paper] [2025.03]
SAM-RD: Zhu, Liangshan and Wu, Xing and Wang, Chengliang and Wang, Haidong.
"SAM Adaptation with Refocused Attention and Diverse Prompts for Medical Image Segmentation." ICASSP (2025). [paper] [2025.03]
YOLOE: Ao Wang, Lihao Liu, Hui Chen, Zijia Lin, Jungong Han, Guiguang Ding.
"YOLOE:Real-Time Seeing Anything." ArXiv (2025). [paper] [code] [2025.03]
RS2-SAM 2: Fu Rong, Meng Lan, Qian Zhang, Lefei Zhang.
"RS2-SAM2: Customized SAM2 for Referring Remote Sensing Image Segmentation." AAAI (2026). [paper] [2025.03]
OmniSAM: Ding Zhong, Xu Zheng, Chenfei Liao, Yuanhuiyi Lyu, Jialei Chen, Shengyang Wu, Linfeng Zhang, Xuming Hu.
"OmniSAM:Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation." ArXiv (2025). [paper] [2025.03]
MemorySAM: Chenfei Liao, Xu Zheng, Yuanhuiyi Lyu, Haiwei Xue, Yihong Cao, Jiawen Wang, Kailun Yang, Xuming Hu.
"MemorySAM:MemorizeModalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation." ArXiv (2025). [paper] [2025.03]
SAQ-SAM: Jing Zhang, Zhikai Li, Qingyi Gu.
"SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model." ArXiv (2025). [paper] [2025.03]
SAMEO: Wei-En Tai, Yu-Lin Shih, Cheng Sun, Yu-Chiang Frank Wang, Hwann-Tzong Chen.
"Segment Anything, Even Occluded." CVPR (2025). [paper] [2025.03]
EvoSAM: Zhaori Liu, Mengyang Li, Hu Han, Enli Zhang, Shiguang Shan, Zhiming Zhao.
"Dynamically evolving segment anything model with continuous learning for medical image segmentation." ArXiv (2025). [paper] [2025.03]
PDFNet: Xianjie Liu, Keren Fu, Qijun Zhao.
"Patch-Depth Fusion: Dichotomous Image Segmentation via Fine-Grained Patch Strategy and Depth Integrity-Prior." ArXiv (2025). [paper] [code] [2025.03]
SAM-COD: Jiaming Liu, Linghe Kong, Guihai Chen.
"Improving SAM for Camouflaged Object Detection via Dual Stream Adapters." ICCV (2025). [paper] [2025.03]
OpenVocabCT: Yuheng Li, Yuxiang Lai, Maria Thor, Deborah Marshall, Zachary Buchwald, David S. Yu and Xiaofeng Yang.
"Towards Universal Text-driven CT Image Segmentation." TMI (2025). [paper] [code] [2025.03]
SAS: Danielle L. Ferreira, Ahana Gangopadhyay, Hsi-Ming Chang, Ravi Soni, Gopal Avinash.
"SAS: Segment Anything Small for Ultrasound- A Non-Generative Data Augmentation Technique for Robust Deep Learning in Ultrasound Imaging." ArXiv (2025). [paper] [2025.03]
Melvin Reka, Tessa Pulli, Markus Vincze.
"Multi-Modal 3D Mesh Reconstruction from Images and Text." ArXiv (2025). [paper] [2025.03]
Yuchen Mao, Hongwei Li, Yinyi Lai, Giorgos Papanastasiou, Peng Qi, Yunjie Yang, Chengjia Wang.
"Semi-Supervised Medical Image Segmentation via Knowledge Mining from Large Models." ArXiv (2025). [paper] [code] [2025.03]
LPANet: Wentao Wu, Chenglong Li, Xiao Wang, Bin Luo, Qi Liu.
"Large Language Model Guided Progressive Feature Alignment for Multimodal UAVObject Detection." ArXiv (2025). [paper] [2025.03]
S4M: Adrien Meyer, Lorenzo Arboit, Giuseppe Massimiani, Francesco Brucchi, Luca Emanuele Amodio, Didier Mutter, Nicolas Padoy.
"S4M: Segment Anything with 4 Extreme Points." ArXiv (2025). [paper] [2025.03]
Haiyue Zu, Jun Ge, Heting Xiao, Jile Xie, Zhangzhe Zhou, Yifan Meng, Jiayi Ni, Junjie Niu, Linlin Zhang, Li Ni, Huilin Yang.
"Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching." ArXiv (2025). [paper] [2025.03]
APG-SAM: Danping Yin and Qingqing Zheng and Long Chen and Ying Hu and Qiong Wang.
"APG-SAM: Automatic prompt generation for SAM-based breast lesion segmentation with boundary-aware optimization." Expert Systems with Applications (2025). [paper] [2025.03]
Meta-CD: Gao, Junyu and Zhang, Da and Wang, Feiyu and Ning, Lichen and Zhao, Zhiyuan and Li, Xuelong.
"Combining SAM with Limited Data for Change Detection in Remote Sensing." TGRS (2025). [paper] [code] [2025.03]
SAM-MedUS: Feng Tian, Jintao Zhai, Jinru Gong, Weirui Lei, Shuai Chang, Fangfang Ju, Shengyou Qian, Xiao Zou.
"SAM-MedUS: a foundational model for universal ultrasound image segmentation." Journal of Medical Imaging (2025). [paper] [2025.03]
Yolo-MLSAM: Hongguang Chen, Banteng Liu, Ke Wang.
"Yolo-MLSAM: SAM Based Breast Cancer Microcalcification Cluster-Segmentation Method." JCEIM (2025). [paper] [2025.03]
Ananth Kochuparambil Biju; Vishnu Prasad Vasu; Sreekumar Krishnan.
"Consolidated extraterrestrial planetary crater detection using SAM (segment anything model)." AIP Conf. Proc.(2025). [paper] [2025.03]
Chen Jiang, Tianling Lyu, Gege Ma, Zhan Wu, Xinyun Zhong, Yan Xi, Yang Chen, Wentao Zhu.
"CBCT projection domain metal segmentation for metal artifact reduction using hessian-inspired dual-encoding network with guidance from segment anything model." Medical Physics (2025). [paper] [2025.03]
SegLGAD: Xiao Du and Bing Li and Tongkun Liu and Yi Ding and Liuyi Jin and Zhuo Zhao.
"SegLGAD: Local-to-global industrial anomaly detection with visual segmentation model." Optics & Laser Technology(2025). [paper] [code] [2025.03]
Det-SAM-Ore: Li, Fei and Liu, Xiaoyan and Li, Zongping.
"A Two-Stage Framework with Ore-Detect and Segment Anything Model for Ore Particle Segmentation and Size Measurement." IEEE Sensors Journal (2025). [paper] [2025.03]
MAET-SAM: Shuaiyu Bu, Yuanyuan Li, Guoqiang Liu , Yifan Li.
"MAET-SAM: Magneto-Acousto-Electrical Tomography segmentation network based on the segment anything model." Mathematical Biosciences and Engineering (2025). [paper] [2025.03]
PCCFU: Sulan Zhai and Chengzhuang Liu and Zhengzheng Tu and Chenglong Li and Liuxuanqi Gao.
"Weakly Supervised RGBT Salient Object Detection via SAM-Guided Label Optimization and Progressive Cross-modal Cross-scale Fusion." Information Fusion (2025). [paper] [code] [2025.03]
Aishik Konwer, Zhijian Yang, Erhan Bas, Cao Xiao, Prateek Prasanna, Parminder Bhatia, Taha Kass-Hout.
"Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation." CVPR (2025). [paper] [2025.03]
Steve Andreas Immanuel, Woojin Cho, Junhyuk Heo, Darongsae Kwon.
"Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model." ICLRW (2025). [paper] [code] [2025.03]
SurgiSAM2: Devanish N. Kamtam, Joseph B. Shrager, Satya Deepya Malla, Xiaohan Wang, Nicole Lin, Juan J. Cardona, Serena Yeung-Levy, Clarence Hu.
"SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection." ArXiv (2025). [paper] [2025.03]
GBT-SAM: Cecilia Diana-Albelda, Roberto Alcover-Couso, Álvaro García-Martín, Jesus Bescos, Marcos Escudero-Viñolo.
"GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain tumour Segmentation on mp-MRI." ArXiv (2025). [paper] [code] [2025.03]
WeakMedSAM: Haoran Wang, Lian Huai, Wenbin Li, Lei Qi, Xingqun Jiang, Yinghuan Shi.
"WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining ." ArXiv (2025). [paper] [code] [2025.03]
AHCPTQ: Wenlun Zhang, Shimpei Ando, Kentaro Yoshioka.
"AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model." ArXiv (2025). [paper] [2025.03]
SPD-VFM: Pengchen Liang, Leijun Shi, Huiping Yao, Bin Pu, Jianguo Chen, Lei Zhao, Haishan Huang, Zhuangzhuang Chen, Zhaozhao Xu, Lite Xu, Qing Chang, Yiwei Li.
"Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration." ArXiv (2025). [paper] [2025.03]
SHIFNet: Jiayi Zhao, Fei Teng, Kai Luo, Guoqiang Zhao, Zhiyong Li, Xu Zheng, Kailun Yang.
"Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance." ArXiv (2025). [paper] [code] [2025.03]
ReID-SAM: Kunjun Li, Cheng-Yen Yang, Hsiang-Wei Huang, Jenq-Neng Hwang.
"Technical Report for ReID-SAM on SkiTB Visual Tracking Challenge 2025." ArXiv (2025). [paper] [2025.03]
Clayton Bromley, Alexander Moore, Amar Saini, Doug Poland, Carmen Carrano.
"An Analysis of Segment Anything 2." ArXiv (2025). [paper] [2025.03]
SAGE: Guanyao Wu, Haoyu Liu, Hongming Fu, Yichuan Peng, Jinyuan Liu, Xin Fan, Risheng Liu.
"Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond." CVPR (2025). [paper] [code] [2025.03]
SparseMamba-PCL: Luyi Qiu, Tristan Till, Xiaobao Guo, Adams Wai-Kin Kong.
"SparseMamba-PCL: Scribble-Supervised Medical Image Segmentation via SAM-Guided Progressive Collaborative Learning." ArXiv (2025). [paper] [code] [2025.03]
SemiSAM+: Yichi Zhang, Bohao Lv, Le Xue, Wenbo Zhang, Yuchen Liu, Yu Fu, Yuan Cheng, Yuan Qi.
"SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models." MIA (2025). [paper] [2025.02]
Silius M. Vandeskog, Magne Aldrin, Daniel Howell, Edvin Fuglebakk.
"Adding smoothing splines to the SAM model improves stock assessment." ArXiv (2025). [paper] [2025.02]
Utku Ozbulak, Seyed Amir Mousavi, Francesca Tozzi, Nikdokht Rashidian, Wouter Willaert, Wesley De Neve, Joris Vankerschaver.
"Less is More? Revisiting the Importance of Frame Rate in Real-Time Zero-Shot Surgical Video Segmentation." ArXiv (2025). [paper] [2025.02]
BudSAM: Zhou, Chenxi and Wan, Tianjiao and Xu, Kele and Qiao, Peng and Dou, Yong.
"Segment Anything for Visual Bird Sound Denoising." IEEE SPL (2025). [paper] [code] [2025.02]
LORENZA: Yehonathan Refael, Iftach Arbel, Ofir Lindenbaum, Tom Tirer.
"LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training and Fine-Tuning via Efficient Zeroth-Order Adaptive SAM Optimization." ArXiv (2025). [paper] [2025.02]
HumanCLIP: Keito Suzuki, Bang Du, Girish Krishnan, Kunyao Chen, Runfa Blark Li, Truong Nguyen.
"Open-Vocabulary Semantic Part Segmentation of 3D Human." 3DV (2025). [paper] [2025.02]
CLIP+Grad-CAM+SAM: Muhammad A. Muttaqien, Tomohiro Motoda, Ryo Hanai, Domae Yukiyasu.
"Attention-Guided Integration of CLIP and SAM for Precise Object Masking in Robotic Manipulation." 2025 IEEE/SICE International Symposium on System Integration (2025). [paper] [2025.02]
VesselSAM: Adnan Iltaf, Rayan Merghani Ahmed, Bin Li, Shoujun Zhou.
"VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention." ArXiv (2025). [paper] [code] [2025.02]
DICEPTION: Canyu Zhao, Mingyu Liu, Huanyi Zheng, Muzhi Zhu, Zhiyue Zhao, Hao Chen, Tong He, Chunhua Shen.
"DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks." ArXiv (2025). [paper] [code] [project] [2025.02]
AV2T-SAM: Kyungbok Lee, You Zhang, Zhiyao Duan.
"AUDIO VISUAL SEGMENTATION THROUGH TEXT EMBEDDINGS." ArXiv (2025). [paper] [2025.02]
LVM-MSC: Feibo Jiang, Siwei Tu, Li Dong, Kezhi Wang, Kun Yang, Ruiqi Liu, Cunhua Pan, Jiangzhou Wang.
"Lightweight Vision Model-based Multi-user Semantic Communication Systems." ArXiv (2025). [paper] [2025.02]
USegMix: Jiamu Wang, Jin Tae Kwak.
"USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images." ArXiv (2025). [paper] [2025.02]
SESSRS: Qiao, Yang and Zhong, Bo and Du, Bailin and Cai, He and Jiang, Jinxiong and Liu, Qinhuo and Yang, Aixia and Wu, Junjun and Wang, Xiaoya.
"SAM Enhanced Semantic Segmentation for Remote Sensing Imagery Without Additional Training." TGRS (2025). [paper] [code] [2025.02]
UrbanSAM: Chenyu Li, Danfeng Hong, Bing Zhang, Yuxuan Li, Gustau Camps-Valls, Xiao Xiang Zhu, Jocelyn Chanussot.
"UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction." ArXiv (2025). [paper] [code] [2025.02]
Ufaq Khan, Umair Nawaz, Adnan Qayyum, Shazad Ashraf, Muhammad Bilal, Junaid Qadir.
"Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review." ArXiv (2025). [paper] [2025.02]
YOLO-SAM: Tianyou Jiang, Mingshun Shao, Tianyi Zhang, Xiaoyu Liu, Qun Yu.
"Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks." ArXiv (2025). [paper] [2025.02]
SIYO: Mayankeyshwar, Mridul and Kumar, Lookinder and Wagh, Mamata P. and Behuria, Swatishree and Yadav, Dev.
"Brain Tumor Detection and Segmentation using SAM integrated YOLOv9 Scheme." ASPCC (2024). [paper] [2025.02]
FieldSeg: Lucas B. Ferreira and Vitor S. Martins and Uilson R.V. Aires and Nuwan Wijewardane and Xin Zhang and Sathish Samiappan.
"FieldSeg: A scalable agricultural field extraction framework based on the Segment Anything Model and 10-m Sentinel-2 imagery." Computers and Electronics in Agriculture (2025). [paper] [2025.02]
GDPGO-SAM: Hua, Shuzhen, Biao Yang, Xinchang Zhang, Ji Qi, Fengxi Su, Jing Sun, and Yongjian Ruan.
"GDPGO-SAM: An Unsupervised Fine Segmentation of Desert Vegetation Driven by Grounding DINO Prompt Generation and Optimization Segment Anything Model." Remote Sensing (2025). [paper] [2025.02]
Raphael Stock, et al.
"Segment Anything in Medical Images with nnUNet." ArXiv (2025). [paper] [2025.02]
MedfcientSAM: Bao-Hiep Le, et al.
"MedfcientSAM: A Robust Medical Segmentation Model with Optimized Inference Pipeline for Limited Clinical Settings." ArXiv (2025). [paper] [code] [2025.02]
SegAnyPET: Yichi Zhang, Le Xue, Wenbo Zhang, Lanlan Li, Yuchen Liu, Chen Jiang, Yuan Cheng, Yuan Qi.
"SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images." ArXiv (2025). [paper] [code] [2025.02]
Pengchen Liang, Bin Pu, Haishan Huang, Yiwei Li, Hualiang Wang, Weibo Ma, Qing Chang.
"Vision Foundation Models in Medical Image Analysis: Advances and Challenges." ArXiv (2025). [paper] [2025.02]
SASVi: Ssharvien Kumar Sivakumar, Yannik Frisch, Amin Ranem, Anirban Mukhopadhyay.
"SASVi - Segment Any Surgical Video." ArXiv (2025). [paper] [2025.02]
SpeHeatal: Yi Shi, Yunkai Wang, Xupeng Tian, Tieyi Zhang, Bing Yao, Hui Wang, Yong Shao, Cencen Wang, Rong Zeng.
"SpeHeatal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis." AAAI (2025). [paper] [2025.02]
MaizeEar-SAM: Hossein Zaremehrjerdi, Lisa Coffey, Talukder Jubery, Huyu Liu, Jon Turkus, Kyle Linders, James C. Schnable, Patrick S. Schnable, Baskar Ganapathysubramanian.
"MaizeEar-SAM: Zero-Shot Maize Ear Phenotyping." ArXiv (2025). [paper] [2025.02]
PRISM: Kangning Cui, Rongkun Zhu, Manqi Wang, Wei Tang, Gregory D. Larsen, Victor P. Pauca, Sarra Alqahtani, Fan Yang, David Segurado, David Lutz, Jean-Michel Morel, Miles R. Silman.
"Detection and Geographic Localization of Natural Objects in the Wild: A Case Study on Palms." ArXiv (2025). [paper] [2025.02]
SAM-Assisted-Registration: Hao Xu, Tengfei Xue, Jianan Fan, Dongnan Liu, Yuqian Chen, Fan Zhang, Carl-Fredrik Westin, Ron Kikinis, Lauren J. O'Donnell, Weidong Cai.
"Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness." IPMI (2025). [paper] [code] [2025.02]
WRT-SAM: Yunyi Zhou, Kun Shi, Gang Hao.
"WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing ." ArXiv (2025). [paper] [2025.02]
MITO: Laura Dodds, Tara Boroushaki, Fadel Adib.
"MITO: Enabling Non-Line-of-Sight Perception using Millimeter-waves through Real-World Datasets and Simulation Tools." ArXiv (2025). [paper] [2025.02]
SAM2Refiner: Yuan Yao, Qiushi Yang, Miaomiao Cui, Liefeng Bo.
"Towards Fine-grained Interactive Segmentation in Images and Videos." ArXiv (2025). [paper] [2025.02]
SAM-QA: Emil Mededovic, Valdy Laurentius, Yuli Wu, Marcin Kopaczka, Zhu Chen, Mareike Schulz, René Tolba, Johannes Stegmaier.
"No Free Lunch in Annotation either: An objective evaluation of foundation models for streamlining annotation in animal tracking." ArXiv (2025). [paper] [code] [2025.02]
CBCT-US: Feng Li, Yuan Bi, Dianye Huang, Zhongliang Jiang, Nassir Navab.
"Robotic CBCT Meets Robotic Ultrasound." ArXiv (2025). [paper] [2025.02]
IDCC-SAM: Fanijo, Samuel, Ali Jannesari, and Julie Dickerson.
"IDCC-SAM: A Zero-Shot Approach for Cell Counting in Immunocytochemistry Dataset Using the Segment Anything Model." Bioengineering (2025). [paper] [2025.02]
LV-SAM: Yagang Wu, Tianli Zhao, Shijun Hu, Qin Wu, Yingxu Chen, Xin Huang & Zhoushun Zheng.
"Integrating multi-scale information and diverse prompts in large model SAM-Med2D for accurate left ventricular ejection fraction estimation." Med Biol Eng Comput(2025). [paper] [2025.02]
LangRS: Mohanad Diab and Polychronis Kolokoussis and Maria Antonia Brovelli.
"Optimizing zero-shot text-based segmentation of remote sensing imagery using SAM and Grounding DINO." Artificial Intelligence in Geosciences (2025). [paper] [code] [2025.02]
Xia, Sijie, Rufu Qin, Yang Lu, Lianjiang Ma, and Zhenghu Liu.
"A Monocular Vision-Based Safety Monitoring Framework for Offshore Infrastructures Utilizing Grounded SAM." Journal of Marine Science and Engineering(2025). [paper] [2025.02]
Yufang He and Bo Chen and Mahdi Motagh and Yuyan Zhu and Songdong Shao and Jiaye Li and Bing Zhang and Hermann Kaufmann.
"International Journal of Applied Earth Observation and Geoinformation." International Journal of Applied Earth Observation and Geoinformation (2025). [paper] [2025.02]
Save: Park, Chae Jung and Nguyen, Khanh-Binh.
"Save: Segment Audio-Visual Easy Way Using The Segment Anything Model." SSRN (2025). [paper] [2025.02]
CAB-USRI: Jinxin Shao, Haosu Zhang & Jianming Miao.
"Depthanything and SAM for UIE: exploring large model information contributes to underwater image restoration." Machine Vision and Applications (2025). [paper] [2025.02]
REMOTE SENSING LETTERS: Hui Zhang.
"A SAM-based dual-branch network for remote sensing semantic segmentation." REMOTE SENSING LETTERS (2025). [paper] [2025.02]
SAMCell: Alexandra D. VandeLoo, Nathan J. Malta, Emilio Aponte, Caitlin van Zyl, Danfei Xu, Craig R. Forest.
"SAMCell: Generalized Label-Free Biological Cell Segmentation with Segment Anything." ArXiv (2025). [paper] [2025.02]
AutoMedSAM: Peng Huang, Shu Hu, Bo Peng, Jiashu Zhang, Hongtu Zhu, Xi Wu, Xin Wang.
"Diffusion-empowered AutoPrompt MedSAM." ArXiv (2025). [paper] [code] [2025.02]
SAMRefiner: Yuqi Lin, Hengjia Li, Wenqi Shao, Zheng Yang, Jun Zhao, Xiaofei He, Ping Luo, Kaipeng Zhang.
"SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement." ICLR (2025). [paper] [code] [2025.02]
MTRMB: You Zhou, Jiangshan Zhao, Deyu Zeng, Zuo Zuo, Weixiang Liu, Zongze Wu.
"Multimodal Task Representation Memory Bank vs. Catastrophic Forgetting in Anomaly Detection." ArXiv (2025). [paper] [2025.02]
FunduSAM: Jinchen Yu, Yongwei Nie, Fei Qi, Wenxiong Liao, Hongmin Cai.
"FunduSAM: A Specialized Deep Learning Model for Enhanced Optic Disc and Cup Segmentation in Fundus Images." ArXiv (2025). [paper] [2025.02]
GlandSAM: Zhang, Qixiang and Li, Yi and Xue, Cheng and Wang, Haonan and Li, Xiaomeng.
"GlandSAM: Injecting Morphology Knowledge Into Segment Anything Model for Label-Free Gland Segmentation." TMI.(2025). [paper] [2025.02]
LAM: Wei-Bin Kou, Guangxu Zhu, Rongguang Ye, Shuai Wang, Ming Tang, Yik-Chung Wu.
"Label Anything: An Interpretable, High-Fidelity and Prompt-Free Annotator." ICRA (2025). [paper] [2025.02]
PP: Wang Xinyi, Kang Hongyu, Wei Peishan, Shuai Li, Yu Sun, Sai Kit Lam, Yongping Zheng.
"Proxy Prompt: Endowing SAM and SAM 2 with Auto-Interactive-Prompt for Medical Segmentation." ArXiv (2025). [paper] [2025.02]
FE-UNet: Guohao Huo, Ruiting Dai, Ling Shao, Hao Tang.
"FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation." ArXiv (2025). [paper] [2025.02]
ZISVFM: Ying Zhang, Maoliang Yin, Wenfu Bi, Haibao Yan, Shaohan Bian, Cui-Hua Zhang, Changchun Hua.
"ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models." IEEE Transactions on Robotics (2025). [paper] [code] [2025.02]
RFMedSAM 2: Bin Xie, Hao Tang, Yan Yan, Gady Agam.
"RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2." ArXiv (2025). [paper] [2025.02]
FLIP: Manuel Traub, Martin V. Butz.
"Rethinking Vision Transformer for Object Centric Foundation Models." ArXiv (2025). [paper] [2025.02]
Tell2Reg: Wen Yan, Qianye Yang, Shiqi Huang, Yipei Wang, Shonit Punwani, Mark Emberton, Vasilis Stavrinides, Yipeng Hu, Dean Barratt.
"Tell2Reg: Establishing spatial correspondence between images by the same language prompts." ArXiv (2025). [paper] [code] [2025.02]
Functional-SAM: Sidak Pal Singh, Hossein Mobahi, Atish Agarwala, Yann Dauphin.
"Avoiding spurious sharpness minimization broadens applicability of SAM." ArXiv (2025). [paper] [2025.02]
IMDPrompter: Quan Zhang, Yuxin Qi, Xi Tang, Jinwei Fang, Xi Lin, Ke Zhang, Chun Yuan.
"IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning." ICLR (2025). [paper] [2025.02]
LBG: Rohan Chacko, Nicolai Haeni, Eldar Khaliullin, Lin Sun, Douglas Lee.
"Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation." WACV(2025). [paper] [2025.02]
GFDS: Tongkun Liu, Bing Li, Xiao Jin, Yupeng Shi, Qiuying Li, Xiang Wei.
"Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models." ArXiv (2025). [paper] [code] [2025.02]
SAM-PLE: Mingyu Yang, Jitong Lu, and Hun-Seok Kim.
"SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation." ICRA (2025). [paper] [2025.02]
VLP-SAM: Kosuke Sakurai, Ryotaro Shimizu, Masayuki Goto.
"Vision and Language Reference Prompt into SAM for Few-shot Segmentation." ArXiv (2025). [paper] [code] [2025.02]
Self-Prompt-SAM: Bin Xie, Hao Tang, Dawen Cai, Yan Yan, Gady Agam.
"Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation." ArXiv (2025). [paper] [2025.02]
PEFT-SAM: Carolin Teuber, Anwai Archit, Constantin Pape.
"Parameter Efficient Fine-Tuning of Segment Anything Model." ArXiv (2025). [paper] [code] [2025.02]
PathoSAM: Titus Griebel, Anwai Archit, Constantin Pape.
"Segment Anything for Histopathology." ArXiv (2025). [paper] [code] [2025.02]
AVSBench-Robust: Jia Li, Wenjie Zhao, Ziru Huang, Yunhui Guo, Yapeng Tian.
"Do Audio-Visual Segmentation Models Truly Segment Sounding Objects?." ArXiv (2025). [paper] [2025.02]
Diogo Ebert Gatti; Eduardo Lobo Lustosa Cabral.
"SISTEMAS PARA PERCEPÇÃO DO ESPAÇO LIVRE À FRENTE DE UM VEÍCULO E CÁLCULO DA DISTÂNCIA DE SEUS LIMITES." ArXiv (2025). [paper] [2025.01]
OHIF-SAM2: Jaeyoung Cho, Aditya Rastogi, Jingyu Liu, et al.
"OHIF-SAM2: Accelerating Radiology Workflows with Segment Anything Model 2." ArXiv (2025). [paper] [code] [2025.01]
Joseph Lundy.
"Foosball Robot Object Detection and Angle Estimation." ArXiv (2025). [paper] [2025.01]
Niu, Ziang; Huang, Ting; Xu, Chengjia; Sun, Xinyue; Taha, Mohamed Farag; He, Yong; Qiu, Zhengjun.
"A Novel Approach to Optimize Key Limitations of Azure Kinect DK for Efficient and Precise Leaf Area Measurement." ArXiv (2025). [paper] [2025.01]
J Valero Casas-Aljama.
"AI-powered 2D animation editor." ArXiv (2025). [paper] [2025.01]
Tavakoli, Neda et al.
"Automated quantification of left ventricular scar volume in cardiac MRI using large vision models." Journal of Cardiovascular Magnetic Resonance (2025). [paper] [2025.01]
Mehrnia, Mehri et al.
"Evaluating foundational 'segment anything' (Med-SAM1, Med-SAM2) deep learning models for left atrial segmentation in 3d LGE CMR." Journal of Cardiovascular Magnetic Resonance (2025). [paper] [2025.01]
SAM2Act: Haoquan Fang, Markus Grotz, Wilbert Pumacay, Yi Ru Wang, Dieter Fox, Ranjay Krishna, Jiafei Duan.
"SAM2Act: Integrating Visual Foundation Model with A MemoryArchitecture for Robotic Manipulation." ArXiv (2025). [paper] [code] [2025.01]
FlexiCrackNet: Xinlong Wan, Xiaoyan Jiang, Guangsheng Luo, Ferdous Sohel, Jenqneng Hwang.
"FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM." ArXiv (2025). [paper] [2025.01]
DeepSketchCamo: Ying Zang, Runlong Cao, Jianqi Zhang, Yidong Han, Ziyue Cao, Wenjun Hu, Didi Zhu, Lanyun Zhu, Zejian Li, Deyi Ji, Tianrun Chen.
"Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches." ArXiv (2025). [paper] [2025.01]
Tongxu Zhang, Bei Wang.
"Point Cloud Upsampling as Statistical Shape Model for Pelvic." ArXiv (2025). [paper] [2025.01]
Marker Track: Aimee Guo, Weihua Mao.
"Marker Track: Accurate Fiducial Marker Tracking for Evaluation of Residual Motions During Breath-Hold Radiotherapy." Biomedical Physics & Engineering Express (2024). [paper] [code] [2025.01]
CLISC: Xiaochuan Ma, Jia Fu, Wenjun Liao, Shichuan Zhang, Guotai Wang.
"CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation." ISBI (2025). [paper] [2025.01]
KD-SAM: Kunal Dasharath Patil, Gowthamaan Palani, Ganapathy Krishnamurthi.
"Efficient Knowledge Distillation of SAM for Medical Image Segmentation." ArXiv (2025). [paper] [2025.01]
EG-SAM: Longyi Chen and Xiandong Wang and Fengqin Yao and Mingchen Song and Jiaheng Zhang and Shengke Wang.
"An Edge-Guided SAM for effective complex object segmentation." Expert Systems With Applications (2025). [paper] [2025.01]
Yijie Zhu, Shan E Ahmed Raza.
"Gland Segmentation Using SAM With Cancer Grade as a Prompt." ISBI (2025). [paper] [2025.01]
MPG-SAM 2: Fu Rong, Meng Lan, Qian Zhang, Lefei Zhang.
"MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation." ArXiv (2025). [paper] [2025.01]
Gabrielle Hoyer, Michelle W Tong, Rupsa Bhattacharjee, Valentina Pedoia, Sharmila Majumdar.
"Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility." ArXiv (2025). [paper] [2025.01]
APSAM: Jian Wang, Xiaokang Zhang, Xianping Ma, Weikang Yu, Pedram Ghamisi.
"Auto-Prompting SAM for Weakly Supervised Landslide Extraction." ArXiv (2025). [paper] [code] [2025.01]
MONA: Boxun Hu, Mingze Xia, Ding Zhao, Guanlin Wu.
"MONA: Moving Object Detection from Videos Shot by Dynamic Camera." ArXiv (2025). [paper] [2025.01]
DynamicEarth: Kaiyu Li, Xiangyong Cao, Yupeng Deng, Chao Pang, Zepeng Xin, Deyu Meng, Zhi Wang.
"DynamicEarth: How Far are We from Open-Vocabulary Change Detection?." ArXiv (2025). [paper] [code] [2025.01]
fabSAM: Yufeng Xie, Hanzhi Wu, Hongxiang Tong, Lei Xiao, Wenwen Zhou, Ling Li, Thomas Cherico Wanger.
"fabSAM: A Farmland Boundary Delineation Method Based on the Segment Anything Model." ArXiv (2025). [paper] [2025.01]
MedicoSAM: Anwai Archit, Luca Freckmann, Constantin Pape.
"MedicoSAM: Towards foundation models for medical image segmentation." ArXiv (2025). [paper] [code] [2025.01]
UW-COT220 & VL-SAM2: Chunhui Zhang, Li Liu, Guanjie Huang, Hao Wen, Xi Zhou, Yanfeng Wang.
"Towards Underwater Camouflaged Object Tracking: Benchmark and Baselines." CVPR Workshop (2025). [paper] [ResearchGate] [project] [2025.01]
HOPOMOP: Michael Schwingshackl, Fabio Francisco Oberweger, Markus Murschitz.
"Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks." WACV (2025). [paper] [code] [2025.01]
CableSAM: Aihua Ling, Junwen Wang, Jiaming Lu & Ruyu Liu.
"CableSAM: an efficient automatic segmentation method for aircraft cabin cables." Optoelectronics Letters (2025). [paper] [2025.01]
Wang, Yunlong, and Zhiyong Zhang.
"Segment Any Leaf 3D: A Zero-Shot 3D Leaf Instance Segmentation Method Based on Multi-View Images." Sensors (2025). [paper] [2025.01]
SegmentAnyTooth: Khoa Dang Nguyen and Hung Trong Hoang and Thi-Phuong Hong Doan and Khai Quang Dao and Ding-Han Wang and Ming-Lun Hsu.
"SegmentAnyTooth: An open-source deep learning framework for tooth enumeration and segmentation in intraoral photos." Journal of Dental Sciences (2025). [[paper](SegmentAnyTooth: An open-source deep learning framework for tooth enumeration and segmentation in intraoral photos - ScienceDirect)] [2025.01]
SAM-Glomeruli: Sun, Rui, and Tianzhu Zhang.
"SAM-Glomeruli: Enhanced Segment Anything Model for Precise Glomeruli." MICCAI Workshop (2024). [paper] [2025.01]
FATE-SAM: Xingxin He, Yifan Hu, Zhaoye Zhou, Mohamed Jarraya, Fang Liu.
"Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation." ArXiv (2025). [paper] [2025.01]
Pengru Deng, Jiapeng Yao, Chun Li, Su Wang, Xinrun Li, Varun Ojha, Xuhui He, Takashi Matsumoto.
"Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures." ArXiv (2025). [paper] [2025.01]
VRS-HQ: Sitong Gong, Yunzhi Zhuge, Lu Zhang, Zongxin Yang, Pingping Zhang, Huchuan Lu.
"The Devil is in Temporal Token: High Quality Video Reasoning Segmentation." ArXiv (2025). [paper] [code] [2025.01]
SuperSAM: Waqwoya Abebe, Sadegh Jafari, Sixing Yu, Akash Dutta, Jan Strube, Nathan R. Tallent, Luanzheng Guo, Pablo Munoz, Ali Jannesari.
"SuperSAM: Crafting a SAM Supernetwork via Structured Pruning and Unstructured Parameter Prioritization." ArXiv (2025). [paper] [2025.01]
SkipClick: Robin Schön, Julian Lorenz, Daniel Kienzle, Rainer Lienhart.
"SkipClick: Combining Quick Responses and Low-Level Features for Interactive Segmentation in Winter Sports Contexts." ArXiv (2025). [paper] [code] [2025.01]
SAM-DA: Javier Gamazo Tejero, Moritz Schmid, Pablo Márquez Neila, Martin S. Zinkernagel, Sebastian Wolf, Raphael Sznitman.
"SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation." WACV (2025). [paper] [2025.01]
PGP-SAM: Zhonghao Yan, Zijin Yin, Tianyu Lin, Xiangzhu Zeng, Kongming Liang, Zhanyu Ma.
"PGP-SAM: Prototype-Guided Prompt Learning for Efficient Few-Shot Medical Image Segmentation." ISBI (2025). [paper] [2025.01]
SST: Zhenyang Feng, Zihe Wang, Saul Ibaven Bueno, Tomasz Frelek, Advikaa Ramesh, Jingyan Bai, Lemeng Wang, Zanming Huang, Jianyang Gu, Jinsu Yoo, Tai-Yu Pan, Arpita Chowdhury, Michelle Ramirez, Elizabeth G. Campolongo, Matthew J. Thompson, Christopher G. Lawrence, Sydne Record, Neil Rosser, Anuj Karpatne, Daniel Rubenstein, Hilmar Lapp, Charles V. Stewart, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao.
"Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation." ArXiv (2025). [paper] [2025.01]
OCORD: Shuo Zhang, Runpu Wei, Kongming Liang.
"OCORD: Open-Campus Object Removal Dataset." ArXiv (2025). [paper] [code] [2025.01]
Guided SAM: S.B. van Rooij, G.J. Burghouts.
"Guided SAM: Label-Efficient Part Segmentation." ArXiv (2025). [paper] [2025.01]
EdgeTAM: Chong Zhou, Chenchen Zhu, Yunyang Xiong, Saksham Suri, Fanyi Xiao, Lemeng Wu, Raghuraman Krishnamoorthi, Bo Dai, Chen Change Loy, Vikas Chandra, Bilge Soran.
"EdgeTAM: On-Device Track Anything Model." CVPR (2025). [paper] [code] [2025.01]
RSRefSeg: Keyan Chen, Jiafan Zhang, Chenyang Liu, Zhengxia Zou, Zhenwei Shi.
"RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models." ArXiv (2025). [paper] [code] [2025.01]
CCT: Olivier Morelle, Justus Bisten, Maximilian W. M. Wintergerst, Robert P. Finger, Thomas Schultz.
"Weakly Supervised Segmentation of Hyper-Reflective Foci with Compact Convolutional Transformers and SAM2." German Conference on Medical Image Computing(2025). [paper] [2025.01]
FLAIR: Chinmay K Lalgudi, Mark E Leone, Jaden V Clark, Sergio Madrigal-Mora, Mario Espinoza.
"Zero-shot Shark Tracking and Biometrics from Aerial Imagery." ArXiv (2025). [paper] [2025.01]
SPA: Hu, Jihong and Li, Yinhao and Jain, Rahul Kumar and Lin, Lanfen and Chen, Yen-wei.
"SPA: Leveraging the SAM with Spatial Priors Adapter for Enhanced Medical Image Segmentation." JBHI(2025). [paper] [2025.01]
SAM-Upflow Splitter: Wenhui Liu, Yulong Qiao, Zhengyi Xing, Yue Zhao.
"Zero-shot moving ship segmentation based on segment anything network and optical flow network." ELECTRONICS LETTERS (2025). [paper] [2025.01]
Naddaf-Sh, Amir-M., Vinay S. Baburao, and Hassan Zargarzadeh.
"Leveraging Segment Anything Model (SAM) for Weld Defect Detection in Industrial Ultrasonic B-Scan Images." Sensors (2025). [paper] [2025.01]
Sa2VA: Haobo Yuan, Xiangtai Li, Tao Zhang, Zilong Huang, Shilin Xu, Shunping Ji, Yunhai Tong, Lu Qi, Jiashi Feng, Ming-Hsuan Yang.
"Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos." ArXiv (2025). [paper] [code] [project] [hugging face] [2025.01]
AutoFish: Stefan Hein Bengtson, Daniel Lehotský, Vasiliki Ismiroglou, Niels Madsen, Thomas B. Moeslund, Malte Pedersen.
"AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish." WACV Workshop (2025). [paper] [code] [2025.01]
MedFocusCLIP : Aadya Arora, Vinay Namboodiri.
"MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention." ArXiv (2025). [paper] [code] [2025.01]
Risha Goel, Zain Shabeeb, Isabel Panicker, Vida Jamali.
"Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy." ArXiv (2025). [paper] [2025.01]
SAM4EM: Javier Montalvo, Álvaro García-Martín, Pablo Carballeira, Juan C. SanMiguel.
"Unsupervised Class Generation to Expand Semantic Segmentation Datasets." ArXiv (2025). [paper] [2025.01]
EdgeSAM: Yang, Wenya and Chen, Xiao-Diao and Wu, Wen and Qin, Hongshuai and Yan, Kangming and Mao, Xiaoyang and Song, Haichuan.
"Boosting Deep Unsupervised Edge Detection via Segment Anything Model." IEEE TII (2024). [paper] [2025.01]
PowerSAM: Nannan Yan, Yuhao Li, Yingke Mao, et al.
"PowerSAM: Edge-Efficient Segment Anything for Power Systems Through Visual Model Distilla- tion PowerSAM: Edge-Efficient Segment Anything for Power Systems Through Visual Model Distillation." ArXiv (2025). [paper] [2025.01]
PG-SAG: Tengfei Wang, Xin Wang, Yongmao Hou, Yiwei Xu, Wendi Zhang, Zongqian Zhan.
"PG-SAG: Parallel Gaussian Splatting for Fine-Grained Large-Scale Urban Buildings Reconstruction via Semantic-Aware Grouping." ArXiv (2025). [paper] [code] [2025.01]
MA-SAM: D. Fan et al.
"MA-SAM: A Multi-atlas Guided SAM Using Pseudo Mask Prompts without Manual Annotation for Spine Image Segmentation." TMI (2025). [paper] [2025.01]
ReferSAM: S. -A. Liu, H. Xie, J. Ge and Y. Zhang.
"ReferSAM: Unleashing Segment Anything Model for Referring Image Segmentation." TCSVT (2025). [paper] [code] [2025.01]
YS3AM: Mu S, Liu J, Zhang P, et al.
"YS3AM: Adaptive 3D Reconstruction and Harvesting Target Detection for Clustered Green Asparagus." ArXiv (2025). [paper] [2025.01]
FCP: Suho Park, SuBeen Lee, Hyun Seok Seong, Jaejoon Yoo, Jae-Pil Heo.
"Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation." AAAI (2025). [paper] [code] [2025.01]
ScarNet: Neda Tavakoli, Amir Ali Rahsepar, Brandon C. Benefield, Daming Shen, Santiago López-Tapia, Florian Schiffers, Jeffrey J. Goldberger, Christine M. Albert, Edwin Wu, Aggelos K. Katsaggelos, Daniel C. Lee, Daniel Kim.
"ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI." ArXiv (2025). [paper] [2025.01]
EUGIS: Jiang Shang, Yuanmeng Wu, Xiaoxiang Han, Xi Chen and Qi Zhang.
"Evidential Calibrated Uncertainty-Guided Interactive Segmentation paradigm for Ultrasound Images." ArXiv (2025). [paper] [code] [2025.01]

2024

Paper list 2024

2023

Paper list 2023

Open Source Projects

No.	Project	Title	Project page	Code base	Affiliation	Description
000	SAM	Segment Anything	Project page	Code	Meta	A foundation model for general image segmentation.
001	SAM2	Segment Anything Model 2	Project page	Code	Meta	A video foundation model.
002	SAM-Track	Segment and Track Anything	Colab	Code	Zhejiang University	A project dedicated to tracking and segmenting any objects in videos, either automatically or interactively.
003	Grounded-SAM	Grounded-Segment-Anything	Colab	Code	IDEA-Research	A project by combining Grounding DINO and SAM which aims to detect and segment Anything with text inputs.
004	MMDet-SAM	-	-	Code	OpenMMLab	A new way of instance segmentation by combining SAM with Closed-Set Object Detection, Open-Vocabulary Object Detection, Grounding Object Detection.
005	MMRotate-SAM	Zero-shot Oriented Object Detection with SAM	-	Code	OpenMMLab	A project join SAM and weakly supervised horizontal box detection to achieve rotated box detection.
006	MMOCR-SAM	-	-	Code	OpenMMLab	A solution of Text Detection/Recognition and SAM that segments every text character, with striking text removal and text inpainting demos driven by diffusion models and Gradio.
007	MMEditing-SAM	-	-	Code	OpenMMLab	A project join SAM and image generation to create awesome images and edit any part of them.
008	Label-Studio-SAM	OpenMMLab PlayGround: Semi-Automated Annotation with Label-Studio and SAM	-	Code	OpenMMLab	A project combining Label-Studio and SAM to achieve semi-automated annotation.
009	PaddleSeg	Segment Anything with PaddleSeg	-	Code	PaddlePaddle	A pretrained model parameters of PaddlePaddle format.
010	SegGPT	Segmenting Everything In Context	Hugging Face	Code	BAAI-Vision	SAM In Context based on Painter.
011	SEEM	Segment Everything Everywhere All at Once	Hugging Face	Code	Microsoft	A project can Segment Everything Everywhere with Multi-modal prompts all at once.
012	CLIP Surgery	CLIP Surgery for Better Explainability with Enhancement in Open Vocabulary Tasks	Project page	Code	HKUST	A work about SAM based on CLIP's explainability to achieve text to mask without manual points.
013	SAMCOD	Can SAM Segment Anything? When SAM Meets Camouflaged Object Detection	-	Code	-	SAM +Camouflaged object detection (COD) task.
014	Inpaint Anything	Segment Anything Meets Image Inpainting	Hugging Face	Code	USTC and EIT	SAM combines Inpainting, which is able to remove the object smoothly.
015	PerSAM	Personalize Segment Anything Model with One Shot	Hugging Face	Code	-	SAM with specific concepts.
016	MedSAM	Segment Anything in Medical Images	-	Code	-	A step-by-step tutorial with a small dataset to help you quickly utilize SAM.
017	Segment-Any-Anomaly	GroundedSAM Anomaly Detection	Colab	Code	HUST	Grounding DINO + SAM to segment any anomaly.
018	SSA	Semantic Segment Anything	-	Code	Fudan University	A dense category annotation engine.
019	Magic Copy	-	-	Code	-	Magic Copy is a Chrome extension that uses SAM to extract a foreground object from an image and copy it to the clipboard.
020	Segment Anything with Clip	Segment Anything with Clip	Hugging Face	Code	-	SAM combined with CLIP.
021	MetaSeg	Segment Anything Video	Hugging Face	Code	-	Packaged version of the SAM.
022	SAM in Napari	Segment Anything Model (SAM) in Napari	Project page	Code	Applied Computer Vision Lab and German Cancer Research Center	Extended SAM's click-based foreground separation to full click-based semantic segmentation and instance segmentation.
023	SAM Medical Imaging	SAM Medical Imaging	-	Code	-	SAM for Medical Imaging.
024	3D-Box	3D-Box via Segment Anything	-	Code	-	SAM is extended to 3D perception by combining it with VoxelNeXt.
025	Anything-3D	-	-	Code	-	Anything 3DNovel View, Anything-NeRF, Any 3DFace.
026	L2SET	Learning to Segment EveryThing	-	Code	UC Berkeley, FAIR	A new partially supervised training paradigm for instance segmentation.
027	Edit Anything	Edit Anything by Segment-Anything	-	Code	-	Edit anything in images powered by SAM, ControlNet, StableDiffusion, \etc.
028	Image Edit Anything	IEA: Image Editing Anything	-	Code	-	Using stable diffusion and SAM for image editing.
029	SAM for Stable Diffusion Webui	Segment Anything for Stable Diffusion WebUI	-	Code	-	This extension aim for connecting AUTOMATIC1111 Stable Diffusion WebUI and Mikubill ControlNet Extension with SAM and GroundingDINO to enhance Stable Diffusion/ControlNet inpainting.
030	Earth Observation Tools	Segment Anything EO tools	Colab	Code	-	An earth observation tools for SAM.
031	Moving Object Detection	Towards Segmenting Anything That Moves	-	Code	-	A project about SAM + Moving Object Detection.
032	OCR-SAM	Optical Character Recognition with Segment Anything	Project page	Code	-	Combining MMOCR with SAM and Stable Diffusion.
033	SALT	Segment Anything Labelling Tool	-	Code	-	A project uses the SAM Model and adds a barebones interface to label images and saves the masks in the COCO format.
034	Prompt Segment Anything	Prompt Segment Anything	-	Code	-	An implementation of zero-shot instance segmentation using SAM.
035	SAM-RBox	-	-	Code	-	A project uses SAM for generating rotated bounding boxes with MMRotate, which is a comparison method of H2RBox-v2.
036	VISAM	MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors	-	Code	-	Combining SAM with MOT, it create the era of "MOTS".
037	SegEO	Segment Anything EO tools	-	Code	-	The tools are developed to ease the processing of spatial data (GeoTIFF and TMS) with SAM using sliding window algorithm for big files.
038	Napari Segment Anything	Napari Segment Anything	Project page	Code	-	SAM native Qt UI.
039	Segment-Anything-U-Specify	Segment-Anything-U-Specify	-	Code	-	Using CLIP and SAM to segment any instance you specify with text prompt of any instance names.
040	SegDrawer	Simple static web-based mask drawer	Colab	Code	-	Simple static web-based mask drawer, supporting semantic segmentation with SAM.
041	Track Anything	Segment Anything Meets Videos	Hugging Face	Code	SUSTech	Track-Anything is a flexible and interactive tool for video object tracking and segmentation.
042	Count Anything	-	-	Code	-	A method uses SAM and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.
043	RAM	Relate Anything Model	Hugging Face	Code	MMLab, NTU and VisCom Lab, KCL/TongJi	Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
044	Segment Any RGBD	Segment Any RGBD	Project page	Code	-	Segment AnyRGBD is a toolbox to segment rendered depth images based on SAM.
045	Show Anything	Show Anything	Hugging Face	Code	Showlab, NUS	Some Applications that are compatible with both SAM and Generation.
046	Transfer Any Style	Any-to-Any Style Transfer: Making Picasso and Da Vinci Collaborate	-	Code	LV-lab, NUS	An interactive demo based on Segment-Anything for style transfer which enables different content regions apply different styles.
047	Caption Anything	-	Colab	Code	VIP lab, SUSTech	Caption-Anything is a versatile image processing tool that combines the capabilities of SAM, Visual Captioning, and ChatGPT.
048	Image2Paragraph	Transform Image Into Unique Paragraph	Project page	Code	-	Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
049	LIME SAM	Local Interpretable Model-agnostic Explanations Segment Anything	Colab	Code	-	LIME-SAM aims to create an Explainable Artificial Intelligence (XAI) framework for image classification using LIME (Local Interpretable Model-agnostic Explanations) as the base algorithm, with the super-pixel method replaced by SAM.
050	Paint Anything	-	-	Code	-	An interactive demo based on SAM for stroke-based painting which enables human-like painting.
051	SAMed	Customized Segment Anything Model for Medical Image Segmentation	Colab	Code	USTC	SAMed is built upon the large-scale image segmentation model, SAM, to explore the new research paradigm of customizing large-scale models for medical image segmentation.
052	Personalize SAM	Personalize Segment Anything with 1 Shot in 10 Seconds	Hugging Face	Code	MMLab, CUHK	A training-free Personalization approach for SAM, termed as PerSAM. Given only a single image with a reference mask, PerSAM can segment specific visual concepts.
053	Open-vocabulary-Segment-Anything	Open-vocabulary-Segment-Anything	-	Code	-	Combining OwlViT with Segment Anything - Open-vocabulary Detection and Segmentation (Text-conditioned, and Image-conditioned).
054	Labal-Anything-Pipeline	Label-Anything-Pipeline	-	Code	ZJU	Annotation anything in visual tasks just all in one-pipeline with GPT-4 and SAM.
055	Grounded-Segment-Any-Parts	Grounded Segment Anything: From Objects to Parts	Project page	Code	HKU	Expand Segment Anything Model (SAM) to support text prompt input. The text prompt could be object-level(eg, dog) and part-level(eg, dog head).
056	AnyLabeling	AnyLabeling	Youtube page	Code	-	Effortless AI-assisted data labeling with AI support from Segment Anything and YOLO.
057	SSA	Semantic-Segment-Anything	Project page	Code	-	Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
058	RefSAM	Label Data with Segment Anything in Roboflow	Project page	Code	-	Referring Image Segmentation Benchmarking with Segment Anything Model (SAM).
059	Roboflow Annotate	Launch: Label Data with Segment Anything in Roboflow	Project page	APP	Roboflow	SAM-assisted labeling for training computer vision models.
060	ImageBind SAM	-	-	Code	IDEA-Research	This is an experimental demo aims to combine ImageBind and SAM to generate mask with different modalities.
061	X-AnyLabeling	X-AnyLabeling	WeChat	Code	CVHub	A new interactive automatic labeling tool based on AnyLabeling.
062	Segment Anything + NNCF	-	WeChat	Code	-	OpenVINO™ NNCF for segment anything encoder quantization acceleration.
063	YOLOv8 + SAM	-	WeChat	-	-	Use SAM in YOLOv8.
064	SearchAnything	SearchAnything	Zhihu blog, Twitter	Code	CAS and MSRA	A semantic local search engine powered by various AI models.
065	SAM Meets Stable Diffusion	-	WeChat	Code	PaddlePaddle	Segment and generate Anything.
066	Language Segment-Anything	-	-	Code	-	SAM with text prompts generates masks for specific objects in images.
067	Expedit-SAM	-	-	Code	-	Expediting SAM without Fine-tuning.
068	Segment-Anything-Fast	Accelerating Generative AI with PyTorch: Segment Anything, Fast	Project page	Code	Team PyTorch	A batched offline inference oriented version of segment-anything.
069	YOLOv9+SAM	YOLOv9+SAM	Project page	Code	-	Dynamic Detection and Segmentation with YOLOv9+SAM.
070	LiteMedSAM	LiteMedSAM	Project page	Code	-	A lightweight version of MedSAM for fast training and inference.
071	ISAT_with_segment_anything	ISAT_with_segment_anything	Project page	Code	-	An Interactive Semi-automatic Annotation Tool based on segment anything model, supporting SAM, SAM2, SAM-HQ, MobileSAM, EdgeSAM, etc.
072	Track Anything Annotate	Track Anything Annotate	Project page	Code	-	A video annotation tool combining SAM2 and Xmem++.

Awesome Repositories for SAM

License

This project is released under the MIT license. Please see the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 1,175 Commits
Paper_List		Paper_List
imgs		imgs
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Comprehensive Survey on Segment Anything Model for Vision and Beyond

If you like our project, please give us a star ⭐ on GitHub for latest update.

We strongly encourage authors of relevant works to make a pull request and add their paper's information [here].

🔥 Highlights

Contents

Citation

Survey

Paper List

Seminal Papers

Follow-up Papers

The latest papers within a week are marked with a 💥

2026

2025

2024

2023

Open Source Projects

Awesome Repositories for SAM

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

A Comprehensive Survey on Segment Anything Model for Vision and Beyond

If you like our project, please give us a star ⭐ on GitHub for latest update.

We strongly encourage authors of relevant works to make a pull request and add their paper's information [here].

🔥 Highlights

Contents

Citation

Survey

Paper List

Seminal Papers

Follow-up Papers

The latest papers within a week are marked with a 💥

2026

2025

2024

2023

Open Source Projects

Awesome Repositories for SAM

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Packages