Surgical Tool Detection on CholecTrack20 Using Lightweight Deep Learning Models
DOI:
https://doi.org/10.54097/r36zwv93Keywords:
Minimally Invasive Surgery, Surgical Tool Detection and Tracking, Lightweight Deep Learning.Abstract
With the rapid development of minimally invasive surgery (MIS) and robot-assisted surgery, real-time, accurate, and robust surgical instrument detection and tracking has become a core research focus in medical AI. This review summarizes the current state of surgical tool detection and tracking, with a particular focus on the application of lightweight deep learning models on the CholecTrack20 dataset. Models such as MobileNetV2 and YOLOv8n demonstrate promising deployment potential and performance on embedded and low-computation platforms. We analyze their advantages and limitations in small-object detection, occlusion handling, multi-view tracking, and end-to-end real-time inference, and discuss potential improvements through multi-frame fusion, feature pyramid networks, lightweight attention modules, and data augmentation strategies. Furthermore, future research directions are outlined, including multimodal perception (vision + tactile/force feedback), explainable AI (XAI), and uncertainty estimation to ensure clinical safety and regulatory compliance. Overall, lightweight models offer practical deployment value for surgical tool detection and tracking, and provide a feasible pathway toward intelligent, multimodal surgical systems.
Downloads
References
[1] Abdalla Osman, E. I., Mubarak Ismail, M. M. E., Hassan Mukhtar, M. A., Babiker Ahmed, A. U., Abd Elfrag Mohamed, N. A., & Alamin Ibrahim, A. A. Cureus, 17 (3), e81339 (2025).
[2] S. W. Wong and P. Crowe, Journal of Robotic Surgery 17, 1873 (2023).
[3] B. Ghanekar, L. R. Johnson, J. L. Laughlin, M. K. O’Malley, and A. Veeraraghavan, International Symposium on Biomedical Imaging (ISBI) (Pp. 1-5). IEEE. 22 (2025).
[4] W. Guo, J. Wu, Z. Chen, Q. Zhao, M. Xu, Z. Lei, and H. Liu, in Lecture Notes in Computer Science (2025), pp. 168–177.
[5] J. C. Á. Cerón, G. O. Ruiz, L. Chang, and S. Ali, Medical Image Analysis 81, 102569 (2022).
[6] N. P. P. Gala, May 2025 International Research Journal on Advanced Engineering and Management (IRJAEM) 3 (05): 1657-1665 (2025).
[7] C. I. Nwoye, K. Elgohary, A. Srinivas, F. Zaid, J. L. Lavanchy, and N. Padoy, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 8942 (2025).
[8] R. Alrasheed, O. A. Waraga, M. A. Talib and M. A. Moufti,2024 Global Digital Health Knowledge Exchange & Empowerment Conference (gDigiHealth.KEE), pp. 1-7 (2024).
[9] H. Aoki, & N. Fujita in Seventeenth International Conference on Quality Control by Artificial Vision Vol. 13737, pp. 227-234 (2025).
[10] A. Martin-Gomez, H. Li, T. Song, S. Yang, G. Wang, H. Ding, N. Navab, Z. Zhao, and M. Armand, IEEE Transactions on Visualization and Computer Graphics 30, 3578 (2023).
[11] T. Allyne, MD & M. Luca, MD, A SAGES Technology and Value Assessment. (2018).
[12] T. Mostafa, D.P Andres, M.Masoud, NVIDIA Technical Blog (2025).
[13] V. Schorp, F. Giraud, G. Pargätzi, M. Wäspe, L. Von Ritter-Zahony, M. Wegmann, N. A. Cavalcanti, J. G. Henao, N. Bünger, D. Cachin, S. Caprara, P. Fürnstahl, and F. Carrillo, 17th Hamlyn Symposium on Medical Robotics (2025).
[14] W.-L. Chuang, M.-H. Yeh, and Y.-L. Yeh, Actuators 10, 141 (2021).
[15] Anon, Zimmer Biomet: Warsaw, IN, USA, (2023).
[16] G Loza, P. Valdastri, S Ali Healthcare Technology Letters, 11 (2-3), 48-58. (2024)
[17] X. Du, T. Kurmann, P.-L. Chang, M. Allan, S. Ourselin, R. Sznitman, J. D. Kelly, and D. Stoyanov, IEEE Transactions on Medical Imaging 37, 1276 (2018).
[18] A. Qayyum, H. Ali, M. Caputo, H. Vohra, T. Akinosho, S. Abioye, I. Berrou, P. Capik, J. Qadir, and M. Bilal, Scientific Reports 15, (2025).
[19] J. Liu, X. Guo, and Y. Yuan, IEEE Transactions on Medical Imaging 41, 715 (2021).
[20] D. Bouget, M. Allan, D. Stoyanov, P. Jannin, Medical image analysis, 35, 633-654 (2017).
[21] R. A. Rizal, J. S. Sihotang, R. Gultom In 2019 International Conference of Computer Science and Information Technology (ICoSNIKOM) (pp. 1-6). IEEE (2019).
[22] B. Namazi, G. Sankaranarayanan, and V. Devarajan, Surgical Endoscopy 36, 679 (2021).
[23] D. S. Yanni, B. M. Ozgur, R. G. Louis, Y. Shekhtman, R. R. Iyer, V. Boddapati, A. Iyer, P. D. Patel, R. Jani, M. Cummock, A. Herur-Raman, P. Dang, I. M. Goldstein, M. Brant-Zawadzki, T. Steineke, and L. G. Lenke, Neurosurgical FOCUS 51, E11 (2021).
[24] L. Qiu, C. Li, and H. Ren, Healthcare Technology Letters 6, 159 (2019).
[25] A. Zia, Y. Sharma, V. Bettadapura, E. L. Sarin, and I. Essa, International Journal of Computer Assisted Radiology and Surgery 13, 443 (2018).
[26] C. I. Nwoye, K. Elgohary, A. Srinivas, F. Zaid, J. L. Lavanchy, & N. Padoy, arXiv preprint arXiv: 2312.07352 (2023).
[27] G. Ghiasi, T.-Y. Lin, and Q. V. Le, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019).
[28] H. Wu, Y. Chen, N. Wang, and Z.-X. Zhang, 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2019).
[29] H. Zhang, M. Cisse, Y. N. Dauphin, and D. Lopez-Paz, arXiv (Cornell University) (2017).
[30] G. Loza, P. Valdastri, and S. Ali, Healthcare Technology Letters 11, 48 (2023).
[31] L. Li, B. Li, and H. Zhou, PeerJ Computer Science 8, e1145 (2022).
[32] H. D. Viet, T. T. Nguyen, H. N. Lam, B. P. Nguyen, T. Q. Vu, H. M. Nguyen, V. T. Pho, H. H. Dang, D. V. Sang, and T. T. Nguyen, Journal of Medical Artificial Intelligence 0, 0 (2023).
[33] L. Wiese, L. Hinz, E. Reithmeier, P. Korn, and M. Neuhaus, Computers 14, 69 (2025).
[34] B. Zhao, R. Song, and J. Liang, 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 6123 (2023).
[35] A. Moslemi, A. Briskina, Z. Dang, and J. Li, Machine Learning with Applications 18, 100605 (2024).
[36] A. Abiri, J. Pensa, A. Tao, J. Ma, Y.-Y. Juo, S. J. Askari, J. Bisley, J. Rosen, E. P. Dutson, and W. S. Grundfest, Scientific Reports 9, (2019).
[37] L. Farah, J. M. Murris, I. Borget, A. Guilloux, N. Martelli M. & S. I. M. Katsahian, Mayo Clinic proceedings. Digital health, 1 (2), 120–138. (2023).
[38] Z. Sadeghi, R. Alizadehsani, S. Kausar, R. Rehman, P. Mahanta, P. K. Bora, A. Almasri, R. S. Alkhawaldeh, S. Hussain, B. Alatas, A. Shoeibi, H. Moosaei, M. Hladík, S. Nahavandi, & P. M Pardalos, Computers & Electrical Engineering, 118, 109370–109370. (2024).
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Academic Journal of Science and Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.








