TY - JOUR
T1 - Keyframe extraction from laparoscopic videos via diverse and weighted dictionary selection
AU - Ma, Mingyang
AU - Mei, Shaohui
AU - Wan, Shuai
AU - Wang, Zhiyong
AU - Ge, Zongyuan
AU - Lam, Vincent
AU - Feng, Dagan
N1 - Funding Information:
Manuscript received February 22, 2020; revised June 19, 2020 and August 17, 2020; accepted August 17, 2020. Date of publication August 25, 2020; date of current version May 11, 2021. This work was supported in part by the National Natural Science Foundation of China (61671383), in part by the Innovation Foundation for Doctor Dissertation of Northwestern Polytechnical University (CX201914), and in part by the Fundamental Research Funds for the Central Universities (3102018AX001). (Corresponding author: Shaohui Mei.) Mingyang Ma, Shaohui Mei, and Shuai Wan are with the School of Electronics and Information, Northwestern Polytechnical University, Xi’an, Shaanxi 710129, China (e-mail: [email protected]. edu.cn; [email protected]; [email protected]).
Publisher Copyright:
© 2013 IEEE.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2021/5
Y1 - 2021/5
N2 - Laparoscopic videos have been increasingly acquired for various purposes including surgical training and quality assurance, due to the wide adoption of laparoscopy in minimally invasive surgeries. However, it is very time consuming to view a large amount of laparoscopic videos, which prevents the values of laparoscopic video archives from being well exploited. In this paper, a dictionary selection based video summarization method is proposed to effectively extract keyframes for fast access of laparoscopic videos. Firstly, unlike the low-level feature used in most existing summarization methods, deep features are extracted from a convolutional neural network to effectively represent video frames. Secondly, based on such a deep representation, laparoscopic video summarization is formulated as a diverse and weighted dictionary selection model, in which image quality is taken into account to select high quality keyframes, and a diversity regularization term is added to reduce redundancy among the selected keyframes. Finally, an iterative algorithm with a rapid convergence rate is designed for model optimization, and the convergence of the proposed method is also analyzed. Experimental results on a recently released laparoscopic dataset demonstrate the clear superiority of the proposed methods. The proposed method can facilitate the access of key information in surgeries, training of junior clinicians, explanations to patients, and archive of case files.
AB - Laparoscopic videos have been increasingly acquired for various purposes including surgical training and quality assurance, due to the wide adoption of laparoscopy in minimally invasive surgeries. However, it is very time consuming to view a large amount of laparoscopic videos, which prevents the values of laparoscopic video archives from being well exploited. In this paper, a dictionary selection based video summarization method is proposed to effectively extract keyframes for fast access of laparoscopic videos. Firstly, unlike the low-level feature used in most existing summarization methods, deep features are extracted from a convolutional neural network to effectively represent video frames. Secondly, based on such a deep representation, laparoscopic video summarization is formulated as a diverse and weighted dictionary selection model, in which image quality is taken into account to select high quality keyframes, and a diversity regularization term is added to reduce redundancy among the selected keyframes. Finally, an iterative algorithm with a rapid convergence rate is designed for model optimization, and the convergence of the proposed method is also analyzed. Experimental results on a recently released laparoscopic dataset demonstrate the clear superiority of the proposed methods. The proposed method can facilitate the access of key information in surgeries, training of junior clinicians, explanations to patients, and archive of case files.
KW - Dictionary selection
KW - keyframe extraction
KW - laparoscopic videos
KW - video summarization
UR - http://www.scopus.com/inward/record.url?scp=85105862649&partnerID=8YFLogxK
U2 - 10.1109/JBHI.2020.3019198
DO - 10.1109/JBHI.2020.3019198
M3 - Article
C2 - 32841131
AN - SCOPUS:85105862649
SN - 2168-2194
VL - 25
SP - 1686
EP - 1698
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
IS - 5
ER -