GWO-ATDL-SBD: A GREY WOLF OPTIMIZED ADAPTIVE TEMPORAL DEEP LEARNING FRAMEWORK FOR ROBUST SHOT BOUNDARY DETECTION

Authors

  • Tushar Banik Department of Computer Science and Engineering, ICFAI University, Tripura, India
  • Saptarshi Chakraborty Department of Computer Science and Engineering, ICFAI University, Tripura, India
  • Dalton Meitei Thounaojam Department of Computer Science, Manipur University, India
  • Sapam Jitu Singh Department of Computer Science and Engineering, Manipur University, India
  • Raju Rajkumar Department of Computer Science, Pravabati College, Imphal, India

DOI:

https://doi.org/10.29121/shodhkosh.v7.i1.2026.8167

Keywords:

Shot Boundary Detection, Grey Wolf Optimizer, Bi-Lstm, Video Analysis, Temporal Deep Learning

Abstract [English]

Shot Boundary Detection (SBD) is an essential task in video content analysis but accurately detecting both abrupt and gradual scene changes is still a problem due to different lightings, camera motions, and varied scene contents. This paper introduces a hybrid framework named GWO-ATDL-SBD that combines Grey Wolf Optimizer (GWO) with adaptive temporal deep learning for performing robust shot boundary detection. At first, multi-cue features such as edge energy difference, motion vector entropy, and color histogram distance are extracted to represent different visual characteristics in combination. These features are combined dynamically using weights obtained by GWO and passed to a Bi-directional Long Short-Term Memory (Bi-LSTM) network to capture the frame-wise temporal dependencies. In contrast to the traditional methods having fixed thresholds, our method uses GWO not only to find the optimal feature fusion weights and classifier parameters but also decision thresholds for adaptive and accurate recognition of both cuts and transitions. Extensive experiments on standard video datasets show that the proposed method substantially surpasses the existing SBD methods in terms of precision, recall, and F1-score. The findings reveal the potential of marrying evolutionary optimization with temporal deep learning for tackling complex video transition detection problems.

References

Anthwal, S., Ganotra, D.: An overview of optical flow-based approaches for motion segmentation. The Imaging Science Journal 67(5), 284–294 (2019) DOI: https://doi.org/10.1080/13682199.2019.1641316

Benoughidene, A., Titouna, F.: A novel method for video shot boundary detection using cnn-lstm approach. International Journal of Multimedia Information Retrieval 11(4), 653–667 (2022) DOI: https://doi.org/10.1007/s13735-022-00251-8

Chakraborty, D., Chiracharit, W., Chamnongthai, K.: Video shot boundary detection using principal component analysis (pca) and deep learning. In: 2021, 18th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), pp. 272–275 (2021). IEEE. DOI: https://doi.org/10.1109/ECTI-CON51831.2021.9454775

Chakraborty, S., Singh, A., Thounaojam, D.M.: A novel bifold-stage shot bound- ary detection algorithm: invariant to motion and illumination. The Visual Computer 38(2), 445–456 (2022) DOI: https://doi.org/10.1007/s00371-020-02027-9

Chakraborty, S., Thounaojam, D.M.: A novel shot boundary detection system using hybrid optimization technique. Applied Intelligence 49(9), 3207–3220(2019) DOI: https://doi.org/10.1007/s10489-019-01444-1

Chan, C., Wong, A.: Shot boundary detection using genetic algorithm optimization. In: 2011 IEEE International Symposium on Multimedia, pp. 327–332 (2011). IEEE DOI: https://doi.org/10.1109/ISM.2011.58

Dubey, A. K., & Dubey, A. (2026). Digitalization in Teaching and Learning: Impact on Student Engagement and Academic Achievement. ShodhAI: Journal of Artificial Intelligence, 3(1), 37–42. https://doi.org/10.29121/shodhai.v3.i1.2026.73 DOI: https://doi.org/10.29121/shodhai.v3.i1.2026.73

Gawande, U., Hajari, K., Golhar, Y., Fulzele, P.: A novel gray wolf optimization based key frame extraction method for video classification using convlstm. Neural Computing and Applications 36(32), 20355–20385 (2024) DOI: https://doi.org/10.1007/s00521-024-10266-3

GG, L.P., Domnic, S.: Walsh–hadamard transform kernel-based feature vector for shot boundary detection. IEEE Transactions on Image Processing 23(12), 5187–5197 (2014) DOI: https://doi.org/10.1109/TIP.2014.2362652

Goswami, B., Boers, N., Rheinwalt, A., Marwan, N., Heitzig, J., Breitenbach, S.F., Kurths, J.: Abrupt transitions in time series with uncertainties. Nature communications 9(1), 48 (2018). DOI: https://doi.org/10.1038/s41467-017-02456-6

Guo, H., Liu, J., Xiao, Z., Xiao, L.: Deep cnn-based hyperspectral image classification using discriminative multiple spatial-spectral feature fusion. Remote Sensing Letters 11(9), 827–836 (2020) DOI: https://doi.org/10.1080/2150704X.2020.1779374

Hassanien, A., Elgharib, M., Selim, A., Bae, S.-H., Hefeeda, M., Matusik, W.: Large-scale, fast and accurate shot boundary detection through spatio-temporal convolutional neural networks. arXiv preprint arXiv:1705.03281 (2017)

Idan, Z.N., Abdulhussain, S.H., Mahmmod, B.M., Al-Utaibi, K.A., Al-Hadad, S.A.R., Sait, S.M.: Fast shot boundary detection based on separable moments and support vector machine. IEEe Access 9, 106412–106427 (2021) DOI: https://doi.org/10.1109/ACCESS.2021.3100139

Kar, T., Kanungo, P., Mohanty, S.N., Groppe, S., Groppe, J.: Video shot-boundary detection: issues, challenges and solutions. Artificial Intelligence Review 57(4), 104 (2024) DOI: https://doi.org/10.1007/s10462-024-10742-1

Kar, T., Kanungo, P.: A motion and illumination resilient framework for automatic shot boundary detection. Signal, Image and Video Processing 11,1237–1244 (2017) DOI: https://doi.org/10.1007/s11760-017-1080-0

Li, H., Wei, M.: Fuzzy clustering based on feature weights for multivariate time series. Knowledge-Based Systems 197, 105907 (2020) DOI: https://doi.org/10.1016/j.knosys.2020.105907

Li, Y., Li, C., Li, X., Wang, K., Rahaman, M.M., Sun, C., Chen, H., Wu, X., Zhang, H., Wang, Q.: A comprehensive review of markov random field and conditional random field approaches in pathology image analysis. Archives of Computational Methods in Engineering 29(1), 609–639 (2022) DOI: https://doi.org/10.1007/s11831-021-09591-w

Li, Y.-N., Lu, Z.-M., Niu, X.-M.: Fast video shot boundary detection framework employing pre-processing techniques. IET image processing 3(3), 121–134 (2009) DOI: https://doi.org/10.1049/iet-ipr.2007.0193

Liu, S., Liu, G., Zhou, H.: A robust parallel object tracking method for illumination variations. Mobile Networks and Applications 24(1), 5–17 (2019) DOI: https://doi.org/10.1007/s11036-018-1134-8

Mei, S., Ji, J., Hou, J., Li, X., Du, Q.: Learning sensor-specific spatial-spectral features of hyperspectral images via convolutional neural networks. IEEE Transactions on Geoscience and Remote Sensing 55(8), 4520–4533 (2017) DOI: https://doi.org/10.1109/TGRS.2017.2693346

Mirjalili, S., Mirjalili, S.M., Lewis, A.: Grey wolf optimizer. Advances in engineering software 69, 46–61 (2014) DOI: https://doi.org/10.1016/j.advengsoft.2013.12.007

Mondal, J., Kundu, M.K., Das, S., Chowdhury, M.: Video shot boundary detec- tion using multiscale geometric analysis of nsct and least squares support vector machine. Multimedia Tools and Applications 77(7), 8139–8161 (2018) DOI: https://doi.org/10.1007/s11042-017-4707-9

Navin, K., Krishnan, M., et al.: Fuzzy rule based classifier model for evidence based clinical decision support systems. Intelligent systems with applications 22,200393 (2024) DOI: https://doi.org/10.1016/j.iswa.2024.200393

Park, S., Sch¨ops, T., Pollefeys, M.: Illumination change robustness in direct visual slam. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 4523–4530 (2017). IEEE DOI: https://doi.org/10.1109/ICRA.2017.7989525

Peralta, B., Tirapegui, V., Pieringer, C., Caro, L.: A simple proposal for sentiment analysis on movies reviews with hidden markov models. In: Iberoamerican Congress on Pattern Recognition, pp. 152–162 (2019). Springer DOI: https://doi.org/10.1007/978-3-030-33904-3_14

Prasanna, C.S., Rahman, M.Z.U., Bayleyegn, M.D.: Brain epileptic seizure detection using joint cnn and exhaustive feature selection with rnn-blstm classifier. IEEE Access 11, 97990–98004 (2023) DOI: https://doi.org/10.1109/ACCESS.2023.3312187

Ray, P.P.: A review on tinyml: State-of-the-art and prospects. Journal of King Saud University-Computer and Information Sciences 34(4), 1595–1623 (2022) DOI: https://doi.org/10.1016/j.jksuci.2021.11.019

Sharma, D.M., Shandilya, S.K.: An efficient cyber-physical system using hybridized enhanced support-vector machine with ada-boost classification algorithm. Concurrency and Computation: Practice and Experience 34(21), 7134(2022) DOI: https://doi.org/10.1002/cpe.7134

Shinde, T., Shrivastava, M.: Development of an ant colony optimisation-based edge detection framework. African Journal of Applied Research 11(5), 573–595 (2025) DOI: https://doi.org/10.26437/m29ams60

Singh, A., Thounaojam, D.M., Chakraborty, S.: A novel automatic shot boundary detection algorithm: robust to illumination and motion effect. Signal, Image and Video Processing 14(4), 645–653 (2020) DOI: https://doi.org/10.1007/s11760-019-01593-3

Thounaojam, D., Khelchandra, T., Jayshree, T., Roy, S., Singh, K.: Colour histogram and modified multi-layer perceptron neural network based video shot boundary detection. International Arab Journal of Information Technology 16(4),686–693 (2019)

Warhade, K.K., Merchant, S., Desai, U.B.: Shot boundary detection in the presence of fire flicker and explosion using stationary wavelet transform. Signal, Image and Video Processing 5, 507–515 (2011) DOI: https://doi.org/10.1007/s11760-010-0163-y

Wu, L., Zhang, S., Jian, M., Lu, Z., Wang, D.: Two stage shot boundary detection via feature fusion and spatial-temporal convolutional neural networks. IEEE Access 7, 77268–77276 (2019) DOI: https://doi.org/10.1109/ACCESS.2019.2922038

Youssef, B., Fedwa, E., Driss, A., Ahmed, S.: Shot boundary detection via adaptive low rank and svd-updating. Computer Vision and Image Understanding161,20–28 (2017) DOI: https://doi.org/10.1016/j.cviu.2017.06.003

Zhao, L., Sun, X.M., Zhang, M.W.: A shot boundary detection method based on pso-svm. Applied Mechanics and Materials 130, 3821–3825 (2012) DOI: https://doi.org/10.4028/www.scientific.net/AMM.130-134.3821

Downloads

Published

2026-05-16

How to Cite

Banik, T., Chakraborty, S., Thounaojam, D. M., Singh, S. J., & Rajkumar, R. (2026). GWO-ATDL-SBD: A GREY WOLF OPTIMIZED ADAPTIVE TEMPORAL DEEP LEARNING FRAMEWORK FOR ROBUST SHOT BOUNDARY DETECTION. ShodhKosh: Journal of Visual and Performing Arts, 7(1), 75–89. https://doi.org/10.29121/shodhkosh.v7.i1.2026.8167