HYBRID SCHEMES BASED ON WAVELET TRANSFORM AND CONVOLUTIONAL AUTO-ENCODER FOR IMAGE COMPRESSION

Houda Chakib; Najlae Idrissi; Oussama Jannani

doi:10.29121/ijoest.v7.i2.2023.479

Authors

Houda Chakib Data4Earth Laboratory, Faculty of Sciences and Technics, Sulan Moulay Slimane University USMS, Morocco
Najlae Idrissi Data4Earth Laboratory, Faculty of Sciences and Technics, Sulan Moulay Slimane University USMS, Morocco
Oussama Jannani Data4Earth Laboratory, Faculty of Sciences and Technics, Sulan Moulay Slimane University USMS, Morocco

DOI:

https://doi.org/10.29121/ijoest.v7.i2.2023.479

Keywords:

Wavelet Transform, RGB Color Space, YCbCr Color Space, Convolutional Auto Encoder, Image Compression

Abstract

In recent years, image compression techniques have received a lot of attention from researchers as the number of images at hand keep growing. Digital Wavelet Transform is one of them that has been utilized in a wide range of applications and has shown its efficiency in image compression field. Moreover, used with other various approaches, this compression technique has proven its ability to compress images at high compression ratios while maintaining good visual image quality. Indeed, works presented in this paper deal with mixture between Deep Learning algorithms and Wavelets Transformation approach that we implement in different color spaces. In fact, we investigate RGB and Luminance/Chrominance YCbCr color spaces to develop three image compression models based on Convolutional Auto-Encoder (CAE). In order to evaluate the models’ performances, we used 24 raw images taken from Kodak database and applied the approaches on every one of them and compared achieved experimental results with those obtained using standard compression method. We draw this comparison in terms of performance parameters: Structural Similarity Index Metrix SSIM, Peak Signal to Noise Ratio PSNR and Mean Square Error MSE. Reached results indicates that with proposed schemes we gain significate improvement in distortion metrics over traditional image compression method especially SSIM parameter and we managed to reduce MSE values over than 50%. In addition, proposed schemes output images with high visual quality where details and textures are clear and distinguishable.

Downloads

Download data is not yet available.

References

Ahmadi, K., Javaid, A. Y., and Salari, E. (2015). An Efficient Compression Scheme Based on Adaptive Thresholding In Wavelet Domain Using Particle Swarm Optimization. Signal Processing: Image Communication, 32, 33-39. https://doi.org/10.1016/j.image.2015.01.001.

Alexandridis, A. K., and Zapranis, A. D. (2013). Wavelet Neural Networks : A Practical Guide. Neural Networks, 42, 1-27. https://doi.org/10.1016/j.neunet.2013.01.008.

Antonini, M., Barlaud, M., Mathieu, P., and Daubechies, I., ″ IEEE. (1992). Image Coding Using Wavelet Transform. IEEE Transactions on Image Processing, 1(2), 205-220. https://doi.org/10.1109/83.136597.

Chuxi, Y., Zhao, Y., and Wang, S., (2019). Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction,″ IEEE Computer Science. https://doi.org/10.1109/ACCESS.2019.2911403.

Denk, T., Parhi, K. K., and Cherkassky, V. (1993). Combining Neural Network and the Wavelet Transform for Image Compression Proceeding of International Conference, (1), 637-640. https://doi.org/10.1109/ICASSP.1993.319199.

Dimililer, K., and Khashman, A. (2008). Image Compression Using Neural Networks and Haar Wavelet. Transaction on Signal Processing, ISSN, 4, 2008. https://dl.acm.org/doi/10.5555/1466835.1466844.

Farnoush, F., (2017). ‶ Learning Activation Functions in Deep Neural Networks, ʺ Ecole Polytechnique, Montreal (Canada) ProQuest dissertations publishing, 10957109.

Feng, Q., Yin, Q., and Guo, P., (2022). Image Recognition with Haar Wavelet and Pseudoinverse Learning Algorithm Based Autoencoders,″ Journal of Physics: Conference Series, 2278. https://doi.org/10.1088/1742-6596/2278/1/012019.

Huynh-Thu, Q., and Ghanbari, M. (2008). Scope of Validity of PSNR in Image/Video Quality Assessment. Electronics Letters, 44(13), 800. https://doi.org/10.1049/el:20080522.

Huynh-Thu, Q., and Ghanbari, M. (2012). The Accuracy of PSNR in Predicting Video Quality for Different Video Scenes and Frame Rates. Telecommunication Systems. Janv 2012, 49(1), 35-48. https://doi.org/10.1007/s11235-010-9351-x.

Ioffe, S., and Szegedy, C., (2015). ‶ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift,ʺ Machine Learning, Arxiv:1502.03167[Cs.LG]. (2015). https://doi.org/10.48550/arXiv.1502.03167.

Kingma, P. D., and Ba, J. L., (2017). ADAM : A Method for Stochastic Optimization, ″ ArXiv:1412.6980v9 [Cs.LG].

Krishnanaik, V., Someswar, G. M., Purushotham, K., and Rajaiah, A. (2013). Implementation of Wavelet Transform, DPCM and Neural Network for Image Compression. International Journal of Engineering and Computer Science ISSN : 2319-7242, 2(8), 2468-2475.

Luo, H., Tang, Y. Y., Biuk-Aghai, R. P., Yang, X., Yang, L., and Wang, Y.,(2018). ‶Wavelet-Based Extended Morphological Profile and Deep AutoEncoder for Hyperspectral Image Classification, ʺ International Journal of Wavelets. (2018). Multiresolution and information processing, 16(03), 1850016. https://doi.org/10.1142/S0219691318500169.

Ma, H., Liu, D., Yan, N., Li, H., and Wu, F., (2020). ‶End-To-End Optimized Versatile Image Compression with Wavelet-Like Transform, ʺ IEEE Transactions on Pattern Analysis and Machine Intelligence, 44 (3). https://doi.org/10.1109/TPAMI.2020.3026003.

Nwankpa, C. E., Ijomah, W., Gachagan, A., and Marshall, S. (2018). Activation Functions: Comparison of Trends in Practice and Research for Deep Learning ″, Arxiv:1811.03378v1 [Cs.LG]. Activation Functions : Comparison of Trends in Practice and Research for Deep Learning ″.

Osowski, S., Waszczuk, R., and Bojarczak, P., (2006). Image Compression Using Feed Forward Neural Networks- Hierarchical Approach, ″ Lecture Notes in Computer Science. Book Chapter, Springer - Verlag, 3497, 1009-1015. https://doi.org/10.1007/3-540-59497-3_280.

Paul, A., Kundu, A., Chaki, N., Jha, C. S., and Dutta, D., (2022). ‶ Wavelet Enabled Convolutional AutoEncoder Based Deep Neural Network for Hyperspectral Image Denoising,ʺ Multimed Tools Appl 81, 2529-2555. https://doi.org/10.1007/s11042-021-11689-z.

Rabbani, M. and Jones, P.W. (1991). ‶ Digital Image Compression Techniques, ʺvol.TT07, SPIE Press Book, Bellingham, Washington, USA.

Rozema, R., Kruitbosch, H. T., Van Minnen, B., Dorgelo, B., Kraeima, J., and Van Ooijen, P. M. A., (2022). ‶Structural Similarity Analysis of Midfacial Fractures-A Feasibility Study, ʺ Quant Imaging Med Surg., 12(2), 1571-1578. https://doi.org/10.21037/qims-21-564.

Singh, A. V., and Murthy, K. S. (2012). Neuro-Wavelet Based Efficient Image Compression Using Vector Quantization. International Journal of Computers and Applications (0975-08887), 49-N°.3. https://doi.org/10.5120/7610-0653.

Wang, Z., Bovik, A. C., Sheikh, H. R., and Simoncelli, E. P., (2004). Image Quality Assessment: from Error Visibility to Structural Similarity. IEEE Transactions on Image Processing, 13(4), 600-612. https://doi.org/10.1109/tip.2003.819861.

Wang, Z., Simoncelli, E. P., and Bovik, A. C., (2004). Multiscale Structural Similarity for Image Quality Assessment,″ Conference Record of the Thirty-Seventh Asilomar Conference on Signals, Systems and computers, 2, 1398-1402. https://doi.org/10.1109/ACSSC.2003.1292216.

Williams, T., and Li, R., (2018). An Ensemble of Convolutional Neural Networks Using Wavelets for Image Classification, ″ Journal of Software Engineering and Applications, 11(02). https://doi.org/10.4236/jsea.2018.112004.

Zang, Q., and Beneveniste, A., (1992).‶Wavelet Networks, ʺ IEEE Tans. Neural Networks, 7(1), 889-898. https://doi.org/10.1109/72.165591.

Zhang, Q. (1997). Using Wavelet Network in Nonparametric Estimation. IEEE Transactions on Neural Networks, 8(2), 227-236. https://doi.org/10.1109/72.557660.

Zhu, Q., Wang, H., and Zhang, R. (2021). Wavelet Loss Function for Auto-Encoder. In IEEE Access, 9, 27101-27108. https://doi.org/10.1109/ACCESS.2021.3058604.