UNIFIED COMMUNICATION: A SURVEY ON HARMONIZING REGIONAL LANGUAGE DIVERSITY

Authors

  • Thenarasi V Assistant Professor, Department of Computer Science, Government First Grade College, Siddartha Layout, Mysore, India
  • Santhosh Kumar B N Assistant Professor of Computer Science, Maharani's Science College for Women, Mysore, India
  • Prakasha Raje Urs M Assistant Professor of Computer Science, Maharani's Science College for Women, Mysore, India
  • Rashmi R Assistant Professor of Physics, Maharani’s Science College for Women, Mysore, India

DOI:

https://doi.org/10.29121/shodhkosh.v5.i1.2024.2678

Keywords:

Text Extraction, OCR, Language Detection, Translation, Summarization

Abstract [English]

This project tackles the complex task of extracting insights from text and images using Optical Character Recognition (OCR). After extracting text, language identification is crucial for a comprehensive multiclass classification approach, especially given the limitations of existing machine translation systems for Indian languages. The paper carefully examines challenges in machine translation, morphological analysis, parsing, word sense disambiguation, and the translation process to enhance the quality of translations. Beyond translation, the project includes automatic text summarization to distill essential content. Through the seamless integration of OCR, language detection, translation, and text summarization, our approach aims to facilitate unified communication by harmonizing diverse voices in multilingual settings

References

Bhagyashree P Pujeri, Jagadeesh Sai D. (2020). “An Anatomization of Language Detection and Translation using NLP Techniques” .International Journal of Innovative Technology and Exploring Engineering (IJITEE). Volume-10 Issue-2 ,pp.69-77 DOI: https://doi.org/10.35940/ijitee.B8265.1210220

Dr. Sreelekha S. (2020). “Machine Translation between Malayalam and English”. Linguistics Journal. Volume 14 Issue 2,pp. 7-30

Jonathan Pilault , Raymond Li , Sandeep Subramanian and Christopher Pal. (2020). “On Extractive and Abstractive Neural Document Summarization with Transformer Language Models”. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. pp. 9308–9319 DOI: https://doi.org/10.18653/v1/2020.emnlp-main.748

Akshara Kandimalla 1, Pintu Lohar 2, Souvik Kumar Maji and Andy Way. (2022). “Improving English-to-Indian Language Neural Machine Translation Systems. Information 13,245. DOI: https://doi.org/10.3390/info13050245

Sangeetha G, Prathusha Laxmi B, and Vijayaraja V “A Survey On Web-Based Intelligent Chat Bot”, MDPI, 2018

Pushpalatha Kadavigere Nagaraj , Kshamitha Shobha Ravikumar, Mydugolam Sreenivas Kasyap, Medhini Hullumakki Srinivas Murthy, Jithin Paul. ”Kannada to English Machine Translation Using Deep Neural Network”. Ingénierie des Systèmes d’Information Vol. 26, No. 1, February, 2021, pp. 123-127. DOI: https://doi.org/10.18280/isi.260113

Y. C. A. Padmanabha Reddy , Shyam Sunder Reddy Kasireddy , Nageswara Rao Sirisala , Ramu Kuchipudi and Purnachand Kollapudi.” An Efficient Long Short-Term Memory Model for Digital Cross-Language Summarization”. CMC, 2023,vol.74, no.3. DOI: https://doi.org/10.32604/cmc.2023.034072

ChetanaVaragantham, J.SrinijaReddy, UdayYelleni, MadhumithaKotha, Dr P.VenkateswaraRao.” Text summarization using nlp”. Journal of Emerging technologies and Innovative Research(JETIR).

Vipin Gupta, G.N.Rathna, K.R.Ramakrishnan.” Automatic kannada text extraction from camera captured images”.

Danial Md Nor , Rosli Omar , M. Zarar M.Jenu , and Jean-Marc Ogier.” Image segmentation and text extraction: Application to the extraction of textual information in scene images”. International Seminar on Application of Science Mathematics 2011.

Jeelen Kumar Sarungbam , Bhupendra Kumar, Ankur Choudhary. ”Script Identification and Language Detection of 12 Indian Languages using DWT and Template Matching of Frequently Occurring Character(s)”. International Conference- Confluence The Next Generation Information Technology Summit.Walaa Hassan, Shereen elBohy, Min Rafik, Ahmed Ashraf, Sherif Gorgui, Michael Emil, Karim Ali "An Interactive Chatbot for College Enquiry", 2023.

T. Venkateswara Prasad, G. Mayil Muthukumaran.” Telugu to English Translation using Direct Machine Translation Approach”. International Journal of Science and Engineering Investigations. vol. 2, issue 12, January 2013.pp. 25-32.

Suyash Awasthi, Anupriya Purwar, Dhananjay Batra, Prof. Prakash Devale, "Web Based College Chatbot - SDABot", 2021. DOI: https://doi.org/10.22214/ijraset.2021.37505

V. Jayanthi and S. Thenmalar. “Tamil OCR Conversion from Digital Writing Pad Recognition Accuracy Improves through Modified Deep Learning Architectures”. Hindawi Journal of Sensors.

Downloads

Published

2024-01-31

How to Cite

V, T., B N, S. K., Urs M, P. R., & R, R. (2024). UNIFIED COMMUNICATION: A SURVEY ON HARMONIZING REGIONAL LANGUAGE DIVERSITY. ShodhKosh: Journal of Visual and Performing Arts, 5(1), 1070–1076. https://doi.org/10.29121/shodhkosh.v5.i1.2024.2678