VOICE ASSISTANTS ENRICHED WITH NLU AND INTEGRATED FACE RECOGNITION
DOI:
https://doi.org/10.29121/shodhkosh.v5.i6.2024.4429Keywords:
Voice Assistant, Natural language processing (NLP), Speech RecognitionAbstract [English]
Voice assistants (VAs) enriched with Natural Language Understanding (NLU) and integrated face recognition represent a significant advancement in human-computer interaction. NLU enhances the VA’s ability to interpret complex commands, context, and user intent, enabling more accurate responses. The integration of face recognition further personalizes the user experience by identifying individuals, allowing for tailored responses and secure access to functions. These advanced VAs streamline tasks such as sending emails, adjusting device settings, controlling media, and performing system operations like shutdowns, while also offering seamless authentication through facial recognition. This research explores the potential of combining NLU and face recognition to enhance user accessibility, security, and convenience. The study highlights how these technologies work together to provide more context-aware interactions and personalized services. It aims to demonstrate the transformative impact of NLU- and face recognition-enhanced VAs in improving usability, efficiency, and user experience across various applications.
References
Kottilingam. Kottursamy, "A review on finding efficient approach to detect customer emotion analysis using deep learning analysis", Journal of Trends in Computer Science and Smart Technology, vol. 3, no. 2, pp. 95-113, 2021. DOI: https://doi.org/10.36548/jtcsst.2021.2.003
Amrita Thakur, Pujan Budhathoki, Sarmila Upreti, Shirish Shrestha and Subarna Shakya, "Real Time Sign Language Recognition and Speech Generation", Journal of Innovative Image Processing, vol. 2, no. 2, pp. 65-76, 2020. DOI: https://doi.org/10.36548/jiip.2020.2.001
Jasmeet Kaur and Anil Kumar, "Speech Emotion Recognition Using CNN k-NN MLP and Random Forest" in Computer Networks and Inventive Communication Technologies, Singapore:Springer, pp. 499-509, 2021. DOI: https://doi.org/10.1007/978-981-15-9647-6_39
Jing Han, Zixing Zhang, Fabien Ringeval and Björn Schuller, "Reconstruction-error-based learning for continuous emotion recognition in speech", In 2017 IEEE international conference on acoustics speech and signal processing (ICASSP), pp. 2367-2371, 2017. DOI: https://doi.org/10.1109/ICASSP.2017.7952580
S. Shahnawazuddin, Rohit Sinha, Sparse coding over redundant dictionaries for fast adaptation of speech recognition system, Computer Speech & Language, Volume 43, 2017, Pages 1-17, ISSN 0885-2308, Article (CrossRefLink). DOI: https://doi.org/10.1016/j.csl.2016.10.004
Kabid Hassan Shibly, Samrat Kumar Dey, Md.Aminul Islam, Shahriar Iftekhar Showrav, Design and Development of Hand Gesture Based Virtual Mouse, ICASERT, 2019 DOI: https://doi.org/10.1109/ICASERT.2019.8934612
D. J. Atha, M. R. Jahanshahi, Evaluation of deep learning approaches based on convolutional neural networks for corrosion detection, Struct. Health Monit. 17 (5) (2018) 1110–1128. Article (CrossRef Link) [10] S. Shah DOI: https://doi.org/10.1177/1475921717737051
Deepak shende, Ria Umahiya, Monika Raghorte, Aishwarya Bhisikar, Anup Bhange.AI based voice assistant using python. JETIR, february 2019, vol 6, issue 2.
Mayank Chourasia, Shriya Haral, Srushti Bhatkar and Smita Kulkarni, "Emotion recognition fromspeech signal using deep learning", Intelligent Data Communication Technologies and Internet of Things:Proceedings of ICICI 2020, pp. 471-481, 2021. DOI: https://doi.org/10.1007/978-981-15-9509-7_39
Abhilash, S.S., Thomas, L., Wilson, N. and Chaithanya, C., 2018. Virtual Mouse Using Hand Gesture. International Research Journal of Engineering and Technology (IRJET), 5(4), pp.3903-3906. DOI: https://doi.org/10.21884/IJMTER.2017.4069.3WZ5X
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Balakrishnan S G, Sathiya Dharan K, Prasanth D, Saravanakumar M, Nandhakishore K

This work is licensed under a Creative Commons Attribution 4.0 International License.
With the licence CC-BY, authors retain the copyright, allowing anyone to download, reuse, re-print, modify, distribute, and/or copy their contribution. The work must be properly attributed to its author.
It is not necessary to ask for further permission from the author or journal board.
This journal provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.
 
							 
			
		 
			 
			 
				













 
  
  
  
  
 