Spoken Gujarati Language Processing

Upadhyay, Smit

Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/11898

Full metadata record

DC Field	Value	Language
dc.contributor.author	Upadhyay, Smit	-
dc.date.accessioned	2023-08-18T08:52:12Z	-
dc.date.available	2023-08-18T08:52:12Z	-
dc.date.issued	2023-06-01	-
dc.identifier.uri	http://10.1.7.192:80/jspui/handle/123456789/11898	-
dc.description.abstract	In the recognition of speech there are various types of techniques that translates human speech waves/frequency into readable words or in various different forms which is very easily understable by machines. Speech recognition is the very interesting field for researchers who works for different regional language processing which is easily understand by computers. For most popular languages like English and all these technologies reached at top level. For audio Classification and NLP purpose in market many other various types of Classification Techniques/methods are available. Here in this paper, I represent one model which is being best recognizer for Gujarati Spoken Digits. Here, my work finding the good and purposeful use of Artificial Neural Network (ANN) and Convolutional neural Network (CNN) for Spoken Gujarati Digit Dataset. Mainly CNN is used in the purpose for Image Classifier but here time – frequency graph of spoken Gujarati digits used in this for getting Wave graphs. In specific, wavelet transform is utilized in shaping the time-frequency representation because it gives superior recurrence localization for low recurrence signals such as speech. The time-frequency representation is resized to a common measurement utilizing bicubic addition and the coming about image-like representation, referred as Mel-spectrograms, is utilized for recognizing talked digits utilizing CNN. This model I use ANN and CNN for particular findings of finding audio classifications and similarity. At the end after testing and training period the findings are 99 percentage accuracy and it’s Val accuracy is 79 percentage have been gained by running this model at 100 epochs in CNN.	en_US
dc.publisher	Institute of Technology	en_US
dc.relation.ispartofseries	21MCED16;	-
dc.subject	Computer 2021	en_US
dc.subject	Project Report 2021	en_US
dc.subject	Computer Project Report	en_US
dc.subject	Project Report	en_US
dc.subject	21MCE	en_US
dc.subject	21MCED	en_US
dc.subject	21MCED16	en_US
dc.title	Spoken Gujarati Language Processing	en_US
dc.type	Dissertation	en_US
Appears in Collections:	Dissertation, CE (DS)

Files in This Item:

File	Description	Size	Format
21MCED16.pdf	21MCED16	2.26 MB	Adobe PDF	View/Open

Show simple item record

IR @ Nirma University