Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/11898
Full metadata record
DC FieldValueLanguage
dc.contributor.authorUpadhyay, Smit-
dc.date.accessioned2023-08-18T08:52:12Z-
dc.date.available2023-08-18T08:52:12Z-
dc.date.issued2023-06-01-
dc.identifier.urihttp://10.1.7.192:80/jspui/handle/123456789/11898-
dc.description.abstractIn the recognition of speech there are various types of techniques that translates human speech waves/frequency into readable words or in various different forms which is very easily understable by machines. Speech recognition is the very interesting field for researchers who works for different regional language processing which is easily understand by computers. For most popular languages like English and all these technologies reached at top level. For audio Classification and NLP purpose in market many other various types of Classification Techniques/methods are available. Here in this paper, I represent one model which is being best recognizer for Gujarati Spoken Digits. Here, my work finding the good and purposeful use of Artificial Neural Network (ANN) and Convolutional neural Network (CNN) for Spoken Gujarati Digit Dataset. Mainly CNN is used in the purpose for Image Classifier but here time – frequency graph of spoken Gujarati digits used in this for getting Wave graphs. In specific, wavelet transform is utilized in shaping the time-frequency representation because it gives superior recurrence localization for low recurrence signals such as speech. The time-frequency representation is resized to a common measurement utilizing bicubic addition and the coming about image-like representation, referred as Mel-spectrograms, is utilized for recognizing talked digits utilizing CNN. This model I use ANN and CNN for particular findings of finding audio classifications and similarity. At the end after testing and training period the findings are 99 percentage accuracy and it’s Val accuracy is 79 percentage have been gained by running this model at 100 epochs in CNN.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries21MCED16;-
dc.subjectComputer 2021en_US
dc.subjectProject Report 2021en_US
dc.subjectComputer Project Reporten_US
dc.subjectProject Reporten_US
dc.subject21MCEen_US
dc.subject21MCEDen_US
dc.subject21MCED16en_US
dc.titleSpoken Gujarati Language Processingen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, CE (DS)

Files in This Item:
File Description SizeFormat 
21MCED16.pdf21MCED162.26 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.