Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/8832
Full metadata record
DC FieldValueLanguage
dc.contributor.authorShah, Viral-
dc.date.accessioned2019-08-31T06:57:10Z-
dc.date.available2019-08-31T06:57:10Z-
dc.date.issued2018-06-01-
dc.identifier.urihttp://10.1.7.192:80/jspui/handle/123456789/8832-
dc.description.abstractCurrent surveillance systems can be designed to detect unusual activities in the surveillance video, such as object detection, movement tracking, and activity monitoring. However, they have a limitation in recognizing unusual audio associated with video as an example - a gunshot in the background on the scene. In such cases, identification of such abnormal audio is important for surveillance and safety. This thesis work focuses on recognition and classification of audio signals and by implementation of an audio classification model. This model is based on con- volution neural networks and has been trained by UrbanSound8K and Data speech commands data sets. The model extracts spectrograms from input audio signal and gives it as an input to the convolution neural network. The output of the network is the predicted label for corresponding audio. This model can classify the abnormal environmental sounds like a gunshot. The model can also predict small keywords like yes, no, up, and similar common audio words. This classification has been carried out on 2 different networks. One using VGG16 which is an image classification model. VGG-16 model achieved an audio classification accuracy of 100% for the cleaned data and around 80% for the noisy data. From the classification output of VGG16, it was concluded that neural net- works designed for image classification can also be employed for audio classification. The second model was a 2-layer convolution neural network, which gave audio clas- sification accuracy of 93.00%. The contribution of this thesis is an exploration of deep learning approach for task of audio classification.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries16MECE21;-
dc.subjectEC 2016en_US
dc.subjectProject Reporten_US
dc.subjectProject Report 2016en_US
dc.subjectEC Project Reporten_US
dc.subjectEC (ES)en_US
dc.subjectEmbedded Systemsen_US
dc.subjectEmbedded Systems 2016en_US
dc.subject16MECen_US
dc.subject16MECEen_US
dc.subject16MECE21en_US
dc.titleAudio Classification using Deep Neural Networken_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, EC (ES)

Files in This Item:
File Description SizeFormat 
16MECE21.pdf16MECE213.16 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.