Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/11344
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBhadiyadra, Dhwani-
dc.date.accessioned2022-11-07T07:54:35Z-
dc.date.available2022-11-07T07:54:35Z-
dc.date.issued2022-06-01-
dc.identifier.urihttp://10.1.7.192:80/jspui/handle/123456789/11344-
dc.description.abstractVoice activity detection (VAD) deals with the problem of effectively recognizing voice regions in audio. With the increase in usage of voice-driven applications, the use of voice/speech activity detection is only increasing. In this paper, a method for voice activity detection is proposed in audios or recordings where the voice is acting as both foreground and background noise. The goal is to develop a single model that can detect voiced regions in both the abovementioned cases. The proposed model is tested on two datasets, one is public dataset, where voice is acting as foreground noise and another is a private dataset, where voice is acting as background noise. Here a detailed comparison of VAD using statistical and machine learning approaches has been carried out, and it has been concluded that the machine learning approach is better with 86.38 % testing accuracy and 94% testing sensitivity on the TIMIT corpus (Public dataset) and 73.27% testing accuracy and 79.67% testing sensitivity on Philips Lumea recordings (Private dataset) using the random forest machine-learning algorithm.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries20MCED03;-
dc.subjectComputer 2020en_US
dc.subjectProject Reporten_US
dc.subjectComputer Project Reporten_US
dc.subjectProject Report 2020en_US
dc.subject20MCEen_US
dc.subject20MCEDen_US
dc.subject20MCED03en_US
dc.subjectCE (DS)en_US
dc.subjectDS 2020en_US
dc.titleVoice Activity Detection using Statistical and Machine Learning Approachesen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, CE (DS)

Files in This Item:
File Description SizeFormat 
20MCED03.pdf20MCED031.63 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.