Please use this identifier to cite or link to this item:
http://10.1.7.192:80/jspui/handle/123456789/11344
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Bhadiyadra, Dhwani | - |
dc.date.accessioned | 2022-11-07T07:54:35Z | - |
dc.date.available | 2022-11-07T07:54:35Z | - |
dc.date.issued | 2022-06-01 | - |
dc.identifier.uri | http://10.1.7.192:80/jspui/handle/123456789/11344 | - |
dc.description.abstract | Voice activity detection (VAD) deals with the problem of effectively recognizing voice regions in audio. With the increase in usage of voice-driven applications, the use of voice/speech activity detection is only increasing. In this paper, a method for voice activity detection is proposed in audios or recordings where the voice is acting as both foreground and background noise. The goal is to develop a single model that can detect voiced regions in both the abovementioned cases. The proposed model is tested on two datasets, one is public dataset, where voice is acting as foreground noise and another is a private dataset, where voice is acting as background noise. Here a detailed comparison of VAD using statistical and machine learning approaches has been carried out, and it has been concluded that the machine learning approach is better with 86.38 % testing accuracy and 94% testing sensitivity on the TIMIT corpus (Public dataset) and 73.27% testing accuracy and 79.67% testing sensitivity on Philips Lumea recordings (Private dataset) using the random forest machine-learning algorithm. | en_US |
dc.publisher | Institute of Technology | en_US |
dc.relation.ispartofseries | 20MCED03; | - |
dc.subject | Computer 2020 | en_US |
dc.subject | Project Report | en_US |
dc.subject | Computer Project Report | en_US |
dc.subject | Project Report 2020 | en_US |
dc.subject | 20MCE | en_US |
dc.subject | 20MCED | en_US |
dc.subject | 20MCED03 | en_US |
dc.subject | CE (DS) | en_US |
dc.subject | DS 2020 | en_US |
dc.title | Voice Activity Detection using Statistical and Machine Learning Approaches | en_US |
dc.type | Dissertation | en_US |
Appears in Collections: | Dissertation, CE (DS) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
20MCED03.pdf | 20MCED03 | 1.63 MB | Adobe PDF | ![]() View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.