Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/6646
Title: Speaker Diarization
Authors: Naik, Nirali
Keywords: Computer 2014
Project Report 2014
Computer Project Report
Project Report
14MCE
14MCEC
14MCEC16
Issue Date: 1-Jun-2016
Publisher: Institute of Technology
Series/Report no.: 14MCEC16;
Abstract: We live in the century, where information is strength. Information can be saved in the form of text/multimedia. One such form is Speech audio files. To access those file easily and efficiently, Speaker Diarization is the best option. Speaker Diarization is all about \who spoke when?". This system takes in the input in form of an audio file. The diarization sytem makes the segments of speech file such that each segment is ho- mogeneous. Here, homogeneous segment means that those region of speech contains speech from only one speaker. These segments will be given to the clustering module. This module will merge all the identical segments. To identify the segments from the same speaker, system has to build models for each speaker. These models are constructed using GMMs,using EM algorithm. Feature extraction techniques are ap- plied for extracting speaker specific information. VOice activity detection algorithms are used to differentiate between speech/non-speech reginons and identifying speaker change points. Diarization Systems can be used in Movie analysis, Automatic speech segmentation, Rich transcription, Audio archiving and monitoring, Audio indexing and retrieval. This system doesn't know the number of speakers involved in the sys- tem in advance. We just know is the domain of audio file(which type of recording is this, i.e. telephone conversation, meeting room conversation etc.). The evalua- tion measure used for speaker diarization systems is Diarization Error Rate. This is computed using, Miss, false alarm and confusion. The ideal output expected using diarization system is the speech regions related to speakers, and the speaker lables.
URI: http://hdl.handle.net/123456789/6646
Appears in Collections:Dissertation, CE

Files in This Item:
File Description SizeFormat 
14MCEC16.pdf14MCEC161.8 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.