Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/6646
Full metadata record
DC FieldValueLanguage
dc.contributor.authorNaik, Nirali-
dc.date.accessioned2016-07-14T07:43:01Z-
dc.date.available2016-07-14T07:43:01Z-
dc.date.issued2016-06-01-
dc.identifier.urihttp://hdl.handle.net/123456789/6646-
dc.description.abstractWe live in the century, where information is strength. Information can be saved in the form of text/multimedia. One such form is Speech audio files. To access those file easily and efficiently, Speaker Diarization is the best option. Speaker Diarization is all about \who spoke when?". This system takes in the input in form of an audio file. The diarization sytem makes the segments of speech file such that each segment is ho- mogeneous. Here, homogeneous segment means that those region of speech contains speech from only one speaker. These segments will be given to the clustering module. This module will merge all the identical segments. To identify the segments from the same speaker, system has to build models for each speaker. These models are constructed using GMMs,using EM algorithm. Feature extraction techniques are ap- plied for extracting speaker specific information. VOice activity detection algorithms are used to differentiate between speech/non-speech reginons and identifying speaker change points. Diarization Systems can be used in Movie analysis, Automatic speech segmentation, Rich transcription, Audio archiving and monitoring, Audio indexing and retrieval. This system doesn't know the number of speakers involved in the sys- tem in advance. We just know is the domain of audio file(which type of recording is this, i.e. telephone conversation, meeting room conversation etc.). The evalua- tion measure used for speaker diarization systems is Diarization Error Rate. This is computed using, Miss, false alarm and confusion. The ideal output expected using diarization system is the speech regions related to speakers, and the speaker lables.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries14MCEC16;-
dc.subjectComputer 2014en_US
dc.subjectProject Report 2014en_US
dc.subjectComputer Project Reporten_US
dc.subjectProject Reporten_US
dc.subject14MCEen_US
dc.subject14MCECen_US
dc.subject14MCEC16en_US
dc.titleSpeaker Diarizationen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, CE

Files in This Item:
File Description SizeFormat 
14MCEC16.pdf14MCEC161.8 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.