Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/4781
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTanna, Jayesh B.-
dc.date.accessioned2014-08-11T08:52:11Z-
dc.date.available2014-08-11T08:52:11Z-
dc.date.issued2014-06-01-
dc.identifier.urihttp://hdl.handle.net/123456789/4781-
dc.description.abstractText-To-Speech (TTS) conversion is a great topic of research nowadays. This project will be very useful for the illiterate and especially for the blind people. By using this system any person can read any article and understand it. The overall process of Text-To-Speech (TTS) conversion can be divided into mainly three blocks: Text Normalization, Text-To-Phoneme and Phoneme-to-Sound. Text normalization block removes the symbols and replace it with blank space and analyze all the digits and texts from the input texts. Linguistic analysis block provides intonation and prosody to the graphemes. And finally, waveform generation block will generate the original output sound. Text-To-Speech (TTS) conversion can be classified in three ways mainly: Concatenate synthesis, Formant synthesis, Hidden Markov Model (HMM) and Articulatory synthesis. As the name suggest, concatenative synthesis concatenates the different words form the database of pre-recorded words and then maps each of the matched input words with its equivalent phoneme. Due to large memory requirement of this system, this synthesis technology mainly not used in embedded systems, where memory and power is the main factors of the whole system. Formant synthesis works very well without any kind of pre-recorded word’s database. But, the drawback of this synthesis technology is that the naturalness in the output sound (robotic or not like human). HMM based synthesis is a synthesis method based on hidden markov models, also called Statistical Parametric Synthsis. In this system, the frequency spectrum (Vocal tract), fundamental frequency (vocal source) and duration (prosody) of speech are modeled simultaneously by HMMS. Speech waveforms are generated from HMMs themselves based on the maximum likelihood criterion. Articulatory synthesis is an ideal synthesis technique, in which, the whole articulatory system (mouth) of human being is modeled and sound will generate by the program. Programming point of view, this technique is very complex as compared to above methods and output is not that much accurate. Due to its complexity, this technique is rarely used for TTS system. Thus, by considering all the feasible parameters like simplicity, complexity, power, memory etc., concatenative synthesis is superior than other ones.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries12MECE23;-
dc.subjectEC 2012en_US
dc.subjectProject Reporten_US
dc.subjectProject Report 2012en_US
dc.subjectEC Project Reporten_US
dc.subjectEC (ES)en_US
dc.subjectEmbedded Systemsen_US
dc.subjectEmbedded Systems 2012en_US
dc.subject12MECen_US
dc.subject12MECEen_US
dc.subject12MECE23en_US
dc.titleText-To-Speech (TTS) Conversion for Gujarati Languageen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, EC (ES)

Files in This Item:
File Description SizeFormat 
12MECE23.pdf12MECE231.53 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.