Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/9106
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBhatt, Malhar-
dc.date.accessioned2020-07-15T07:00:22Z-
dc.date.available2020-07-15T07:00:22Z-
dc.date.issued2019-06-01-
dc.identifier.urihttp://10.1.7.192:80/jspui/handle/123456789/9106-
dc.description.abstractThe increased popularity of Deep Neural Network (DNN) has led to a good amount of research in getting the best hardware configuration to achieve the optimum throughput from server processors running an end to end DNN topology. The pri- mary focus here is to do the inference which is to know how efficiently the processor can execute the already trained models for the Text To Speech (TTS) topology. This thesis explores the benchmarking performed on Intel's server platform with DeepMind's Tacotron-2 topology as a workload. Benchmarking consists of running the topology using baseline and Intel optimized Tensor ow framework. The timeline analysis and system performance analysis helps in understanding the behavior of the topology on the server platform. The processor's system performance is measured using Intel's VTune amplifier tool. In order to understand the impact of workload on the server processor, this thesis focuses on the key parameters like Processing time, Memory bandwidth utilization and Memory latency, Hotspot analysis, and effective Central Processing Unit (CPU) utilization. Along with the above param- eters, timeline and baseline pro ling on the workload is also done. As an outcome of this benchmarking, several observations and recommendations are proposed for further research and improvements that can be done in the area of Silicon architec- ture, Software frameworks, and Kernels. The results highlight the key observations and the performance graphs and comparison using baseline and Intel Optimized framework for the tacotron module.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries17MECE02;-
dc.subjectEC 2017en_US
dc.subjectProject Reporten_US
dc.subjectProject Report 2017en_US
dc.subjectEC Project Reporten_US
dc.subjectEC (ES)en_US
dc.subjectEmbedded Systemsen_US
dc.subjectEmbedded Systems 2017en_US
dc.subject17MECen_US
dc.subject17MECEen_US
dc.subject17MECE02en_US
dc.titlePerformance Analysis of AI workload on Intel hardware platformen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, EC (ES)

Files in This Item:
File Description SizeFormat 
17MECE02.pdf17MECE021.62 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.