Please use this identifier to cite or link to this item:
http://10.1.7.192:80/jspui/handle/123456789/9233
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Sharma, Panthak | - |
dc.date.accessioned | 2020-07-24T05:43:45Z | - |
dc.date.available | 2020-07-24T05:43:45Z | - |
dc.date.issued | 2019-06-01 | - |
dc.identifier.uri | http://10.1.7.192:80/jspui/handle/123456789/9233 | - |
dc.description.abstract | Interior designers often get troubled with imagining the designs. Generative Adver- sarial Networks (GANs) can help designers put their thoughts on computer screen in real-time by giving voice commands. GANs are one of the trending research topics in the field of artificial intelligence. Speech recognition is an important ascpet of AI in present days and GANs have the ability to generate new data based on it’s learning from gaussian curve. Synthesizing photo-realistic images is a challenging task. In this paper, An approach of synthesizing photo-realistic images from voice commands is shown. Two GAN models are used in order to generate a healthy looking image based on the voice commands are given. Google voice API is used in order to achieve voice-to-text conver- sion. Converted text being the input for first GAN and it will generate a low-resolution image with primitive shape conditioned with the text given. The image generated from first GAN will work as input for the second GAN along with the same text used earlier. Second GAN will refine the image and put more details in the image along with convert- ing the image to a larger resolution. Dataset used for this purpose is created on own from scretch, It consists of sofaset images for interior designing. | en_US |
dc.publisher | Institute of Technology | en_US |
dc.relation.ispartofseries | 17MCEC16; | - |
dc.subject | Computer 2017 | en_US |
dc.subject | Project Report 2017 | en_US |
dc.subject | Computer Project Report | en_US |
dc.subject | Project Report | en_US |
dc.subject | 17MCE | en_US |
dc.subject | 17MCEC | en_US |
dc.subject | 17MCEC16 | en_US |
dc.title | Synthesizing photograph via Voice commands using Generative Adversarial Networks (GANs) | en_US |
dc.type | Dissertation | en_US |
Appears in Collections: | Dissertation, CE |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
17MCEC16.pdf | 17MCEC16 | 4.19 MB | Adobe PDF | ![]() View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.