Please use this identifier to cite or link to this item:
http://10.1.7.192:80/jspui/handle/123456789/12473
Title: | Pose Correctional Captioning |
Authors: | Barot, Manav |
Keywords: | Computer 2022 Project Report Project Report 2022 Computer Project Report 22MCE 22MCES 22MCES01 CE (CCS) CCS 2022 Cyber Security |
Issue Date: | 1-Jun-2024 |
Publisher: | Institute of Technology |
Series/Report no.: | 22MCES01; |
Abstract: | Our study addresses the issue of offering real-time assistance for improving exercise form by establishing an advanced architecture that comprehends visual relationships between source and target images. Our method incorporates employing MaxVit Vision Transformers, layered cross-attention layers, and an encoder-transformer decoder fusion to construct intelligent position-correcting captions. We conducted a comprehensive literature review to inform our work, focusing on evaluating MaxVit Vision Transformers for precise pose correctional captions, assessing the impact of stacked cross-attention layers, and quantifying the advantages of transformer decoders and pretrained GPT over traditional models. Rigorous testing against numerous image datasets verifies the proposed architecture’s usefulness in providing users with descriptive information to transition seamlessly between different workout poses. This research builds upon prior breakthroughs, as we have also done rigorous assessments on metrics such as Bleu-4, Rough-L, Meteor, and Cidar, attaining considerably better results surpassing previous standards with a Bleu-4 of 16.2, Rough-L score of 44.06, Meteor score of 36.80, and Cidar score of 11.39. Our research gives a novel alternative to the absence of real-time workout form recommendations. The recommended solution demonstrates promising results, giving a practical way to increasing training routines by eliminating mistakes and reducing the possibility of accidents using intelligent posture corrections captions. |
URI: | http://10.1.7.192:80/jspui/handle/123456789/12473 |
Appears in Collections: | Dissertation, CE (CCS) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
22MCES01.pdf | 22MCES01 | 4.1 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.