Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/12473
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBarot, Manav-
dc.date.accessioned2024-08-29T05:00:52Z-
dc.date.available2024-08-29T05:00:52Z-
dc.date.issued2024-06-01-
dc.identifier.urihttp://10.1.7.192:80/jspui/handle/123456789/12473-
dc.description.abstractOur study addresses the issue of offering real-time assistance for improving exercise form by establishing an advanced architecture that comprehends visual relationships between source and target images. Our method incorporates employing MaxVit Vision Transformers, layered cross-attention layers, and an encoder-transformer decoder fusion to construct intelligent position-correcting captions. We conducted a comprehensive literature review to inform our work, focusing on evaluating MaxVit Vision Transformers for precise pose correctional captions, assessing the impact of stacked cross-attention layers, and quantifying the advantages of transformer decoders and pretrained GPT over traditional models. Rigorous testing against numerous image datasets verifies the proposed architecture’s usefulness in providing users with descriptive information to transition seamlessly between different workout poses. This research builds upon prior breakthroughs, as we have also done rigorous assessments on metrics such as Bleu-4, Rough-L, Meteor, and Cidar, attaining considerably better results surpassing previous standards with a Bleu-4 of 16.2, Rough-L score of 44.06, Meteor score of 36.80, and Cidar score of 11.39. Our research gives a novel alternative to the absence of real-time workout form recommendations. The recommended solution demonstrates promising results, giving a practical way to increasing training routines by eliminating mistakes and reducing the possibility of accidents using intelligent posture corrections captions.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries22MCES01;-
dc.subjectComputer 2022en_US
dc.subjectProject Reporten_US
dc.subjectProject Report 2022en_US
dc.subjectComputer Project Reporten_US
dc.subject22MCEen_US
dc.subject22MCESen_US
dc.subject22MCES01en_US
dc.subjectCE (CCS)en_US
dc.subjectCCS 2022en_US
dc.subjectCyber Securityen_US
dc.titlePose Correctional Captioningen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, CE (CCS)

Files in This Item:
File Description SizeFormat 
22MCES01.pdf22MCES014.1 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.