Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/11351
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSanghvi, Vidit-
dc.date.accessioned2022-11-07T09:31:46Z-
dc.date.available2022-11-07T09:31:46Z-
dc.date.issued2022-06-01-
dc.identifier.urihttp://10.1.7.192:80/jspui/handle/123456789/11351-
dc.description.abstractExtracting data in digital form is one of the needed functionality for the companies who process the documents. Many companies does this by manual content writing into computers and it requires a lot of time and one of the tedious works to do. However, since application of optical character recognition has been in trend since few years after successfully transforming content from scanned and non-scanned images and documents to digital format with good amount of accuracy, we explore popular approaches with a goal of building application from scratch to parse the documents we have. We have discussed those approaches and it’s performance, however paper is mainly focused on implementing a system which transforms and stores the content in .xlsx (excel) format. The main goal of this project is to reduce time in processing documents which is currently done by humans at a goods transportation place which classifies the documents as safe to transfer the goods or not safe. The data we have is in native portable document format and also non-native documents. We face challenges parsing them, like handling tabular contents and more overhead of annotation timing. Lastly, we analyse the results we are getting from each approach.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries20MCED09;-
dc.subjectComputer 2020en_US
dc.subjectProject Reporten_US
dc.subjectComputer Project Reporten_US
dc.subjectProject Report 2020en_US
dc.subject20MCEen_US
dc.subject20MCEDen_US
dc.subject20MCED09en_US
dc.subjectCE (DS)en_US
dc.subjectDS 2020en_US
dc.titleDocument Data Extraction With Optical Character Recognitionen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, CE (DS)

Files in This Item:
File Description SizeFormat 
20MCED09.pdf20MCED092.94 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.