Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/5872
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPatel, Disha-
dc.date.accessioned2015-07-31T07:19:08Z-
dc.date.available2015-07-31T07:19:08Z-
dc.date.issued2015-06-01-
dc.identifier.urihttp://hdl.handle.net/123456789/5872-
dc.description.abstractThe web has become a large collection of many unstructured data or documents. To search for useful information on the web, search engines are to be used. Even from the retrieved results, users are required to search within those documents to Find informa- tion. So it has become difficult to extract the information easily. To solve this issue, use of different web data extraction techniques is to be done. To achieve this goal, few unsupervised web data extraction systems have been studied and there are many di er- ent techniques available for information extraction. A survey for the existing Roadrunner Algorithm which is effcient for data extraction is done. Still, it has few limitations and to overcome that, an approach is proposed which uses Mining Data Records and Tree Align- ment technique for processing of the input HTML pages. An experiment is performed to compare both the results and it is able to overcome the limitations of Roadrunner. So, it can be used to extract the useful data and get the desired results.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries13MCEI25;-
dc.subjectComputer 2013en_US
dc.subjectProject Report 2013en_US
dc.subjectComputer Project Reporten_US
dc.subjectProject Reporten_US
dc.subject13MCEIen_US
dc.subject13MCEI25en_US
dc.subjectINSen_US
dc.subjectINS 2013en_US
dc.subjectCE (INS)en_US
dc.titleAn Unsupervised Web Data Extraction Systemen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, CE (INS)

Files in This Item:
File Description SizeFormat 
13MCEI25.pdf13MCEI252.15 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.