Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/4848
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJoshi, Rutu-
dc.date.accessioned2014-08-19T07:53:38Z-
dc.date.available2014-08-19T07:53:38Z-
dc.date.issued2014-06-01-
dc.identifier.urihttp://hdl.handle.net/123456789/4848-
dc.description.abstractClassification of web pages is essential for improving the quality of web search, focused crawling, development of web directories like Yahoo, ODP etc. This paper compares various classification techniques for the task of web page classification. The classification techniques compared include k nearest neighbours (KNN), Naive Bayes (NB), support vector machine (SVM), classification and regression trees (CART) random forest (RF) and particle swarm optimization (PSO).Impact of using different representations of web pages is also studied. The different representations of the web pages that are used comprise Boolean, bag-of-words and term frequency and inverse document frequency (TFIDF). Experiments are performed using WebKB and R8 datasets. Accuracy and f-measure are used as the evaluation measures. Impact of feature selection on the accuracy of the classifier is moreover demonstrated.en_US
dc.publisherInstitute of Technologyen_US
dc.relation.ispartofseries12MCEC11;-
dc.subjectComputer 2012en_US
dc.subjectProject Report 2012en_US
dc.subjectComputer Project Reporten_US
dc.subjectProject Reporten_US
dc.subject12MCEen_US
dc.subject12MCECen_US
dc.subject12MCEC11en_US
dc.titleWeb Page Classificationen_US
dc.typeDissertationen_US
Appears in Collections:Dissertation, CE

Files in This Item:
File Description SizeFormat 
12MCEC11.pdf12MCEC11640.75 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.