Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/5163
Title: Experimental Evaluation Of Different Classification Techniques For Web Page Classification
Authors: Joshi, Rutu
Thakkar, Priyank
Keywords: Classification and Regression Trees (CART)
K-Nearest Neighbours (KNN)
Naive Bayes (NB)
Particle Swarm Optimization (PSO)
Random Forest
Support Vector Machine (SVM)
Web Page Classification
Computer Faculty Paper
Faculty Paper
ITFCE037
Issue Date: May-2014
Publisher: IJARET
Series/Report no.: ITFCE037-3;
Abstract: Classification of web pages is essential for improving the quality of web search, focused crawling, development of web directories like Yahoo, ODP etc. This paper compares various classification techniques for the task of web page classification. The classification techniques compared include k-Nearest Neighbours (KNN), Naive Bayes (NB), Support Vector Machine (SVM), Classification and Regression Trees (CART), Random Forest (RF) and Particle Swarm Optimization (PSO).Impact of using different representations of web pages is also studied. The different representations of the web pages that are used comprise Boolean, bag-of-words and Term Frequency and Inverse Document Frequency (TFIDF). Experiments are performed using WebKB and R8 data sets. Accuracy and F-measure are used as the evaluation measures. Impact of feature selectionon the accuracy of the classifier is moreover demonstrated.
Description: International Journal of Advanced Research in Engineering and Technology (IJARET), Vol. 5 (5), May, 2014, Page No. 91 – 101
URI: http://hdl.handle.net/123456789/5163
ISSN: 0976 - 6480
Appears in Collections:Faculty Papers, CE

Files in This Item:
File Description SizeFormat 
ITFCE037-3.pdfITFCE037-3210.1 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.