Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/4079
Title: Automatic Document Classification
Authors: Jani, Jignesh
Keywords: Computer 2011
Project Report 2011
Computer Project Report
Project Report
11MICT
11MICT19
ICT
ICT 2011
CE (ICT)
Issue Date: 1-Jun-2013
Publisher: Institute of Technology
Series/Report no.: 11MICT19
Abstract: Enormous amount of documents are generated everyday life. There is always a need to retrieve the documents over any medium. This has come up with the solution of classifying the documents with appropriate label. A lot of research has been done on this topic to classify them using different classifiers. The classifier used in this research is Naïve Bayes classifier, due to its simplicity. The Naïve Bayes classifier classifies a document only under one class no matter by what fraction posterior probabilities of other classes are smaller. By considering the fraction by which other associated terms are smaller we rank a document more into a specific context but also little less into another context. The Apriori algorithm is used to find the frequent patterns out of the document which will give the context of the document and will help in labeling the document with more appropriate classification tag. The proposed approach is to classify the document with Naïve Bayes classifier at first level and then finding associated terms from documents and comparing them with the already mined frequent patterns from the train dataset. This two level classification gives the more precise label to the document.
URI: http://10.1.7.181:1900/jspui/123456789/4079
Appears in Collections:Dissertation, CE (ICT)

Files in This Item:
File Description SizeFormat 
11MICT19.pdf11MICT19466.4 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.