Please use this identifier to cite or link to this item:
http://10.1.7.192:80/jspui/handle/123456789/8359
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Prajapati, Dineshkumar Jethalal | - |
dc.date.accessioned | 2019-05-13T06:46:19Z | - |
dc.date.available | 2019-05-13T06:46:19Z | - |
dc.date.issued | 2018 | - |
dc.identifier.uri | http://10.1.7.192:80/jspui/handle/123456789/8359 | - |
dc.description.abstract | Multilevel association rule discovers knowledge from conceptual hierarchical data set and thus provides more significant information than single level association rule. However, existing multilevel association rule mining algorithms have limitation of processing speed while analyzing big data. To overcome this, Hadoop-based distributed multilevel association rule mining approach is proposed which process the transactional dataset into partitions then transfers each task to all participating nodes. Thus, it reduces inter node message passing in the cluster. The proposed methodology is applied in two phases. In the first phase, the transactional dataset is generated from big sales dataset using Hadoop MapReduce framework. Then, a proposed distributed multilevel frequent pattern mining algorithms MR-MLAB (MapReduce based Multilevel Apriori using Bottom-up Approach) and MR-MLAT (MapReduce based Multilevel Apriori using Top-down Approach) are used to generate level-crossing frequent itemset for each level of concept hierarchy. Performance of the system is compared based on minimum support threshold at different level of concept hierarchy and also by varying dataset size. Moreover, time efficiency of proposed algorithms is compared with existing Traditional Multilevel Apriori (TMLA) algorithm. Due to ancestor relationship, this proposed distributed multilevel frequent pattern mining algorithm generates huge amount of hierarchical redundancy. Thus, to improve the performance of the system, such hierarchical redundancy needs to be eliminated. In second phase, distributed multilevel frequent pattern mining algorithm is applied on regional transactional dataset to generate frequent k-itemsets for each region. Then, multilevel association rules are generated for each region. These generated regional multilevel rules are so large that it becomes complex to analyze it using traditional methods. Hence, MR-MCIRD (MapReduce based Multilevel Consistent and Inconsistent Rule Detection) algorithm is proposed to derive consistent and inconsistent multilevel rules for each region. Further, extracted multilevel consistent and inconsistent rules are evaluated and compared based on confidence, all-confidence, cosine, interestingness of a rule, lift and conviction interestingness measures. The proposed methodology helps organization to improve marketing strategy for regions where inconsistent rules are relatively higher than consistent rules. Pruned interesting rules also provide useful and actionable knowledge to the users. | en_US |
dc.publisher | Institute of Technology | en_US |
dc.relation.ispartofseries | TT000062; | - |
dc.subject | Theses | en_US |
dc.subject | Computer Theses | en_US |
dc.subject | Theses IT | en_US |
dc.subject | Dr. Sanjay Garg | en_US |
dc.subject | 12EXTPHDE91 | en_US |
dc.subject | ITFCE027 | en_US |
dc.subject | TT000062 | en_US |
dc.title | Multilevel Association Rule Mining in Distributed Environment | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | Ph.D. Research Reports |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
TT000062.pdf | TT000062 | 1.35 MB | Adobe PDF | ![]() View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.