The Mining Algorithm of Frequent Itemsets based on Mapreduce and FP-tree

Publications

Share / Export Citation / Email / Print / Text size:

International Journal of Advanced Network, Monitoring and Controls

Xi'an Technological University

Subject: Computer Science, Software Engineering

GET ALERTS

eISSN: 2470-8038

DESCRIPTION

0
Reader(s)
0
Visit(s)
0
Comment(s)
0
Share(s)

SEARCH WITHIN CONTENT

FIND ARTICLE

Volume / Issue / page

Related articles

VOLUME 2 , ISSUE 3 (September 2017) > List of articles

The Mining Algorithm of Frequent Itemsets based on Mapreduce and FP-tree

Bo He / Jianhui Pei / Hongyuan Zhang

Keywords : FP-tree, Mapreduce, Frequent itemsets, Big data, Data mining

Citation Information : International Journal of Advanced Network, Monitoring and Controls. Volume 2, Issue 3, Pages 68-71, DOI: https://doi.org/10.1109/iccnea.2017.59

License : (CC BY-NC-ND 4.0)

Published Online: 11-April-2018

ARTICLE

ABSTRACT

The date mining based on big data was a very important field. In order to improve the mining efficiency, the mining algorithm of frequent itemsets based on mapreduce and FP-tree was proposed, namely, MAFIM algorithm. Firstly, the data were distributed by mapreduce. Secondly, local frequent itemsets were computed by FP-tree. Thirdly, the mining results were combined by the center node. Finally, global frequent itemsets were got by mapreduce and the search strategy. Theoretical analysis and experimental results suggest that MAFIM algorithm is fast and effective.

Content not available PDF Share

FIGURES & TABLES

REFERENCES

Han JW, Kamber M, Pei J. Data Mining: Concepts and Techniques Third Edition [M]. San Francisco: Morgan Kaufmann, 2011.

 

Big Data Across the Federal Government [EB/OL]. http://www.whitehouse.gov/sites/default/files/microsites/ostp/big_data_fact_sheet_final_1.pdf, 2012.

 

Science. Special Online Collection: Dealing with Data [EB]. http://www.sciencemag.org/site/special/data/, 2011.

 

Marconi K, Lehmann H. Big Data and Health Analytics[M]. Boca Raton:CRC Press, 2014.

 

He B, Yan H. Incremental Updating Algorithm of Global Maximum Frequent Itemsets in Distributed Database[J]. Journal of Sichuan University(Engineering Science Edition), 2012,44(3):112~117. (in Chinese with English abstract)

 

McKinsey&Company. The big-data revolution in US health care: Accelerating value and innovation [R]. http://www.mckinsey.com/industries/healthcare-systems-and-services/our-insights/the-big-data-revolution-in-us-health-care, 2013.

 

He B. Fast Mining of Global Maximum Frequent Itemsets in Distributed Database [J]. Control and Decision, 2011,26(8):1214~1218. (in Chinese with English abstract)

 

Muin J. Khoury and John P. A. Ioannidis. Big data meets public health[J]. Science, 2014, 346(6213) : 1054-1055.

 

Chen ZB, Han H, Wang JX. Data Warehouse and Data Mining[M].Beijing: Tsinghua University Press, 2009.

 

Song YQ, Zhu ZH, Chen G. An algorithm and its updating algorithm based on FP-tree for mining maximum frequent itemsets[J]. Journal of software, 2003,14(9):1586~1592(in Chinese with English abstract)

 

Bayardo RJ. Efficiently mining long patterns form databases[C]. In: Haas LM, Tiwary A, eds. Proc. Of the ACM SIGMOD International Conference on Management of Data. Dallas:ACM Press, 2000. 1~12.

 

EXTRA FILES

COMMENTS