邓晓衡

教授 博士生导师 硕士生导师

入职时间:2006-01-05

所在单位:电子信息学院

职务:院长

学历:博士研究生毕业

性别:男

联系方式:Email:dxh@csu.edu.cn

学位:博士学位

在职信息:在职

主要任职:计算机学院副院长 湖南省数据传感与交换设备工程中心 主任 IEEE RS Chapter长沙 主席CCF普适计算专委 委员 CCF长沙 执委

毕业院校:中南大学

学科:计算机科学与技术
信息与通信工程

当前位置: 邓晓衡 >> 论文成果

M. Yimin, G. Junhao, Mwakapesa D S, et al. PFIMD: a parallel MapReduce-based algorithm for frequent itemset mining[J]. Multimedia Systems, 2021, 27(4): 709-722.

发布时间:2024-03-13

点击次数:

发表刊物:Multimedia Systems

摘要:Frequent itemset mining (FIM) is a significant data mining technique which is widely adopted in numerous applications for exploring frequent items. With the rapid growth and expansion of datasets, FIM has become an interesting topic for many researchers, which has triggered many innovations of numerous FIM algorithms in the big data environment. This study aims to design an optimization parallel frequent itemset mining algorithm based on MapReduce, named as PFIMD algorithm, to deal with the problem of time and space complexity during processing and computing item sets, as well as the failure to adequately balance the load among parallel tasks in the existing parallel FIM algorithms. First, a structure called DiffNodeset is adopted for avoiding the increase of N−list cardinality in the MRPrePost algorithm effectively. Then, a 2-way comparison strategy is designed to speed up the DiffNodeset generation of 2-itemsets and reduce the time complexity of the algorithm. Finally, the steps of the improved algorithm are parallelized using the cloud computing platform Hadoop and the program- ming model MapReduce. Moreover, to achieve a uniform grouping of each item in F−list, a load balancing strategy based on dynamic grouping is proposed, which solves the problem of uneven load of each node in the cluster. The experimental results show that the modified algorithm not only overcomes the shortcoming of MRPrePost in the big data environment, but also greatly reduces the time and space complexity. Finally, the specific applications of PFIMD algorithm in several multimedia data sets are listed to illustrate its universality.

备注:http://faculty.csu.edu.cn/dengxiaoheng/zh_CN/lwcg/10445/content/49265.htm

是否译文:

附件:

  • 44-PFIMD_a_parallel_MapReduce-based_algorithm_for_fre.pdf

  • 上一条: Q. Lu, C. Zhu, X. Deng. An efficient image encryption scheme based on the LSS chaotic map and single S-box[J]. IEEE Access, 2020, 8: 25664-25678.

    下一条: H. Long, H. Shen, X. Deng. ISIRS: information theory-based social influence with recommender system[J]. International Journal of Embedded Systems, 2019, 11(6): 796-805.