文章摘要
姜广坤.大数据环境下基于Hadoop平台的医学数据挖掘算法研究[J].机床与液压,2018,46(18):163-168.
.Research on medical data mining algorithm based on Hadoop platform in big data environment[J].Machine Tool & Hydraulics,2018,46(18):163-168
大数据环境下基于Hadoop平台的医学数据挖掘算法研究
Research on medical data mining algorithm based on Hadoop platform in big data environment
  
DOI:10.3969/j.issn.1001-3881.2018.18.026
中文关键词: 数据挖掘  关联规则  云平台  Apriori 算法  医学数据
英文关键词: Data mining, Association rules, Cloud platform, Apriori algorithm, Medical data
基金项目:
作者单位E-mail
姜广坤 大连海洋大学 dljianggk@qq.com 
摘要点击次数: 235
全文下载次数: 0
中文摘要:
      为了有效利用云平台Hadoop框架的并行处理能力。通过对大数据挖掘技术中的传统关联规则算法 Apriori算法进行了分析和改进,提出了一种基于Map Reduce并行模式的改进数据挖掘算法,适用于医学大数据的分析和应用。首先通过布尔排列优化数据库中事务数据的存储方式,从而有效减少数据库被扫描的次数。然后采用关联规则优化减少Apriori算法中冗余的子集。为了验证改进算法的有效性,采用医学历史数据进行实验验证。最后仿真实验结果显示,相比传统的Apriori算法,提出算法的运行效率更高,具有较好的可靠性和有效性。
英文摘要:
      In order to effectively use the parallel processing capabilities of the cloud platform Hadoop framework, an improved data mining algorithm based on Map Reduce parallel mode is proposed by analyzing and improving a traditional association rules algorithm Apriori algorithm which belong to big data mining technology, which is suitable for the analysis and application of medical big data. First, the Boolean arrangement is used to optimize the storage mode of transaction data in the database, and it will effectively reduce the number of database scanned. Then, the association rule optimization is used to reduce the redundant subsets in the Apriori algorithm. In order to verify the effectiveness of the improved algorithm, medical history data is used to verify the experiment. Finally, the simulation results show that the proposed algorithm is more efficient and has better reliability and validity as compared with the traditional Apriori algorithm.
查看全文   查看/发表评论  下载PDF阅读器
关闭

分享按钮