一种适合于科学数据的聚类算法 Clustering in scientific data mining based on grid and iterative method期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

一种适合于科学数据的聚类算法

引用本文：	李欣宇,傅彦.一种适合于科学数据的聚类算法[J].成都信息工程学院学报,2006,21(3):327-330.

作者姓名：	李欣宇傅彦

作者单位：	电子科技大学计算机学院,四川,成都,610054

基金项目：	国家高技术研究发展计划(863计划)

摘要：	聚类是科学数据挖掘中的核心问题.在已提出的聚类算法中大都是基于＂距离＂的概念,这类算法的缺点在于处理数据量大和维数高的科学数据时不够有效,因此提出迭代网格算法.这个算法与基于距离的损法有根本不同,它抛弃了距离的概念,而采取一种新的思路.它不仅能够自动发现包含有趣知识的子空间,并将里面存在的所有聚类挖掘出来;而且它能很好的处理维数高和数据量大的科学数据.
关键词：	科学数据挖掘聚类分析网格密度高维数据
文章编号：	1671-1742(2006)03-0327-04
修稿时间：	2005年5月19日
Clustering in scientific data mining based on grid and iterative method

LI Xin-yu,FU Yan.Clustering in scientific data mining based on grid and iterative method[J].Journal of Chengdu University of Information Technology,2006,21(3):327-330.

Authors:	LI Xin-yu FU Yan

Abstract:	The clustering is an important part of the scientific data mining.Among the various algorithms a main class is based on the "distance".The "K-means" and "k-medoids" are two of these kinds.However these algorithms are inefficient when dealing with the large data sets and data sets of high-dimension.This algorithm differs much from the above ones and it takes a totally different approach called a grid and density based algorithm.It can automatically find out the subspaces containing interesting patterns and discover all clusters in that subspace and it performs well when dealing with the high-dimensional data.

Keywords:	scientific data mining cluster analysis grid density high-dimension data
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏