Scaling up the DBSCAN algorithm for clustering large spatial databases based on sampling technique |
| |
Authors: | Guan Ji-hong Zhou Shui-geng Bian Fu-ling He Yan-xiang |
| |
Institution: | 1. School of Computer, Wuhan University, 430072, Wuhan, China 2. State Key Laboratory of Software Engineering, Wuhan University, 430072, Wuhan, China 3. College of Remote Sensing and Information Engineering, Wuhan University, 430072, Wuhan, China
|
| |
Abstract: | Clustering, in data mining, is a useful technique for discovering interesting data distributions and patterns in the underlying data, and has many application fields, such as statistical data analysis, pattern recognition, image processing, and etc. We combine sampling technique with DBSCAN algorithm to cluster large spatial databases, and two sampling-based DBSCAN (SDBSCAN) algorithms are developed. One algorithm introduces sampling technique inside DBSCAN, and the other uses sampling procedure outside DBSCAN. Experimental results demonstrate that our algorithms are effective and efficient in clustering largescale spatial databases. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|