首页 | 本学科首页   官方微博 | 高级检索  
     

Study and Implementation of a New SQL-Based ETL Approach
引用本文:BAO Yubin SONG Jie LENG Fangling WANG Daling YU Ge. Study and Implementation of a New SQL-Based ETL Approach[J]. 武汉大学学报:自然科学英文版, 2007, 12(5): 804-808. DOI: 10.1007/s11859-007-0045-5
作者姓名:BAO Yubin SONG Jie LENG Fangling WANG Daling YU Ge
作者单位:College of Information Science and Engineering, Northeastern University, Shenyang 110004, Liaoning, China
基金项目:Supported by the National Natural Science Foundation of China (60673139, 60573090)
摘    要:This paper analyzes the main characteristics, benefits, and disadvantages of existing traditional ETL (extraction, transformation, loading) methods, and summaries some factors affecting the performance of ETL tools. Then, a new ETL approach, E-LT (extraction, loading and transformation), is proposed. The E-LT approach applies database mapping technique to realize that loading stage and transformation stage in the ETL process are performed at the same time after the extraction stage. Thus, it can use SQL commands to complete loading and transformation processing, and eliminates the staging area before loading in traditional ETL process. The framework of an ETL engine based on E-LT method is presented. The ETL process including initial loading and incremental refreshment is discussed in detail, and the SQL-based algorithm for initial loading is presented. The performance of E-LT method on loading throughout outperforms some commercial ETL approaches by experimental proof and theoretical analysis. At last, a real case in marine data warehousing of the E-LT method is discussed for illustrating the validity of the proposed method.

关 键 词:数据库 ETL E-LT SQL
文章编号:1007-1202(2007)05-0804-05
收稿时间:2007-03-01
修稿时间:2007-03-01

Study and implementation of a new SQL-based ETL approach
Bao Yubin,Song Jie,Leng Fangling,Wang Daling,Yu Ge. Study and implementation of a new SQL-based ETL approach[J]. Wuhan University Journal of Natural Sciences, 2007, 12(5): 804-808. DOI: 10.1007/s11859-007-0045-5
Authors:Bao Yubin  Song Jie  Leng Fangling  Wang Daling  Yu Ge
Affiliation:(1) College of Information Science and Engineering, Northeastern University, Shenyang, 110004, Liaoning, China
Abstract:This paper analyzes the main characteristics, benefits, and disadvantages of existing traditional ETL (extraction, transformation, loading) methods, and summaries some factors affecting the performance of ETL tools. Then, a new ETL approach, E-LT (extraction, loading and transformation), is proposed. The E-LT approach applies database mapping technique to realize that loading stage and transformation stage in the ETL process are performed at the same time after the extraction stage. Thus, it can use SQL commands to complete loading and transformation processing, and eliminates the staging area before loading in traditional ETL process. The framework of an ETL engine based on E-LT method is presented. The ETL process including initial loading and incremental refreshment is discussed in detail, and the SQL-based algorithm for initial loading is presented. The performance of E-LT method on loading throughout outperforms some commercial ETL approaches by experimental proof and theoretical analysis. At last, a real case in marine data warehousing of the E-LT method is discussed for illustrating the validity of the proposed method. Biography: BAO Yubin (1968–), male, Associate professor, Ph.D., research direction: data warehouse, data mining.
Keywords:data warehouse   ETL   E-LT   SQL
本文献已被 维普 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号