Two-ETL Phases for Data Warehouse Creation: Design and Implementation
详细信息    查看全文
  • 关键词:Extract transform and load ; Business process modeling notation ; Data warehouse design ; Transformation operations ; Correspondence table
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2015
  • 出版时间:2015
  • 年:2015
  • 卷:9282
  • 期:1
  • 页码:138-150
  • 全文大小:4,297 KB
  • 参考文献:1.Golfarelli, M.: From user requirements to conceptual design in data warehouse design-a survey. In: Data Warehousing Design and Advanced Engineering Applications: Methods for Complex Construction, pp. 6鈥?1 (2010)
    2.Nabli, A.: Approche d鈥檃ide 脿 la conception automatis茅e d鈥檈ntrep么t de donn茅es: Guide de mod猫lisation. Presses Acadmiques Francophones (2013)
    3.Favre, C., Bentayeb, F., Boussaid, O., Darmont, J., Gavin, G., Harbi, N., Kabachi, N., Loudcher, S.: Les entrep么ts de donn茅es pour les nuls. ou pas!. In: 2茅me Atelier aide 脿 la D茅cision 脿 tous les Etages (EGC/AIDE), Janvier 2013
    4. Trujillo, J., Luj谩n-Mora, S.: A uml based approach for modeling ETL processes in data warehouses. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 307鈥?20. Springer, Heidelberg (2003) View Article
    5.Mallek, H., Walha, A., Ghozzi, F., Gargouri, F.: ETL-web process modeling. In: ASD Advances on Decisional Systems Conference (2014)
    6.El-Sappagh, A., Hendawi, A., Bastawissy, H.: A proposed model for data warehouse ETL processes. J. King Saud Univ. Comput. Inf. Sci. 23(2), 91鈥?04 (2011)
    7. Mu帽oz, L., Maz贸n, J.-N., Pardillo, J., Trujillo, J.: Modelling ETL processes of data warehouses with UML activity diagrams. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM-WS 2008. LNCS, vol. 5333, pp. 44鈥?3. Springer, Heidelberg (2008) View Article
    8.Munoz, L., Mazon, J., Trujillo, J.: Automatic generation of ETL processes from conceptual models. In: Data Warehousing and OLAP, pp. 33鈥?0 (2009)
    9. Atigui, F., Ravat, F., Teste, O., Zurfluh, G.: Using OCL for automatically producing multidimensional models and ETL processes. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 42鈥?3. Springer, Heidelberg (2012) View Article
    10.El Akkaoui, Z., Zimanyi, E.: Defining ETL worfklows using BPMN and BPEL. In: Data Warehousing and OLAP, pp. 41鈥?8 (2009)
    11. El Akkaoui, Z., Maz贸n, J.-N., Vaisman, A., Zim谩nyi, E.: BPMN-based conceptual modeling of ETL processes. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 1鈥?4. Springer, Heidelberg (2012) View Article
    12. Oliveira, B., Belo, O.: BPMN patterns for ETL conceptual modelling and validation. In: Chen, L., Felfernig, A., Liu, J., Ra艣, Z.W. (eds.) ISMIS 2012. LNCS, vol. 7661, pp. 445鈥?54. Springer, Heidelberg (2012) View Article
    13. Wilkinson, K., Simitsis, A., Castellanos, M., Dayal, U.: Leveraging business process models for ETL design. In: Parsons, J., Saeki, M., Shoval, P., Woo, C., Wand, Y. (eds.) ER 2010. LNCS, vol. 6412, pp. 15鈥?0. Springer, Heidelberg (2010) View Article
    14. Jovanovic, P., Romero, O., Simitsis, A., Abell贸, A.: Requirement-driven creation and deployment of multidimensional and ETL designs. In: Castano, S., Vassiliadis, P., Lakshmanan, L.V.S., Lee, M.L. (eds.) ER 2012 Workshops 2012. LNCS, vol. 7518, pp. 391鈥?95. Springer, Heidelberg (2012) View Article
  • 作者单位:Ahlem Nabli (16)
    Senda Bouaziz (16)
    Rania Yangui (17)
    Faiez Gargouri (17)

    16. MIRACL Laboratory, Faculty of Sciences, Sfax University, 1171, Sfax, Tunisia
    17. MIRACL Laboratory, Institute of Computer Science and Multimedia, Sfax University, 1030, Sfax, Tunisia
  • 丛书名:Advances in Databases and Information Systems
  • ISBN:978-3-319-23135-8
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
文摘
Building the ETL process is potentially one of the biggest tasks of building a warehouse. In fact, it is complex, time consuming, and consumes most of data warehouse projects implementation efforts, costs, and resources. Nevertheless, the difference on data structures imposes new requirements on the ETL process implementation and maintenance. What makes these tasks even more challenging is the fact that data continue to grow rapidly and business requirements change over time. In this paper, we propose a method that contains Two-ETL phases, one treats the pre-treatment phase and another deals with the actual ETL. Our method consists on determining the correspondence table, modeling new operations using the Business Process Modeling Notation (BPMN) and implementing these operations with Talend Open Source (TOS). In addition, our method allows the design of ETL process in an earlier stage, which enormously facilitates the implementation of this process. Another advantage of our proposal is the use of the BPMN which allows to cover a deficit of communication that often occurs between the design and implementation of business processes.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700