Abstract
Data warehousing populates data from source systems into a central data warehouse (DW) through extraction, transformation, and loading (ETL). Massive transaction data are routinely recorded in a variety of applications such as retail commerce, bank systems, and Website management. Transaction data records a time and relevant reference data needed for a particular transaction record. It is non-trivial for a standard ETL to process transaction data with dependencies and velocity. This paper presents a two-tiered segmentation approach for transaction data warehousing. The approach uses the so-called two-staging ETL to process the detailed records from operational source systems, followed by dimensional data process to populate a dimension data store with star or snowflake schema. The proposed approach is an all-in-one solution capable of processing fast/slowly changing data, and early/late-arriving data. The paper empirically evaluates the proposed method, and the results have shown its effectiveness for transaction data warehousing.
Originalsprog | Engelsk |
---|---|
Titel | Emerging Perspectives in Big Data Warehousing |
Redaktører | David Taniar, Wenny Rahayu |
Antal sider | 27 |
Forlag | IGI global |
Publikationsdato | jun. 2019 |
Sider | 1-27 |
Kapitel | 1 |
ISBN (Trykt) | 9781522555162 |
ISBN (Elektronisk) | 9781522555179 |
DOI | |
Status | Udgivet - jun. 2019 |
Emneord
- Teknik, ingeniørvidenskab og IT