Table of Contents Table of Contents
Previous Page  21 / 96 Next Page
Information
Show Menu
Previous Page 21 / 96 Next Page
Page Background

W H I T E P A P E R

© 2017 Persistent Systems Ltd. All rights reserved. 21

www.persistent.com

The Kimball DW/BI systemarchitecture focuses on the following components:

Backroom area

or ETL (Extract -Transform – Load) area

( ,

pages 119-133). This area performs four major

[1]

operations:

Extracting

the data from the sources,

Performing

cleansing and conforming

transformations,

Delivering

dimensional models to the presentation server and

Managing

the ETL process and back room environment (job scheduling and monitoring,

recovery/restart capabilities, versioning, and pipelining/parallelizing).

An ETL tool may be built homegrown or may be used off-the-shelf, with the key parameters being the

productivity support, usability and metadata driven approach. The data stores in this layer deal with process

history, staging tables, master tables, reference tables, audit tables and error tables.

Presentation server

(

,

pages 133-141), also called the target DW server, is where the data complying to a

[1]

logical dimensional model and queryable by the front room applications is stored. It is organized by business

process, is atomically grained, is tied together by the enterprise data warehouse bus architecture. In general,

because of the large datasets involved in most organizations, a second layer is necessary for summary level

queries to return in real time (or at least, a reasonable amount of time): the aggregation layer, where data is pre-

aggregated during the load process as opposed to being aggregated on the fly by analytic queries.

Implementation choices for the aggregation layer include the same (or another) relational database as the one

storing the atomically grained dimensional models, or an OLAPengine.

The management features are those that govern the life of the data warehouse, which include performance

monitoring, securing and auditing access to the server, backing up and recovering content, physical database

maintenance and automated systemoperation.