1. Trang chủ
  2. » Giáo án - Bài giảng

Decision support and BI systems ch08

39 210 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 39
Dung lượng 4,33 MB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Learning Objectivestransformation, and load ETL processes active data warehousing and security issues... Data MartA departmental data warehouse that stores only relevant data A subset t

Trang 1

Business Intelligence and Decision Support Systems

(9 th Ed., Prentice Hall)

Chapter 8:

Data Warehousing

Trang 2

 Describe the processes used in developing and managing data warehouses

 Explain data warehousing operations

 Explain the role of data warehouses in decision support

Trang 3

Learning Objectives

transformation, and load (ETL) processes

active) data warehousing

and security issues

Trang 5

Main Data Warehousing (DW) Topics

Trang 6

Data Warehouse Defined

are specially organized to provide wide, cleansed data in a standardized format

integrated, subject-oriented databases design

to support DSS functions, where each unit of data is non-volatile and relevant to some

moment in time”

Trang 8

Data Mart

A departmental data warehouse that stores only relevant data

A subset that is created directly from a data warehouse

A small data warehouse designed for a strategic business unit or a department

Trang 9

Data Warehousing Definitions

A type of database often used as an interim area for a data warehouse

An operational data mart

A data warehouse for the enterprise

Data about data In a data warehouse, metadata describe the contents of a data warehouse and the manner of its acquisition and use

Trang 10

A Conceptual Framework for DW

Data Sources

ERP

Legacy

POS

Other OLTP/wEB

External data

Select

Transform Extract

Integrate Load

ETL Process

Enterprise Data warehouse Metadata

Replication

Data/text mining

Custom built applications

OLAP, Dashboard, Web

Routine Business Reporting

Applications (Visualization)

Data mart (Engineering)

Data mart (Marketing)

Data mart (Finance)

Data mart ( ) Access

No data marts option

Trang 11

Generic DW Architectures

software

access and analyze data from the warehouse

Trang 14

A Web-based DW Architecture

Web Server

Client (Web browser)

Application Server

Data warehouse

Web pages

Internet/

Intranet/

Extranet

Trang 15

Alternative DW Architectures

SourceSystems

Staging Area

Independent data marts(atomic/summarized data)

End user access and applications

ETL(a) Independent Data Marts Architecture

SourceSystems

Staging Area

End user access and applications

ETL

Dimensionalized data marts linked by conformed dimentions(atomic/summarized data)

(b) Data Mart Bus Architecture with Linked Dimensional Datamarts

SourceSystems

Staging Area

End user access and applications

ETL

Normalized relational warehouse (atomic data)

(c) Hub and Spoke Architecture (Corporate Information Factory)

Trang 16

Alternative DW Architectures

SourceSystems

Staging Area

Normalized relational warehouse (atomic/some summarized data)

End user access and applications

ETL (d) Centralized Data Warehouse Architecture

End user access and applications

Logical/physical integration of common data elements

Existing data warehousesData marts and legacy systmes

Data mapping / metadata (e) Federated Architecture

Trang 17

Which Architecture is the Best?

Empirical study by Ariyachandra and Watson (2006)

Trang 18

Data Warehousing Architectures

1 Information interdependence between organizational units

2 Upper management’s information needs

3 Urgency of need for a data warehouse

4 Nature of end-user tasks

5 Constraints on resources

6 Strategic view of the data

warehouse prior to implementation

7 Compatibility with existing

Ten factors that potentially affect the

architecture selection decision:

Trang 19

Enterprise Data Warehouse (by Teradata Corporation)

Trang 20

Data Integration and the Extraction, Transformation, and Load (ETL) Process

Integration that comprises three major processes:

data access, data federation, and change capture

A technology that provides a vehicle for pushing data

from source systems into a data warehouse

An evolving tool space that promises real-time data integration from a variety of sources

A new way of integrating information systems

Trang 21

Extraction, transformation, and load (ETL) process

Data Integration and the Extraction, Transformation, and Load (ETL) Process

Packaged application

Legacy system

Other internal applications

Transient data source

Extract Transform Cleanse Load

Datawarehouse

Data mart

Trang 22

ETL

learning curve

number of data sources/architectures

functional user

Trang 23

Benefits of DW

 Allows end users to perform extensive analysis

 Allows a consolidated view of corporate data

 Better and more timely information

 Enhanced system performance

 Simplification of data access

 Enhance business knowledge

 Present competitive advantage

 Enhance customer service and satisfaction

 Facilitate decision making

 Help in reforming business processes

Trang 24

Data Warehouse Development

 There is no one-size-fits-all strategy to DW

Trang 25

DW Development Approaches

(Inmon Approach) (Kimball Approach)

Trang 26

DW Structure: Star Schema (a.k.a Dimensional Modeling)

Claim Information

Time Location

Start Schema Example for anAutomobile Insurance Data Warehouse

Dimensions:

How data will be sliced/

diced (e.g., by location, time period, type of automobile or driver)

Facts:

Central table that contains (usually summarized) information; also contains foreign keys to access each dimension table

Trang 27

object in which each

dimension of the data

Trang 28

Best Practices for Implementing DW

business professionals (a business–supplier relationship must be developed)

Trang 29

Risks in Implementing DW

(Continued …)

Trang 30

Risks in Implementing DW – Cont

Trang 31

Things to Avoid for Successful Implementation of DW

because it is available

design is the same as transactional DB design

technology oriented rather than user oriented

(…see more on page 356)

Trang 32

Real-time DW (a.k.a Active Data Warehousing)

real-time analysis and real-time decision making is growing rapidly

Trang 33

Evolution of DSS & DW

Trang 34

Active Data Warehousing (by Teradata Corporation)

Trang 35

Comparing Traditional and Active DW

Trang 36

Data Warehouse Administration

DW requires especially strong monitoring in order to sustain its efficiency, productivity and security.

management of a data warehouse entails skills and proficiency that go past what is required of a traditional database

administrator.

hardware, and networking technologies

Trang 37

DW Scalability and Security

 The amount of data in the warehouse

 How quickly the warehouse is expected to grow

 The number of concurrent users

 The complexity of user queries

data-access functions will grow linearly with the size of the warehouse

Trang 38

BI / OLAP Portal for Learning

Trang 39

End of the Chapter

Ngày đăng: 10/08/2017, 11:06

TỪ KHÓA LIÊN QUAN

TÀI LIỆU CÙNG NGƯỜI DÙNG

  • Đang cập nhật ...

TÀI LIỆU LIÊN QUAN