1. Trang chủ
  2. » Công Nghệ Thông Tin

SAS Data Integration Studio 3.3- P43 potx

5 184 0
Tài liệu đã được kiểm tra trùng lặp

Đang tải... (xem toàn văn)

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 5
Dung lượng 564,81 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

4 In the Data Transforms folder, click and drag SCD Type 2 Loader into the Process Designer window.. 206 Add SCD Columns to the Dimension Table 4 Chapter 129 In the Project tree, click a

Trang 1

Using Slowly Changing Dimensions 4 Create and Populate the Job 205

3 In this example, three additional columns will be specified for CUSTOMER_SCD: two change-tracking columns (VALID_FROM_DTTM and VALID_TO_DTTM) and

a column for generated keys (CUSTOMER_GEN_KEY)

3 CUSTOMER_TRANS and CUSTOMER_SCD have been registered in a current metadata repository

3 You have selected a default SAS application server for SAS Data Integration Studio, as described in “Selecting a Default SAS Application Server” on page 96 This server can access all tables that are used in the job

3 The main metadata repository is under change-management control For details about change management, see “Working with Change Management” on page 113

3 It is assumed that you have started SAS Data Integration Studio and have opened the appropriate metadata profile

The first task is to check out any existing metadata that must be updated for the current job

Check Out Existing Metadata That Must Be Updated

You do not have to check out the metadata for a table in order to add it as a source or

a target in a job However, the metadata for the CUSTOMER_SCD table must be checked out because (a) we assume that the metadata for this table was created and checked in earlier, and (b) the metadata for the table must be updated for the current job

Follow these steps to check out existing metadata:

1 On the SAS Data Integration Studio desktop, select the Inventory tab.

2 In the Inventory tree, open the Tables folder.

3 Select the table that must be updated for the current job: CUSTOMER_SCD

4 Select Project I Check Out from the menu bar The metadata for this table will

be checked out and will appear in the Project tree

The next task is to create and populate the job

Create and Populate the Job

Follow these steps to populate the job Load the Customer Dimension:

1 In SAS Data Integration Studio, in the Shortcuts pane, click Process Designer

to start the New Job Wizard

2 In the New Job Wizard, type the job name Load the Organization Dimension and click Finish An empty Process Designer window is displayed.

3 In the tree view, click the Process Library tab, then expand the Data

Transforms folder

4 In the Data Transforms folder, click and drag SCD Type 2 Loader into the

Process Designer window Release the mouse button to display the SCD Type 2 Loader transformation template in the Process Designer window for the new job The template displays with drop zones for a source and a target

5 In the tree view, select the Inventory tab.

6 In the Inventory tree, open the Tables folder.

7 In the Tables folder, click and drag the CUSTOMER_TRANS table into the source

drop area of the SCD Type 2 Loader

8 In the tree view, click the Project tab.

Trang 2

206 Add SCD Columns to the Dimension Table 4 Chapter 12

9 In the Project tree, click and drag CUSTOMER_SCD into the target drop area of the

SCD Type 2 Loader transformation The job is now fully populated with tables and transformations, as shown in the following display

Display 12.5 Fully Populated Job for Loading the Dimension Table CUSTOMER_SCD

The next step is to add tracking columns and a generated key column to the CUSTOMER_SCD table

Add SCD Columns to the Dimension Table Follow these steps to add tracking columns and a generated key column to the CUSTOMER_SCD table:

1 In the Process Designer window, double-click the icon for the CUSTOMER_SCD table to display its properties window

2 In the properties window, click the Columns tab Initially, the CUSTOMER_SCD

dimension table was created with the same columns as the source table CUSTOMER_TRANS (These columns are shown in Display 12.4 on page 204.) To implement slowly changing dimensions, the dimension table needs three new columns

3 To add the first new column, click CUSTOMER_ID, then click New A new untitled

column appears beneath CUSTOMER_ID

4 Replace the default name of the new column with the name VALID_FROM_DTTM.

This column will contain the date and time that each row was physically loaded into the table When the job is run, data will be provided for this column by the SCD Type 2 Loader

5 In the row for VALID_FROM_DTTM, double-click the Type column and select Numeric.

6 In the row for VALID_FROM_DTTM, double-click the Format column and type

Datetime20.

7 In the row for VALID_FROM_DTTM, click New A new untitled column appears beneath VALID_FROM_DTTM.

8 Replace the default name of the second new column with the name

VALID_TO_DTTM This column will contain the date and time that each row was

Trang 3

Using Slowly Changing Dimensions 4 Specify the Primary Key for the Dimension Table 207

superseded by a new current row When the job is run, data will be provided for this column by the SCD Type 2 Loader

9 In the row for VALID_TO_DTTM, specify the Numeric type and the Datetime20.

format

10 To add the third new column, click New and replace the default column name with the name CUSTOMER_GEN_KEY This column will provide unique key values for all

of the rows in the table The new key values will be generated by the SCD Type 2 Loader

11 Press the TAB key twice, then double click and select the Numeric data type.

12 Press the TAB key once, then click and type 12., which is a numeric format.

13Click the row number on the far left for the row CUSTOMER_GEN_KEY and drag the column up to position number one The columns of the dimension table are now configured for slowly changing dimensions, as shown in the following display

Display 12.6 Columns in the Dimension Table

14 Click Apply to save your changes.

The next step is to designate the generated column as the primary key for the

dimension table

Specify the Primary Key for the Dimension Table

Follow these steps to designate the generated column as the primary key for the dimension table:

1 In the properties window for the CUSTOMER_SCD table, click the Keys tab Locate the New button in the left pane of the tab.

Trang 4

208 Specify the Business Key for the SCD Loader 4 Chapter 12

2 Click the down arrow to the right of the New button and select Primary Key from

the pull-down menu Default metadata for a primary key appears

3 In the right pane of the tab, select the CUSTOMER_GEN_KEY column, then click the right arrow The CUSTOMER_GEN_KEY column is now specified as the primary key of the dimension table, as shown in the following display

Display 12.7 Primary Key for the Dimension Table

4 Click OK to save your changes and close the properties window for the dimension

table

You have now configured the metadata for the dimension table The next step is to configure keys and change tracking for the SCD loader

Specify the Business Key for the SCD Loader The business key is the primary key of a source table that is used to load the dimension table In this example, the business key would be the CUSTOMER_ID column in the CUSTOMER_TRANS table Follow these steps to specify the business key for the SCD loader:

1 In the Process Designer window, double-click the icon for the SCD Type 2 Loader

to display its properties window

2 In the properties window, click the Business Key tab Click New to display the

column selection window

Trang 5

Using Slowly Changing Dimensions 4 Specify the Generated Key for the SCD Loader 209

3 In the column selection window, select the CUSTOMER_ID column and click OK The CUSTOMER_ID column displays as the business key on the Business Key

tab, as shown in the following display

Display 12.8 Business Key Specified for the SCD Type 2 Loader

4 Click Apply to save your changes.

The next step is to specify a generated key for the SCD loader

Specify the Generated Key for the SCD Loader

Follow these steps to specify the generated key for the SCD loader:

1 In the properties window for the SCD Type 2 Loader, click the Generated Keys

tab

Ngày đăng: 05/07/2014, 11:20

TỪ KHÓA LIÊN QUAN