1. Trang chủ
  2. » Công Nghệ Thông Tin

SAS Data Integration Studio 3.3- P34 docx

5 123 0
Tài liệu đã được kiểm tra trùng lặp

Đang tải... (xem toàn văn)

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 5
Dung lượng 553,76 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

160 Configure the SQL Join Transformation 4 Chapter 10Specify GROUP BY Options for the Join In the previous section, the Total_Retail_Price column in the join result table was changed to

Trang 1

160 Configure the SQL Join Transformation 4 Chapter 10

Specify GROUP BY Options for the Join

In the previous section, the Total_Retail_Price column in the join result table was changed to a calculated column that sums all of its values All retail transactions for all employees would be added together However, the report that you want to create summarizes the retail transactions for each employee

To limit the scope of the Total_Retail_Price column to each employee, add a GROUP

BY option that applies to all of the other mapped columns in the join That is, you

would specify the following columns in the Group By tab of the SQL Join

transformation: Employee_Name, Employee_ID, Job_Title, Company, Department, Section, and Org_Group

Follow these steps to specify GROUP BY options in the SQL Join transformation:

1 In the properties window for the SQL Join transformation, click the Group By tab.

2 In the Columns pane, select all columns except the Total_Retail_Price column Then click the right arrow to move the columns into the Group by columns pane The Group By tab should resemble the following display.

Display 10.9 Columns Specified on the Group By Tab

3 Click Apply to save your changes without closing the properties window.

4 (Optional) To see how the SQL code is changed by the expression that you just

defined, click the SQL tab The code on the SQL tab should resemble the following

sample:

SELECT ’ORGANIZATION_DIM’n.’Employee_ID’n,

’ORGANIZATION_DIM’n.’Company’n, ’ORGANIZATION_DIM’n.’Department’n,

’ORGANIZATION_DIM’n.’Section’n, ’ORGANIZATION_DIM’n.’Org_Group’n,

’ORGANIZATION_DIM’n.’Job_Title’n, ’ORGANIZATION_DIM’n.’Employee_Name’n,

SUM(’ORDER_FACT’n.’Total_Retail_Price’n) format=8 AS

’Total_Retail_Price’n FROM ’orstar’n.’ORGANIZATION_DIM’n INNER JOIN ’orstar’n.’ORDER_FACT’n

ON (’ORDER_FACT’n.’Employee_ID’n = ’ORGANIZATION_DIM’n.’Employee_ID’n)

Trang 2

Example Process Flows 4 Configure the Loader Transformation 161

GROUP BY ’ORGANIZATION_DIM’n.’Employee_ID’n,

’ORGANIZATION_DIM’n.’Company’n, ’ORGANIZATION_DIM’n.’Department’n,

’ORGANIZATION_DIM’n.’Section’n, ’ORGANIZATION_DIM’n.’Org_Group’n,

’ORGANIZATION_DIM’n.’Job_Title’n, ’ORGANIZATION_DIM’n.’Employee_Name’n

5 The updates to the SQL Join transformation are complete Click OK to save your

changes and close the properties window for the transformation

Update the Metadata for the Total Sales By Employee Table

When the metadata for the Total Sales By Employee table was created, it was not fully optimized for the report shown in Display 10.1 on page 150 In order to produce the report, you must make the following changes to the column metadata for the

Total_Sales_By_Employee table:

3 Rename the Total_Retail_Price column to Total_Revenue to better match the

summarized data that you specified in “Change One Column to a Calculated Column” on page 157

3 Specify formatting for the Total_Revenue column so that its contents will display

properly in the report

Follow these steps to update column metadata for Total Sales By Employee table:

1 In the Process Designer window, select Total_Sales_By_Employee, then select File I Properties from the menu bar A properties window displays.

2 In the properties window, click the Columns tab.

3 Click the Total_Retail_Price column Change the name to Total_Revenue.

4 In the Total_Revenue column, scroll right to display the Format column Enter the format DOLLAR13.2 to specify the appearance of this column in the HTML

output file

5 In the Total_Revenue column, click twice in the Sort column to display a

pull-down icon Click the icon and select the DSCFORMATTED option This option sorts the rows in descending order, based on the formatted value of the

Total_Revenuecolumn

6 Click OK to save your changes and close the properties window for the table.

Configure the Loader Transformation

You must make two updates to the Loader transformation for the Total Sales By Employee table:

3 On the Mapping tab in the transformation properties window, map the source table column Total_Retail_Price to the target table column Total_Revenue The individual sales figures in the Total_Retail_Price column are summarized

in the Total_Revenue column., so these two columns must be mapped to each

other The Loader transformation will automatically map source and target

columns with the same name, but not map columns with different names

3 On the Load Techniques tab, select the Drop Table option For SAS tables such

as Total Sales By Employee, the Drop Table option conserves disk space.

Follow these steps to update the Loader transformation for the Total Sales By

Employee table:

1 In the Process Designer window, select the Loader transformation Then select File I Properties from the menu bar A properties window displays.

Trang 3

162 Run the Job and Check the Log 4 Chapter 10

2 In the properties window, click the Mapping tab.

3 On the Mapping tab, position the cursor on the name of the source column Total_Retail_Price, then click and drag the mapping arrow to the target column Total_Revenue These two columns are now mapped The Mapping tab

should resemble the following display

Display 10.10 Column Mapping in the Loader

4 In the properties window, click the Load Technique tab Select the Drop Target

radio button to replace the physical table each time the job is run

The Loader is now configured and is ready to run

Run the Job and Check the Log

At this point, you have a process flow that will join two source tables, extract information from columns, calculate total sales revenue for each employee, and write the results to the Total_Sales_By_Employee table Before you add a transformation that will produce the desired report, you might want to verify that the current process flow will produce the information that is required in the report

1 With the job displayed in the Process Designer window, select Process I Submit

from the menu bar SAS Data Integration Studio generates code for the job and submits the code to a SAS application server The server executes the code A pop-up window displays to indicate that the job is running

2 If a pop-up error message appears, or if you want to look at the log for the

completed job, click the Log tab in the Process Designer window.

3 On the Log tab, scroll through the SAS log information that was generated during

the execution of the job

The code that was executed for the job is available on the Source Code tab in

the Process Designer window

4 If you find errors in the source code for a step, select the corresponding

transformation in the process flow diagram, then select File I Properties from

the menu bar A properties window displays

Trang 4

Example Process Flows 4 Add the Publish to Archive Transformation to the Process Flow 163

5 Correct the metadata and resubmit the job until there are no more errors

6 After the job runs without error, save the job Select File I Save from the menu

bar

The next task is to verify that the job created the correct output

Verify the Contents of the Total_Sales_By_Employee Table

The data that was loaded into the Total_Sales_By_Employee table should be

appropriate for the report that you want to create (Display 10.1 on page 150) To view

data in the Total_Sales_By_Employee table, select the table, then select View I View Data from the menu bar The View Data window opens, as shown in the following

display

Display 10.11 Contents of the Total_Sales_By_Employee Table

The data in the Total_Sales_By_Employee table appears to be appropriate for the report The next task is to add a report transformation to the end of the process flow

Add the Publish to Archive Transformation to the Process Flow

The Publish to Archive transformation generates a SAS package file and an optional HTML report The package file can be published by SAS programs that use the

publishing functions in SAS Integration Technologies software

Follow these steps to add the Publish to Archive transformation to the process flow:

1 From the SAS Data Integration Studio desktop, click the Process tab to display

the Process Library

Trang 5

164 Add the Publish to Archive Transformation to the Process Flow 4 Chapter 10

2 In the Process Library, open the Publish folder Click and drag the Publish to Archivetransformation into any location in the Process Designer and release the mouse button As shown in the following display, an icon and an input drop zone appear in the Process Designer

Display 10.12 Example Job with Publish to Archive

Ngày đăng: 05/07/2014, 11:20