Main Tasks for Users 4 Updating Metadata for Jobs 105Create Match Code and Apply Lookup Standardization Transformations The Process Library tree includes two transformation templates tha
Trang 1Main Tasks for Users 4 Updating Metadata for Jobs 105
Create Match Code and Apply Lookup Standardization Transformations
The Process Library tree includes two transformation templates that require SAS Data Quality Server software: Create Match Code and Apply Lookup Standardization These transformations enable you to increase the value of your data through data analysis and data cleansing
To use these transformations, the SAS Data Quality Server software must be installed, a SAS application server must be configured to access a Quality Knowledge Base, and the Quality Knowledge Base must contain the locales that you need to reference in your SAS Data Integration Studio jobs When the prerequisites have been met, you can drag and drop these transformations into your process flow diagrams
SAS Data Quality Functions in the Expression Builder Window
SAS Data Integration Studio provides an Expression Builder window in the properties window of some transformations (For a description of this window, see
“Expression Builder Window” on page 16.) If SAS Data Quality Server software is available to you, the Expression Builder window includes a wide range of data quality functions One way to see the data quality functions is to open the properties window of
the SQL Join transformation and select the Where tab For detailed information about
the data quality functions, see the SAS Data Quality Server: Reference, which is
available in the online SAS Help and Documentation for Base SAS and in SAS OnlineDoc
Data Validation Transformation
When incorporated into SAS Data Integration Studio jobs, the Data Validation transformation enables you to detect error conditions and specify responses to those errors Error conditions include blank or missing values, duplicate values, and invalid values The actions that you can take in response to erroneous values include stopping the job, changing the value, or writing the row to an error table instead of to the target The Data Validation transformation does not require SAS Data Quality Server software
Updating Metadata
Updating Metadata for Jobs
Overview
Use the property window for a job to update its basic metadata For example, you can specify code that should be run before or after the job For a description of the job properties window, see “Job Properties Window” on page 17
Trang 2106 Updating Metadata for Tables or External Files 4 Chapter 7
Update Metadata for a Job
Assume that the metadata for the job is currently checked into a change-managed repository
1 On the SAS Data Integration Studio desktop, select the Inventory tab.
2 In the Inventory tree, open the Jobs folder.
3 Select the desired job, then select Project I Check Out The metadata for the
job is checked out A check mark is displayed next to the job in the Inventory tree
An icon indicating a checked-out job appears in the Project tree
4 In the Project tree, select the metadata for the job, then select File I Properties
from the menu bar The properties window for the job displays
5 Use the tabs in this window to update the metadata for the job Each tab has its own Help button
6 When you are finished updating the metadata, save the changes to the job by
clicking the OK button.
7 In the Project tree, select the repository icon From the menu bar on the SAS Data
Integration Studio desktop, select Project I Check In Repository.
Updating Metadata for Tables or External Files
Overview
Use the table properties window to update the metadata for a table or external file that is registered in a current metadata repository For a description of the table properties window, see “Table or External File Properties Window” on page 26
Update Metadata for a Table or External File in a Tree View
The following steps describe one way to update the metadata for a table or external file in a tree view
1 On the SAS Data Integration Studio desktop, select the Inventory tab.
2 In the Inventory tree, open the Tables folder or the External Files folder.
3 Select the table, then select Project I Check Out The metadata for the table or
file is checked out A check mark is displayed next to the table or file in the Inventory tree An icon indicating a checked-out table or file appears in the Project tree
4 In the Project tree, select the table or file, and select File I Properties from the
menu bar The properties window for the table or file is displayed
Note: You must display the table or file from the Project tree in order to update metadata Displaying the table or file from the Inventory tree enables browsing only
5 Use the tabs in this window to make changes to the metadata for the table or file
Each tab has its own Help button.
6 When you are finished updating the metadata, save the changes to the table or file
by clicking the OK button.
7 In the Project tree, select the repository icon From the menu bar on the SAS Data
Integration Studio desktop, select Project I Check In Repository.
Trang 3Main Tasks for Users 4 Updating Metadata for Tables or External Files 107
Update Metadata for a Table or External File in a Process Flow
The following steps describe one way to update the metadata for a table or external file in the process flow for a job When working under change management, you must check out both the job and the table or external file
1 On the SAS Data Integration Studio desktop, select the Inventory tab.
2 In the Inventory tree, expand the Jobs folder and the Tables folder or the
External Files folder
3 Select the job and the table or external file, then select Project I Check Out.
The metadata for the job and the table or file is checked out A check mark is displayed next to these objects in the Inventory tree Icons indicating a
checked-out job and a checked-out table or file appear in the Project tree
4 In the Project tree, select the desired job, then select View I View Job from the menu bar The process flow diagram for the job displays on the Process Editor
tab in the Process Designer window
5 Select the metadata for the desired table or external file, then select FileI
Properties from the menu bar The properties window displays the metadata for
the table or file
Note: You must display the table or file from the Project tree in order to update metadata Displaying the table or file from the Inventory tree enables browsing only
6 Use the tabs in the properties window to make changes to the metadata for the
table or file Each tab has its own Help button.
7 When you are finished updating the metadata, save the changes to the table or file
by clicking the OK button.
8 Close the Process Designer window and save your changes to the job
9 In the Project tree, select the repository icon From the menu bar on the SAS Data
Integration Studio desktop, select Project I Check In Repository.
Using a Physical Table to Update Table Metadata
The Update Table Metadata feature compares the columns in a physical table to the columns that are defined in the metadata for that table If column metadata does not match the columns in the physical table, the metadata is updated to match the physical table
For existing tables, the Update Table Metadata feature adds new columns, removes deleted columns, and records changes to all column attributes When you select and run this feature against one or more tables simultaneously, the update log lists which tables have been successfully updated and which have failed
When you use the Update Table Metadata option on a physical table in DBMS format and the DBMS table has more than one schema, the Update Table Metadata option selects the first schema The Update Table Metadata feature uses the following resources: the current metadata server and the SAS application server to read the physical table the current metadata server to update the metadata to match the
physical table
You are prompted to provide a user name and password for the metadata server if this was not already saved with the current metadata profile If you have not provided
a user name and password for the SAS application server during the current session, you are prompted to provide them A warning message displays if the SAS Workspace Server component of the SAS application server is older than SAS 9.1.3, Service Pack 3 Follow these steps to use the Update Table Metadata feature:
1 If you are working under change management, check out one or more tables that contain the metadata that you want to update
Trang 4108 Updating Metadata for Transformations 4 Chapter 7
2 From the Project tree, select one or more tables for which you want to update the
metadata and then select Update Table Metadata from the Tools menu.
You might be prompted to supply a user name and password for the relevant servers When the update is finished, you can choose to view the resulting SAS log
Updating Metadata for Transformations
Overview
A transformation is a metadata object that specifies how to extract data, transform data, or load data into data stores A number of standard transformations are provided
in the Process Library for SAS Data Integration Studio For a description of the standard transformations, see Appendix 1, “Standard Transformations in the Process Library,” on page 217
There are two main reasons to update the metadata for a transformation:
3 to update the default metadata for a transformation when it is first added to a job
3 to change the behavior of a transformation in a process flow Use the property window for a transformation to update its metadata For a description of the transformation properties window, see “Transformation Properties Window” on page 27
Update the Default Metadata for a Transformation
When you drag a transformation template from the Process Library and drop it into the process flow for a job, the default metadata for the transformation is mostly blank You must update the default metadata according to your goals for that transformation
in the process flow
The general steps for updating the default metadata for a transformation are described in “Update the Job as Needed” on page 100
Examples of how to update the default metadata for transformation are provided in Chapter 10, “Example Process Flows,” on page 149 To see more examples of how to update the default metadata for transformations, follow these steps to display the relevant Help topics:
1 From the SAS Data Integration Studio menu bar, select Help I Contents The
online Help window displays
2 In the left pane of the Help window, select Examples I Process Library Examples.
Other Updates to the Metadata for a Transformation
After you have made the initial updates to the default metadata for a transformation, you might have to update the transformation again later For example,
you might need to change the options on the Options tab for the transformation The
following steps describe one way to update the metadata for a transformation Assume that the metadata for the job that contains the transformation is currently checked into
a change-managed repository
1 On the SAS Data Integration Studio desktop, select the Inventory tab.
2 In the Inventory tree, expand the Jobs folder.
3 Select the job with the transformation that you want to update, then select
Project I Check Out The metadata for the job is checked out A check mark is
Trang 5Main Tasks for Users 4 Setting Name Options for Individual Tables 109
displayed next to the job in the Inventory tree An icon indicating a checked-out job appears in the Project tree
4 In the Project tree, select the desired job, then select View I View Job from the menu bar The process flow diagram for the job displays on the Process Editor
tab in the Process Designer window
5 Select the metadata for the transformation that you want to update, then select
File I Properties from the menu bar The properties window displays the
metadata for the transformation
Note: You must display the transformation from the Project tree in order to update metadata Displaying the transformation from the Inventory tree enables browsing only
6 Use the tabs in the properties window to make changes to the metadata for the
transformation Each tab has its own Help button.
7 When you are finished updating the metadata, save the changes to the table or file
by clicking the OK button.
8 Close the Process Designer window and save your changes to the job
9 In the Project tree, select the repository icon From the menu bar on the SAS Data
Integration Studio desktop, select Project I Check In Repository.
Setting Name Options for Individual Tables
Overview
SAS Data Integration Studio cannot access tables or columns with case-sensitive names or with special characters in the names unless the appropriate options have been specified in the metadata for the table
Prerequisites
For tables in DBMS format, it is assumed that the appropriate name options have already been set for the database library that is used to access the table, as described in
“Supporting Case and Special Characters in Table and Column Names” on page 73 Name options do not need to be set on the library that is used to access a table in SAS format
Task Summary
The following steps describe one way to enable name options for a table whose
metadata has been saved to a metadata repository It is assumed that the metadata repository is under change management
1 On the SAS Data Integration Studio desktop, select the Inventory tab.
2 In the Inventory tree, open the Tables folder or the External Files folder.
3 Select the table that you want to update, then select Project I Check Out The
metadata for the table is checked out A check mark is displayed next to the table
in the Inventory tree An icon indicating a checked-out table appears in the
Project tree
4 In the Project tree, select the metadata for the table, then select FileI
Properties from the menu bar The properties window for the table is displayed.
5 In the properties window, click the Physical Storage tab.