Developing and Deploying Data Warehouse and Business Intelligence Solutions Kerr-McGee Information Management Group Skye Brannon Jeff Bridgwater Sarena Sherrard... Data TransformationDat
Trang 1Developing and Deploying Data Warehouse and
Business Intelligence Solutions Kerr-McGee Information Management Group
Skye Brannon Jeff Bridgwater Sarena Sherrard
Trang 2Who is Kerr-McGee?
• Kerr-McGee is an Oklahoma City-based energy and
inorganic chemical company with worldwide operations and assets of approximately $10 billion.
• http://www.kerr-mcgee.com/
Trang 3• Introduction to DW/BI Concepts
• Extract, Transform & Load (ETL)
• Business Intelligence / Reporting
• A Day in the Life
Trang 4DW / BI Concepts
Trang 5Information Management Strategy
Structure the systems and data
relationships to provide user-friendly
customer access to data in order to provide decision-making information.
Trang 6Adding Value to Data
Trang 7Information Pyramid
Trang 8What is a Data Warehouse?
A copy of data from one or more On-line Transaction Processing (OLTP) systems
specifically structured for Query, Reporting and Analysis (QRA).
• Data is typically at a summarized level to limit the size and complexity of the data warehouse
• Data is usually cleansed and merged to create an
“apples to apples” comparisons
OLTP
Systems
Data Warehouse
End-User Reporting
Trang 9The Idea Behind Data Warehousing
Data Warehouse
Trang 10Data Transformation
Data Extraction Data Cleansing Data Integration Data Improvement
Operational Data Store
Operational Data Store WarehouseData
Data Warehouse
OLAP Query
Information Delivery
Information Delivery
Data Mining
Operations & Systems Management
Datamarts
Enterprise Reporting
Framework Architecture
Trang 11Business Intelligence
Information Data
Business Intelligence
Integrated Meaningful Consistent Validated Easy to Use Leveragable Timely
Integrated Meaningful Consistent Validated Easy to Use Leveragable Timely
Trang 12Domestic Oracle Financials
European Oracle Financials
Aberdeen Oracle Financials PREMAS
Intl.
Systems
Aberdeen Data Warehouse
O&G Data Warehouse
Others
Warehouse
Chemhouse
Trang 13Chemical
O&G
Oracle Financials Passport?
Domestic Oracle Financials
European Oracle Financials
Aberdeen Oracle Financials PREMAS
Intl.
Systems
Aberdeen Data Warehouse
O&G Data Warehouse
P2000
DFW
Merak Tobin
Peoplesoft
Consolidated Analysis & Reporting Solution (Cognos Business Intelligence)
Possibly Phased Out or Integrated
Possibly Phased Out or Integrated
Phased Out
HR Data Warehouse
Trang 14• Manager Planning and management of entire product or project lifecycle; May assist in ETL & BI Interface design and development
• Data Warehouse Architect – Applies knowledge of technology options,
platforms, and design techniques across product and project lifecycle;
responsible for design of overall warehouse process
• ETL Specialist – Analysis and design of extraction, transformation, and
loading strategy; development of ETL scripts and procedures
• Business Intelligence Specialist – Design and development of
multidimensional-cubes & reports; performance and tuning of chosen
technologies
• Web Interface Specialist – Design and development of application interface
elements; coordinates interfaces between application components
Data Warehouse Roles
Trang 15ETL
Trang 16Plan/Forecast/ Analysis
Plan/Forecast/ Analysis
Operational Data Store
Operational Data Store WarehouseData
Data Warehouse
OLAP Query
Datamarts
Information Analysis
Information Analysis
Data Mining
Data Visualization
Data Visualization
Global / Dept/ Business Unit Summary and Analysis
Global / Dept/ Business Unit Summary and Analysis
Metadata Management
Executive Information Systems
Executive Information Systems
Data Transformation
Data Extraction and Transformation
• Applying business rules to turn data into useable information
• Clean up and standardization of consumers, vendors, products, etc.
• Integration of disparate internal and external data
• Can be 70% - 80% of effort
• Issues
- Can be difficult and time consuming to define
business rules
- Extraction tools automate only the more simple tasks
Project Management & Quality Assurance
Operations & Systems Management
Data Extraction and Transformation
Trang 17• Smaller windows of opportunity
– Make decision in a shorter period of time due to competitive, global market
• Global marketplace (DW timing updates)
• High-profile e-Business initiatives
– Satisfying requirements
Data
Volume + Inclusion Source + Extract Timing = Warehouse Complexity
Trang 18• Challenge to develop efficient, consistent methods of gathering and cleansing
heterogeneous data
– Capture and load of data from multiple source systems (both internal and external)
– Integrates data into a single source
– Cross-system mapping to standard identifiers (surrogate keys)
– Aggregation for information delivery and BI initiatives
ETL - The “Heavy Lifting”
Trang 19ETL Tools - Only Half the Story
• Half the story: ETL Tools Extract, Transform, and Load data
• Transport data between sources and targets
• Document data element changes (metadata)
• Administer run-time processes and operations
– Scheduling
– Error management
– Audit logs
– Statistics
Trang 20ETL Tools – Core Components
Databases/Files/
Legacy Apps
Metadata Repository
Design Manager
Metadata Import/Export
Metadata Import/Export
Trang 21(Oracle Warehouse Builder / DataJunction)
• Enhanced Scheduling & Logging
• Not Multi-Warehouse Oriented
– Informatica Powermart
• Great UI
• Powerful Scheduling & Logging
• High Price
• Proprietary Transform Language
ETL - The Options
Trang 22RDBMS
me DBMS
SQLScripts
PERLScripts
InterfaceApps
LoaderUtility
LoaderUtility
COBOLCode
Data Repository
ETL - The Reality
Trang 23Informatica Powermart
Repository Manger Designer
Trang 24Business Intelligence
/ Reporting
Trang 25What is Business Intelligence?
Business Intelligence is the transformation of data into
information you can use to drive your business.
There are a number of vendors that have developed
Business Intelligence software Kerr-McGee uses
Cognos
Trang 26Data Warehouse
Data Warehouse
Data Transformation OperationalOperationalData StoreData Store
Business Intelligence Tools
• Combination of applications and tools
• Provide analysis, presentation and
reporting facilities for users
• Tailored to meet diverse needs of
executives, mgrs, analysts
• Data may reside in ODS, data
warehouse or data mart
Plan/Forecast/ Analysis
Plan/Forecast/ Analysis
Information Analysis
Information Analysis
Data Mining
Data Visualization
Data Visualization
Global / Dept/ Business Unit Summary and Analysis
Global / Dept/ Business Unit Summary and Analysis
Executive Information Systems
Executive Information Systems
Project Management & Quality Assurance
Operations & Systems Management
Business Intelligence Tools
Trang 27Highly Summarized Highly
Summarized
Moderately Summarized Moderately
Market Researchers
Management Business Analysts
Market Researchers
Executive
Categorize Information Needs
Financial analysts, product managers,
etc
Financial analysts, product managers,
etc
Senior Management
Salespersons, line managers, administrative staff, etc
Salespersons, line managers, administrative staff, etc
Trang 28Information Delivery Mechanisms
Operational Trends
Web or C/S
Wireless
Mobile
Operational Trends
Web or C/S
(in millions)
1998 1999 2000 2001 2002
Net Revenues $x,xxx $x,xxx $x,xxx$x,xxx $x,xxx
Net income xxx xxx xxx xxx x,xxx
Earnings per share x.xx x.xx x.xx x.xx x.xx
Return on net revenues xx% xx% xx% xx% xx%
Cash & s/t investments $xxx $xxx $xxx$xxx $xxx
Total Assets $xxx $xxx $xxx$xxx $xxx
Shareholder Equity xxx xxx xxx xxx x,xxxOperational Trends
Web or C/S
Predefined Summaries
Predefined Summaries
Directed Analysis
Ad-hoc Queries
Ad-hoc Queries
Delivery Mechanism
Considerations
Integrated with Operations?
Detailed Reporting only?
Real-time or based on a Periodic
Business Cycle (Financials)
Tethered or ‘disconnected’?
C/S Web Wireless
Trang 29B.I Infrastructure
Trang 30All things Cognos
*What we will cover.
Trang 31Drill Through - Linking to source data using selected filters
Powerplay Web - On-Line Analysis Tool for cubes (slice/dice, drill down, drill across & drill through)
Newsbox -A web based folder used to store views of data (reports) Every KMBI user has their own personal newsbox
Trang 32Cognos - Upfront
- Upfront - Portal Management
Trang 33Cognos - PowerPlay
- PowerPlay – web reports/slicing and dicing/data analysis, based on cubes
More Information on Cognos website: http://www.cognos.com/products/businessintelligence/analysis/
More Information on Cognos website: http://www.cognos.com/products/businessintelligence/analysis/
Trang 34Cognos - Impromptu
- Impromptu – printable reports (in PDF) that may/or may not be produced with
prompts for filtered information
Trang 35Cognos - Visualizer
Trang 36• Initial Project meeting should include:
– Client - gives input on look and feel, data requirements, timelines
– Project Manager – ensures project is feasible within budget and time
restraints at the onset and through out the project
– Data Warehouse Architect – ensures all the needed data is in the data
Trang 39Day in the Life
Trang 40Typical Day