






Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
The importance of data warehouses in business and their role in providing competitive advantage, productivity enhancement, and customer relationship management. It outlines the steps for designing and constructing a data warehouse, including the top-down view, data source view, data warehouse view, and business query view. The document also covers the three-tier architecture of data warehouses, consisting of the bottom tier warehouse database server, middle tier OLAP servers, and top tier client tools. The document further compares and contrasts ROLAP (Relational Online Analytical Processing), MOLAP (Multidimensional Online Analytical Processing), and HOLAP (Hybrid Online Analytical Processing) OLAP servers.
What you will learn
Typology: Study notes
1 / 10
This page cannot be seen from the preview
Don't miss anything!
4.1 Steps for the design and construction of warehouse:
A data warehouse provides competitive advantage by presenting relevant information from which to measure performance and make critical judgments in order to win in the competitive market space. A data warehouse can enhance productivity since it can quickly gather information about the activities of the organization. Thirdly, a data warehouse provides enhanced customer relationship management since it provides consistent view of the customers and items along all lines of the business, all departments and all markets. Finally, it is possible that a data warehouse brings about cost reduction by tracking trends, patterns and exceptions over long periods of time in a reliable and a consistent manner.
The business analysis framework must be understood in order to create an efficient data warehouse. Four different views of the data warehouse must be consisted: the top-down view, the data source view, the data warehouse view and the business query view.
Since data warehouse construction is a difficult and a long term task, its implementation scope should be clearly defined in the beginning. The goals of an initial data warehouse should be specific, achievable and measurable
Data warehouses normally adopt three-tier architecture:
1. The bottom tiers is a warehouse database server that is almost always a relational database stsyem.Data from operational databases and from external sources are extracted using application program interfaces known as gateways. A gateway is supported by the underlying DBMS and allows client programs to execute code.
From the architecture point of view there are three data warehouse models: the enterprise warehouse, the data mart, and the virtual warehouse.
integration, usually from one or more operational systems and from external information providers. It takes extensive business modeling and it takes many years to design and build.
Normally business users are presented with multidimensional data from data warehouses or data marts without them being aware of the way in which i. e. how or where the data are stored. However, the physical architecture and implementation of OLAP servers need to take into consideration the issues regarding data storage. Various implementations are possible:
Data can be summarized and stored in a variety of ways in a multidimensional cube of an OLAP system. A user or analyst can search for interesting patterns in a cube by specifying a number of OLAP operations, such as drill down, roll up, slice, and dice. While tools are there to help the discovery process is not automated. The user follows his or her own intuition or hypotheses, tries to recognize exceptions or anomalies in the data. Discovery driven exploration is an alternative approach in which pre-computed measures indicating data exceptions are used to guide the user in the data analysis process. Exception indicators indicate cell values that are significantly different from the anticipated structural model. Three measures are used as exception indicators to help identify data anomalies. These measures indicate the degree of surprise that the quantity in a cell holds, with respect to its expected value. The measures are computed and associated with every cell, for all levels of aggregation. They are
SelfExp: This indicates the degree of surprise of the cell value, relative to other cells t the same level of aggregation.
InExp: This indicates the degree of surprise somewhere beneath the cell, if we were to drill down from it.
PathExp: This indicates the degree of surprise for each drill-down path from the cell.
Data cubes facilitate the answering of data mining queries as they allow the computation of aggregate data at multiple levels of granularity. Multifeature cubes compute complex queries involving multiple dependent aggregates at multiple granularities. These cubes are very useful in practice. Many complex data mining queries can be answered be answered by multifeature cubes without any significant increase in computing cost, in comparison to cube computation for simple queries with standard data cubes.
From Data Warehousing to Data Mining
In this section we study the usage of data warehousing for information processing, analytical processing and data mining. We also introduce on-line analytical mining (OLAM), a powerful paradigm that integrates OLAP with data mining technology.
Data warehouses and data marts and data marts are used in a wide range of applications. Business executives in almost every industry use the data collected, integrated, preprocessed and stored in data warehouses. Data warehouses are used extensively in banking and financial services, consumer goods and retail distribution sectors and controlled manufacturing, such as demand based production.
The more a data warehouse has been in use the more it would have evolved. This evolution takes place through a number of stages. Initially data warehouses are used for generating reports and answering predefined queries. Progressively, it used to analyze summarized and detailed data. Later, the data warehouses are put to strategic use. Finally the data warehouse may be put to use for strategic decision-making and knowledge discovery using data mining tools.
Business users need to know what exists in the data warehouse (through metadata), how to access the contents of the data warehouse, how to examine the contents using analytical tools and how to present the results of such an analysis. There are three kinds of data warehouse applications: