DW General - 1 High Level Overview

A dimensional database is a relational database that uses a dimensional data model to organize data. This model uses fact tables and dimension tables in a star or snowflake schema. So in general we can say  dimensional database is just the optimal type of
database for data warehousing(Actually from my viewpoint, this is also the drawback for the Data Warehouse, because all the current RDBMS are all target for transaction system and to track the operational data by nature, it is not designed for data warehouse
system, so even data warehouse can build on them, it suffer many limitations).

Overview

Data warehouse databases provide a decision support system (DSS) environment in which you can evaluate the performance of an entire enterprise over time. In the broadest sense, the term data warehouse is used to refer to a database that contains very large
stores of historical data. The data is stored as a series of snapshots, in which each record represents data at a specific time. By analyzing these snapshots you can make comparisons between different time periods. You can then use these comparisons to help
make important business decisions.

Data warehouse databases are optimized for data retrieval. The duplication or grouping of data, referred to as database
de-normalization, increases query performance and is a natural outcome of the dimensional design of the data warehouse. By contrast, traditional online transaction processing (OLTP) databases automate day-to-day transactional operations. OLTP
databases are optimized for data storage and strive to eliminate data duplication. Databases that achieve this goal are referred to as normalized databases.

An enterprise data warehouse (EDW) is a data warehouse that services the entire enterprise. An enterprise data warehousing environment can consist of an
EDW, an operational data store (ODS), and physical and virtual
data marts.

A data warehouse can be implemented in several different ways. You can use a single data management system, such as Informix or SQL Server, for both transaction processing and business analytics. Or, depending on your system workload requirements, you can
build a data warehousing environment that is separate from your transactional processing environment.

Data warehouse

DW is kind of database that is optimized for data retrieval to facilitate reporting and analysis. A data warehouse incorporates information about many subject areas, often the entire enterprise. Typically you use a dimensional data model to design a data
warehouse. The data is organized into dimension tables and fact tables using star and snowflake schemas. The data is de-normalized to improve query performance. The design of
a data warehouse often starts from an analysis of what data already exists and how to collected in such a way that the data can later be used. Instead of loading transactional data directly into a warehouse, the data is often integrated and
transformed before it is loaded into the warehouse.

The primary advantage of a data warehouse is that it provides easy access to and analysis of vast stores of information on many subject areas.

Data mart

A database that is oriented towards one or more specific subject areas of a business, such as tracking inventories or transactions, rather than an entire enterprise. A data mart is used by individual departments or groups. Like a data warehouse, you typically
use a dimensional data model to build a data mart. For example the data mart might use a single star schema comprised of one fact table and several dimension tables. The design of a
data mart often starts with an analysis of what data the user needs rather than focusing on the data that already exists.

Operational data store

A subject-oriented system that is optimized for looking up one or two records at a time for decision making. An operational data store (ODS) is a hybrid form of data warehouse that contains timely, current, integrated information. Including the ODS in the
data warehousing environment enables access to more current data more quickly, particularly if the data warehouse is updated by one or more batch processes rather than updated continuously. The data typically is of a higher level granularity than the transaction.
You can use an ODS for clerical, day-to-day decision making. This data can serve as the common source of data for data warehouses.

DW General - 1 High Level Overview

时间: 2024-10-08 23:37:47

DW General - 1 High Level Overview的相关文章

classloader

4 Jboss 启动及加载过程 详细参考: http://tech.it168.com/j/2007-06-27/200706271521984.shtml 1) org.jboss.Main.main(String[]) 为入口 . 2) main 函数创建一个名叫” jboss ”的线程组 , 然后创建一个属于该组的线程 , 在线程中执行 boot 方法 . 3) boot 方法首先处理 main 函数中的参数 ( 及一些其它的系统环境设置 ), 接着就用系统的属性创建了org.jboss.

OpenStack Identity API v3

Table Of Contents OpenStack Identity API v3 What’s New in Version 3.7 What’s New in Version 3.6 What’s New in Version 3.5 What’s New in Version 3.4 What’s New in Version 3.3 What’s New in Version 3.2 What’s New in Version 3.1 What’s New in Version 3.

Indexing Sensor Data

In particular embodiments, a method includes, from an indexer in a sensor network, accessing a set of sensor data that includes sensor data aggregated together from sensors in the sensor network, one or more time stamps for the sensor data, and metad

Oracle E-Business Suite Maintenance Guide Release 12.2(Patching Procedures)

更多内容参考: http://docs.oracle.com/cd/E51111_01/current/acrobat/122ebsmt.zip Preparing for Patching For patches that have manual steps, the patch readme file instructs you to use Oracle Patch Application Assistant (PAA) to create customized instructions

Java性能提示(全)

http://www.onjava.com/pub/a/onjava/2001/05/30/optimization.htmlComparing the performance of LinkedLists and ArrayLists (and Vectors) (Page last updated May 2001, Added 2001-06-18, Author Jack Shirazi, Publisher OnJava). Tips: ArrayList is faster than

PatentTips - Systems, methods, and devices for dynamic resource monitoring and allocation in a cluster system

BACKGROUND? 1. Field? The embodiments of the disclosure generally relate to computer clusters, and more particularly to systems, methods, and devices for the efficient management of resources of computer clusters.? 2. Description of the Related Art?

ClearOS Community 7.1 Install Virtual Machine

In today's linux tutorial we will follow with the help of snapshots to install, setup and configure ClearOS 7.1 the latest beta verion on VMware 64 bit. ClearOS is a next generation Network, Cloud and Systems management Operating System that can act

10gocm->session5->数据库管理实验->GC资源管理器资源消耗组演示

<GC资源管理器> 官方文档:administrator's Guide->24 Using the Database Resource Manager 使用率:在实际生产环境中使用的比较少 作用:限制数据库资源的使用,CPU  内存  undo表空间  IO  session 场合:当有多个应用使用一个数据库并且资源紧张时可用"资源管理器"调节 实现:可用GC完成也可用EM完成,还可以使用程序包完成 UI方式:简单,快捷,不容易出错 commend方式:设置复杂,命

Enhancing network controls in mandatory access control computing environments

A?Mandatory?Access?Control?(MAC) aware firewall includes an extended rule set for MAC attributes, such as a security label or path. Application labels may be used to identify processes and perform firewall rule-checking. The firewall rule set may inc