Chapter

A Model-Driven Heuristic Approach for Detecting Multidimensional Facts in Relational Data Sources

09/2010; DOI:10.1007/978-3-642-15105-7_2 pp.13-24
Source: DBLP

ABSTRACT Facts are multidimensional concepts of primary interests for knowledge workers because they are related to events occurring
dynamically in an organization. Normally, these concepts are modeled in operational data sources as tables. Thus, one of the
main steps in conceptual design of a data warehouse is to detect the tables that model facts. However, this task may require
a high level of expertise in the application domain, and is often tedious and time-consuming for designers. To overcome these
problems, a comprehensive model-driven approach is presented in this paper to support designers in: (1) obtaining a CWM model
of business-related relational tables, (2) determining which elements of this model can be considered as facts, and (3) deriving
their counterparts in a multidimensional schema. Several heuristics –based on structural information derived from data sources–
have been defined to this end and included in a set of Query/View/Transformation model transformations.

0 0
 · 
0 Bookmarks
 · 
51 Views
  • Article: Extracting the extended entity-relationship model from a legacy relational database
    [show abstract] [hide abstract]
    ABSTRACT: The maintenance of an existing database depends on the depth of understanding of its characteristics. Such an understanding is easily lost when the developers disperse. The situation becomes worse when the related documentation is missing. This paper addresses this issue by extracting the extended entity-relationship schema from the relational schema. We developed algorithms that investigate characteristics of an existing legacy database in order to identify candidate keys of all relations in the relational schema, to locate foreign keys, and to decide on the appropriate links between the given relations. Based on this analysis, a graph consistent with the entity-relationship diagram is derived to contain all possible uniary and binary relationships between the given relations. The minimum and maximum cardinalities of each link in the mentioned graph are determined, and extra links within the graph are identified and categorized, if any. The latter information is necessary to optimize foreign keys related information. Finally, the last steps in the process involve~(when applicable) suggesting improvements on the original conceptual design, deciding on relationships with attributes, many-to-many and n-ary (n⩾3) relationships, and identifying is-a links. User involvement in the process is minimized to the case of having multiple choices, where the system does not have the semantic knowledge required to decide on a certain choice.
    Information Systems.
  • Source
    Article: The Dimensional Fact Model: A Conceptual Model for Data Warehouses.
    Int. J. Cooperative Inf. Syst. 01/1998; 7:215-247.
  • Source
    Conference Proceeding: Multidimensional Design by Examples.
    Data Warehousing and Knowledge Discovery, 8th International Conference, DaWaK 2006, Krakow, Poland, September 4-8, 2006, Proceedings.; 01/2006

Full-text (2 Sources)

View
6 Downloads
Available from
17 Oct 2012

Keywords

application domain
 
business-related relational tables
 
comprehensive model-driven approach
 
concepts
 
conceptual design
 
CWM model
 
data sources–
 
data warehouse
 
events
 
main steps
 
multidimensional schema
 
operational data sources
 
primary interests
 
problems
 
Query/View/Transformation model transformations
 
structural information
 
tables
 
tedious
 
time-consuming