Journal articles

A UML-based Data Warehouse Design Method

Data warehouses are a major component of data-driven decision support systems (DSS). They rely on multidimensional models. The latter provide decision makers with a business-oriented view to data, thereby easing data navigation and analysis via On-Line Analytical Processing (OLAP) tools. They also determine how the data are stored in the data warehouse for subsequent use, not only by OLAP tools, but also by other decision support tools (e.g. reporting or data mining). Data warehouse design is a complex task, which requires a systematic method. Few such methods have been proposed to date. This paper presents a UML-based data warehouse design method that spans the three design phases (conceptual, logical and physical). Starting from user requirements, the conceptual phase leads to an object-oriented model represented using the UML notation. For this purpose, UML is enriched with concepts relevant to multidimensional modeling. The logical phase maps the enriched UML model into a multidimensional schema, independently of any implementation tool. The physical phase allows the designer to map the multidimensional schema into a physical database schema depending on the implementation target tool, typically a native multidimensional (MOLAP) or a relational (ROLAP) tool. Our method comprises a set of metamodels used at each phase (in particular a unified multidimensional metamodel used at the logical phase), as well as a set of transformations that can be semi-automated. Following our object orientation, we represent all the metamodels using UML, and illustrate the formal specification of the transformations based on OMG¿s Object Constraint Language (OCL). Throughout the paper, we illustrate the application of our method to a case study.
PRAT, N., AKOKA, J. et COMYN-WATTIAU, I. (2006). A UML-based Data Warehouse Design Method. Decision Support Systems, 42(3), pp. 1449-1473.