Escolar Documentos
Profissional Documentos
Cultura Documentos
A Framework for Building Resilient Data Warehouses using a Mandala Topology Architecture
Michel JACQUES
Information Assembly SPRL, Brussels, Belgium
Introduction
Each user type has a different Source data is in an unrefined state (to Constructing ETL work for a DW project is a complex affair, one that Data Application: DW Platform is Usage-Driven perspective and role towards the DW. various degrees) that must have its data requires planning. This framework uses a DW architecture based on a Existing users include: data stewards, elements differentiated into a DW core Mandala Topology. At its core is an agile, step-by-step approach to Integration analytical end-users, quality control data model (IN) in order to able to masters, DW administrators & integration recombine them effectively afterwards for identify ETL work units: META managers. Each user has his own set of analytics (OUT). During this transition, 1.Identifying the external users information requirements and accesses two other types of data elements are STAR 2.Positioning main data flows, the DW via a specific interface (STAG, produced: metadata (UP) and erroneous MART STAG EDM Data META, MART, or SINK). The STAR can data (DOWN) streams. No data should 3.Decomposing a flow internal layers Stewards only be accessed indirectly via the other be lost in this closed system. Thus the Analytics 4.Incorporating them within the DW application platform (Mandala) interfaces to ensure data integrity & position & direction of a data stream SINK 5.Extending conceptual EDM with a functional level of detail security and prevents dependencies determines its purpose, the means, and caused by different user demands. The its destination. This naturally leads to 6.Classifying data model entities and their dependencies users are part of an iterative feedback asymmetries between streams, which Quality 7.Combining the Functional ER model with Topic Areas & Tiers loop improving future data content and Management: Control must be accounted for in the ETL design.. 8.Enumerate the ETL work units in matrix format Just about every [DW] process has side effects; but they can be deliberate and sustaining quality. instead of unintentional and perniciousand we can also be inspired by it to design some Hence a multi-perspectives/faceted data with rates up to acting as a crossway between The method focuses on the topological relationship between all the DW Intravenous Dextrose infusion warehouse architecture250mls/hr of 20% Dextrose positive side effects to our own enterprises instead of focusing exclusively on a single end. (p.80) these end-users is comprehensive and non-discriminatory at an organizational level. artifacts, in order to comprehensive improve planning & design of DW. Ref: W. McDonough and M. Braungart, Craddle to Craddle, North Point Press, NY, 2002 Dietary intervention with frequent meals and corn starch
Diazoxide intolerant leading to hyponatraemia, oedema and nausea 4.DW Mandala Topology Architecture
The Buddhist Mandala metaphor Data Application: Platform for 5 Architectural Layers The purpose of an ETL is to increase the Enterprise Data Model: Conceptual Model The conceptual model is Octreotide/glucagon architecture intravenously in order to replace counter-regulatory hormones helps visualize an integration & analytical capability of business-oriented, while the META where topological relationships sourced data. An ETL data path consists of 5 logical model is focused on between DW components Subcutaneous Octreotide - hypoglycaemia worsened1 layers, each conducting a different set of data Authoring content and application. The including: data modeling, ETL transformations. The first two IN stages functional data model, proposed data flows, and surrounding integrate the data to enable multiple Prednisolone developed fluid retention herein, places itself in the gap Trn Maps actors & applications; functionally interpretations, while the last two OUT between the two. It extends the STAR interact. The architecture is STAG MART stages specialise the data such that it INimprovement (post-procedure insulin 29 pmol/l), but relapsed the OUT number of entity types from after 4 weeks. Hepatic Arterial Embolisation (HAE) performedcrosswaywith initial twice that EDM similar to a road becomes fit-for-purpose. This mirror-effect is Attendance initial fact and dimension with Profiles gives essential context and Extraction:::Staging:::::Integration::::Publishing:User Access referred to as ETL chirality. new types for holding hierarchies, Roster movement to data and The following are important elements when selecting the most appropriate data modelling dim. ids, details & associations. operations. The context is technique to use: a) the degree of convergence built into the data during ETL; b) the number of Trainings This improves history-keeping SINK composed of 5 distinct locations unique pathways in the dimensional model; c) increased data flow resilience by decreased data and makes the core data model (STAG, META, SINK, MART, & reliance; and d) ability for decoupling of model components. Erroneous more resilient while decoupling STAR) that provide a clear Data disturbances will occur either from external sources in an unexpected, subtle or extreme Data Repo the associated data flows. logical structure determining manner, which requires a faster data recovery by minimising data reload to only what is relevant. There is a need to extend our vocabulary of forms when modelling. This involves adding data, flows, modeling patterns, access and security, user groups, and integration methods. Resilience also involves cyclic transfer of information across data applications reinforcing each functional features so as to harmonise form with function and thus achieve a greater Moreover, locations enable data persistence facilitating data recovery and transformations. system data quality and monitoring usage. decoupling of DW artefacts, whilst maintaining data cohesion.
Metadata
(static)
T00
Dimensions
base, details & struct)
T01
Facts
(events & profiles)
T02
Grids
(associations)
T03
Data Marts
(lov, hier & msr)
T99
Conclusion: The advantages of implementing such a topological architecture include: greater scalability of additional data themes, enhanced performance of data flows, increased resilience of decoupled artifacts, sturdier quality control, and lower operating and development costs. The framework provides a comprehensive, reproducible, and proven DW architecture solution.