University Data Warehouse Engineering Process Report

Verified

Added on  2022/12/18

|8
|2044
|2
Report
AI Summary
This report provides a detailed analysis of the 'A Data Warehouse Engineering Process' article by Sergio Luján-Mora and Juan Trujillo. It examines the intention and content of the article, focusing on the complexities of data warehousing, development methods, and the use of Unified Modeling Language (UML). The report discusses the research method, which involved experimentation to assess data collection and workflow structures. It highlights the stages, levels, and diagrams used in the experiment, including the source, integration, data warehouse, customization, and client aspects. The findings reveal the implementation process and the importance of data population and tuning for optimized running. The report also explores the relevancy of the topic to software engineering, concluding with an assessment of UML usage and the front-end and back-end processes involved. The report is a comprehensive overview of the article, offering insights into the data warehouse engineering process.
Document Page
Running head: DATA WAREHOUSE
DATA WAREHOUSE
Name of the Student
Name of the University
Author’s note
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
1
DATA WAREHOUSE
Table of Contents
Introduction......................................................................................................................................2
Body.................................................................................................................................................2
Conclusion.......................................................................................................................................6
Bibliography....................................................................................................................................7
Document Page
2
DATA WAREHOUSE
Introduction
Identification of the paper that will be reviewed
The Paper that is chosen for execution of the project is ‘A Data Warehouse Engineering
Process’, written by Sergio Luj´an-Mora and Juan Trujillo.
Statement of Purpose
The goal of the work is to properly develop a data warehouse engineering process for
making the developing process highly efficient.
Outline of the report
This report will be discussing about the intention and the content that is present in the
chosen article and journal. This report also discusses about the research method that is
implemented in the process. Proper discussion regarding the observation that has been made will
also be performed. Precise description of the experiment that has been conducted will also be
stated. Finding of the project deliverable will also be stated. The result that has been received is
also well discussed in the report. Discussion regarding the relevancy of the article as per the
course is also made in this report. A proper conclusion is also provided in the report.
Body
Intention and content of the article
The entire discussion rotates around the terminology of data warehouse. The complexity
of data warehouse has been one of the prime focus points of the chosen article. Discussion
regarding the time that is consumed in the proposition of the tasks has also been a point of
discussion in this case. Chances of the project failing has also been one of the major discussion
point. The diversified Data warehouse models that have been used in the recent years are also
been discussed. Development method of data warehouse is also provided. This development
process is mainly focussed on Unified Modelling language. Unified Process is also well stated in
the paper. Proper discussion regarding the back end and the front end process is also made.
Importance of UML in designing the varied sections of data warehouse is also stated in the
paper.
Document Page
3
DATA WAREHOUSE
Research method
Experiment is conducted for performing the completion of the project. In this report, data
is not collected via any kind of secondary data research. In this case primary data research is
taken into consideration. Experiment has been performed for proper assessment of the data
collection process. With the help of experimentation, the main aspect that is considered is that
the data collection will be directly as per the data generated in the experiment. Hence wise
analysis of the same data will be helping in performing highly accurate understanding regarding
the process.
Observation
The proper assessment of the data warehouse structure can be performed as per the
workflow structure and conceptual modelling of the DW. The source of the target data is also
performed as per the target data map. This process is developed as per the conceptual level. The
main diagrams that are made includes the likes of the DW conceptual schema. The client
conceptual schema as well as the data mapping. Proper relationship is established in between the
relationship of the SCS and the DWCS. Again the relationship in between the DWCS and CCS
will also be well performed. From CCS, it can be stated that extension of the Unified Modelling
Language. The entire modelling will be performed as a set of stereotype and hence wise elegant
multi-dimensional properties as per the conceptual level is well analysed. Usage of Object
Constraint Language is also well stated as per the present stereotypes of the standard modelling
language.
Experiment
The experiment in this paper is subdivided in 3 major parts. The parts are namely Stages,
Levels and Diagrams. In the initial phase, stages are well defined. This section is mainly focused
on the Source, Integration, Data Warehouse, Customization and Clients. The Source of the
project is basically focused on the data sources that are considered for the data warehouse.
Considerations regarding the OLTP systems, external data sources are considered. In this case
proper discussion regarding the syndicate data and census data are made. After performing this
stage integration process are performed. This ensures that proper mapping in between the data
sources are performed. Mapping in between the data sources and the data warehouses as well.
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
4
DATA WAREHOUSE
After this section proper concept development of the Data warehouse is performed. This helps in
better developing of the framework of data warehouse. After this section proper analysis of the
customization process is performed. This ensures that better assessment of the mapping process
ion between the data warehouse and the client structure is performed. After this section proper
analysis of the structure of the as per the client access to the data warehouse is also stated. Proper
analysis of the OLAP application is also well stated. Proper research regarding the data mart is
also stated in the section. The three levels that are discussed in this case includes the likes of
conceptual level, logical level and the physical level. The conceptual level deals with the
conceptual point of view of the Data warehouse. The logical level is focused on addressing of the
designing section of the data warehouse system. Proper definition of the ETL processes will also
be provided in the section. This section defines the physical aspect of the data warehouse system.
This is the main reason that proper understanding of the configuration of the database servers can
be framed. After performing this stage, diagrams are provided. This ensures the fact that proper
analyzed diagrams are stated. This ensures that the designer will be playing the main role in this
section. In this case there are 3 major sections. The sections include the likes of Name of the
coined diagram, UML diagrams and the profile of the dashed boxes.
The experiment is mainly performed on the basis of Data warehouse engineering process.
This section is completely based on the processing of Unified Software Development process.
This is the main reason that industry standards are well set along with the UML vision modelling
language. This will help in bettering of the software development process with the help of UML.
The UP is considered as the generic SEP (Software Engineering Process). This SEP is
instantiated for the organization and hence wise this will be effecting the project or domain of the
organization. This will be helping in better data warehouse development. Few of the sections that
are considered in this case includes the likes of UP specifying the likes of the developing of the
data warehouse. Use case requirement, iterative, architecture centric and incremental process is
also performed. The entire project life cycle is divided in 4 major parts. The parts are namely
Interception, Elaboration, Design, Implementation and Transition. The work flows are again
diversified in 5 major sections. The sections include proper assessment of the requirement
gathering, analysis, designing, Implementation and Test process. After this 2 more work flows
are added to the system. This includes the fact that proper care in the section of the Maintenance
and post development review is provided. This is the main reason that better emphasis on the
Document Page
5
DATA WAREHOUSE
shifts over iteration can be developed. The iteration process can be performed in a same time.
This is the main reason that functioning process will be evolving in a better manner. Separate
and distinct UML diagrams are also provided in the section. The models are well modified and
hence wise better sectioning of the workflow diagrams are implemented.
The requirements of the experiment includes understanding of the exposure regarding the
final user which will be helping in understanding the interesting measures and aggregation. This
will also require the analysis of the dimensions that are considered. Proper queries that will be
required for generation of the periodical reports will also be helping in easing the project. With
the help of this process, frequency of the data can be well analyzed as well. The use case that is
considered in this case incurs proper assessing of the user needs along with the different
approaches of traditional elicitation. This will also help in understanding the wants of the user
and hence wise proper assessment of the data warehouse establishment can be made. This will
also help in understanding the vivid roles and the designations that are to be considered. With the
help of the UML proper use case diagram can be made. This will ensure that visual modeling use
case can be performed. A common template will be followed in completion of the experiment.
This will be helping in proper specification of the name, unique identifier and system state that
will be assessing the actual steps of the use case.
Finding
The aim of the workflow is to verify the implementation process of the work that is
desired. The result states implementation of the physical Data Warehouse are to be built on the
concept of the data population and the data warehouse tuning as per optimized running. The
main diagrams includes the likes of data warehouse logical schema, client logical schema, ETL
process, Exportation process and Transportation diagram. As per this sectioning, cleansing and
quality control activity is also modelled.
Problem highlighted
The main aspect that proper data warehousing is not performed properly.
More number of iterations are expected
Result
Document Page
6
DATA WAREHOUSE
The workflow of the development effort is absent. The development process is used in a
manner that the outcome is considered for the personnel who have more than 20 years of
experience. In case of maintaining the track record of time taken and effort provided for each of
the phase, proper estimation of the time and staff requirement can be performed for the future
projects.
Discuss the conclusion
The conclusion of the report is mainly concerned with the age group whose data is
considered for completion of the experiment. Proper assessing of the usage of UML is also well
stated. Usage of UML as a modelling language is also properly stated. Proper assessment of the
front end as well as the back end is also considered as an integral part.
Relevancy of the topic
Relevancy of the topic is very high as the data warehousing is mainly focused on the
processing of the queries and hence wise proper structuring and queries can be made. These are
the topics that are highly relevant to that of the software engineering.
Conclusion
From the above discussion it can be stated that a proper discussion regarding the
experimental research method is made. Proper discussion regarding the experimental process is
also provided. Again proper discussion regarding the findings of the project is also stated.
Relevancy of the topic is also stated in this report.
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
7
DATA WAREHOUSE
Bibliography
[1] S. Luján-Mora. and J.Trujillo,. A data warehouse engineering process. In International
Conference on Advances in Information Systems (pp. 14-23). Springer, Berlin, Heidelberg, 2004,
October.
[2] R. Kimball. and M. Ross. The kimball group reader: Relentlessly practical tools for data
warehousing and business intelligence remastered collection. John Wiley & Sons, 2015.
[3] O. Boussaid, J. Darmont, F. Bentayeb and S. Loudcher. Warehousing complex data from the
web. arXiv preprint arXiv:1701.00398, 2017.
chevron_up_icon
1 out of 8
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]