logo

CIS8008 - Assignment on Business Intelligence

   

Added on  2020-03-04

22 Pages3176 Words50 Views
Running head: ASSIGNMENT 2Assignment 2Name of the StudentName of the UniversityAuthor Note

1ASSIGNMENT 2Table of ContentsTask 1.........................................................................................................................................2Task 1.1......................................................................................................................................2Task 1.2......................................................................................................................................3Task 2.........................................................................................................................................5Task 2.1......................................................................................................................................5Task 2.2....................................................................................................................................13Task 3.......................................................................................................................................17References................................................................................................................................19

2ASSIGNMENT 2Task 1Task 1.1Data Warehouse: A data warehouse is a kind of relational database designed forspecific query and data analysis. It is not used for regular transactional processing of the data.Historical data collected from different sources are collected from different transactional dataand other sources. A data warehouse helps the organization to separate out the analysisworkload from the transactional workload of the servers (Kimball 2013). Apart from theanalysis capabilities, a data warehouse also has the capability to do data extraction,transportation, data transformation and data loading solutions. It also includes an onlineanalytical processing engine (OLAP), client data analysis tools and applications, which areused to process the gathering of information and to deliver it to the users. A data warehouse isdesigned to help in analyzing of the information collected from the sources. To learn moreabout a department of an organization, they can invest in a data warehouse, which willanalyses the information collected from the department (Vaisman and Zimányi 2014). Theability to analyses the information in a section wise manner helps the warehouse to be subjectoriented in nature. A data warehouse has the properties of being subject oriented in nature,has data integration procedures, stores time variant information, and has storage fornonvolatile information. To implement a correct data warehouse the organizations mustfollow correct design mechanism. Data Lake: A data lake is a new generation of data storage procedure that has beendeveloped to meet the new emerging trends in data analysis. It can be defined as a temporarystorage area for data being collected from the online resources for the analysis of theorganization. The data collected is just dropped into the data lake accompanied by a uniqueidentifier. This identifier can be used to identify the data that it holds. The identifier ca be

3ASSIGNMENT 2compared to being a metadata tag of the information collected (Miloslavskaya and Tolstoy2016). When data analysis is done of the information, the identifiers are called upon by usinga query. The relevant information is collected and the result is returned. The data fetched isanalyzed and a compact decision is provided. The term Data Lake is coined it the Hadooporiented object storage. Using a data lake can provide effective information during dataanalysis or when data mining is done on the organization (Fang 2015). The concept of a datalake is a new trend in the digital world and is being slowly accepted. As a data lake is a largestorage of information there is no need to follow any schema for designing the storage facilityof the database. Data Mart: Data mart is a small version of data warehouse that is used by a certainclass of workers to store their data analysis information. The term is often misused with datawarehouse, but they are very different terms (Ramos, Alturas and Moro 2017). However,they might to the same work but the working environment is different. For a largerorganization, there is always the option of using a data warehouse. However, the use of a datamart concept in new it is slowly being accepted into the digital world (Golfarelli and Rizzi2013).Task 1.2Data Warehouse: Data has been stored in a data warehouse at a very granular level ofdetails. During analysis, all information related to the query is extracted, changed and loaded.This means that the information is first extracted from the sources and changed into acommon format for the warehouse to read it (Ross et al. 2014). The revised information isthen loaded into the database to continue analyzing. When a query is sent to the datawarehouse, it first locates the information from the warehouse and retrieves the data. It thenpresents the information in an integrated view for the user to view. A warehouse provides abetter form of query support than the traditional database. The warehouse has access to

4ASSIGNMENT 2enhanced spreadsheet functions, structured and faster query processing, and data mining andefficient viewing. The enhanced spreadsheet function helps the organization to view theanalyzed data in a better view. An organization should have a data warehouse for doingcompetitive and comparative historical data analysis, to get real time analysis of financialinformation of the organization, to simplify the data processing methods, to identify thecompetitive market trends and to reduce the cost in the operations of the organization(Kimball and Ross 2013). Most of the organizations can benefit from the use of a datawarehouse. Data Lake: A data lake helps an organization to analyze data and information ofdifferent variety and volume of the data (O'Leary 2014). To implement a successful data lakeimplementation an organization has to use different tools to collect the information frommultiple data sources. They also have to keep in mind they need to do the data collection in adomain specific information. Searching of information in different department would causeconfusion, as the only identifier of the data is a metadata tag. There should also be animplementation of an automated management of the metadata information. The data lakeshould have the ability to scan out the new incoming information into categories, tag themand store them in the database (Roski, Bo-Linn and Andrews 2014). Following these steps,an organization will be able to implement a data lake in their organization. The schema whicha traditional a database follows is absent in such a data lake which make the implementationeasier. Data analysis on an experimental basis can also be done on the data stored in the lake. Data Mart: A data mart is targeted for a department in an organization; data analysisis easier on the information stored in the data mart. A large organization can save resourcesand time by analyzing the information department wise (Rahman, Riyadi and Prasetyo 2015).The final analysis data can be clubbed to form a better-detailed information. The data martuse the OLAP feature of the data warehouse to do data analysis of the information. Using a

5ASSIGNMENT 2data mart in an organization is helpful because the load of analyzing a data warehouse isshared between the data marts. It produces authorize able different subsets of the datawarehouse. It can be used to analyze the return of investment of a department of anorganization. The data mart provides savings by reducing time consumed for the analysis ofthe information (Zhu et al. 2015). If the data mart is not used in the right manner then thewhole s\data warehouse can collapse. Task 2Task 2.1The following set of images show the charts created using Rapid Miner:Figure 1: Initiating the connection between the database and the software

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
Data Management Architectures Assignment
|28
|2628
|100

Data Warehouse for a University Literature Review
|22
|5001
|138

Business Intelligence | Task Report | Answers
|19
|2054
|19

Data Warehouses vs Data Marts
|4
|608
|272

Database Application Direction | Data Warehouse
|5
|799
|22

Developing a Proof of Concept Data Warehouse/Data Mart using Dimensional Modelling
|11
|2251
|475