logo

Data Mining and Data Warehousing - IT 446

   

Added on  2022-08-10

9 Pages2260 Words408 Views
Instructions:
You must submit two separate copies (one Word file and one PDF file) using the Assignment Template on
Blackboard via the allocated folder. These files must not be in compressed format.
It is your responsibility to check and make sure that you have uploaded both the correct files.
Zero mark will be given if you try to bypass the SafeAssign (e.g. misspell words, remove spaces between words,
hide characters, use different character sets or languages other than English or any kind of manipulation).
Email submission will not be accepted.
You are advised to make your work clear and well-presented. This includes filling your information on the cover
page.
You must use this template, failing which will result in zero mark.
You MUST show all your work, and text must not be converted into an image, unless specified otherwise by the
question.
Late submission will result in ZERO mark.
The work should be your own, copying from students or other resources will result in ZERO mark.
Use Times New Roman font for all your answers.
Data Mining and Data Warehousing
IT 446
Student Details:
Name: ###
CRN: ###
ID: ###
Data Mining and Data Warehousing - IT 446_1
Pg. 1 Question TwoQuestion Two
Question One
Compare and contrast Star and Snowflakes schema of data warehousing model.
Star Schema: In terms of star schema, one fact table can remain in the center of
star and other linked dimension tables around it. This schema is called star
schema as the structure is similar to a star (Sanchez, 2016). Star schema is one
of the simplest type of schema in data warehousing. It has another name called
star join schema. This schema is suitable for querying large data sets.
Example of Star Schema
2 MarksLearning
Outcome(s):1
LO1: Explain
different data
mining tasks,
problems and the
algorithms most
appropriate for
addressing them.
Data Mining and Data Warehousing - IT 446_2
Pg. 2 Question TwoQuestion Two
Star schema represents every dimension using single dimension table. Attributes of
dimension should be represented within dimension table. Each dimension table should
be connected to fact table using foreign key. Dimension tables should not have
connection between each other (Sidi et al., 2016). Fact table should consist of measure
and key. Star schema is well known for optimal disk space and easily understandable
concepts. Normalization concepts should not be applied to dimension tables. Star
schema has wide support by business intelligence tools.
Snowflake Schema: Snowflake is the extended version of star schema. This is because
snowflake includes additional dimensions within star schema. The same reason is
applied to naming this schema, its structure is similar to a snowflake. Snowflake tables
are created by applying normalization concepts (Benjelloun, El Merouani & El Amin,
2017). For this reason, the dimension tables are decomposed into smaller tables which
add additional dimension tables into schema.
Snowflake Schema
The biggest feature of snowflake schema is lesser usage of disk space than star
schema. Dimension implementation is comparatively easier. Query performance is
Data Mining and Data Warehousing - IT 446_3

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
IT 446 Data Mining and Data Warehousing | Assignment
|9
|1556
|13

Computing and Informatics Assignment
|5
|1472
|23

College of Computing and Informatics | Assignment-2
|9
|2152
|27

Assignment Template | Document
|5
|1485
|69

College of Computing and Informatics Assignment
|6
|1733
|22

Computing and Informatic docx.
|9
|2172
|15