logo

ITECH 2201 Cloud Computing Assignment (Big Data)

   

Added on  2020-05-28

6 Pages2377 Words124 Views
 | 
 | 
 | 
Running head: CLOUD COMPUTING (BIGDATA)Cloud Computing (Big Data)Name of the studentName of the AssignmentAuthors Note
ITECH 2201 Cloud Computing Assignment (Big Data)_1

1CLOUD COMPUTING (BIGDATA)Part A (4 Marks)Exercise 1: 1. Data Science refers to the emerging fields of social science, statistics, computer science,information, and design. 2. In the past two years, the percent of the data IBM has been created in the world is 90%(Chen,Mao & Liu, 2014).3. The value of petabyte storage is a million gigabytes (1015) (Rajasekar, Dhanamani & Sandhya,2015).4. Foundation course: The foundation courses offer the students with the basic knowledgeand required skills, which are relevant in the area of information and data science. This courseoffers design analysis, store, retrieve and many more.Advanced course: Advance courses give deeper focus on the application and values ofdata science. The course contains skill that is more complex and analytical methods like adesign, experiment, data visualization, and big data related problem-solving. Exercise 2: 5. According to the author, the motivation comes from the use of big data and to the futureof big data. The fact that “Big Data is now the part of our life and Big Data hides a lot ofsolutions to almost many problems of the industry” brings the actual motivation to researchersand students. People are now part of the “Big Data Ocean” so this can be the other reason formotivation. The author wanted to show the developing trends of Big Data and discover the truevalue of it through this paper.6. The 7 V’s mentioned in the paper are:1)Volume: In a Big Data, Volume refers to the data size that is created from several sourcessuch as audio, video, text, social networking, research studies, crime reports, weatherforecasting and natural disaster as mentioned. 2)Velocity: Velocity of data is discussed with two perspectives.a) The Velocity of incoming data where the business needs can be prepared with thetechnology and with the process of database engine. b) The Speed of movement of big data into a big storage through a fast response when thedata arrives.3) Variety: Big Data variation depends on different shapes like images, video, text as it acquireda direct interface from the users.
ITECH 2201 Cloud Computing Assignment (Big Data)_2

2CLOUD COMPUTING (BIGDATA)4) Veracity: Veracity of Big Data mainly focuses on the reliability of the data. Thus, it is theimportant steps to process big data to cleanse data.5) Validity: Validity of Big Data refers to “data accuracy, and correctness with regard to theintended usage”. The significance of the utility of big data is valid with the relationshipbetween the data elements and the intended consumption.6) Volatility: Volatility depends on the data retention policy regardless of whether it is fortraditional data or big data. Implementation of it can be easily achieved in a relationaldatabase.7) Value: As compared to all the 6 V’s, Value has the desired outcome of the big data analysis,while in the previous ones, they have features and approaches. 7. Big data have a lot of potential in the research community and the real world industry. Ithas generated billions of data every day, which has made it a developing trend. Big data hashidden many solutions to the problem of an industry, which have made it a part of our life. Theresearchers have made understood the 7 V’s of which 7th V as ‘Value’ is the actual output of theindustry challenges and the issues. In addition to the Big Data Ocean, the data has dominated thetechnology. The value of data should exceed the ownership, cost or management. Thegovernance of mechanisms largely depends on the data value. It is necessary to write and executethe limit of the true value of enterprise for the data extraction of the policy and structure.Typically, the data can be between the layers. In short, the risk will be low for the data, whichare at the higher level. Therefore, it accepted to be on a level where the cost is associated to havehigh storage cost with a higher level of protection. The research has shown the vision to improve healthcare as a product of Big DataUtilization. Data on healthcare will eventually grow in the field of predictive medicine, makeefficient use of case studies, treatment of histories and diseases, and provide a prescription ofdata and finally will bring improvement in healthcare. Though there is a fear about the Big Dataunintended harm, there is also a believe which will benefit at the end of the day by outweighingsuch harm. So, a high amount of research is still needed to get the solutions, reasons, andinformation. Therefore, Big Data Utilization will be the future work for every researcher andeach student to move forward.Exercise 3:8. From data partners, custom connectors, data partners, HTTP, logs big data can be acquired.Once data is collected, it will be stored in the cloud and the enterprise will use it (Di Martino etal, 2014).9. The Data stored in the NoSQL and HDFS are pre-processed, transformed and then organizedto load in the data warehouse. A specific data required sessionization, which categorizes all the
ITECH 2201 Cloud Computing Assignment (Big Data)_3

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
ITECH 2201 Cloud Computing School of Science
|25
|6295
|137

Big Data and MapReduce
|23
|7772
|440

Green Computing, Big Data, and Storage Design: A Workbook for Week 6-8
|12
|2466
|122

What is Data Science? Part B Exercise 1: What is Data Science?
|24
|7358
|489

Big Data Workbook for Week 6 - ITECH 2201 Cloud Computing
|10
|3514
|434

Big Data Characteristics, Tools and Applications - ITECH 2201 Cloud Computing
|8
|1913
|494