ITECH 2201 Cloud Computing School of Science

Verified

Added on  2021/05/30

|25
|6295
|137
AI Summary

Contribute Materials

Your contribution can guide someone’s learning journey. Share your documents today.
Document Page
ITECH 2201 Cloud Computing
School of Science, Information Technology & Engineering
Workbook for Week 6 (Big Data)
Please note: All the efforts were taken to ensure the given web links are accessible. However,
if they are broken – please use any appropriate video/article and refer them in your answer
Part A (4 Marks)
Exercise 1: Data Science(1 mark)
Read the article at http://datascience.berkeley.edu/about/what-is-data-science/ and
answer the following:
What is Data Science?
Data science: Multidisciplinary integration of the data reasoning, algorithm development as well as technology
to resolve difficult problems of analysis (Kowolenko & Vouk, 2018).
According to IBM estimation, what is the percent of the data in the world today that has been
created in the past two years?
90% of data in the whole world created in last two years
What is the value of petabytestorage?
Petabyte= 1015 bytes of data
For each course, both foundation and advanced, you find at
http://datascience.berkeley.edu/academics/curriculum/briefly state (in 2 to 3 lines) what
they offer?Based on the given course description as well as from the video. The purpose
of this question is to understand the different streams available in Data Science.
CRICOS Provider No. 00103D Insert file name here Page 1 of 26

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Foundation: The Master of Information and Data Science curriculum design options can accelerate the
completion of the course. Students who are unfamiliar with object-oriented based programming will need to
complete the Python for the Data Science Foundation classes or course as a main part of their basic course
units.
Advanced: Data science roles and responsibilities are different as well as skills needed for them differ
considerably. The advanced course plays an important role in profound or deep understanding and value as
well as application of the data science.
Exercise 2: Characteristics of Big Data(2 marks)
Read the following research paper from IEEE Xplore Digital Library
Ali-ud-din Khan, M.; Uddin, M.F.; Gupta, N., "Seven V's of Big Data understanding Big
Data to extract value," American Society for Engineering Education (ASEE Zone 1), 2014
Zone 1 Conference of the , pp.1,5, 3-5 April 2014
and answer the following questions:
Summarise the motivation of the author (in one paragraph)
In this article motivation for writing this article as well as outlining pertinent arguments comes from a fact that
Big data has become part of everybody life, as well as big data hide solutions to several problems in any of
these industries. In fact, Big Data provides the raw material for building the next great machine. The author
supports the fact in which big data will ultimately take over new technology and the Internet world.
What are the 7 v’s mentioned in the paper? Briefly describe each V in one paragraph.
1. Volume: This is the main feature that defines big data as "big."
2. Velocity: This can simply be defined as the speed of change.
CRICOS Provider No. 00103D Insert file name here Page 2 of 26
Document Page
3. Variety: It can be simply defined as having different forms of data sources.
4. Veracity: It refers to the credibility of the data being used..
5. Variability: First, the variability is different from the variety.
6. Visualization: It refers to how data is presented to management for decision making.
7. Value: Values are the last, and it is important to understand that organizations need to obtain some
value after tremendous efforts and resources.
Explore the author’s future work by using the reference [4] in the research paper.
Summarise your understanding how Big Data can improve the healthcare sector in 300
words.
The healthcare industry faces many challenges, in which new diseases are spread to maintain good efficiency.
Large data analysis can help to resolve these health-care challenges. Due to the enormous data available in
the healthcare sector such as health, clean, financial, medical, research and development, administration and
operational data, meaningful insights can be found to improve the functioning of the industry. Healthcare
companies have used large numbers to check the hospital's admission rates and analyze staff performance as
a part of their professional intelligence plan. Healthcare firms can cut out the healthcare costs and provide
good care with predictive analysis. Big data assists to reduce the risk of medicines by improving administrative
performance and financial and helps to reduce reading. Medical insurance is complex and suffers from
controversial and fraudulent claims. Big Data Analysis helps facilitate the efficiency of medical insurance
claims by disclosing the trends and exposing the claims process. Patients get good returns on their insurance
claims, and keepers receive fast money, relying on EMR's adoption is filling data in healthcare, and because
carers need to keep records of historical patients, the number of data will only increase, this is good news for
solution providers selling data storage and large data It is only for analysis That's good news. The trend is
seen in EMR, diagnosis, treatment effectiveness, operational effectiveness, vendor expenses and many
things. There is everything to look for the right use case for data.
Exercise 3: Big Data Platform(1 mark)
CRICOS Provider No. 00103D Insert file name here Page 3 of 26
Document Page
In order to build a big data platform - one has to acquire, organize and analyse the big
data. Go through the following links and answer the questions that follow the links: Check
the videos and change the wordings
http://www.infochimps.com/infochimps-cloud/how-it-works/
http://www.youtube.com/watch?v=TfuhuA_uaho
http://www.youtube.com/watch?v=IC6jVRO2Hq4
http://www.youtube.com/watch?v=2yf_jrBhz5w
Please note: You are encouraged to watch all the videos in the series from Oracle.
How to acquire big data for enterprises and how it can be used?
In the above video, Big Data now shows a big problem, but there is a simple story behind Hype. For decades,
the company has been making professional decisions based on stored transaction data in relational
databases. Oracle offers a comprehensive and highly integrated product portfolio so that you can create and
manage these different types of data, explore new insights and analyze those using existing data to gain
hidden relationships (AlMahmoud, Damiani, Otrok & Al-Hammadi, 2017).
How to organize and handle the big data?
To help research institutions collect, integrate, organize, and analyze data from a variety of sources, a
comprehensive and comprehensive product portfolio is necessary. Before filtering large data research
platforms in data warehouses, users need to process, filter, and transform large amounts of data.
What are the analyses that can be done using big data?
Infrastructure involved in the analysis of large amounts of data needs to support analytics for example
statistical analysis and data mining in order to store large amounts of the data types in various systems; extent
to intense data volumes; provide fast response times; as well as automate based on an logical models
decision making.
CRICOS Provider No. 00103D Insert file name here Page 4 of 26

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Part B (4 Marks)
Part B answers should be based on well cited article/videos – name the references used
in your answer.For more information read the guidelines as given in Assignment 1.
Exercise 4: Big Data Products (1 mark)
Google is a master at creating data products. Below are few examples from Google.
Describe the below products and explain how the large scale data is used effectively in
these products.
a. Google’s PageRank
PageRank is a measurement that evaluates the number and quality of webpage, determines the
importance of the page and related authentic scores in 0-10 scale.
b. Google’s Spell Checker
Google's spell check is a very old feature that Google constantly improves. It is necessary to
understand how this works to understand keyword research process and to better understand
reputation / brand management.
c. Google’s Flu Trends
Web service operated by the Google.
d. Google’s Trends
Google Trends are publically web based facility for the Google Inc. that is completely based on the
Google Search and it also shows that certain search-periods are often used in various parts of the
world with search-volume and in different languages.
CRICOS Provider No. 00103D Insert file name here Page 5 of 26
Document Page
Like Google – Facebook and LinkedIn also uses large scale data effectively. How?
As a social platform, websites generate mass data. All data is grouped together to analyse behaviours of users
in order to provide recommendations through social media providers. For example, Facebook will notify things
about neglect, so that users can love their posts and their search terms according to their pages or products
they like.
Exercise 5: Big Data Tools(2 marks)
Briefly explain why a traditional relational database (RDBS) is not effectively used to store
big data?
First of all, size of data has been greatly increased to range of PB level, a PB level = 1,024 TB. RDBMS finds it
as very challenging to manage or handle this huge large data. To solve this problem, RDBMS adds more
centralized processing units and more memory generation to the database management systems. Secondly,
most data comes from semi-structured or non-formatted formats of social media, video, text, email and audio
(Zhao, Zhou, Li & Huang, 2018).
What is NoSQL Database?
It offers transaction manipulation, horizontal scalability, and transactional semantics for easy management and
inspection. NoSQL is a database design methodology that can adapt to various data models including various
values, documents, histograms and graphic formats.
Name and briefly describe at least 5 NoSQL Databases
Cassandra originally developed by Facebook and now became Apache open source software, which is well
suited for social networking of the cloud computing databases.
Lucene: Subproject of Apache Software. It is an open source full-text search engine toolkit.
Oracle NoSQL: Oracle NoSQL Database is Oracle's NoSQL type distributed key database.
H Base: An open source, non-relational distributed database
CRICOS Provider No. 00103D Insert file name here Page 6 of 26
Document Page
Big Table non-relational databases: These RDBMSs are mainly used in large enterprise scenarios (Lerner,
2014).
What is MapReduce and how it works?
Map Reduce jobs usually divide the input dataset into separate blocks that are easily processed by the map
task in a totally parallel manner.
Briefly describe some notable MapReduce products (at least 5)
Disco Project
Couchdb
Apache Hadoop
Infinispan
Riak
Amazon’s S3 service lets to store large chunks of data on an online service. List some 5
features for Amazon’s S3 service.
1. Versioning
2. Lifecycle
3. Tagging is a cost allocation classification as AWS billing characteristic to track AWS costs
4. Request Pricing
5. RRS is decreased redundancy storage
Getting the concise, valuable information from a sea of data can be challenging. We need
statistical analysis tool to deal with Big Data. Name and describe some (at least 3)
statistical analysis tools.
SAS has a very comprehensive system of data access,
CRICOS Provider No. 00103D Insert file name here Page 7 of 26

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
SPSS Statistics: system is convenient, more complete statistical methods
EXCEL spreadsheet is powerful spreadsheet software (Lerner, 2014)
Exercise 6: Big Data Application (1 mark)
Name 3 industries that should use Big Data – justify your claim in 250 words for each
industry using proper references.
1. Nonprofits
Even though it might seem strange to jump from the achievement of Fortune in almost five hundred companies
to non-profit firms, the technologies that make for-profit companies flourishing are not automatically mutually
exclusive. One large nonprofit that needs to be considered is Wikipedia. Wikipedia is most significant site in
the whole world. It has more data as compared to other non-profit sites. The potential is staggering, but it can
be done to compensate for its costs including: promoting fundraising hard work, adjusting marketing to
accommodate their population and isolation most likely to be updated Donated users (Dhar, 2014).
2. Sales
For profit-making organizations, large data technology used to receive donations is more effective when the
sale is applicable. Actually, if we are in sales-driven industry, and we do not use large data, we will give
ourselves a lot of pain. The sales experience is not at all how to show the sales knowledge. Like non-profit
organizations, our sales section has a commodity or service which can just be done through financial
transactions and many customers are encouraged.
3. Insurance
Insurance industry which is seen as a most popular industries in the world-data-driven industry, so it is
amazing that some people use large scale shopping solutions to increase their business. Most insurance
CRICOS Provider No. 00103D Insert file name here Page 8 of 26
Document Page
companies are like experts, mineral experts, intermediaries like me, accountants and actuators (Kirkpatrick,
2013).
From your lecture and also based on the below given video link:
https://www.youtube.com/watch?v=_sXkTSiAe-A
Write a paragraph about memory virtualization.
Memory virtualization permits networked servers and distributed servers so that they share a large memory
pool to conquer physical memory limitations, an ordinary bottleneck for software performance.
Watch the below mentioned YouTube link:
https://www.youtube.com/watch?v=wTcxRObq738
Based on the video answer the following questions:
What is RAID 0?
RAID 0, is a method or technique for decomposing files and distributing data across each and every disk
drives in RAID group.
Describe Striping, Mirroring and Parity.
Stripping: Breaks data stripe into a specific size block (known as "block size") and then blocks it in the
entire red section. This type of data storage can affect performance.
Mirroring: Storage system in which a single copy of the data is stored on RAID member simultaneously.
This form of data or information placement affects the fault tolerance and performance.
Parity: There is a storage technique that uses parity stripe as well as checksum mechanism. In Parity
Technologies, a certain parity functions are removed for data validation.
Exercise 2: Storage Design (2 marks)
CRICOS Provider No. 00103D Insert file name here Page 9 of 26
Document Page
Summarize storage repository design based on the following video link:
https://www.youtube.com/watch?v=eVQH7C3nulY
The repository is the rational disk space provided by the file system on physical storage hardware. In case we
create a repository on file system, the file system which already exists, such as an NFS share (Rouse, 2009).
Below YouTube link describes the Intelligent Storage System
https://www.youtube.com/watch?v=raTIRsMi7zk
Based on the watched video answer the following questions:
What is ISS?
Smart Storage System: Feature-rich RAID array and it provides extremely optimized output and input
processing capabilities. Also provides plenty of caching and several performance-enhancing I/O paths.
What are the 4 main components of the ISS?
4 components of ISS:
Cache
Front end
Physical disks
Back end
How cache works in ISS?
Caches are semiconductor memories that temporarily place data to decrease the time necessary to serve host
input and output requests. Caching improves data storage system execution and performance by separating
mechanical delays related to physical disk that is the lowest component of the intelligent data storage system
(RoHS Guild Compliance 2016).
CRICOS Provider No. 00103D Insert file name here Page 10 of 26

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Storage Area Network (SAN) and Network Attached Storage (NAS) are widely
used concepts in data storage arena. The following YouTube video links gives
detailed description of these concepts:
http://www.youtube.com/watch?v=csdJFazj3h0
http://www.youtube.com/watch?v=vdf6CvGQZrk
https://www.youtube.com/watch?v=KxdfGcynfJ0
https://www.youtube.com/watch?v=4RsLUTJ_Qtk
Based on the watched videos answer the following questions:
Describe NAS and SAN briefly using diagrams?
CRICOS Provider No. 00103D Insert file name here Page 11 of 26
Document Page
What are the advantages of SAN over NAS?
Advantages
Virtualization Support
Resiliency
Disaster Recovery
Disadvantages
Compression
De-duplication
Thin Provisioning
What are two common NAS file sharing protocols? How they are different from
each other?
________________________________________________________________
CRICOS Provider No. 00103D Insert file name here Page 12 of 26
Document Page
NAS is file-level computer data storage server, which gives access to data to different customer groups. NAS
cannot be run only as a single file server, but can be dedicatedly do this work by its software, hardware, or by
configuration of these components.
Part B (3 Marks)
Exercise 3: Storage Design (1 Mark)
Design Storage Solution for New Application
Scenario
An organization is deploying a new business application in their environment.
The new application requires 1TB of storage space for business and application
data. During peak workload, application is expected to generate 4900 IOPS (I/O
per second) with typical I/O data block size of 4KB.
The vendor available disk drive option is 15,000 rpm drive with 100 GB capacity.
Other specifications of the drives are:
Average Seek time = 5 millisecond and data transfer rate = 40 MB/sec.
You are required to calculate the required number of disk drives that can meet
both capacity and performance requirements of an application.
Hint:In order to calculate the IOPS from average seek time, data transfer rate,
disk rpm and data block size refer slide 28 in week 6 lecture slide. Once you
have IOPS, refer slide 29 in week 6 to calculate the required number of disks.
The organization is deploying new corporate applications in an organizational
environment. A new application requires 1TB of application data and business storage. At the peak of the
workload, the application is anticipated to create 4900 IOPS with a unique I/O size of about 4KB blocks.
Vendor-provided disk drive options are rpm 15,000 drives with approximately 100 GB of capacity.
Dc=1TB/100GB=10 disks
Ts=0.005s+0.5/(15,000rmp/60)+4kb/40mb=0.0071s
S=1/0.71=141
CRICOS Provider No. 00103D Insert file name here Page 13 of 26

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
0.7S=99
Dp=4900/99=50 disks
Hence, necessary disks drive: 50 disks.
Exercise 4: Storage Evolution (2 Marks)
Watch the following videos for Fiber Channel over Ethernet and answer the
questions that follow:
http://www.youtube.com/watch?v=hSFyf-rmjA8
http://www.youtube.com/watch?v=iCfJCzfNLrw
What is FCoE and why we need FCoE?
FCoE: A storage protocol which enables direct transmission over FCCE. FCoE enables fiber channel traffic to
move into existing very high-speed Ethernet framework and converts the storage as well as IP protocol to a
interface and cable transport.
In your opinion how FCoE is cost effective than traditional connection – give brief
explanation.
FCoE shows a way to dramatically reduce number of switches, cards, adapters, and related cables, by running
LAN as well as SAN on the same infrastructure facilities. According to the analyst company Enterprise
Strategy Group, due to the low cost of hardware, FCOE deployment is cheaper than the traditional network.
You have read and answered about SAN in part A based on your
understanding and with some research effort answers the following questions:
What is a Virtual SAN?
Virtual SAN which is a software-defining storage product that is provided by VMware that allows enterprise to
share storage capacity and provides immediate virtual machine storage through policies powered by simple
virtual machines (Saha, 2014).
CRICOS Provider No. 00103D Insert file name here Page 14 of 26
Document Page
What is IP SAN protocols and FibreChannel over IP (FCIP)?
IP SAN is a storage area network which permits numerous servers to access pool pools of the storage
gadgets using the storage protocol, depending on an Internet Engineering team's standard Internet based
Protocol suite.
Watch the below video about Introduction to Object-based and Unified Storage
and:
https://www.youtube.com/watch?v=kl9X6mzEWO4
Choose the correct answer from the following questions:
What is an advantage of a flat address space over a hierarchical address
space?
a. Highly scalable with minimal impact on performance
b. Provides access to data, based on retention policies
c. Provides access to block, file, and object with same interface
d. Consumes less bandwidth on network while accessing data
What is a role of metadata service in an OSD node?
a. Responsible for storing data in the form of objects
b. Stores unique IDs generated for objects
c. Stores both objects and objects IDs
d. Controls functioning of storage devices
What is used to generate an object ID in a CAS system?
a. File metadata
b. Source and destination address
c. Binary representation of data
d. File system type and ownership
What accurately describes block I/O access in a unified storage?
CRICOS Provider No. 00103D Insert file name here Page 15 of 26
Document Page
a. I/O traverse NAS head and storage controller to disk
b. I/O traverse OSD node and storage controller to disk
c. I/O traverse storage controller to disk
d. I/O is directly sent to the disk
What accurately describes unified storage?
a. Provides block, file, and object-based access within one platform
b. Provides block and file storage access using objects
c. Supports block and file access using flat address space
d. Specialized storage device purposely built for archiving
What is Greenhouse effect?
Greenhouse effect is that radiation from planet's atmosphere raises the temperature of the Earth
surface to a temperature without the atmosphere.
We are legally, ethically, and socially required to green our IT products,
applications, services, and practices – is this statement true? Why?
I agree with the above statement. As stated in Harnessing Green IT: Practices and principles for IT
experts have a responsibility to make IT infrastructure green as well as to reduce the impact on the
environment.
What is Green IT and what are the benefits of greening IT?
Green IT main goals are to achieve minimal environmental impacts through the production, design, and use of
displays, printers, computers, servers, storage devices, and network and communication systems. This
environmental sustainability and IT-Building focuses on production and processing efficiency on energy
efficient, sustainable business related processes and customs. IT can also assist aid and use other
environmental aspects, and can help to create green awareness. Keeping composite computing performance,
green IT "about methods and procedures" to use compatibility with environmentally friendly systems. In other
words, it is a concept that creates a sustainable and environmentally friendly IT practice program. Benefits of
CRICOS Provider No. 00103D Insert file name here Page 16 of 26

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
green IT can be summarized briefly. It does not matter whether it helps reduce greenhouse emissions,
environmental pollution and energy conservation. It is competitive in the industry because investors and
governments are increasing the demand for environmental efficiency.
The benefits of greening IT are:
Reduce green gas emissions
Competitiveness in the industry
Enduring benefits reduce costs
Meets buyer's requirements
Exercise 2: Environmental Sustainability (0.5 Marks)
Read the article in the below link and answer the questions that follow:
http://www.computer.org/csdl/mags/it/2010/02/mit2010020004.html
According to the article how do you build a greener environment?
In this article author Muruges has mentioned four points to create green environment: Green color,
green treatment, and green design moreover green building Green usage means improving energy efficiency
as well as improving the efficiency of various IT industry. Reduce energy consumption in areas Green
Processing means recycling of older IT tools to reduce redress. Green design implies IT. Consolidate energy
conservation and environmental protection concepts in the designing of systems as well as components. Also
Green manufacturing aims to reduce the environmental impact of creating electronic goods (Murugesan 2008).
Summarize the article in 150 words
Muruges use concept of "green IT" to think about it, and how IT reduces its effect on the professional
environment and creates sustainable programs in its IT activities. He explained the concept of green IT for the
first time and said that every step of the computer life has a major impact on our environment, such as
consumption of energy and resources and dangerous waste. Therefore, the author places four points ahead in
CRICOS Provider No. 00103D Insert file name here Page 17 of 26
Document Page
Information Technology activities: Green Use, Green Treatment, Green Construction and Green Design.
These four stages give advice to IT professionals and organizations about how to create a sustainable
environment. Furthermore, as more universities start to offer appropriate courses, they also mention the
possibility of green IT.
Exercise 3: Environmentally Sound Practices (1 Mark)
The questions in this exercise can be answered by doing internet search.
Briefly explain the following terms – a paragraph for each term:
Power usage effectiveness (PUE) and its reciprocal
As author describes, the power consumption is "metrics used to identify the efficiency of energy in the data
centers." This concept was first proposed by members of the Green Grid, which is an industry organization
trying to get better energy efficiency. PE is identified by the energy required to turn on all the IT equipment by
part of the total facility energy in the data center.
Data center efficiency (DCE)
Data center efficiency is basically an indicator that enables data center operators to quickly understand their
energy efficiency so that they can monitor, compare, and enhance performance. It is also proposed by the
green grid, of all the reciprocity of the first PUE. However, due to its misconception, DCE data did not achieve
PUE's success due to its misconception of the data center's energy efficiency.
Data center infrastructure efficiency (DCiE)
The data center framework is interconnected with PUE. It is determined by using the total energy consumption
to divide the actual energy consumption of the data center. It can also be used to evaluate the data center's
energy efficiency.
List 5 universities who offers Green Computing course. You should name the university,
the course name and the brief description about the course.
CRICOS Provider No. 00103D Insert file name here Page 18 of 26
Document Page
Kent University: CO847 Green Computing: This course provides a review of the fundamental concepts related
to computer components' electricity consumption and current low-energy technologies. It also focuses on low-
power networks, building technologies for green systems with virtual machines and energy-efficient large
storage systems.
Australian National University: COMP7310 ICT Continuity: The ability to retain ICTs for evaluating and
reducing the use of carbon footprint as well as materials utilized by computers & telecommunications. Policies
can reduce computers impact on the environment and make businesses more energy-efficient.
University of Chicago: CS 290 N Green Computing: Course studies "green computing" from the system
perspective, but will try to learn from many topics like mechanical engineering, industrial eco-science as well
as economics. It searches the energy-saving system's design on embedded devices from data centers.
Massachusetts Amherst University: CMPSC 69GC Green Computing: Green computing is a science of
creating carbon-efficiency hardware as well as software, moreover the science of computing, which makes the
physical infrastructure more energy-efficient. This will predict and reduce the "green computing" -for complete
server, storage and network design-and "green calculation" -through the Centre Network and reduce the
footprint of carbon structures such as buildings. It also includes related topics for example green energy and
its use in the computing environments. Students will be given documents describing the level of art in this field.
Linkoping University: TDDD 50 Green Computing: This is an introduction to Information and Communication
Technology that guides the concept of sustainable development. This will inspire the global ICT footprint,
emphasizing the energy savings index, the ICT's lifecycle, energy-aware computing, particularly the data
centers and tools to examine energy costs of networks, applications and mobile devices (Linkoping University,
2015).
Exercise 4: Major Cloud APIs (1 Mark)
The following companies are the major cloud service provider: Amazon, GoGrid,
Google, and Microsoft.
CRICOS Provider No. 00103D Insert file name here Page 19 of 26

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
List and briefly describe (3 lines for each company) the Cloud APIs provided by
the above major vendors.
Amazon: Developers Publishing API helps Android Application Directors develop and update applications in
the Developer and Application App Store.
Microsoft: Windows Azure Service Management API, cloud service operating system
Azure Internet-level computing and service platform hosted by a Microsoft-powered or supported data center.
GoGrid API: GoGrid is a grid hosting platform. Developers can use Web Control Panel to access multi-tiered
Internet infrastructure on demand.
Google API: Google Compute Engine provides Google's new information centers and virtual machines running
on a global fiber network. Calculate Engine's tools as well as workflow support can grow from single examples
to worldwide load-barreled cloud computing (Microsoft. 2015).
Part B (3 Marks)
Exercise 1: Greening IT Standards and Regulations (0.5 Marks)
To design green computers and other IT hardware – the following standards and
regulations are mainly used EPEAT (www.epeat.net), the Energy Star 4.0
standard, and the Restriction of Hazardous Substances Directive
(https://www.gov.uk/guidance/rohs-compliance-and-guidance). Use the link
provide with some internet search – summarize each standards and regulations
in 150 words.
EPEAT: Reliable standard used to examine the performance of electronic products and to help the long-
standing sustainability of companies (EPEAT, 2016). These products fall into three types: PCs and monitors,
televisions and video equipment (Australian Government, 2016).
Bronze level: Complete all required standards
CRICOS Provider No. 00103D Insert file name here Page 20 of 26
Document Page
Silver grade: Not only fulfills all the necessary criteria, however also meets at least 70% of the standard
requirements.
Gold Level: 75% of all required criteria and optional criteria must be completed.
ENERGY STARS 4.0 Standard: Dell's right products and study standards that help in reducing greenhouse
gas emissions following the US strict energy efficiency guidelines.
The Restriction of Hazardous Substances Directive: Hazardous substances are an environmental standard
that prohibits the use of six hazardous substances in electrical and electronic products.
Exercise 2: Green cloud computing (0.5 Marks)
Xiong, N.; Han, W.; Vandenberg, A, "Green cloud computing schemes based on
networks: a survey," Communications, IET, vol.6, no.18, pp.3294,3300, Dec. 18 2012
Most part of power consumption in data centers comes from computation processing, disk
storage, network and cooling systems. Nowadays, there are new technologies and
methods proposed to reduce energy cost in data centers. From the above paper
summarize (in 300 words) the recent work done in these fields.
Computational processing, Microprocessor: A multi-clock domain processor, presented by Semipro.
Memory units, which have independent voltage and frequency scaling Comes from
Computing Processing, Job Scheduling: Recently introduced a new method called Data Center Thermal
Sensing Resource Management.
Calculation process, virtualization: Virtualization of computer assets has contributed towards GCC process, as
recent research in this field is flourishing.
Cooling Systems: Cooling in the high end mainframe is the most common topics in the area.
Disk Storage: Fusion solid storage devices can reduce carbon by 80%.
Exercise 3: Cloud API Functionalities (2 Marks)
List the functionalities that can be achieved by using the APIs mentioned in the
following link:
CRICOS Provider No. 00103D Insert file name here Page 21 of 26
Document Page
https://code.google.com/p/sainsburys-nectar-api/
Retrieving the account details
How to decide on the offers
Retrieving offers
What API is used in the following link and how it is used?
https://pypi.python.org/pypi/python-novaclient
In the least complex terms, APIs are sets of necessities that represent how one
application can converse with another. APIs aren't at all new; at whatever point
you utilize a work area or workstation, APIs are what make it conceivable to
move data between programs—for example, by reordering a piece of a
LibreOffice archive into an Excel spreadsheet.
Openstack is an open source collaborative software project which meets many
of the cloud needs. Below links gives vast information about Openstack.
https://support.rc.nectar.org.au/docs/openstack
http://docs.openstack.org/api/quick-start/content/
Write a report (1 page) about the Openstack features and functionalities.
Open Stack is cloud based operating system which controls a large pool of computing, storage, and network
assets throughout the data center. Such operations are managed through dashboards that allow
administrators to control and allow users to prerequisite resources through the web interface.
Nova: main computing engine
Network: Neutron
Glance: image service
Heat: orchestration component
Cinder: block storage component
After successful authentication, users can get access to other Open Stack APIs to create and manage
resources in the Open Stack cloud.
CRICOS Provider No. 00103D Insert file name here Page 22 of 26

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
According to the SUSE site, Open Stack features are:
1. Enterprise Ready
2. Automated Install Process
CRICOS Provider No. 00103D Insert file name here Page 23 of 26
Document Page
References
Amazon. (2016). Developer Publishing API. Retrieved 5 30, 2016, from Amazon:
https://developer.amazon.com/public/apis/manage/developer-publishing
Australian Government. (2016). Greenhouse effect. Retrieved 5 30, 2016, from Australian Government
Department of the Environment :
http://www.environment.gov.au/climate-change/climate-science/greenhouse-effect
AlMahmoud, A., Damiani, E., Otrok, H., & Al-Hammadi, Y. (2017). Spamdoop: A privacy-preserving Big Data
platform for collaborative spam detection. IEEE Transactions On Big Data, 1-1. doi:
10.1109/tbdata.2017.2716409
Australian National University. (2016). COMP7310: ICT Sustainability. Retrieved 5 30, 2016, from Australian
National University: https://cs.anu.edu.au/courses/comp7310/
DELL. (2016). Energy Star 4.0. Retrieved 5 30, 2016, from DELL:
http://sustainability.tufts.edu/wp-content/uploads/EnergyStar_Computers.pdf
EPEAT. (2016). EPEAT Criteria. Retrieved 5 30, 2016, from EPEAT: http://www.epeat.net/resources/criteria/
Google. (2016). COMPUTE ENGINE. Retrieved 5 30, 2016, from Google Cloud Platform:
https://cloud.google.com/compute/
Linköping University. (2015). TDDD50 Green Computing. Retrieved 5 30, 2016, from Linköping University:
https://www.ida.liu.se/~TDDD50/info/courseinfo.en.shtml
Microsoft. (2015). What Is Azure? Retrieved 5 30, 2016, from Microsoft Azure: https://msdn.microsoft.com/en-
us/library/azure/dd163896.aspx
Murugesan , S. (2008). Harnessing Green IT: Principles and Practices. IT professional , 10 (1), 24-33.
Murugesan, S. (2010). Making IT Green. IT Professional , 12 (2), 4-5.
NectarCloud. (2016). Openstack. Retrieved from NectarCloud: https://support.rc.nectar.org.au/docs/openstack
CRICOS Provider No. 00103D Insert file name here Page 24 of 26
Document Page
OpenStack. (2016). What is OpenStack? Retrieved from OpenStack: OpenStack is a cloud operating system
that controls large pools of compute, storage, and networking resources throughout a data center, all
managed through a dashboard that gives administrators control while empowering their users to
provision resources t
Programmableweb. (2016). GoGrid API . Retrieved 5 30, 2016, from Programmableweb:
http://www.programmableweb.com/api/gogrid
Programmableweb. (2016). Windows Azure Service Management API. Retrieved 5 30, 2016, from
Programmableweb: http://www.programmableweb.com/api/windows-azure-service-management
RoHS Guild Compliance. (2016). Is Your Facility RoHS Compliant for 2016? Retrieved 5 30, 2016, from
RoHS Guild Compliance: http://www.rohsguide.com/
Rouse, M. (2009). power usage effectiveness (PUE). Retrieved 5 30, 2016, from TechTarget:
http://searchdatacenter.techtarget.com/definition/power-usage-effectiveness-PUE
Saha, B. (2014). Green Computing. International Journal of Computer Trends and Technology , 14 (2), 46-50.
SUSE. (2016). The OpenStack solution of choice for business–critical private clouds. Retrieved from SUSE:
https://www.suse.com/products/suse-openstack-cloud/features
The Green Grid. (2016). The Green Grid Data Center Power Efficiency Metrics PUE and DCiE. Retrieved 5
40, 2016, from The Green Grid: http://www.thegreengrid.org/sitecore/content/Global/Content/white-
papers/The-Green-Grid-Data-Center-Power-Efficiency-Metrics-PUE-and-DCiE.aspx
The University of Chicago. (2010). CS 290N: Green Computing . Retrieved 5 30, 2016, from The University of
Chicago: http://people.cs.uchicago.edu/~ftchong/290N-W10/index.html
University of Kent. (2016). Green Computing - CO847 . Retrieved 5 30, 2016, from University of Kent:
https://www.kent.ac.uk/courses/modules/module/CO847
University of Massachusetts Amherst. (2009). CMPSCI 691GC. Retrieved 5 31, 2016, from University of
Massachusetts Amherst: http://lass.cs.umass.edu/~shenoy/courses/fall09/691gc/
CRICOS Provider No. 00103D Insert file name here Page 25 of 26
1 out of 25
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]

Your All-in-One AI-Powered Toolkit for Academic Success.

Available 24*7 on WhatsApp / Email

[object Object]