Comprehensive Analysis of Big Data Concepts and Technologies
VerifiedAdded on 2022/12/22
|7
|1446
|38
Report
AI Summary
This report provides a comprehensive overview of Big Data, exploring its key features, data processing techniques, and analytical tools. The report begins by defining Big Data and its importance in the current business landscape, emphasizing its role in understanding consumer behavior and making effective business decisions. It then delves into the main features of Big Data, including data processing, analytics, and reporting. The report compares Hadoop with Relational Database systems, providing application scenarios for Hadoop, and explains the MapReduce technique, highlighting its importance and working. Furthermore, it contrasts relational and NoSQL databases, discussing their respective strengths and weaknesses. The conclusion summarizes the report's findings and emphasizes the significance of Big Data in modern data management and analysis.

BIG DATA
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser

Table of Contents
INTRODUCTION...........................................................................................................................3
TASK 1............................................................................................................................................3
The main features of Big Data....................................................................................................3
TASK 2............................................................................................................................................4
Compare Hadoop and Relational Database system and provide application scenario for
Hadoop .......................................................................................................................................4
TASK 3............................................................................................................................................4
Map reduce importance and working..........................................................................................4
TASK 4............................................................................................................................................5
Compare relational and NoSQL databases ................................................................................5
CONCLUSION................................................................................................................................5
REFERENCES ...............................................................................................................................6
INTRODUCTION...........................................................................................................................3
TASK 1............................................................................................................................................3
The main features of Big Data....................................................................................................3
TASK 2............................................................................................................................................4
Compare Hadoop and Relational Database system and provide application scenario for
Hadoop .......................................................................................................................................4
TASK 3............................................................................................................................................4
Map reduce importance and working..........................................................................................4
TASK 4............................................................................................................................................5
Compare relational and NoSQL databases ................................................................................5
CONCLUSION................................................................................................................................5
REFERENCES ...............................................................................................................................6

INTRODUCTION
This report focuses on importance of Big Data in current scenario for organisation and
how it plays crucial role in making effective business decisions. Big data is explained as large
data sets which are used to analyse and understand pattern, trends and figures which can be
utilised to understand consumer behaviour and their buying pattern by organisation. This type of
data is also used by organisation to resolve issues of organisation. This is used by organisation
on day to day basis which assist organisation in maintaining effective and efficient operations in
organisation. This report focuses on different software which are used to store and analyse data.
TASK 1
The main features of Big Data
Big Data help in collecting and analysing large data sets to understand and explore
different patterns and it also focuses on providing insights into problems and new trends and help
organisation in building effective strategies to tackle and resolve issues. This data includes
information about consumer preferences to market trends and help in making effective business
decision for organisation which contributes in growth and development of organisation. The
main features of Big Data are mentioned below:
Data processing – It is explained as collection of data and organising that data to produce
effective results. Data processing involves handling of data by graphs and charts which
help in interpretation of data and help organisation in making effective business
decisions. It also help in extracting and analysing data from different perspectives and it
useful for large set of unstructured data. It help in reducing the cost and maximize the
speed for extracting the useful data from raw information (Ardito and et. al., 2019).
Analytics – Big Data analytics tools offer variety of packages which provides several
options to users and analytics is used to understand and analyse data which is useful for
organisation and help in making effective business decisions. Packages which are
provided by Big Data analytics help organisation in effective decision management.
Reporting Features – Reporting help organisation in keeping their business up to the
mark and assists in gathering effective information on timely basis which assist
organisation in effective process of operations and help in making effective decision for
business of organisation (Wang and et. al., 2018).
This report focuses on importance of Big Data in current scenario for organisation and
how it plays crucial role in making effective business decisions. Big data is explained as large
data sets which are used to analyse and understand pattern, trends and figures which can be
utilised to understand consumer behaviour and their buying pattern by organisation. This type of
data is also used by organisation to resolve issues of organisation. This is used by organisation
on day to day basis which assist organisation in maintaining effective and efficient operations in
organisation. This report focuses on different software which are used to store and analyse data.
TASK 1
The main features of Big Data
Big Data help in collecting and analysing large data sets to understand and explore
different patterns and it also focuses on providing insights into problems and new trends and help
organisation in building effective strategies to tackle and resolve issues. This data includes
information about consumer preferences to market trends and help in making effective business
decision for organisation which contributes in growth and development of organisation. The
main features of Big Data are mentioned below:
Data processing – It is explained as collection of data and organising that data to produce
effective results. Data processing involves handling of data by graphs and charts which
help in interpretation of data and help organisation in making effective business
decisions. It also help in extracting and analysing data from different perspectives and it
useful for large set of unstructured data. It help in reducing the cost and maximize the
speed for extracting the useful data from raw information (Ardito and et. al., 2019).
Analytics – Big Data analytics tools offer variety of packages which provides several
options to users and analytics is used to understand and analyse data which is useful for
organisation and help in making effective business decisions. Packages which are
provided by Big Data analytics help organisation in effective decision management.
Reporting Features – Reporting help organisation in keeping their business up to the
mark and assists in gathering effective information on timely basis which assist
organisation in effective process of operations and help in making effective decision for
business of organisation (Wang and et. al., 2018).
⊘ This is a preview!⊘
Do you want full access?
Subscribe today to unlock all pages.

Trusted by 1+ million students worldwide

TASK 2
Compare Hadoop and Relational Database system and provide application scenario for Hadoop
Hadoop is explained as open source java based framework which is used to store and
process big data. In this, data is stored on commodity servers which runs as clusters and it is
inspired by Google Map reducing programming model.
Relational database is explained as the tool used to organise data and use data which can be
linked, related and based on common data. This database helps in storing information in tables
and focuses on utilising stored data effectively (Choi, Wallace and Wang, 2018).
Relational Database System Hadoop
It is considered as row and column based
databases and used for data storage.
Explained as open-source software which is
used for storage of data and help in running of
application.
Structured data is processed in this type of
system.
Both structured and unstructured data is
processed.
It is considered less scalable in comparison to
Hadoop.
Considered as highly scalable
It help in storing and transforming aggregated
data.
Help in storing huge volume of data.
For this software cost is applicable It considered as free of cost because of open
source.
Hadoop is used in different industries as it help in storing huge amount of data and Hadoop
HDFS assist in processing and transforming data into manageable data. Processing of data is
done by the help of Hadoop by different organisation and it help in managing the stored data
effectively.
Compare Hadoop and Relational Database system and provide application scenario for Hadoop
Hadoop is explained as open source java based framework which is used to store and
process big data. In this, data is stored on commodity servers which runs as clusters and it is
inspired by Google Map reducing programming model.
Relational database is explained as the tool used to organise data and use data which can be
linked, related and based on common data. This database helps in storing information in tables
and focuses on utilising stored data effectively (Choi, Wallace and Wang, 2018).
Relational Database System Hadoop
It is considered as row and column based
databases and used for data storage.
Explained as open-source software which is
used for storage of data and help in running of
application.
Structured data is processed in this type of
system.
Both structured and unstructured data is
processed.
It is considered less scalable in comparison to
Hadoop.
Considered as highly scalable
It help in storing and transforming aggregated
data.
Help in storing huge volume of data.
For this software cost is applicable It considered as free of cost because of open
source.
Hadoop is used in different industries as it help in storing huge amount of data and Hadoop
HDFS assist in processing and transforming data into manageable data. Processing of data is
done by the help of Hadoop by different organisation and it help in managing the stored data
effectively.
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser

TASK 3
Map reduce importance and working
Map reduce is explained as technique of processing and this model of program is based
on distributive computing based on Java. Its algorithm contains two different tasks by the name
Map and Reduce (Mikalef and et. al., 2018). Map help in converting one set of data into another
set. Map reduce is considered as programming model which belongs to Hadoop framework and
used to access big data which is stored in Hadoop File system. In map reduce execution, the map
function processes a key value pair which then emits a certain number of key value pairs. Reduce
function processes values which are grouped in the same key and after that it again emits the set
of key value pairs which is considered as output (Tiwari, Wee and Daryanto, 2018). There are
various features of map reduce which are mentioned below:
Scalability as it is considered as one of the highly scalable framework.
Flexibility as it help organisation in accessing new sources of data
It considered as one of the cost effective solution.
Considered fast in emitting output by certain key number pairs.
Map reduce is considered as simple model of programming.
Key value pair of Mapreduce focuses on recoding Hadoop execution. The Map output types
should match the input types of the Reduce as shown below:
Map: (K1, V1) -> list (K2, V2)
Reduce: {(K2, list (V2}) -> list (K3, V3)
Map Input is considered as offset as the key and content of line is considered as text.
Map Output and Map focuses on filtering the data and provide environment to group data on
key.
Key– It is field/ text/ object on which the data groups and aggregates on the reducer.
Value– It is the field/ text/ object which each individual reduces method handles.
Map output is input to reduce and reduce output depends on required output.
Map reduce importance and working
Map reduce is explained as technique of processing and this model of program is based
on distributive computing based on Java. Its algorithm contains two different tasks by the name
Map and Reduce (Mikalef and et. al., 2018). Map help in converting one set of data into another
set. Map reduce is considered as programming model which belongs to Hadoop framework and
used to access big data which is stored in Hadoop File system. In map reduce execution, the map
function processes a key value pair which then emits a certain number of key value pairs. Reduce
function processes values which are grouped in the same key and after that it again emits the set
of key value pairs which is considered as output (Tiwari, Wee and Daryanto, 2018). There are
various features of map reduce which are mentioned below:
Scalability as it is considered as one of the highly scalable framework.
Flexibility as it help organisation in accessing new sources of data
It considered as one of the cost effective solution.
Considered fast in emitting output by certain key number pairs.
Map reduce is considered as simple model of programming.
Key value pair of Mapreduce focuses on recoding Hadoop execution. The Map output types
should match the input types of the Reduce as shown below:
Map: (K1, V1) -> list (K2, V2)
Reduce: {(K2, list (V2}) -> list (K3, V3)
Map Input is considered as offset as the key and content of line is considered as text.
Map Output and Map focuses on filtering the data and provide environment to group data on
key.
Key– It is field/ text/ object on which the data groups and aggregates on the reducer.
Value– It is the field/ text/ object which each individual reduces method handles.
Map output is input to reduce and reduce output depends on required output.

TASK 4
Compare relational and NoSQL databases
Relational databases are explained as data base which helps in storing data in form of
tables and it stores data so it can be used with relation other stored datasets. This type of database
is used by many organisation and in comparison to file database.
NoSQL database is explained as design that provides flexible schemas for the storage of data and
retrieval of data beyond table data of relational databases.
Relational Database NoSQL Database
Focuses on using strong query language. Focuses on using simple query language.
It has fixed schema. No fixed schema of this database.
Focuses on using acid properties. Considered as consistent.
It supports transactions. This database doesn't support transactions.
CONCLUSION
This report concludes about use of Big Data and help in concluding about different
databases which can be used in storage of data and using these databases to for benefit of
organisation and keeping their data effective and useful for long duration.
REFERENCES
Books and Journal
Ardito, L., and et. al., 2019. A bibliometric analysis of research on Big Data analytics for
business and management. Management Decision.
Choi, T. M., Wallace, S. W. and Wang, Y., 2018. Big data analytics in operations management.
Production and Operations Management. 27(10). pp.1868-1883.
Mikalef, P., and et. al., 2018. Big data analytics capabilities: a systematic literature review and
research agenda. Information Systems and e-Business Management. 16(3). pp.547-578.
Compare relational and NoSQL databases
Relational databases are explained as data base which helps in storing data in form of
tables and it stores data so it can be used with relation other stored datasets. This type of database
is used by many organisation and in comparison to file database.
NoSQL database is explained as design that provides flexible schemas for the storage of data and
retrieval of data beyond table data of relational databases.
Relational Database NoSQL Database
Focuses on using strong query language. Focuses on using simple query language.
It has fixed schema. No fixed schema of this database.
Focuses on using acid properties. Considered as consistent.
It supports transactions. This database doesn't support transactions.
CONCLUSION
This report concludes about use of Big Data and help in concluding about different
databases which can be used in storage of data and using these databases to for benefit of
organisation and keeping their data effective and useful for long duration.
REFERENCES
Books and Journal
Ardito, L., and et. al., 2019. A bibliometric analysis of research on Big Data analytics for
business and management. Management Decision.
Choi, T. M., Wallace, S. W. and Wang, Y., 2018. Big data analytics in operations management.
Production and Operations Management. 27(10). pp.1868-1883.
Mikalef, P., and et. al., 2018. Big data analytics capabilities: a systematic literature review and
research agenda. Information Systems and e-Business Management. 16(3). pp.547-578.
⊘ This is a preview!⊘
Do you want full access?
Subscribe today to unlock all pages.

Trusted by 1+ million students worldwide

Tiwari, S., Wee, H. M. and Daryanto, Y., 2018. Big data analytics in supply chain management
between 2010 and 2016: Insights to industries. Computers & Industrial Engineering.
115. pp.319-330.
Wang, Y., and et. al., 2018. An integrated big data analytics-enabled transformation model:
Application to health care. Information & Management. 55(1). pp.64-79.
between 2010 and 2016: Insights to industries. Computers & Industrial Engineering.
115. pp.319-330.
Wang, Y., and et. al., 2018. An integrated big data analytics-enabled transformation model:
Application to health care. Information & Management. 55(1). pp.64-79.
1 out of 7
Related Documents
Your All-in-One AI-Powered Toolkit for Academic Success.
+13062052269
info@desklib.com
Available 24*7 on WhatsApp / Email
Unlock your academic potential
Copyright © 2020–2025 A2Z Services. All Rights Reserved. Developed and managed by ZUCOL.




