A Report on the Uses of Big Data in Business Organizations and Storage

Verified

Added on  2020/06/06

|10
|2823
|174
Report
AI Summary
This report delves into the critical role of big data in contemporary business environments. It explores the various methods of data storage employed by organizations, with a focus on technological advancements and the practices of industry leaders like Google and Microsoft. The report examines the challenges associated with managing and maintaining vast datasets, highlighting the need for improved storage capacity, speed, and flexibility. It also investigates the use of technologies like Apache Hadoop and cloud-based services such as Google BigQuery. Through a qualitative research approach, the report addresses key research questions, including how big data is stored, the technological advancements in data storage, and the strategies employed by major companies. The report also acknowledges research limitations, such as data collection challenges and the vast scope of the topic, and concludes with a time schedule for the research plan. The report provides recommendations for effective data management, emphasizing the importance of data protection.
Document Page
Uses of Big Data
in
Business Organizations
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Table of Contents
INTRODUCTION...........................................................................................................................1
Project Objective.........................................................................................................................1
Project Scope...............................................................................................................................1
Literature Review and Research gap...........................................................................................1
Research Questions/Hypothesis..................................................................................................3
Research Design and Methodology............................................................................................3
Research Limitations...................................................................................................................5
Time Schedule (Research plan)..................................................................................................6
CONCLUSION................................................................................................................................7
REFERENCES................................................................................................................................8
Document Page
INTRODUCTION
In today's world as digitalization is increasing there is great requirement of the data
storage. Large business firms like google are facing this issues at more higher rates as every day
millions of people living in different countries are uploading the data (González, Aliagas and
Valero, 2014). Further, maintaining, updating and regulating the data creates many issues to such
companies. In order to improve the performance, operational activity and take intelligent
decision such entities must improve their data storage capacity, speed, reliability and flexibility
of the system. Moreover, these technologies must be small, compact and should be convenient
for the storage. Present report focus on how big data are stored in the business organizations.
Project Objective
Following are the objectives of the report:
To understand the ways by which data are stored in the organisation.
To analyse the technological advancement in data storage.
To identify the ways by which companies like Google, Microsoft are storing and
maintaining their data.
To recommend new ways by which data storage can be easily manage by such entities.
Project Scope
Following project report is based on the understanding related to large data storage in the
firms like Google and Microsoft. In this, focus on the technology and data storage methods are
shown which are helpful to many IT and non-IT companies. Further, ways are suggested by
which data can be easily accessed, monitored and can be easily updated.
Literature Review and Research gap
Different details associated with the research are mentioned. In this context, here brief
details about the topic is provided where research gap is also stated. Further, research question is
formulated as the beginning of the further analysis.
As per the case study taken, in Google, millions of people are uploading trillions of data
every year which they have to maintain, update and process as per the requirement (Dekker,
Marti-Renom and Mirny, 2013). Further, information in the companies are not fixed and existing
organisations are unable to find the solution to maintain and store such amount of information.
These are the issues faced by every firms. Big data is totally associated with the new aspects
where processing, personnel and technological part are combined. In order to accomplish and
1
Document Page
meet the business and strategic objectives. Moreover, in every organization, they assist in
recognizing the possibilities and helps to increase the gains.
The topic started in the year 2001, where industrialization and globalization has started.
Firms were having more than thousands of the employees and their personal data which they
need to keep secure. There are three main requirement in this, volume, velocity and variety
which was defined by the Dong Laney when they encountered the data management issues
(Joseph and Johnson, 2013, Matteson, 2013, 10 Ways to Use Big Data to Get to Know Your
Customers Better. 2017) There are various things which can be involved in this such as customer
transactional histories, online videos, employees personal data, financial data,production
databases, social media interaction and web traffic logs. As the technological up-gradation has
taken place, Rijmenam has stated that in order to keep such high data there is requirement of the
many new things such as veracity, variability, visualization, and value which must be maintained
the previous data storage system.
These big data are used by the firms to improve their organisational capabilities and
performance. However, entity structure and the ways they are maintaining the data (Size) that
define the uniqueness in the entity (Paul, Pennel and Lemeshow, 2013). It provides the
significant information that provides new opportunities to the companies. Earlier the data storage
was limited where 50 MB data was considered as more than enough but due to technologies now
space of 50TB is considered as small. In order to maintain such data, generally software and
hardware requirement is there. One such software technology is Apache Hadoop which is
commonly related to the Big DATA. Using Cluster formation the software make the data storage
in simplified programming languages. It is most based on the data storage, processing and
specifically organizing the data. It is one of the type there are various other software and
hardware which are used by the organization to store the large data. For example, Google has
launched Google BigQuery which is cloud-based services. Amazon, Cloudera, Hortonworks and
MapR. There is another example where big data are used by the organisation such as Netflix uses
such data for the streaming video services (Raghupathi and Raghupathi, 2014).
There are various other illustration which can be shown that big data storage is very
important in the company. The example is about the Starbucks company where they were
launching there new products wanted to know the customer experience. In order to get the
customer viewpoint regarding the coffee they monitored the blogs, feedbacks on personal sites,
2
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
and their reaction. The outcomes came that people enjoyed the coffee but they though it to be
little expensive. When they lowered the prices, positive feedbacks and comments were all over.
Such technologies are so helpful for the companies that they can easily access the information in
more reliable and faster ways. (Wixom and et. al., 2014) Further, these are used by the
organization to target the customers, optimising business process, increase personal
quantification, performance optimisation, improving healthcare, public health, optimising
machine, device performance, improving security, law enforcement and financial trading and
many more. If firms can manage these data in more effective ways they can gain more success in
the market but data protection is an important aspects.
Research Questions/Hypothesis
Primary Question How big data are stored in the business organizations?
Secondary Question
Explain the ways by which data are stored in the organisation?
Analyse the technological advancement in data storage?
Identify the ways by which companies like Google, Microsoft are storing and
maintaining their data.
Explore and find new ways by which data storage can be easily manage by such entities?
Research Design and Methodology
It is the important part of the research project as it details about all the information related
to tools, techniques and methods. Current report is based on the qualitative research approach
and for this following information which are included in it are explained in brief. Following are
the things that will be included in the report in the methodological section:
Research Approaches:
In order to accomplish the research in appropriate and effective manner there is
requirement of the better approach. Effective decision related to the research can be taken by
using the better techniques and tools (Gandomi and Haider, 2015). Such factors helps to reduce
the errors and assist carrying out of methodology being selected. There are two methods by
which research approaches can be applied in the research, inductive and deductive. In first, tools
that are used are the theories and patterns that are selected over topic. However, another one is
based on the results gained from general to specific. In the present report, inductive approach is
3
Document Page
used where themes were made and answer to the question were achieved. The other approach
was not chosen as specific outcomes are not required for the study.
Research Sampling Methods:
It is the way by which participants of the research are selected from range of population
from where data is to be collected in order to complete all the findings. It is required to
accomplish objectives and find answer to the questions of the research. Appropriate sample size
is also required to maintain the reliability of the report. These are of two types probabilistic and
non probabilistic. In the existing report sampling techniques are used where respondents are
selected from members of the big organisation such as google where data storage is more
required. There are various methods of sampling but here probabilistic method is used in which
Systematic sampling is used by the researcher. It is chosen people from such companies are busy
and they provide very little time to make enquiry.
Data Collection Method:
One of the important part of the research where different sources are mentioned. These
are completed so as it aids to the expected findings. Answer to all the research question can be
collected through relevant sources and researches (Raghupathi and Raghupathi, 2014). There are
two ways by which a researcher can collect the data. These are primary and secondary methods,
in which in the first one the data is collected through surveys, interviews and observation and in
second the data is collected through articles, books, journals, online sites that are associated with
the research. In the current report both the methods are used for the data collection methods.
Survey was done with the people associated with companies having large data. Along with this,
secondary data is collected through the above mentioned places. Moreover, the information are
carried out in effective manner.
Variables Specifications:
Variable specification are the part where general information from where details have
been sourced are mentioned. Here relevant key words are mentioned which are helpful in finding
of the research. Following are the key words that are searched in the report, these are mentioned
below: Global data storage, Technology in organisation, Google data storage system, Microsoft
data storage system. Further relevant sources were used so that information can be gained in
effective and efficient way.
4
Document Page
Reliability and Validity: Reliability can be gained in the research through achieving the similar
answer more than one time by using various instruments. In order to have the reliability there is
requirement of the steady and stable results. There are 3 categories which secure the reliability of
the research. These are Internal Consistency, Parallel forms, Inter-rater and Test-retest,
Reliability (Abbasi, Sarker and Chiang 2016). Validity is based on the what is intended to be
measure. There are two types of the validity, internal and external. In the entire research, these
two words are taken care for example data analysis, collection and sampling are the areas where
information are collected in more appropriate ways. To maintain the reliability, secondary data is
collected through the professional authors and researches that are conducted after 2013. Every
aspect to maintain the reliability and validity are accomplish in the report.
Research Limitations
For conducting any research project the researcher must identify all the research
limitation and should try to minimize them in all possible methods. This is because at the time of
conducting the research, there are chances of limitation which can impact directly on the overall
research. In the above report following are the major limitation faced by the author is to select
among the data collection methods as there are various ways by which data can be collected by
which there is increase in the scope and depth analyses. Further, cost is another issues faced as
collecting data from such companies requires acquaintance in the company. Additionally,
companies prevent such private data so that they do not face the security issues in related to
information storing. As the topic is vast so there are chances that all the literature based on the
current topic may not have been covered. Along with this, as research is based on the primary
and secondary data collection methods so in order to accomplish the objectives in efficient
manner, excessive time and cost is required. Hence, these are major limitation that are associated
with the research and by analysing and understanding the issues corrective actions can be taken
easily and in more suitable ways.
5
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Time Schedule (Research plan)
In order to prepare the research plan, Gantt Chart is used that elaborates how all the
activities in the studies will be carried along with time period consumed.
Activity/Week 1 2 3 4 5 6 7 8 9 10 11 12
Research proposal
Composition of Objectives of
research
Writing the Project Scope
Summary of the literature
review
Research Question/ hypothesis
Research Design and
methodology
Research Limitation
Conclusion
6
Document Page
CONCLUSION
The report comprise of the details related to the Big Data in the organisation and various
ways through which its data is collected in the firms. Brief about the topic is provided in the LR
section along with research objectives and scope. Moreover, methodologies, limitation and time
schedule for the report is evaluated in the report.
7
Document Page
REFERENCES
Books and Journals
González, A., Aliagas, C. and Valero, M., 2014, June. A data cache with multiple caching
strategies tuned to different types of locality. In ACM International Conference on
Supercomputing 25th Anniversary Volume (pp. 217-226). ACM.
Dekker, J., Marti-Renom, M. A. and Mirny, L. A., 2013. Exploring the three-dimensional
organization of genomes: interpreting chromatin interaction data. Nature reviews.
Genetics. 14(6). p.390.
Paul, P., Pennell, M. L. and Lemeshow, S., 2013. Standardizing the power of the Hosmer–
Lemeshow goodness of fit test in large data sets. Statistics in medicine. 32(1). pp.67-80.
Raghupathi, W. and Raghupathi, V., 2014. Big data analytics in healthcare: promise and
potential. Health information science and systems. 2(1). p.3.
Wixom, B. and et. al., 2014. The current state of business intelligence in academia: The arrival of
big data. CAIS. 34. p.1.
Gandomi, A. and Haider, M., 2015. Beyond the hype: Big data concepts, methods, and analytics.
International Journal of Information Management. 35(2). pp.137-144.
Raghupathi, W. and Raghupathi, V., 2014. Big data analytics in healthcare: promise and
potential. Health information science and systems. 2(1). p.3.
Abbasi, A., Sarker, S. and Chiang, R. H., 2016. Big Data Research in Information Systems:
Toward an Inclusive Research Agenda. Journal of the Association for Information
Systems. 17(2).
Joseph, R. C. and Johnson, N. A., 2013. Big data and transformational government. IT
Professional. 15(6). pp.43-48.
Online
Matteson, S., 2013. Big Data basic concepts and benefits explained. [Online]. Available
through:<http://www.techrepublic.com/blog/big-data-analytics/big-data-basic-concepts-
and-benefits-explained/>. [Accessed on 28th August 2017].
10 Ways to Use Big Data to Get to Know Your Customers Better. 2017. [Online]. Available
through : <https://www.wired.com/insights/2013/07/10-ways-to-use-big-data-to-get-to-
know-your-customers-better/>. [Accessed on 28th August 2017].
8
chevron_up_icon
1 out of 10
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]