ArticlePDF Available

Abstract

Big Data helps facilitate information visibility and process automation in design and manufacturing engineering. It also helps analyze trends through analytics and predict inventory, manufacturing output and equipment lifespan and cycles, etc. This paper introduces Big Data, its characteristics and a number of issues of Big Data in design and manufacturing engineering. These issues include design and manufacturing data, Big Data benefits and impacts and its applications and opportunities. Methods, technologies and some technology progress around Big Data are presented in this study. General challenges of Big Data and Big Data challenges in design and manufacturing engineering are also discussed.
© 2015 Lidong Wang and Cheryl Ann Alexander. This open access article is distributed under a Creative Commons
Attribution (CC-BY) 3.0 license.
American Journal of Engineering and Applied Sciences
Original Research Paper
Big Data in Design and Manufacturing Engineering
1Lidong Wang and 2Cheryl Ann Alexander
1Department of Engineering Technology, Mississippi Valley State University, USA
2Technology and Healthcare Solutions, Inc., USA
Article history
Received: 27-03-2015
Revised: 25-04-2015
Accepted: 05-05-2015
Corresponding author:
Lidong Wang
Department of Engineering
Technology,
Mississippi Valley State
University, USA
E-mail: lwang22@students.tntech.edu
Abstract: Big Data helps facilitate information visibility and process
automation in design and manufacturing engineering. It also helps
analyze trends through analytics and predict inventory, manufacturing
output and equipment lifespan and cycles, etc. This paper introduces
Big Data, its characteristics and a number of issues of Big Data in
design and manufacturing engineering. These issues include design
and manufacturing data, Big Data benefits and impacts and its
applications and opportunities. Methods, technologies and some
technology progress around Big Data are presented in this study.
General challenges of Big Data and Big Data challenges in design and
manufacturing engineering are also discussed.
Keywords: Big Data, Design, Manufacturing, Sensors, Big Data Analytics,
Machine Learning, Data Mining, Map Reduce Algorithm, Hadoop
Introduction
Big Data and Characteristics
The McKinsey study defines Big Data as “datasets
whose size is beyond the ability of typical database
software tools to capture, store, manage and analyze.
Big Data in many sectors ranges from a few dozen
Terabytes (TB: Approximately 1012 bytes) to multiple
Petabytes (PB: Approximately 1015 bytes) (Minelli et al.,
2013). It is too big, moves too fast, or does not fit the
strictures of conventional database architectures
(Dumbill, 2013). Characteristics (Russom, 2011;
Eaton et al., 2012; ORRT, 2012; Zikopoulos et al.,
2011; Demchenko et al., 2013) of Big Data can be
categorized into “6 Vs”. They are: Volume, Velocity,
Variety, Value, Variability and Veracity.
Volume
It means data size such as Terabytes (TB), Petabytes
(PB), Exabytes (EB: Approximately 1018 bytes),
Zettabytes (ZB: Approximately 1021 bytes) and
Yottabyte (YB: Approximately 1024 bytes).
Velocity
This relates to how frequently the data is generated. It
can be batch, near real time, real time, or streams.
Variety
It represents all types of data such as streamed
video, streamed audio and Radio Frequency
Identification (RFID) sensor readings. The data type
can be structured, unstructured, or semi-structured
(Bellini et al., 2013; IWT, 2014). Structured data has
fixed fields such as spreadsheets or relational
databases; unstructured data does not reside in fixed
fields-text from articles, email messages, untagged
audio or video data, etc.; and semi-structured data
does not reside in fixed fields, but it uses tags or other
markers to capture elements of the data such as
Extensible Markup Language (XML) and Hyper Text
Markup Language (HTML)-tagged text (Nedelcu,
2013). Data in variety or different formats makes data
integration difficulty or very expensive.
Value
It is defined by the added-value that the collected
data can bring. It refers to the value that the data adds
to creating knowledge. There is some valuable
information somewhere within the data. The valuable
information is golden data if it is extracted, although
most of the pieces of data individually may seem
valueless. Big Data consists of hidden gold (high-
valued data) mixed with dirty (noise, erroneous and
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
224
raw) data. Big Data technologies can process massive
amounts of dirty data and extract the gold information
from it. Data value is related to data volume and data
variety. The economic value of different data varies
depending upon both the source and its end use
(Zaslavsky et al., 2012; Megahed and Jones-Farmer,
2013; Rajpathak and Narsingpurkar, 2013).
Variability
This refers to the fact that data can be changed at
times. It also means data unpredictability and how data
may change. Increasing variety and variability also
increases the attractiveness of data and the potentiality in
providing unexpected, hidden and valuable information
(Bellini et al., 2013).
Veracity
Veracity (Dove et al., 2012; Megahed and Jones-
Farmer, 2013; IWT, 2014) includes two aspects: Data
consistency (or certainty) and data trustworthiness. Data
can be in doubt: Uncertainty due to data inconsistency
and incompleteness, ambiguities, latency, deception and
model approximations. The following aspects help
ensure data veracity:
Integrity of data and linked data (e.g., for complex
hierarchical data, distributed data)
Data authenticity and (trusted) origin
Identification of both data and source
Computer and storage platform trustworthiness
Availability and timeliness
Accountability and reputation
Big Data is often dynamic, heterogeneous, inter-
related, noisy and untrustworthy. However, even
noisy Big Data could be more valuable than tiny
sample data because general statistics obtained from
frequent patterns and correlation analysis usually
overpowers individual fluctuations. Interconnected
Big Data forms large heterogeneous information
networks; therefore, information redundancy can be
explored to compensate for missing data, validate
trustworthy relationships, crosscheck conflicting
cases, disclose inherent clusters and uncover hidden
models and relationships (Agrawal et al., 2011).
Big Data means more information, but it also means
more false information. Its focus is on correlations, not
causality. It is about what, not why (Bottles et al., 2014).
In addition, the data we consider big today may not be
considered big tomorrow because of the advances in data
processing, storage and other system capabilities
(Zaslavsky et al., 2012).
Design and Manufacturing Engineering Data
Industrial operations and systems often produce a
continuous stream of sensor data, event data and
contextual data through sensors, smart machines and
instrumentation. In a factory, data sources possibly
include Computer-Aided Design (CAD) models,
Computer-Aided Manufacturing (CAM) models,
Computer-Aided Engineering (CAE) models, sensors,
instruments, Internet transactions and simulations. The
data is often large, fast-moving and complex. The data
is often in large variety, including documents, test data,
product failure data, CAD/CAM/CAE data,
unstructured CAD drawings and specifications and
product and process performance data, etc. The
increasing volume of data with different types needs to
be stored, managed and analyzed. Big Data
technologies, driven by innovative analytics, can
process large sets of heterogeneous data; and help
extract value and hidden knowledge from large and
diverse data streams (Noor, 2013; Rajpathak and
Narsingpurkar, 2013; Dayal et al., 2014).
Some examples of Big Data in manufacturing are
shown in Table 1. Structured data has the advantage of
being easily entered, stored, analyzed and queried.
Examples include manufacturing data stored in relational
databases, data from manufacturing execution systems
and data from enterprise systems. Unstructured data such
as log files and human-operator-generated shift reports
may be in a raw format that requires decoding before
data values can be extracted. Semi-structured data does
not conform to the models of relational databases or
other data tables, but contains tags or other markers to
separate semantic elements and demonstrate hierarchies
of field sand records (IC, 2014).
Table 1. Manufacturing data examples (IC, 2014)
Structured data Unstructured data Real-time, semi-structured data
Spreadsheets Operator shift reports RFID
Relational databases Machine logs, error logs XML
Enterprise data warehouse Texts, images, audio/video Machine builder standards like
Files stored in manufacturing PCs Manufacturing collaboration social platforms Sensors (vibration, pressure, valve and
acoustics), relays
Manufacturing historians (time series data
structures)
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
225
Benefits and Impacts of Big Data in Design
and Manufacturing Engineering
Mining Big Data can offer benefits in design, such as
better detection of defects in design and design
improvement, saving design time and costs, fast
response to market, developing innovative products and
matching customers’ needs and gaining customer
satisfaction through extracting crucial customer
requirements from the customer-generated data to update
and refine existing designs (Wu et al., 2015).
The manufacturing sector generates a great deal of
text and numerical data in product development
processes. Big Data offers the following benefits in
manufacturing (Brown et al., 2011; McGuire et al.,
2012; Nedelcu, 2013; Noor, 2013; Wu et al., 2015):
Defect tracking and product quality:
Perform predictive diagnostics for product/part failure
Monitor product data quality
Early detect quality problems
Better detect product defects
Provide real-time alerts based on analyzing
manufacturing data
Reduces defects during manufacturing processes
by tracking every detail about every part that goes
into a product
Boost quality
Improvements in supply planning:
Unlock significant value and unearth valuable
insights by performing Big Data analytics and
making information transparent
Better forecast products, production and
manufacturing output
Better forecast sales volumes through semantic-
based Big Data analytics
Improve relationship with suppliers and conduct
better contract negotiations according to collected
supplier performance data
Improve decision-making and minimizes risks in
supply
Improved product manufacturing processes:
Provide an infrastructure for transparency in
manufacturing
Analyze sensor data from production lines, creating
self-regulating processes that cut waste, avoid
costly (and sometimes dangerous) human
interventions and ultimately lift output
Better monitor and control manufacturing
processes by tracking every detail about every part
and procedure, better information visibility and Big
Data analytics for data in motion
Perform predictive manufacturing and optimize
manufacturing processes
Better simulate and test new manufacturing
processes
Driven efficiency across the extended enterprise:
Increase the efficiency of the manufacturing
processes
Increase energy efficiency
Enable effective and consistent collaboration
through integrating datasets from multiple systems
and divisions
Facilitate innovative design for manufacturing and
integration of CAD/CAE/CAM; reduce
unnecessary iterations in product development
cycles; and finally reduce production and
development costs
Offer further opportunities to accelerate product
development; increase product innovation and
development of next-generation products
Improved service:
Determine what manufacturing parameters most
influence customer satisfaction
Develop new products and make products better
match customers’ needs through sentiment analysis
and recommendation systems for Big Data
Enable mass-customization in manufacturing
Better correlate manufacturing and business
performance information together
Reduce warranty costs through warranty analysis
based on Big Data analytics
Better perform remote intelligent services
Table 2 (MKGI, 2010) shows the degree of potential
benefits that Big Data could generate in the areas of
greatest benefits for manufacturing/operations.
Applications and Opportunities of Big Data
in Design and Manufacturing Engineering
Big Data in General Electric, General Motors and
the Automotive Industry
Some manufacturing firms, such as General
Electric, view Big Data from sensors in manufactured
products (e.g., locomotives, jet engines and gas
turbines in GE’s case) as key to effective and efficient
servicing strategies. In the same mode, automobile
manufacturers such as General Motors created self-
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
226
driving cars based on the analysis of Big Data from
sensors and machine vision technologies (Davenport,
2013). Big Data has become the key asset for the whole
production and manufacturing cycle, as well as the
provision of services in the automotive and mobility
space. Big Data is actually at heart of how the extracted
sensor data and location data are combined to provide
services (Camilli and Duisberg, 2013).
Big Data in Semiconductor Manufacturing and
Integrated Circuits
Semiconductor companies have seen significant
opportunities for Big Data and analytics to optimize
semiconductor manufacturing (Hattori, 2013). For higher
product quality, semiconductor companies conduct
extensive tests and collect terabytes of data.
Semiconductor vendors mine Big Data for product
quality (Neison, 2014).
With the continuous shrink of integrated circuits in
feature sizes due to nanotechnologies, extracting
manufacturing information and mining valuable
intelligence from automatically collected Big Data in the
wafer fabrication facilities for real-time decisions and a
higher yield has become very important to support
intelligent manufacturing, enhance the service quality
and maintain competitive advantages for high-tech
companies in the global competition (Hsu et al., 2012).
Big Data at Work for a Missile Plant
A missile plant (Fig. 1), for instance, Raytheon Corp.
in Huntsville, Alabama, USA, monitors its assembly
operations down to the turn of ascrew. If a screw in a
missile fails to complete its full count of turns, Raytheon
will know about it immediately and be able to take
corrective steps. Raytheon’s monitoring technology is
often called “Manufacturing Execution Software
(MES),” and several manufacturers have used MES to
collect and analyze factory-floor data. The systems
enable the real-time control of multiple elements of the
production process (Noor, 2013).
Table 2. Greatest benefit areas for manufacturing/operations; (1: No benefits; 3: Moderate benefits; 5: Very high benefits)
Areas of greatest benefit for manufacturing/operations Degree of benefits
Product quality/defect tracking 3.37
Supply planning 3.34
Manufacturing process defect tracking 3.32
Supplier/supplier component/parts defect tracking 3.11
Collecting supplier performance data to inform contract negotiations 3.08
Forecasting manufacturing output 3.03
Increasing energy efficiency 2.97
Simulation and testing of new manufacturing processes 2.88
Enable mass-customization in manufacturing 2.75
Fig. 1. Big Data at a missile plant (Noor, 2013)
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
227
Big Data in Cloud-based Design and
Manufacturing
Cloud-Based Design Manufacturing (CBDM) is a
service-oriented networked product development model.
Based on this model, service consumers can configure,
select and utilize customized product realization
resources and services. CBDM uses the Internet of
Things (IoT) (e.g., RFID), smart sensors and wireless
devices (e.g., smart phone) to collect real-time design-
and manufacturing-related data. IoT allows engineers to
have access to data such as equipment condition,
machine utilization and the percentage of defective
products from any location. Engineers can use Big Data
analytics for forecasting, automation and proactive
maintenance (Wu et al., 2015).
Technology Integration Based on Big Data for
More Value
Manufacturing sector generates data from a multitude
of sources such as instrumented production machinery
(process control). Most manufacturing companies have
Information Technology (IT) systems to manage the
product data generated via CAD, CAM, CAE and
Product Development Management (PDM) systems.
However, the large datasets generated by these systems
often remain trapped within their respective systems.
Manufacturers can create a significant opportunity to
create more value through effective and consistent
collaboration, the integration of datasets from multiple
systems and Big Data analytics for the integrated
datasets (Nedelcu, 2013).
Medical Device Design and Manufacturing
Simulation-based engineering methods involve
Finite Element Analysis (FEA), Finite Difference
Analysis (FDA), Computational Fluid Dynamics
(CFD) and/or multi-physics simulations to work
toward an optimal design. Advancements in medical
device design and manufacturing that is data-driven
and simulation-based have drawn upon and combined
emerging work in the areas of regulatory science and
Big Data. Big Data technologies enable a broader set
of device materials, anatomical configurations,
delivery methods and tissue interactions to be
evaluated. Computational methods and Big Data
technologies can play an important role in medical
device design and manufacturing (Erdman and Keefe,
2013).
Big Data and Additive Manufacturing
Understanding the use and implications of Big Data
and predictive analytics will be very important as
additive manufacturing (also called 3D printing) makes
traditional models of production, distribution and
demand obsolete in some product areas. Some
companies no longer need to invest in dedicated,
expensive milling machines or injection molding
equipment. A single 3D printer can produce a wide range
of parts-without expensive dies and jigs. People can even
modify designs to meet special, one-of-a-kind needs and
then use an affordable 3D printer to “manufacture” parts
(Waller and Fawcett, 2013b).
Production Process Monitoring, Maintenance,
Quality Assurance and Logistics for Manufacturers
Production sensors generate vast volume of data at
sub-second speeds. Big Data and advanced analytics
analyze this vast amount of data to monitor production
processes and identify when an event will affect
production quality and when maintenance is required,
before production quality is actually affected. Big
Data provides the ability to ensure Quality Assurance
(QA) tests are confirmations of high product quality
(Software AG, 2013).
Table 3 (Waller and Fawcett, 2013a) provides some
examples of potential applications of Big Data within
logistics for manufacturers.
Big Data in CAD/CAE/CAM and CAD Educational
Assessment
Big Data working with3D software in
CAD/CAE/CAM can greatly help companies, especially
companies in the aerospace industry (e.g., the areas of
aeronautics and astronautics) because these companies
have struggled to manage the constantly growing volume
of data. The datasets are large, complex and often fast-
moving. It means that these datasets are often in large
volume, velocity, variability and variety. A lot of these
datasets are unstructured, for example, CAD drawings
and CAD/CAE/CAM specifications. Big Data and
analytics are powerful in processing and managing these
kinds of heterogeneous information, improving data
veracity and creating more values.
Table 3. Examples of potential applications of Big Data in logistics for manufacturers
Forecasting Early response to extremely negative or positive customer sentiment
Inventory management Reduction in shrink, efficient consumer response, quick response and vendor managed inventory
Transportation management Improved notification of delivery time and availability; surveillance data for improved yard management
Human resources More effective monitoring of productivity; medical sensors for safety of labor in factories
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
228
A computational method that is based on time series
analysis was proposed to assess engineering design
processes using a CAD tool. Educational data mining
and learning analytics were studied to assess student
performance in learning and designing in a project-based
setting. The time series process data can be as fine-
grained as the ‘atomic’ design steps (meaning that they
cannot be logically divided further). Data at this level of
granularity has all the four characteristics of Big Data
(IBM, 2012; Xie et al., 2014a):
High volume: A large amount of process data in a
complex open-ended project
High velocity: The data can be collected, processed
and visualized in real time to instantaneously
provide to students and teachers
High variety: Many types of information provided
by a rich CAD system such as all the learner
actions and artifact properties
High veracity: The data must be accurate and
comprehensive to ensure fair and trustworthy
assessments of student performance
CAD logs are instructionally sensitive and can serve
as an effective instrument for assessing complex
engineering design processes. High-volume and high-
variety software logs can be used to detect the effects of
what happens outside the computer on individual
students. CAD logs were used in performance
assessment because their fine-grained, temporal nature
can provide more comprehensive, more reliable and
more personalized process data for finding evidence of
deep learning related to design creativity and problem
solving. Deep learning generates a large amount of
datasets. Big Data can be used to analyze and visualize
these large datasets (Xie et al., 2014b).
‘Mobile Access to CAD’ is a growing with above
average importance and usage. ‘Cloud Based CAD’
currently has low average importance and below average
usage. ‘Big Data’ is not yet ‘big’ in CAD-very low
awareness (Turner, 2014).
Methods, Technologies and Technology
Progress around Big Data
Big Data analytics uses analysis algorithms running
on powerful supporting platforms to uncover potentials
concealed in Big Data, such as unknown correlations or
hidden patterns (Hu et al., 2014). Advanced modelling,
analysis, feedback and visualization are the techniques of
Big Data analytics. These techniques help manufacturing
companies eliminate waste and create value in the design
and production of the products (Papanagnou, 2014). In
addition, data mining, text mining, opinion mining,
social network analysis, cluster analysis are Big Data
analytics methods (Cho and Hwang, 2015).
Machine learning has been used in Big Data.
Massive Parallel-Processing (MPP), distributed file
systems and cloud computing, etc. are supporting
technologies of Big Data (Zaslavsky et al., 2012).
Besides general cloud infrastructure services (storage,
compute, infrastructure/Virtual Machine (VM)
management), the following services are also required
to support Big Data (Turk, 2012):
Cluster services
Hadoop related services and tools
Specialist data analytics tools (logs, events, data
mining, etc.)
Databases/Servers SQL, NoSQL
MPP (Massively Parallel Processing) databases
Registries, indexing/search, semantics, namespaces
Security infrastructure (access control, policy
enforcement, confidentiality, trust, availability,
privacy)
Hadoop is an open source framework for writing and
running distributed applications that are capable of batch
processing large sets of data. Hadoop framework is
mainly known for Map Reduce and its distributed file
system. The Map Reduce algorithm consists of two basic
operations: map and reduce. It is a distributed data
processing model that runs on alarge cluster of
machines. Hadoop includes three parts: Hadoop
Distributed File System (HDFS), Hadoop Map Reduce
and Hadoop Common (Chardonnens, 2013).
To process and manage Big Data with parallel and
distributed data mining algorithms, a Cloud-Based
Design Manufacturing (CBDM) system should
employan open-source software/programming
framework that supports data-intensive distributed
applications (Ren et al., 2012). Map Reduce, a
parallel programming model, is one of widely used
programming models in cloud computing. It enables
CBDM systems to process large datasets. Hadoop is
one of the open source implementations of the Map
Reduce model. Hadoop divides computationally
extensive tasks into small fragments of work and each
work unit is processed on a computer node in a
Hadoop cluster (Dean and Ghemawat, 2008).
New methods for compact visualization of data
with ranging variety and veracity are constantly being
developed to present correlations across bases more
effectively. Visualization may be standalone or may
cross-filter with other views across feature bases.
Visualization may be measured in many ways,
including the bases spanned continuously, number of
points drawn, level of over plot and
precision/correctness (Schwartz et al., 2014).
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
229
Challenges of Big Data
General Challenges of Big Data
Traditional Statistical Process Control (SPC) methods
only focus onnumeric datasets. However, most Big
Data applications are required to process non-numeric
data obtained from different databases. The modeling
of this kind of data is often based on disciplines that
are not in the areas of statisticians and quality
engineers. As for text data, models draw from
linguistic sciences, computer science and psychology.
The models maybe integrate data input in different
languages. The arrival rate of the data fluctuates
depending on factors that are often not understood before
analyzing it. This phenomenon is called trending/viral
for online content (Megahed and Jones-Farmer, 2013).
Big Data has challenges in capture, storage, search,
analysis and virtualization (Zaslavsky et al., 2012).The
specific information of some general challenges is
provided as follows:
Challenges in Big Data management can be
categorized into two types: Engineering and
semantic. Engineering challenges lie in performing
data management activities such as query and
storage efficiently. Semantic challenges lie in
extracting the meaning of the information from
massive volumes of unstructured data, even dirty
data (Bizer et al., 2012)
It is difficult to collect and integrate data with
scalability from distributed locations because of the
variety of disparate data sources and the sheer
volume (Hu et al., 2014)
Data quality is a focal point. During the process
of data capture, sources of data are often
heterogeneous, geographically distributed and
unreliable, being susceptible to errors. Therefore,
a number of data preprocessing techniques, such
as data cleaning, data reduction, data integration
and data transformation, are often used to remove
noise and correct inconsistencies (Han and
Kamber, 2006)
Big Data systems manage and store the gathered
heterogeneous and massive datasets, while
providing function and performance guarantee, in
terms of scalability, fast retrieval and privacy
protection (Hu et al., 2014). Privacy and
information security are concerns in Big Data
Exponential growth of data volume is generated
from different instruments and/or collected from
sensors; it is necessary, but not easy to consolidate
e-Infrastructures as persistent platforms to ensure
continuity and cross-disciplinary collaboration
(Demchenko et al., 2012)
Big Data Challenges in Design and Manufacturing
Engineering
It is important and necessary to integrate
CAD/CAE/CAM and cyber-physical systems with Big
Data systems to make design and manufacturing more
competitive. How to fulfil the integration and how to
extract the right information from Big Data in design
and manufacturing for the right purpose at the right
time are major challenges.
Table 4. Big Data challenges for manufacturing; (1: Not at all a challenge; 3: Moderate challenge; 5: Very high challenge)
Areas of greatest challenges for manufacturing/production Degrees of challenges
Building high levels of trust between data scientists who present insights on Big Data and 3.31
functional managers
Determining what data to use for different business decisions 3.29
Being able to handle the large volume, velocity and variety of Big Data 3.25
Getting business units to share information across organizational silos 3.22
Finding the optimal way to organize Big Data activities in a company 3.20
Getting functional managers to make decisions based on Big Data, rather than on intuition 3.14
Putting the analysis of Big Data in a presentable form for making decisions 3.12
Getting top management in a company to approve investments in Big Data and is related investments 3.11
Determining what to do with the insights that are created from Big Data 3.09
Getting the IT function to recognize that Big Data requires new technologies and new skills 3.08
Finding and hiring data scientists who can manage large amounts of structured and 3.02
unstructured data and create insights
Determining which Big Data technologies to use 3.02
Keeping the data in Big Data initiatives secure from external parties 2.98
Understanding where in a company people should focus Big Data investments 2.98
Reskilling the IT function to be able to use new tools and technologies of Big Data 2.95
Other 2.80
Keeping the data in Big Data initiatives secure from internal parties 2.71
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
230
Machine learning is an important method of Big
Data; however, it has challenges in implementation. One
challenge is the availability of the right data from
different operations and processes. Machine systems
such as Programmable Logic Controllers (PLC) and
Supervisory Control And Data Acquisition (SCADA)
often capture a lot of machine data, but this data may not
be relevant. PLC and SCADA do not store all the data
that is required for a Big Data predictive analytics
solution based on machine learning (Joseph et al., 2014).
Other major challenges of Big Data in design and
manufacturing include building high levels of trust
between data scientists and managers; confidence in
analyzing and managing data with large volume, velocity
and variety; deciding what methods and technologies
will be used; and maintaining the consistency of
managing and using Big Data, etc. (Nedelcu, 2013).
Table 4 (MKGI, 2010) shows some areas of greatest
challenge for manufacturing/production and degrees of
these challenges.
Conclusion and Future Research
Big Data is large in volume, velocity, variety, value,
variability and veracity. Big Data helps integrate various
types of datasets in design and manufacturing
engineering; uncover hidden correlation patterns through
analytics; improve design and production processes; and
create more values. Mining Big Data helps improve
design in quality, time, costs and mass-customization.
Big Data also offers greatest benefits for manufacturing
engineering such as detecting product defects, boosting
quality and improving supply planning, etc. It has had a
lot of applications or great opportunities in design and
manufacturing engineering. These applications or
opportunity areas include electricity, automotive, missile
plants, integrated circuits, semiconductor manufacturing,
additive manufacturing, medical device design and
manufacturing and cloud-based design and
manufacturing, etc. Generally, Big Data has challenges
such as data capture, date integration, data visualization,
extracting values from all of heterogeneous data and
privacy and information security, etc. Specifically in
design and manufacturing engineering, besides the above
challenges, other major challenges lie in: Trust between
data scientists and managers; confidence in analyzing big
data, choice of methods and technologies and
consistency of managing and using big data, etc. All the
aspects of these challenges can be future research. The
authors of the paper will focus on Big Data in
CAD/CAE/CAM of medical devices as further research.
Acknowledgment
This study was supported in part by Technology and
Healthcare Solutions, Inc. in Mississippi, USA. No
conflict of interest to disclose.
Funding Information
The authors have no support or funding to report.
Author’s Contributions
All authors equally contributed in this work.
Ethics
This article has not been published elsewhere. The
corresponding author confirms that both authors have
read and approved the manuscript and no ethical
issues involved.
References
Agrawal, D., P. Bernstein and E. Bertino, 2011.
Challenges and opportunities with Big Data. Purdue
University Purdue e-Pubs.
Bellini, P., M. di Claudio, P. Nesi and N. Rauch,
2013.Tassonomy and Review of Big Data
Solutions Navigation. In: Big Data Computing,
Akerkar, R. (Ed.), CRC Press,
ISBN-10: 1466578386, pp: 57-101.
Bizer, C., P. Boncz, M.L. Brodie and O. Erling, 2012.
The meaningful use of Big Data: four perspectives--
four challenges. SIGMOD Rec., 40: 56-60.
DOI: 10.1145/2094114.2094129
Bottles, K., E. Begoli and B. Worley, 2014.
Understanding the pros and cons of Big Data
analytics. Physic. Executive, 40: 6-12.
PMID: 25188972
Brown, B., M. Chui and J. Manyika, 2011. Are you
ready for the era ofBig Data’? McKinsey Global
Institute.
Camilli, R. and A. Duisberg, 2013. Big Data and the
automotive industry. Report, Bird and Bird.
Chardonnens, T., 2013. Big Data analytics on high
velocity streams. MSc Thesis, University of
Fribourg.
Cho, Y. and S. Hwang, 2015. A study on LBS technique
makes for CRETA LBS platform service. Int. J.
Signal Process. Syst., 3: 82-87.
Davenport, T., 2013. The future of the manufacturing
workforce, report three: Contemporary
manufacturing technologies and the talent to use
them. Technical Report, Manpower Inc.
Dayal, U., M. Akatsu, C. Gupta, R. Vennelakanti and M.
Lenardi, 2014. Expanding global Big Data solutions
with innovative analytics. Hitachi Rev., 63: 333-339.
Dean, J. and S. Ghemawat, 2008. MapReduce:
Simplified data processing on large clusters.
Commun. ACM, 51: 107-113.
DOI: 10.1145/1327452.1327492
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
231
Demchenko, Y., Z. Zhao, P. Grosso, A. Wibisono and
C. de Laat, 2012. Addressing Big Data challenges
for scientific data infrastructure. Proceedings of
the IEEE 4th International Conference on Cloud
Computing Technology and Science, Dec. 3-6,
IEEE Xplore Press, Taipei, pp: 614-617.
DOI: 10.1109/CloudCom.2012.6427494
Demchenko, Y., P. Grosso, C. de Laat and P.
Membrey, 2013. Addressing Big Data issues in
Scientific Data Infrastructure. Proceedings of the
International Conference on Collaboration
Technologies and Systems, May 20-24, IEEE
Xplore Press, San Diego, CA., pp: 48-55.
DOI: 10.1109/CTS.2013.6567203
Dove, E.S., S.A. Faraj, E. Kolker and V. Ozdemir,
2012. Designing a post-genomics knowledge
ecosystem to translate pharmacogenomics into
public health action. Genome Med., 4: 91-91.
DOI: 10.1186/gm392
Dumbill, E., 2013. Making sense of Big Data. Big Data.
Eaton, C., D. Deroos, T. Deutsch, G. Lapis and P.
Zikopoulos, 2012. Understanding Big Data.
McGraw-Hill Companies.
Erdman, A.G. and D.F. Keefe, 2013. Grand challenge:
Applying regulatory science and Big Data to
improve medical device innovation. IEEE Trans.
Biomed. Eng., 60: 700-706.
DOI: 10.1109/TBME.2013.2244600
Han, J. and M. Kamber, 2006. Data Mining, Southeast
Asia Edition: Concepts and Techniques. 2nd Edn.,
Morgan Kaufmann, Amsterdam,
ISBN-10: 0080475582, pp: 800.
Hattori, R., 2013. Big Data and analytics for
semiconductor manufacturing. Technical Report,
IBM, pp: 1-25.
Hsu, C.Y., C.F. Chien and P.N. Chenb, 2012.
Manufacturing intelligence for early warning of key
equipment excursion for advanced equipment
control in semiconductor manufacturing. J. Chin.
Inst. Ind. Eng., 29: 303-313.
DOI: 10.1080/10170669.2012.702135
Hu, H., Y. Wen, T.S. Chua and X. Li, 2014. Toward
scalable systems for Big Data analytics: A
technology tutorial. IEEE Access, 2: 652-687.
DOI: 10.1109/ACCESS.2014.2332453
IBM, 2012.What is Big Data? IBM.
IC, 2014. Optimizing manufacturing with the internet of
things. Intel Corporation.
IWT, 2014. From predictive maintenance to predictive
manufacturing: Big Data challenge for industry.
POM Prognostics for Optimal Maintenance.
Joseph, J., O. Sharif, A. Kumar, S. Gadkari and A.
Mohan, 2014. Using Big Data for machine learning
analytics in manufacturing. Tata Consultancy
Services.
McGuire, T., J. Manyika and M. Chui, 2012. Why Big
Data is the new competitive advantage. Ivey Bus. J.
Megahed, F.M. and L.A. Jones-Farmer, 2013. A
Statistical Process Monitoring Perspective on “Big
Data”, Frontiers in Statistical Quality Control. 11th
Edn., Springer, New York.
Minelli, M., M. Chambers and A. Dhiraj, 2013. Big
Data, Big Analytics: Emerging Business Intelligence
and Analytic Trends for Today’s Business. 1st Edn.,
John Wiley and Sons, New Jersey,
ISBN-10: 111814760X, pp: 187.
MKGI, 2010. The internet of things. McKinsey Global
Institute.
Nedelcu, B., 2013. About Big Data and its challenges
and benefits in manufacturing. Database Syst. J., 4:
10-19.
Neison, R., 2014. Semiconductor vendors mine Big Data
for quality. Evaluat. Eng., 53: 24-24.
Noor, A.K., 2013. Putting Big Data to work. Department
of Mechanical Engineering.
ORRT, 2012. Planning for Big Data. O’Reilly Radar
Team.
Papanagnou, C., 2014. The role of Big Data exploration
and cloud-based technologies in manufacturing.
High Speed Sustainable Manufacturing Institute.
Rajpathak, T. and A. Narsingpurkar, 2013. Managing
knowledge from Big Data analytics in product
development. White Paper, Tata Consultancy
Services.
Ren, L., L. Zhang, F. Tao, X. Zhang and Y. Luo et al.,
2012. A methodology towards virtualisation-based
high performance simulation platform supporting
multidisciplinary design of complex products.
Enterprise Inform. Syst., 6: 267-290.
DOI: 10.1080/17517575.2011.592598
Russom, P., 2011. Big Data analytics. TDWI Best
Practices Report, Fourth Quarter.
Schwartz, J., C. Joshua and E. Poore, 2014. Measuring
the value of Big Data exploitation systems:
Quantitative, non-subjective metrics with the user as
a key component. Parsons J. Inform. Mapp., 6: 1-12.
Software AG, 2013. The value of Big Data in the supply
chain. White Paper, Software AG.
Turk, M., 2012. A chart of the Big Data ecosystem,
take 2.
Turner, C., 2014. Business advantage announces results
of worldwide CAD trends survey. Report, Business
Advantage Company.
Waller, M.A. and S.E. Fawcett, 2013a. Data science,
predictive analytics and Big Data: A revolution that
will transform supply chain design and
management. J. Bus. Logist., 34: 77-84.
DOI: 10.1111/jbl.12010
Lidong Wang and Cheryl Ann Alexander / A merican Journal of Engineering and Applied Sciences 2015, 8 (2): 223.232
DOI: 10.3844/ajeassp.2015.223.232
232
Waller, M.A. and S.E. Fawcett, 2013b. Click here for a
data scientist: Big Data, predictive analytics and
theory development in the era of a maker movement
supply chain. J. Bus. Logist., 34: 249-252.
DOI: 10.1111/jbl.12024
Wu, D., D.W. Rosena, L. Wang and D. Schaefer, 2015.
Cloud-based design and manufacturing: A new
paradigm in digital manufacturing and design
innovation. Computer-Aided Des., 59: 1-14.
DOI: 10.1016/j.cad.2014.07.006
Xie, C., Z. Zhang, S. Nourian and E. Hazzard, 2014a.
Time series analysis method for assessing
engineering design processes using a CAD tool. Int.
J. Eng. Educ., 30: 218-230.
Xie, C., Z. Zhang, S. Nourian, A. Pallant and S. Bailey,
2014b. On the instructional sensitivity of CAD logs.
Int. J. Eng. Educ., 30: 760-778.
Zaslavsky, A., C. Perera and D. Georgakopoulos,
2012. Sensing as a service and Big Data.
Proceedings of the International Conference on
Advances in Cloud Computing, (ACC’ 12),
Bangalore, India, pp: 1-8.
Zikopoulos, P.C., C. Eaton and P. Zikopoulos, 2011.
Understanding Big Data: Analytics for Enterprise
Class Hadoop and Streaming Data. 1st Edn.,
McGraw-Hill, New York, ISBN-10: 0071790535,
pp: 176.
... Moreover, Big Data enables firms to improve their internal processes and suppliers' performance, obtaining substantial benefits in terms of costs and time. For instance, by analysing data from machine sensors, production lines can be monitored and controlled to improve the manufacturing process and reduce defects (Wang and Alexander, 2015). BDA also supports firms in making more reliable decisions regarding supplier selection and contract management (Moretto et al., 2017). ...
... Furthermore, IoT is widely employed in this company's machinery connected to livestock, such as milking machines, and data regarding machine parameters are used to monitor and prevent machine failures. The applicability of I4.0 technologies to livestock provides a wider overview of possible uses and benefits, as Wang and Alexander (2015) highlighted in their study on improving production processes. This improves environmental sustainability by employing resources only when needed and more efficiently and fosters the creation of better working conditions since human intervention is often eliminated (He, 2017). ...
... Industries and organizations in the present age of the Internet of everything (IoE) are constantly producing data from sensors, instrumentation, events, computer-aided design (CAD), computer-aided manufacturing (CAM), and computer-aided engineering (CAE) models (Wang and Alexander 2015). Therefore, as technology evolves, manufacturing and business data are generated in addition to real-time process data from the product life cycle. ...
... Big data technologies, i.e., data mining and cloud computing, render management more competitive and improve manufacturing performance by identifying process bottlenecks and finding solutions as well as through MRO (maintenance, repair, and overhaul) . Figure 7 illustrates the benefits offered by big data in manufacturing (Wang and Alexander 2015). ...
Article
Full-text available
Big data is an expression for massive data sets consisting of both structured and unstructured data that are particularly difficult to store, analyze and visualize. Big data analytics has the potential to help companies or organizations improve operations as well as disclose hidden patterns and secret correlations to make faster and intelligent decisions. This article provides useful information on this emerging and promising field for companies, industries, and researchers to gain a richer and deeper insight into advancements. Initially, an overview of big data content, key characteristics, and related topics are presented. The paper also highlights a systematic review of available big data techniques and analytics. The available big data analytics tools and platforms are categorized. Besides, this article discusses recent applications of big data in chemical industries to increase understanding and encourage its implementation in their engineering processes as much as possible. Finally, by emphasizing the adoption of big data analytics in various areas of process engineering, the aim is to provide a practical vision of big data.
... Big Data has become integral to solving complex challenges across Disclaimer/Publisher's Note: The statements, opinions, and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions, or products referred to in the content. 2 various sectors, including research, healthcare, engineering, and environmental science [9][10][11][12][13]. It enables scientists to uncover hidden patterns and relationships, revolutionizing research and practice. ...
Preprint
Full-text available
Student attrition poses significant societal and economic challenges, leading to unemployment, lower earnings, and other adverse outcomes for individuals and communities. To address this, predictive systems leveraging machine learning and Big Data aim to identify at-risk students early and intervene effectively. This project focuses on extracting key parameters from past dropout data to construct a predictive model and alert authorities to intervene promptly. Two preliminary trials refine machine learning models, establish evaluation standards, and optimize hyperparameters. These trials facilitate systematic exploration of model performance and data quality assessment. Achieving 100% accuracy in dropout prediction, the study identifies academic performance as the primary influencer, with early-year subjects like Mechanics and Materials, Design of Machine Elements, and Instrumentation and Control having significant impact. The longitudinal effect of these subjects on attrition underscores the importance of early intervention. Proposed solutions include early engagement and support or restructuring courses to better accommodate novice learners, aiming to reduce attrition rates.
... The decline in STEM enrollments and increasing dropout rates carry far-reaching implications for the economy, innovation, and societal progress. Big data has emerged as a crucial tool in solving complex, multi-faceted challenges across various sectors, including education, healthcare, engineering, and environmental science [9][10][11][12][13]. It allows researchers to uncover hidden patterns, correlations, and relationships within vast datasets, fundamentally revolutionizing research practices. ...
Article
Full-text available
Student attrition poses significant societal and economic challenges, leading to unemployment, lower earnings, and other adverse outcomes for individuals and communities. To address this, predictive systems leveraging machine learning and big data aim to identify at-risk students early and intervene effectively. This study leverages big data and machine learning to identify key parameters influencing student dropout, develop a predictive model, and enable real-time monitoring and timely interventions by educational authorities. Two preliminary trials refined machine learning models, established evaluation standards, and optimized hyperparameters. These trials facilitated the systematic exploration of model performance and data quality assessment. Achieving close to 100% accuracy in dropout prediction, the study identifies academic performance as the primary influencer, with early-year subjects like Mechanics and Materials, Design of Machine Elements, and Instrumentation and Control having a significant impact. The longitudinal effect of these subjects on attrition underscores the importance of early intervention. Proposed solutions include early engagement and support or restructuring courses to better accommodate novice learners, aiming to reduce attrition rates.
... However, large and complex data cannot be directly processed, and commonly used devices also face the problem of insufficient storage space. With the advent of the digital age, high-performance computing clusters [41] as well as cloud computing [42] can handle large volumes of data with the help of powerful multi-core CPUs and GPUs. Moreover, big data analytics [43] and well-established algorithms [44] have been utilized to calculate and analyze large volumes of data. ...
Article
Full-text available
The wide application of welding manufacturing across key significant industries has aroused an increasing concern on weldments’ reliability. Weld residual stress, as a critical factor, is closely related to structural and material-level failures of weldments. Hence, the prediction of residual stress is necessary for ensuring weldments’ reliability. Neural network models have emerged as promising methods to achieve this prediction. In this review, the prediction of residual stress based on neural network models is reviewed to summarize current state and uncover their limitations. In terms of these limitations, this review gives substantial suggestions for future research on neural network modeling.
... The paper analyzes a number of issues related to the use of Big Data in design and manufacturing engineering. These issues include design and manufacturing data, the benefits and impact of Big Data, and its applications and capabilities [9]. The paper applies Big Data analysis technology to make intelligent decisionmaking in enterprise information, and builds an intelligent decision-making model based on Big Data analysis [10,11]. ...
Article
Full-text available
Досліджено ключовий інструментарій Smart-статистики: великі дані, дані штучного інтелекту та Інтернету речей, соціальних медіа та адміністратив-ні дані. У ході дослідження сформульовано концепцію Smart-статистики та виявлено переваги та загрози для офіційної статистики при використанні даних Smart-статистики. Запропоновано низку принципів роботи з великими даними, згруповано джерела вироблення великих даних за категоріями (відповідно до видів економічної діяльності), які нині затребувані як офіційною статистикою, так і суспільством (сільське господарство, охорона здоров'я, гірнича промисловість, машинобудування, освіта, енергети-ка). Розроблено систему розумних даних у розумних містах, яка містить такі елементи: розумний буди-нок, розумне середовище, розумний контроль, розумний трафік, розумне здоров'я та розумний грома-дянин. Визначено, що метою цієї системи є формування розумного середовища та спрощення способу життя шляхом економії часу, енергії і коштів. Визначено складові персоналізованого збирання та розпо-всюдження даних. Розглянуто штучний інтелект (ШІ) як компонент концепції Smart-статистики. У контексті статті автори зазначають, що використання технологій ШІ викликає етичні міркування щодо таких питань, як право власності на дані, їх прозорість і підзвітність. Ці міркування необхідно враховувати, аби перекона-тися, що використання технологій ШІ в офіційній статистиці є відповідальним і етичним. За результатами дослідження сформульовано висновок, що наразі необхідно сформувати логічні та адекватні підходи до розроблення методології збирання, опрацювання, групування й аналізу статис-тичних даних з альтернативних інформаційних джерел. Обґрунтовано, що, незважаючи на усі переваги інструментів Smart-статистики та потенційно позитивні результати їх імплементації в офіційну статис-тику, з огляду на умови війни й відсутність нормативно-правового поля щодо використання цих інстру-ментів Україна наразі не готова зіштовхнутися із тими загрозами, які вони в собі несуть. Подальші дослідження за розглянутою у статті тематикою мають спрямовуватися на поглиблення вивчення Smart-статистики та пошуки оптимальних шляхів упровадження її інструментів в офіційну статистику України. Ключові слова: Smart-статистика, офіційна статистика, великі дані, штучний інтелект, Інтернет речей, соціальні медіа, адміністративні дані. Вступ. Нещодавно Віце-прем'єр-міністр з інновацій, розвитку освіти, науки та технологій України-Міністр цифрової трансформації Укра-їни М. Федоров запропонував бачення трансфор-мації Державної служби статистики, наголосивши при цьому, що ухвалення управлінських рішень і відновлення України повинні базуватися на якіс-них даних. Ядро трансформаційної концепції, на його думку, полягає у такому: "У центрі всіх про-цесів Держстату має бути ІТ-платформа, яка в режимі реального часу збирає інформацію, ана-лізує і поширює" [1]. Таке бачення майбутнього офіційної статистики відкриває нові інформаційні горизонти шляхом залучення новітніх інновацій-них технологій, водночас провокує появу нових професійних і соціальних загроз, у тому числі у вигляді вивільнення великої кількості фахівців-статистиків. Зазначене змушує як науковців, так і практи-ків у сфері статистики до ґрунтовного вивчення й аналізу наявних підходів до трансформації статис-тичної галузі з метою визначення логічних кроків
Thesis
Full-text available
In the oil and gas industry, a huge amount of data is generated through sensory measurements during exploration to production phases of the reservoir. Uncertainties and inexactness are present in all the reservoir measurements due to heterogeneity and stochastic distribution of reservoir characteristics. Conventionally, field data are interpreted by experienced experts to extract useful information. However, with the advent of measurements-while-drilling and smart-well technologies, there is a significant increase in the volume of data generated and to be analyzed. Therefore, processing and analysis of this huge data pose a significant challenge to the prevailing technologies used in the oil and gas industry. In this study, the intelligent modeling approach has been investigated to provide cost-effective solutions for three major problems of petroleum domain viz. (a) Lithofacies identification (b) Drilling optimization (c) Production rate estimation.
Article
Full-text available
This paper proposes a novel computational approach based on time series analysis to assess engineering design processes using a CAD tool. To collect research data without disrupting a design learning process, design actions and artifacts are continuously logged as time series by the CAD tool behind the scenes, while students are working on a design challenge. These fine-grained data can be used to reconstruct and analyze the entire design process of a student with extremely high resolution. Results of a pilot study in a high school engineering class, in which students solved a solar urban design challenge, suggest that these data can be used to measure the level of student engagement, reveal the gender differences in design behaviors, and distinguish the iterative and non-iterative cycles in a design process. From the perspective of engineering education, this paper contributes to the emerging fields of educational data mining and learning analytics that aim to expand evidence approaches for learning in a digital world.
Article
Full-text available
Through the proliferation of sensors, smart machines, and instrumentation, industrial operations are generating ever increasing volumes of data of many different types. At the same time, advances in computing, storage, communications, and big data technologies are making it possible to store, process, and analyze enormous volumes of data at scale and at speed. The convergence of OT and IT, powered by innovative analytics, holds the promise of creating new social innovation businesses. Hitachi's R&D Group has established the Global Big Data Innovation Lab (GBDIL) to coordinate world-wide analytics research activities in support of the global expansion of the social innovation solution businesses by providing innovative analytics to the recently launched Hitachi Global Center for Innovative Analytics (HGC-IA). This article introduces GBDIL and HGC-IA, and describes a common reference architecture for developing, deploying, and operating big data solutions that leverage Hitachi's innovative analytics technologies. Through the use of example use cases, the article explains the strategy to expand the global big data solution business.
Article
Cloud-based design manufacturing (CBDM) refers to a service-oriented networked product development model in which service consumers are enabled to configure, select, and utilize customized product realization resources and services ranging from computer-aided engineering software to reconfigurable manufacturing systems. An ongoing debate on CBDM in the research community revolves around several aspects such as definitions, key characteristics, computing architectures, communication and collaboration processes, crowdsourcing processes, information and communication infrastructure, programming models, data storage, and new business models pertaining to CBDM. One question, in particular, has often been raised: Is cloud-based design and manufacturing actually a new paradigm, or is it just “old wine in new bottles”? To answer this question, we discuss and compare the existing definitions for CBDM, identify the essential characteristics of CBDM, define a systematic requirements checklist that an idealized CBDM system should satisfy, and compare CBDM to other relevant but more traditional collaborative design and distributed manufacturing systems such as web- and agent-based design and manufacturing systems. To justify the conclusion that CBDM can be considered as a new paradigm that is anticipated to drive digital manufacturing and design innovation, we present the development of a smart delivery drone as an idealized CBDM example scenario and propose a corresponding CBDM system architecture that incorporates CBDM-based design processes, integrated manufacturing services, information and supply chain management in a holistic sense.
Article
The aim of this article is to show the importance of Big Data and its growing influence on companies. It also shows what kind of big data is currently generated and how much big data is estimated to be generated. We can also see how much are the companies willing to invest in big data and how much are they currently gaining from their big data. There are also shown some major influences that big data has over one major segment in the industry (manufacturing) and the challenges that appear.
Article
This article reviews the benefits of Big Data in the manufacturing industry as more sophisticated and automated data analytics technologies are being developed. The challenge of Big Data is that it requires management tools to make sense of large sets of heterogeneous information. A new wave of inexpensive electronic sensors, microprocessors, and other components enables more automation in factories, and vast amounts of data to be collected along the way. In automated manufacturing, Big Data can help reduce defects and control costs of products. Smart manufacturing is expected to evolve into the new paradigm of cognitive manufacturing, in which machining and measurements are merged to form more flexible and controlled environments. The article also suggests that the emerging tools being developed to process and manage the Big Data generated by myriads of sensors and other devices can lead to the next scientific, technological, and management revolutions. The revolutions will enable an interconnected, efficient global industrial ecosystem that will fundamentally change how products are invented, manufactured, shipped, and serviced.