Statistical Characterization, Pattern Identification, and Analysis of Big Data

Paper #:
  • 2017-01-0236

Published:
  • 2017-03-28
DOI:
  • 10.4271/2017-01-0236
Citation:
Wei, Z. and Nikbin, K., "Statistical Characterization, Pattern Identification, and Analysis of Big Data," SAE Int. J. Mater. Manf. 10(3):293-305, 2017, https://doi.org/10.4271/2017-01-0236.
Pages:
13
Abstract:
In the Big Data era, the capability in statistical and probabilistic data characterization, data pattern identification, data modeling and analysis is critical to understand the data, to find the trends in the data, and to make better use of the data. In this paper the fundamental probability concepts and several commonly used probabilistic distribution functions, such as the Weibull for spectrum events and the Pareto for extreme/rare events, are described first. An event quadrant is subsequently established based on the commonality/rarity and impact/effect of the probabilistic events. Level of measurement, which is the key for quantitative measurement of the data, is also discussed based on the framework of probability. The damage density function, which is a measure of the relative damage contribution of each constituent is proposed. The new measure demonstrates its capability in distinguishing between the extreme/rare events and the spectrum events. Several case studies including vehicle reliability, vehicle road test score, warranty, salary distribution of an institution, the city population distribution in 3 countries, and the earthquake distribution worldwide and in the USA, are provided to demonstrate the role of the statistical and probabilistic approaches in the characterization and analysis of the big data.
Access
Now
SAE MOBILUS Subscriber? You may already have access.
Buy
Select
Price
List
Download
$27.00
Mail
$27.00
Members save up to 40% off list price.
Share
HTML for Linking to Page
Page URL

Related Items

Training / Education
2016-03-07
Training / Education
2015-07-13
Training / Education
2016-03-10
Technical Paper / Journal Article
2011-04-12
Standard
1987-02-01
Technical Paper / Journal Article
2011-04-12