Conference PaperPDF Available

Statistical Analysis Tools: An Over View

Authors:

Abstract

A statistical analysis is the collection, organization, Analysis, Interpretation and Presentation of data includes descriptive, hypothesis, linear and nonlinear regression, Graphics, charts, Plots, Correlation, multivariate analysis. This paper discusses about the free and open source Statistical Analysis tools such as SOFA, R, MaxStat, SimFit, and Develve and its features for the benefits of research scholars.
410
National Conference on Research Methodology in Library and Information Science
Statistical Analysis Tools: An Over View
Dr. B. Jeyapragash, T. Rajkumar & A.Muthuraj
Abstract
A statistical analysis is the collection, organization, Analysis, Interpretation and Presentation
of data includes descriptive, hypothesis, linear and nonlinear regression, Graphics, charts, Plots,
Correlation, multivariate analysis. This paper discusses about the free and open source Statistical
Analysis tools such as SOFA, R, MaxStat, SimFit, and Develve and its features for the benefits of
research scholars.
Introduction
Statistics is the study of the collection, organization, examination, summarization,
manipulation, interpretation and presentation of quantitative data. It deals with all aspects of data
including the planning of data collection in terms of the design of surveys and experiments.
Statistical data Analysis is multiple ways to get data, starting with built-in curated data
sources, importing from a variety of file formats, or connecting to databases. The analysis starts from
processing of data, including computing statistical quantities, smoothing, testing, and visualizing.
Statistical software is program which is used for the statistics analysis of the Collection, Organization,
Analysis, Interpretation and Presentation of data.
Types of statistics
1. Descriptive statistics
2. Inferential Statistics
Descriptive statistics: A statistic is a number that is derived from data, for example a mean (average)
or a standard deviation. It can be very helpful when examining data to obtain a suitable set of relevant
descriptive statistics. In particular, it can be very interesting to compare statistics obtained from
different (but related) columns, or between levels of a factor. This gives an idea of the similarities or
differences between the data.
Inferential Statistics: After data exploration, aided by visualization and description techniques, one
will need to identify what formal statistical analysis technique (if any) is required to investigate the
data further and to draw general conclusions. A very large number of statistical techniques have been
developed to handle many different types of data and create relationships between them.
Importance of Statistic Analysis
The following are the importance of Research.
Validity
Careful Analysis
Efficiency
Accuracy
React intelligently
Forecast future
Statistical Analysis Tools
A statistical tool is a program which is used for the statistics analysis of the Collection,
Organization, Analysis, Interpretation and Presentation of data. The statistical analysis tools are
available in commercial and also free / open source. The following are the important free/open
source software tools are available for the Statistical Analysis.
National Conference on Research Methodology in Library and Information Science
411
SOFA
R
MaxStat
SimFit
Develve
SOFA (Statistics Open For All The user-friendly) http://www.sofastatistics.com/home.php
SOFA Statistics is an open source statistics, analysis, and reporting package. SOFA Statistics
has an emphasis on ease of use, learn as you go, and beautiful output. SOFA Statistics generates
attractive, and dynamic, charts that you can share with others easily. The output is HTML and
JavaScript so it is web-friendly from the start. Recently released 1.0.4 Version
Features
Ongoing Development
Beauty - Attractive Output
Spreadsheet Friendly
Data Entry Friendly
Easy Data Recoding
Output You Can Share Easily
Main Tests and Measures Made Easy
Learn As You Go
Automated Reporting
Speaking your language
Database Friendly
Tabular output is in HTML, which
means you can put it directly on your
intranet or website, or put it in a
spreadsheet.
SOFA has a wide range of attractive,
high quality charts including:
simple bar charts (freq or means)
clustered bar charts (freq or means)
pie charts
single and multiple line charts (freq or
means)
area charts (freq or means)
histograms
scatterplots
box and whisker plots
R (https://www.r-project.org/)
R is a free software environment for statistical computing and graphics. Developed at Bell
Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues’
provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests,
time-series analysis, classification, clustering) and graphical techniques, and is highly extensible.
Recently released in 3.2.4 version.
Features
R is an integrated suite of software facilities for data manipulation, calculation and
graphical display.
an effective data handling and storage facility,
a suite of operators for calculations on arrays, in particular matrices,
a large, coherent, integrated collection of intermediate tools for data analysis,
graphical facilities for data analysis and display either on-screen or on hardcopy, and
MaxStat (http://www.maxstat.de/)
MaxStat Statistical Analysis Software supports over 100 commonly used statistical tests and
make it easy to interpret results and create high-quality graphs. Maxstat includes descriptive,
hypothesis, linear and nonlinear regression, correlation, multivariate analysis, and time series.
412
National Conference on Research Methodology in Library and Information Science
MaxStat even helps you to design your experiments by computing sample sizes and power. MaxStat
recently released in 3.6 version.
Features
statistical analysis and scientific
graphing
Results that are easy to understand
Display your data in high-quality
graphs with over 30 styles
Geometric, and median bar
Geometric, and median line
Box plot, low-high
Box-Whisker
DNA plot
Histogram
Vertical and horizontal bar graphs
Simple linear regression
Multiple linear regression
Residual plots
Nonlinear regression
XY plots
Biplots (multivariate analysis)
Scree plot (PCA)
Z-Score plots (PCA)
Auto correlation and lag plots
Forecast and smoothing plots
Simfit (http://www.simfit.org.uk/)
Simfit is a Statistical Research Data Analysis Software for simulation, curve fitting, statistics,
and plotting, using a library of models or user-defined equations. Simfit has been in continuous
development for many years by Bill Bardsley of the University of Manchester.Simfit has forty
programs, each dedicated to an aspect of simulation, plotting, or data analysis, with a reference
manual containing mathematical and statistical. Clipboard data and spread sheet export files can be
analysed, and macros to interface with Microsoft Office are provided.
Features
Grouping of data into histograms (with error bars if appropriate) and cdfs
Extrapolation of best-fit linear and nonlinear curves to arbitrary end points
Automatic transformation of error bars into various axes (Hill, Lineweaver-Burk, Scatchard,
log-odds, etc.)
Multiple axes plots
Pie charts with arbitrary displacements, fill-styles, colours
Bar charts with arbitrary positions, sizes, fill-styles, colours and error-bars
Error bars can be added to 2D and 3D bar charts and 3D cylinder plots
Presentation box and whisker plots, and pie or bar charts with 3D perspective effects
Orbits and vector field diagrams for systems of differential equations
Dendrograms and 3D-cluster plots for use in cluster analysis
3D-surfaces and 2D-projections of contours
Curves in space and projections onto planes
vast array of plotting characters and maths symbols
Standard PostScript fonts, Symbol, Zapf Dingbats and Isolatin1 encoding
Professional quality PostScript files that can easily be edited to change titles, legends,
symbols, line-types and thicknesses, etc.
National Conference on Research Methodology in Library and Information Science
413
Transformation of .eps files into bit-map and compressed graphics formats (e.g. bmp, pcx, tif,
jpg, png, pdf)
Graphical deconvolution of summation models after fitting
Develve (http://develve.net/)
Features
Mean
Median
Min/Max
Compare with
T test
Variation F test
Regression
Multiple linear Regression
Sample size calculations
Chi Square test
Graphs
Distribution fitting
Measurement System Analysis
Design of experiments
Reliability
Conclusion
Statistical analysis plays the important role in any field of research. It also describes the
nature of data to be analysed, explores the relation of the data, prove the validity and helps the future
action to be taken. Hence all the research scholars should understand the statistics therefore it helps
you to look behind the numbers and to know the truth. Based on the statistics knowledge, when we
use the statistical Analysis tools we can get the right results for data to support our search.
References
What is statistical analysis? - Definition from WhatIs.com. (n.d.). Retrieved February 24, 2016,
from http://whatis.techtarget.com/definition/statistical-analysis
Statistical Analysis. (n.d.). Retrieved February 24, 2016, from
http://birtanalytics.actuate.com/statistical-analysis
What is statistical analysis? - Definition from WhatIs.com. (n.d.). Retrieved February 24, 2016,
from http://whatis.techtarget.com/definition/statistical-analysis
ResearchGate has not been able to resolve any citations for this publication.
ResearchGate has not been able to resolve any references for this publication.