Science topic

Statistical Software - Science topic

R, SAS, SPSS, STATA, Statistica...
Questions related to Statistical Software
  • asked a question related to Statistical Software
Question
4 answers
Hi all, I developed an open source Fuzzy Delphi model module on Jamovi. Jamovi is an open-source statistical software. The link is https://lerlerchan.github.io/FuzzyDelpiJmv/
The user interface and result (output) are very simple.
After trying out this module, please provide your feedback.
Thank you
Relevant answer
Answer
Thank you for your help - there are now 22 data points and free lines! As my deadline is approaching, I decided yesterday to conduct the analysis manually last night and came up with a few additional suggestions. The results (e.g. the variable 'd') are rounded to two decimal places, which may not be accurate enough. Also, I was able to manually replicate some of the results (e.g. for item 1), but I couldn’t replicate all of them. I have checked my manual analysis several times and even looked through the R code (especially the fuzzy values, Euclidean distance and defuzzification formula) and I still do not know why there is no match. I’m not sure where the error could be in the analysis I did with Excel.
If you like, I can email or message you the details. I can also send you some of my data and analysis to see if any reconciliation or adjustment is needed. I really appreciate the effort and energy you have put into this add-on for Jamovi. I see a lot of potential in this tool and am looking forward to see how it develops.
  • asked a question related to Statistical Software
Question
4 answers
Good day Water, marine Biologist scientist, Environmental Engineers,and Industrial Ingineers, chemists and chemical Engineers experts
I hope this message finds you well. I am currently serving as a guest researcher at Hof University, pursuing my MSc. in Natural And Applied Sciences (Applied Biology). My research focuses on the Effects of Brine Wastewater from a prospective green hydrogen desalination plant on Coastal Flora and Fauna Biodiversity.
I must confess to having limited familiarity with statistical analysis and the utilization of various statistical software for generating trends, charts, and identifying correlations.
I have raw data from brine chemical analysis and am eager to subject it to statistical analysis. However, I find myself in need of guidance regarding the most appropriate statistical methods for analyzing the specific dataset I have combined. I was wondering if you would be available to help in this regard, or if you might be aware of someone better suited to lend their expertise.
Any assistance you could provide would be immensely appreciated.
Warm regards,
Monika Leevi
Relevant answer
Answer
I don't have knowledge on this topic please
  • asked a question related to Statistical Software
Question
1 answer
I need to use randomization test for a single case experimental design? Any suggestion for the best statistical software that I need for this study?
Relevant answer
Answer
Can you say more specifically what you want? Do you want to randomize a bunch of numbers? For example, sample(1:10) puts the numbers 1 to 10 in random order.
  • asked a question related to Statistical Software
Question
2 answers
I want the advantages and disadvantages of each software
Relevant answer
Answer
When it comes to studying the public budget, several statistical software options can be valuable. some of them are:
  1. IBM SPSS Statistics: It is widely used proprietary statistical software that offers a range of features for data analysis.
  2. JMP: It is a data analysis software that combines interactive visualization with powerful statistical capabilities.
When selecting statistical software, it's important to consider various factors such as your proficiency in using the software, specific analytical requirements, and budget constraints. These factors play a crucial role in choosing the most suitable statistical tool to effectively analyze public budget data and facilitate informed decision-making.
  • asked a question related to Statistical Software
Question
2 answers
I want to calculate/measure microbial community stability. which statistical software's are suitable?
Relevant answer
Answer
Mohammad Eshaq Faiq Thanks. But i want specific package or statistical tools to measure stability of the community. This is very general answer. Hope you understand.
  • asked a question related to Statistical Software
Question
1 answer
Dear colleagues, i want to know or understand how to present CCA results and interpret them. i also want to know if there is any statistical software that can help. Please add readable materials. Thank you.
Relevant answer
CANONICAL CORRESPONDENCE ANALYSIS (CCA AND PARTIAL CCA)
Canonical correspondence analysis investigates the links between a contingency table and a set of variables. Run CCA in Excel using the XLSTAT software.
Put this in google and read this there.
  • asked a question related to Statistical Software
Question
24 answers
I just received the latest TOC alert for Behavior Research Methods, and this article caught my eye:
I've not had time to read it yet, but judging from a quick glance, I wonder if the main "problem" might be that users do not always take time to RTFM* and therefore, do not understand what their software is doing? In any case, I thought some members of this forum might be interested.
Cheers,
Bruce
* RTFM = Read The Fine Manual ;-)
Relevant answer
Answer
Uzair Essa Kori, I repeat my earlier question, which you have not yet addressed: Why in the world did you not say clearly at the top of your post that it was generated using AI? Do you not think that would be the honest and ethical thing to do?
  • asked a question related to Statistical Software
Question
3 answers
I would like to carry out a PCA analysis for my GC-MS samples comprising cuticular hydrocarbons extracted from fly samples. I want to do the analysis in statistical software like R Studio. Is there anyone who has done this before or perhaps let me know how it can be done?
Relevant answer
Answer
Its okay Balu, I will be willing to assist you. Best of luck
  • asked a question related to Statistical Software
Question
5 answers
Modelling Habitat Preferences, Species Correlations and Estimating Species Richness of Mammals from Camera Trap data
Relevant answer
Answer
Hi,
For camera trap data analysis, you can use ordination techniques like PCA or NMDS, and models such as GLMs. Software options include R and SPSS.
  • asked a question related to Statistical Software
Question
2 answers
it is requested i have firm stock and WTI crude oil stock, i applied DCC-GARCH model then calculate its covariance matrix, now i have problem how to calculate and on which statistical software i can calculate the portfolio optimal weights, hedge ratio, and hedging effectiveness.
with best wishes..
Relevant answer
Answer
Please check if this is helpful- https://rdrr.io/cran/riskR/man/risk.hedge.html
  • asked a question related to Statistical Software
Question
5 answers
I need to analyse interview data quantitatively.
Relevant answer
Answer
ATLIS.ti - https://atlasti.com/ is another good one!
  • asked a question related to Statistical Software
Question
22 answers
Especially for cross nested logit (CNL), nested logit (NL), multinomial logit (MNL).
Relevant answer
Answer
Nlogit is by far the best and easiset to use
  • asked a question related to Statistical Software
Question
3 answers
What are the statistical software packages that deal with the artificial intelligence environment?
Relevant answer
Answer
thanks a a lot for these information
  • asked a question related to Statistical Software
Question
13 answers
there are many statistics software that researchers usually use in their works. In your opinion, which one is better? which one do you offer to start?
Your opinions and experience can help others in particular younger researchers in selection.
Sincerely
Relevant answer
Answer
SPSS, as my results necessitate analysis using SPSS
  • asked a question related to Statistical Software
Question
4 answers
I'm using statistical software JMP to run several machine learning models viz. SVM, boosted tree and bootstrap forest. By default from the software, RASE is used to evaluate the generated models. Can RASE and RMSE be used interchangeably and considered the same?
Relevant answer
Answer
These are the same, just different nomenclature.
  • asked a question related to Statistical Software
Question
3 answers
Dear all !
I'm working on historical data related to events and like to carry out a survival analysis. In the dataset there are typically some right censored data, when the individuals are still living. But there are also individuals with missing birth dates, which means that they are left censored.
While survival analysis with right or left censoring can be carried out in most professional statistic software, I found no solution to include right and left censoring in one survival analyis.
Does anyone have an idea?
Thanks
Relevant answer
Answer
Yes, there are methods to handle both right and left censoring in survival analysis. Censoring occurs when the event of interest (e.g., death, failure) is not observed for some individuals within the study period. Right, censoring refers to cases where the event has not yet occurred by the end of the study, while left censoring occurs when the event occurred before the study started and is only known to have happened within a certain time frame.
In the presence of right and left censoring, you can use a statistical technique called interval-censored survival analysis. This approach takes into account the time intervals within which the event of interest occurred, rather than the precise event times. Interval-censored survival analysis allows for the estimation of survival probabilities and the comparison of survival curves when the event times are only known within certain intervals.
There are several methods available for interval-censored survival analysis, including:
  1. Turnbull's estimator: This nonparametric method estimates the survival probabilities by assuming that the event time lies uniformly within the observed interval for each censored individual.
  2. Parametric models: These models assume a specific distribution for the event times and estimate the parameters using maximum likelihood estimation. Common parametric models for interval-censored data include the Weibull, log-normal, and exponential distributions.
  3. Nonparametric maximum likelihood estimation (NPMLE): This method directly estimates the survival probabilities without making specific distributional assumptions. The NPMLE approach, such as the Kaplan-Meier estimator, is commonly used in interval-censored survival analysis.
  4. Bayesian methods: Bayesian approaches provide a flexible framework for interval-censored survival analysis, allowing for the incorporation of prior information and the estimation of survival probabilities based on posterior distributions.
The choice of method depends on the specific characteristics of your data and the assumptions you are willing to make. It is important to consider the underlying distribution of the event times, the nature of censoring, and the sample size.
Implementing interval-censored survival analysis typically requires specialized software or programming packages that offer appropriate functions or procedures for this type of analysis. Consult the documentation of statistical software packages such as R, SAS, or Stata, as they often provide specific functions for interval-censored survival analysis.
Remember to carefully interpret and report the results, acknowledging the presence of censoring and the methods used to handle it in your analysis.
  • asked a question related to Statistical Software
Question
5 answers
Some have the link to donwload free of the Stata Statistics software?
Relevant answer
Answer
Just to be clear : as Ma'Mon Abu Hammad states plainly, Stata is commercial software, made with care and professionalism by people who do this for a living. Downloading bootleg copies is stealing.
Just saying.
  • asked a question related to Statistical Software
Question
8 answers
Ans?
Relevant answer
Answer
IMO R is the best. To get started with R see the book R for Everyone by Lander which contains research grade code. Best wishes David Booth PS there are many more books but I found this one to be best for me. It included download information too.
  • asked a question related to Statistical Software
Question
4 answers
I am working on panel data measuring the direct and indirect effects of social support on subjective well-being from a retrospective cross-sectional data
Relevant answer
Answer
Any structural equation modeling (SEM) software will be able to also handle longitudinal SEM, for example, lavaan (free package in R), Mplus, AMOS, OpenMx, LISREL.
  • asked a question related to Statistical Software
Question
3 answers
Hi, everyone. i am calculating NRI and IDI using STATA. I want to compare the discrimination ability of two seperate models (Model A and Model B). I think the nri program of stata seems only to calculate NRI that reflects the discrimination between a base model and the  model in which a new marker  is added to the base model. Now, how should i calculate a NRI that reflects the discrimination of two models which includes different variates.
Thanks in advance!
Relevant answer
Answer
Would you please tell us, how did you calculate nri and idi in stata?
  • asked a question related to Statistical Software
Question
19 answers
SPSS or STATA? Python or R? Jamovi or JASP?
Relevant answer
Answer
R is a language, so you will find a lot of your early weeks (months!) spent learning how the language works and trying to remember your vocabulary. If you are also trying to learn data analysis at the same time, this results in constant interference between the task of language learning and the task of learning data analysis.
For these reasons I would recommend jamovi. The interface is transparently simple, and it encourages good data analysis habits. You can perform quite complex (and some very advanced) analyses in jamovi, and the library of modules is growing all the time.
For data manipulation and meta-data, Stata is remarkably powerful. Labelling variables, values, and datasets, merging, cleaning, consistency-checking are unrivalled. I know people who pre-process their data in Stata before moving it to R because of these strengths.
Both jamovi and Stata have excellent videos, and the Stata manuals are comprehensive, with every command illustrated with worked examples. You can learn a lot of stats from them!
One big plus to jamovi, of course, is that it's free!
Given that you are doing a masters, I would not recommend R. By the time you get up to speed, it may be time to go!
  • asked a question related to Statistical Software
Question
22 answers
What statistical software do you prefer for your research works? 💻 (Excel, SPSS, Matlab,...)
Thanks for your answer 😉
Relevant answer
Answer
SPSS has wide choices
  • asked a question related to Statistical Software
Question
1 answer
I would like to use the Broad-Sense heritability equation (in the attached document) proposed by Cullis et al. (2006) for an unbalanced data. Is it possible to compute the vBLUP in Genstat beside R package? vBLUP in the equation is the mean variance of a difference of two BLUPs of genotypic effects/the average standard error of differences between BLUPs squared and σg2 is the genotypic variance.
Cullis, B., Smith, A., and Coombes, N. (2006). On the design of the early generation
variety trials with correlated data. J. Agric. Biol. Environ. Stat. 11, 381–393.
doi: 10.1198/108571106X154443
Relevant answer
Answer
To calculate the heritability the genotype term should be fitted as a random term. The VKEEP directive can be used to get the variance component for this term to use in the calculation. The Method section of the help for the VHERITABILITY procedure provides details as to how the heritability is calculated in Genstat and the Cullis method for calculating heritability is available using the VHERITABILITY procedure.The VPREDICT directive can be used to calculate the vBLUPs. Alternatively, this can be done using the prediction menus within the Linear Mixed Model menu.
  • asked a question related to Statistical Software
Question
4 answers
I am examining lifestyle change in migrant Nepalese and I am using, among other things, G-PAQ. WHO recommend using EpiInfo (from CDC) to analyse G-PAQ data, but their downloaded programs, which work through Microsoft Access, are not stand alone, and seem to need eSTEPS to work. However, the website for downloading eSTEPS is old (2007), and some are obsolete and cannot be downloaded as a result. Can anyone tell me how to get the program for anlysing the G-Paq data to work? Download order? Anything? I am becoming increasingly desperate...
  • asked a question related to Statistical Software
Question
3 answers
More statistical software and packages existing nowadays, which one you thought will be much important from your view. And what about R? Do you recommend it?
Relevant answer
Answer
Yes, it is free and R-Studio is very convenient to use.
  • asked a question related to Statistical Software
Question
4 answers
Most of the statistical software provides weighted kappa for ordinal outcomes but only with two rater. For multi-rater the fleiss kappa is provided where we can not apply weighting due to ordinal outcome. My question is which statistical software should I use for weighted kappa for multi-rater ordinal outcome?
Relevant answer
Answer
What statistical software do you use? (Someone may know of a package or add-on for software you already have.)
If you have access to Stata, Daniel Klein's kappaetc package may be of interest. Here is the Stata Journal article introducing it:
HTH.
  • asked a question related to Statistical Software
Question
1 answer
How can I have water-LiBr mixture properties in MATLAB library?
Relevant answer
Answer
One possibility would be to link Matlab with Refprop. More information can be found in the following sources:
GitHub - jowr/librefprop.so: Create a shared library from the Fortran sources provided by Refprop from NIST. This project provides an alternative to the refprop.dll that comes with the software. Please use the official instructions if possible
  • asked a question related to Statistical Software
Question
20 answers
My project is conducted as Augmented Design at filed. For doing ANOVA I am looking for SAS software code. I could not find a complete SAS code for ANOVA and means comparison. Can someone help me out?
Relevant answer
Answer
You can use R software.
  • asked a question related to Statistical Software
Question
9 answers
Hallo, can you explain me how to calculate crude incidence rate of a recurrence per 100 py and 95%CI according to different time period? I use SPSS as a statistical software. I tried with life tables but not able to get the 95%CI. Can you give me a suggestion on this topic?
Thank you
Laura
Relevant answer
Answer
Hello Laura. See Method for calculating incidence rate on this page:
Notice especially the advice about using half-units of time in cases where people are lost to follow-up, or where the event of interest has occurred. In your case, where you check every 3 months, someone who has the condition of interest at 6 months would be scored as having contributed 3 + 1.5 = 4.5 months of person-time.
  • asked a question related to Statistical Software
Question
3 answers
Without using statistical software.
Relevant answer
Answer
yes it is possible
  • asked a question related to Statistical Software
Question
12 answers
Hi everyone, I would like to run a Cox regression progressively including potential confounding factors in the models (Model 0: no confounding factors; Model 1: 1 confounding factor; Model 2: 2 confounding factors; ...)
Since I never did it on my own, I am wondering if you could suggest a practical statistics software for this purpose.
PS I usually use Stata or GraphPad Prism.
Thank you for your collaboration and time.
Relevant answer
Answer
I have the answer : the approximation methods are different!
"When there are failure time ties (note that censor ties are not a problem), the exact likelihood is very cumbersome.
NCSS allows you to select either the approximation proposed by Breslow (1974) or the approximation given by
Efron (1977). Breslow’s approximation was used by the first Cox regression programs, but Efron’s approximation
provides results that are usually closer to the results given by the exact algorithm and it is now the preferred approximation (see for example Homer and Lemeshow (1999)."
  • asked a question related to Statistical Software
Question
34 answers
I have tried EVIEWS. But I came across of many Research Papers relevant to my Research who have used STATA. Many workshops to be held are asking for installed STATA Software. How can I get STATA?
Relevant answer
  • asked a question related to Statistical Software
Question
47 answers
Increasingly R is being used in statistical classes. What is the experience elsewhere? I have been a little disappointed with first year students' spreadsheet skills and even their  interest in statistic and quantitative skills, despite it giving them an obvious advantage in their future career.
Relevant answer
Answer
SPSS and AMOS
  • asked a question related to Statistical Software
Question
1 answer
I have recently re-installed my window after severe damage to my files by ransomware. After re-installation, necessary software and drivers were also installed. But, my statistical software ADEL-R, META-R, GEA-R are not working well. They shut down automatically while performing the data analysis and reopen immediately. My analysis is interrupted. The system leaves a crash message in the location where software is installed (program/GEA-R in c drive). The message file is uploaded below.
If anyone of you happened to strike with this problem before, please suggest possible solutions. I have tried several ways but could not resolve my problem.
Thank you in advance.......
Relevant answer
Answer
Reinstall the packages and see if that helps
  • asked a question related to Statistical Software
Question
3 answers
Hi,
I am looking forward to test unit root for a panel data series. In this regard, I would want to use the Hadri and Rao (2008) test with structural break. Is there any way, I can perform the test in STATA or any other like statistical software.
thanks,
Sagnik
Relevant answer
Answer
In stata software there is xtbunitroot command for break point unitroot test.
  • asked a question related to Statistical Software
Question
2 answers
For example, I want to get a bias-corrected confidence interval for a product of two coefficients from different regression equations:
The first: PM=a0+a1SL+a2SR
The second:OC=b0+b1SR+b2PM+b3SL+b4SR*SL+b5SR*PM
Then what is the bias-corrected confidence interval for a1*b5 using STATA?
Relevant answer
Answer
use SEM command
  • asked a question related to Statistical Software
Question
4 answers
Hi
As you know, for using nonlinear regressions in statistical software like SPSS or Minitab or codes, you need to determine a start point(start value or initial guess) for regression's parameters. Actually you need to choose an optimum start value for parameters to achieve the best nonlinear equation.
How can we determine the optimum start value?
Is there an other way (the other software) to use nonlinear regression regardless to parameter's start value?
Relevant answer
Answer
NO. But a numerical analyst would do plots and look for approximate values of the regression coefficients. As my old numerical analysis professor used to say graphs tell you lots and lots of cool stuff.
David Booth
PS if you had an optimum start you wouldn't need anything else would you?
  • asked a question related to Statistical Software
Question
4 answers
I have a model proposed based on theories (see the figure attached). I am wondering if it is statistically possible to test the model? Specifically, is there any problem if my moderator is affected by the independent variable? If it is possible to test it, what statistics software should I use? Thanks for any suggestions.
Relevant answer
Answer
I think Mr Mukaram is right...
  • asked a question related to Statistical Software
Question
8 answers
I am currently working on a project that addresses educational communication and technology (ECT) barriers in online education and distance learning in the Philippines. However, according to my internet research, SPSS can only handle 1500 cases or respondents. My project requires over 3000 student respondents with more than 150 variables, which includes sub-variables. Is there any statistical software that can meet my requirements?
Relevant answer
Answer
Virtually all statistical software packages can handle voluminous contents.
The question should have been about the quickness and the presentability of each software package.
  • asked a question related to Statistical Software
Question
6 answers
Prism is bit pricey, are there any affordable options but comparable in terms of graphic output, statistical calculation? Thanks.
Relevant answer
Answer
EPI-INFO
  • asked a question related to Statistical Software
Question
16 answers
Any field 
Relevant answer
Arcmap.
  • asked a question related to Statistical Software
Question
3 answers
It has been a time, I'm studying papers that used Interrupted Time Series (ITS) for their analysis, but unfortunately these papers did not mention which software they used! Even if they mentioned software like R, Python, Matlab, they did not mention for example which R package they used, what is the procedure. It is weird because on ML and Metaheuristic studies mostly we mention the whole algorithm and methodology we applied, so other researchers can replicate our work easily. However, about ITS is not like that and it is hard to enter the field!
Appreciate the help of ITS experts.
  • asked a question related to Statistical Software
Question
8 answers
What is the difference between the following statistical software: SPSS; Amos; SMART PLS ?
Relevant answer
Answer
thxs
  • asked a question related to Statistical Software
Question
7 answers
I have run an split-plot design experiment evaluating the efficacy of a treatment (with 3 whole plots and 4 sub-plots), and I am planning to analyse the collected data in Stata.
However, I do not find a pre-defined command to perform such analyses in Stata. I found that some statistical software such as GenStat Discovery have already pre-defined designs to analyse results of such experiment. Unfortunately, I do not have a licence for GenStat. I am planning to do my analyses with Stata.
Does anyone know how to analyse such data in Stata ?
  • asked a question related to Statistical Software
Question
5 answers
We have collected drivers eye moment data (four groups: ‘<30’, 7drivers; ’30-40’, 14 drivers; ’40-50’, 10drivers; ‘>50’, 5 drivers) after 2, 3 and 4h continuous driving. I have attached a visual variable.
Relevant answer
Answer
Good question
  • asked a question related to Statistical Software
Question
10 answers
What is the name of the statistical software you use most frequently for your publications? What are your priorities when determining the software you are going to use to perform a statistical analysis?
Relevant answer
Answer
You might ask commentators which discipline they are in because this influences choice (e.g., python v R for CS v stats, or STATA v lots for econ/policy folks), and also how much computing and statistical expertise they have. Also, by publications do you mean websites or paper?
  • asked a question related to Statistical Software
Question
14 answers
I want to know about the statistical software which is user friendly and widely use for cluster and path analysis of the data set.
Relevant answer
Answer
Dear Sir,
You can use SPAR 2.0 but it requires windows 7 for access...
The best statistical software for path and clucter analysis is Windostat 9.1.
Windostat software will give you wonderful picture of cluster inter and intro distance, path direct and indirect effect along with analysis of data at upto 0.001 probability level.
  • asked a question related to Statistical Software
Question
36 answers
I've been using SPSS for years. I couldn't complain less about it. Lately, I've started reading about R. Theoretically, I got impressed by some of its features. I wonder if anyone tried both of them in any social science discipline and found one of them outsmarts the other in terms of practicality.
Thanks for sharing insights!
Cheers,
Relevant answer
Answer
Your question implies one of these two is the BEST. Why do you make this assumption? Also, as previous people have noted, "it depends" which of these is likely to be better on both your circumstances and what you are specifically working on. I use both, but one about 95% of the time and one about 1% of the time (and 4% on other ones), but I don't know what you are trying to do.
  • asked a question related to Statistical Software
Question
2 answers
I recently just found out about this statistical software and am interested to learn more about how this can be applied in marine fish ecology. Any leads on relevant literature would be most helpful.
Relevant answer
Hi, Jean:
I have been using EstimateS for years, but only in order to estimate species richness and diversity (I mean, alpha diversity). As you probably know, EstimateS now computes 'true measures' of species diversity (i.e., those expressed as the "equivalent number of species", like those promoted by Lou Jost and many others, including myself:
After your question, I really feel curious about what you have found about the use of this excellent piece of software for computing beta diversity. So, if you could share your results with us, that would be great!
Best regards:
Jose
  • asked a question related to Statistical Software
Question
18 answers
The use of computer and research related softwares have made research analysis quite easy and saves time and efforts. I apply MS Excel, AMOS and SPSS. Not much familiar with EViews, ATLAS-ti. All of you may be using one or another statistical softwares. May I request you to share your practice.
Sincerely
Bodh
Relevant answer
Answer
I still use SPSS, but most of my work and my students' Ph.D. research are now using Atlas.ti. I don't want to have any marketing influence, but the software provided by Atlas.ti was able to work with diverse languages, texts, and images what is key for our investigations.
  • asked a question related to Statistical Software
Question
18 answers
What are the advantages and disadvantages regarding data analysis, resolution of figures and graphs from STATA and SPSS.
Relevant answer
Answer
spss
  • asked a question related to Statistical Software
Question
8 answers
For example, would be interpreted as a value of R = 0.39 and R = 0.44?
Relevant answer
Answer
ANOSIM is test of the significance of dissimilarities between/among communities and when you run the test it gives you that R value. Mathematically the R value is a ratio of the between groups variation to the within group variation so that if the between group variation is very high relative to the within group variation then the R value is also high . So in this case when R is 0.44, this indicates that the between groups variation is relatively much higher to the within group variation as compared to a situation where R is lower (0.39). Non-the less, we cannot conclude that the dissimilarities are significant basing on the higher R value only. To arrive at this such conclusion we use the p-value which is part of the ANOSIM output. To this end, the smaller the p-value, the more significant the dissimilarities in the compositions of the two communities.
  • asked a question related to Statistical Software
Question
2 answers
Hi!
I would like to know with which statistical software the bang blinding index and the James blinding index and how to do it.
Thanks
Relevant answer
Answer
Use R
  • asked a question related to Statistical Software
Question
6 answers
I mean, how do I put the dropped out participants' outcome (Missing value) in my analysis? Although I have already known what's ITTA meaning, I still don't know how to do it. Which methods should I use to fill missing values? Which statistical software I can use to perform ITTA & how etc.
If you can give me any explanations or advice, thanks a lot!
Relevant answer
Answer
There are multiple imputation and model-based approaches, such as mixed models and weighted generalized estimating equations (GEEs) for repeatedly measured outcomes, based on all observed data can be valid and unbiased methods for missing not at random data, as long as the models are specified correctly. We should consider using one the following approaches (in R or SAS) that are valid for missing at random data and thought to be more robust than sensitive analysis. Those include multiple imputation, mixed models, inverse probability weighted GEEs, and Bayesian analysis. I hope this clarify your query. Good luck!
  • asked a question related to Statistical Software
Question
27 answers
Hello, I'm struggling to find out which non-parametric test I need to use to compare the VO2 Max scores between physically active women and physically inactive men.
Aka, gender and physical activity are two independent(?) variables. I need a non-parametric test because I ran the data through normality tests and it says it's not normally distributed. I've looked at a Mann-Whitney U test but I don't think it's appropriate as it only lets me select one grouping variable?
Sorry I'm still really new at all this, any help would be appreciated.
Relevant answer
Answer
Simeon Stoynov , if new variable gender and activity level is created then we will have four groups namely; active men, inactive men, active women and inactive women. Afterwards, the four groups can be compared through Kruskal Wallis test.
  • asked a question related to Statistical Software
Question
7 answers
Hello, I am looking for an Origin like Soft to use in my Macbook, preferably some that can be downloaded...
Relevant answer
Answer
Igor pro is similar to origin but a little less intuitive (https://www.wavemetrics.com/)
  • asked a question related to Statistical Software
Question
22 answers
There are many statistical software tools available in the market for quantitative and qualitative analysis. Some are very expensive while some provides student version freely. Which statistical software are you using to analyze and interpret the qualitative research?
Relevant answer
Answer
Somehow the qualitative topic for this question has gotten lost, since programs like SPSS, AMOS, and Minitab only apply to quantitative analysis.
But I'm not sure what the value is of people simply giving one-line statements about which is their favorite program. All of these programs do essentially the same thing, so the best approach is to look at the extensive online tutorials for each and decide which one matches your own preferences.
  • asked a question related to Statistical Software
Question
3 answers
Hi,
I want to maximize the following function AND also want to find the optimal values from X1 to X6 such that Y is maximized. The function contains six linear terms and eleven interactions. It would be helpful for me if I get some idea about some algorithms using statistical software to achieve my goal.
Y = -083 - 0.12X1 - .37X3 + 5.3X2 + .0029log(X4)+1.85X5 + 6:2X6 + .186X1X2 + .22X1X4+ .035X1X5+.39X1X6 + .0073X2X3 + .023X3X4 + .006X3X5+.036X3X6 - .22X2X4 - .2X2X5 - .1X5X4 - .41X4X6.
Relevant answer
Answer
In Matlab/Scilab there is a toolbox that deal with solving optimization problems involving several decision variables, also there may be online free softwares where you can feed in your objective function and the constraints and obtain the optimal set of solutions.
You can also look into the "TORA" software, which is extensively used for solving operations research problems in general.
  • asked a question related to Statistical Software
Question
9 answers
can you recommend me the best statistical package that is easy to use. 
Relevant answer
Answer
If you have an intermediate level of coding you can use R or MATLAB. but if you prefer a more user-friendly one you can try SPSS. And Microsoft Excel is a good friend.
  • asked a question related to Statistical Software
Question
3 answers
Greetings!
Currently, I am conducting a meta-analysis on the association between BMI and prognosis. I discovered that the studies used various cut-offs. However, only two studies are eligible for dose-response meta-analysis (DRMA) as only these studies reported >1 cut-off values. Regarding this study:
1. Is it possible to perform a dose-response meta-analysis when only 2 studies are included?
2. Is there any statistical software that is able to perform dose-response meta-analysis other than STATA and R?
3. Regarding the preliminary analysis, we would like to perform a two-class analysis by comparing studies reporting 1 cut-off values (i.e. >=30 vs <30, >=25 vs <25). Can you pool all the cut-offs in the analysis? Or do you have to group the analyses based on the cut-offs?
Any help will be much appreciated. Thank you very much
Relevant answer
Answer
1. This would be similar to trying to establish linearity with only two data points. You are not able to conclude anything about a dose-response relationship, with two studies with different doses.
However, if you mean that the two studies have investigated a dose-response relationship, and you want to pool these data, then you are one step closer to say something about dose-response, but two studies is very little, and I would not do a meta-analysis.
2. My suggestion would be SAS, apart from STATA and R.
3. You could do both an overall analysis with high vs low BMI, and a stratified analysis with the two different cuttoffs, and present it in a forrest plot.
Of course it should be discussed as a limitation, if you conclude on the analysis pooling results from different cut-off values.
I hope that this was helpful.
  • asked a question related to Statistical Software
Question
3 answers
Hello,
I am having micro RNA data sets of control and patients now I am interested into analyzing micro RNA as a biomarker for a particular disease. I have already delta delta CT values for miRNA expression. In order to do so I want to develop ROC curve, if possible would you please explain step wise how to calculate ROC and AUC/ what statistical software you used for analysis?. This will be very helpful for me in my future research.
Thanking you in advance.
Relevant answer
Answer
Dear Dr. Mayur Doke,
In my experience using microarray data and RNA-Seq data I was able to use the ROC curve based on a strength prediction of my Bayesian network data. For example, if I were using a Bayesian network and attempting to calculate a Roc curve from the data I would utilize the R Package bnlearn. Here were my steps:
1st) Calculate a Bayesian Network in bnlearn using the hill climbing (hc) greedy search code and a data matrix of your microRNA data that is discretized while the scoring you would like to use is bde.
Example code: j = hc(PMID3, R = 200, m =30, score = "bde")
2nd) Model your network titled "j" into a true directed acyclic graph.
Example code: true.dag = model2network (j)
3rd) Calculate the strength of your network from step 2 using the boot.strength code and hill-climbing algorithm (this allows your to perform the prediction of the true positive and false positive rates used to graph your ROC curve). R stands for repetitions your model was run in the algorithm to determine the positive and negative result rates.
strength1 = boot.strength (true.dag (Here is where you put in your mRNA data matrx) , R = 200, m =30, algorithm = "hc")
4th) Perform prediction in bnlearn. In this step you calculate the predictive rates compiled from the strength code in step 3 and also from the BN created in step 1.
Example Code: pred = as. prediction(strength1, j)
5th) In this step you calculate the performance of your predictive model in step 4 also calculating the scores for true positive rate "tpr" and false positive rate "fpr" used in the next step.
Example Code: perf = performance(pred, "tpr", "fpr")
6th) Plot the ROC curve and calculate the area under the curve. After entering this code your ROC curve should appear on the plot panel of R.
Example Code: plot(perf, main = "Arc Detection")
7th) Calculate the area under the curve score for model validation. In this final step you will generate the AUC score that explains the strength of your model and how best your results are predicted accurately.
Example Code: performance(pred, "auc")
Other R packages that can be useful for microRNA data would be pROC and ROCR
I've attached a screenshot of the code I generated for a ROC curve using bnlearn and the ROC curve plot that was also generated. If you need any further help please do not hesitate to ask me.
My Best,
Christian
  • asked a question related to Statistical Software
Question
11 answers
I am no statistics person.For one of my problems, I need to perform linear regression analysis. I am performing it in Minitab 17 statistical software. It gives R-square and optimised R-square. I know higher the value of R-square directly proportionate to good model and Adjusted R-square, model is better. However, adjusted-R-square is close to it. I read that this adjusted -R-square is a better parameter. But I can not understand why. Please help me to understand why this adjusted R square is better than R square?
Relevant answer
Answer
It can be seen that even if you add an extra variable in your model the R-squared value increases, even if the variable is just a spurious variable. However it is not the case for adjusted R-squared value. Adjusted R-square value has a formula which gets adjusted for any extra added variable. Thus if the new variable is not significant, even though the R-squared value will increase but the adjusted R-square will not increase. Hence adjusted R-square will help you to know if the new variable is significant at all or not, and thus adjusted R-square is better than R-square.
  • asked a question related to Statistical Software
Question
6 answers
I have been using Aabel (Gigawhiz) plotting and statistical software. However, it is Mac OS centric and cannot be installed on a PC. I like this software because it enables me to directly select a data point in a graph which then highlights the point in the linked spreadsheet. This is very useful when exploring data and anomalous values. However, it is only available for Mac and I am now mainly PC based. As such, can anyone recommend a PC plotting and statistical software package that provides similar functionality? Thanks in advance : -)
Relevant answer
Answer
You can find this option in "Tecplot" software. If you should use that, you would use "probe At" option.
Goodluck.
  • asked a question related to Statistical Software
Question
3 answers
If you use different tools for different steps (sampling selection, weighting, estimation, etc.), please specify the breakdown.
Relevant answer
Answer
Dear Mamadou S. Diallo ,
Hi,
You can use SPSS, MATLAB, Stata for any purpose.
Best,
Saeed
  • asked a question related to Statistical Software
Question
6 answers
May some one please let me know what syntax should be used for calculating the Hardy Weinberg equilibrium in case control studies?
Relevant answer
Answer
Dear Rubina,
I am working on analyzing polymorphisms on CYP3A5 gene. The percentaje of the AA (CYP3A51*/1*) genotype in the population (50 patients) is 72%, of Ab (CYP3A51*/3*) genotype is 28% and of bb genotype (CYP3A53*/3*)is 0.
I am trying to calculate Hardy Weinberg equilibrium on STATA but I don't know how do I have to introduce the data....¿Do I have to generate a variable named, for example, "Genotype" and put the results of the genotyping analysis as categorical results, I mean: "AA", "Ab" and "bb"? (Patient 1: AA, Patient 2: Ab, and so on.....)¿Which is the second variable to which I have to do the comparison in order to obtain a p value? I am very confused.
Please If anyone could help me, I would really appreciate it! Thanks!
  • asked a question related to Statistical Software
Question
4 answers
I am running a regression in Stata.
As the dependent variable, I have the market share of smartphones (quarterly) for Apple and Samsung, and independent variables are Functional improvements and Design innovation (scored also quarterly).
My supervisor suggested that I have time fixed in order to account for the Christmas boost and I do not really understand how to do it.
And the second question is, am I capturing the interaction effect correctly?
So far I did...
xtset idcompany qdate
reg marketshare design function
and for interaction effect I did
gen designfunction=design*function
reg marketshare design function designfunction
and I got really good P values and R^2, but my coefficient for design*function is ( -.09) I am very curious how should I interpret it.
Does this all make sense? I am really new to Stata. I would really appreciate any help.
Relevant answer
Answer
Hello Alexa Drk. Generally speaking, Statalist (https://www.statalist.org/forums/forum/general-stata-discussion/general) is a better place to post questions about how to do X using Stata.
Second, I would think that if you are using -xtset-, you would want to use -xtreg- rather than -regress-.
Third, if you use the # or ## operators to include interactions (as in the file Aymen Ammari linked to), you'll be able to use -margins- and -marginsplot- to explore the nature of the interactions. See section 11.4.3 here:
HTH.
  • asked a question related to Statistical Software
Question
6 answers
Many of us are doing ongoing prospective research, yet COVID-19 has paused our work for a while or even months because of the city shut-down. There is an unexpected huge increase in lost to follow-up in our research clinics.
How should we deal with these cases, and the associated data?
The selection bias caused by the lost to follow-up cannot be adjusted by study design, as it is started already. What methods can we use to adjust instead?
Can anyone simply explain how inverse probability-of-censoring weighted estimation technique work on this issue?
How to run it practically, e.g. by SPSS software? Or other higher level of statistical software is needed?
How about stratification-based methods or weighted methods? How are they working actually?
Any practical guide available online?
Great thanks in advance with all your help!
Relevant answer
Answer
The missing is likely random as affected by COVID-19. However, surgery groups tends to have less lost to follow up than medical treatment group
  • asked a question related to Statistical Software
Question
12 answers
Actually I am preparing a research article and need to prepare a Bray-Curtis Similarity Index (%) graph. I have been using PAST, portable statistical software for this but it didn't satisfy my need, So suggest me some good, open-source and handy software (if any).
Thanks
Relevant answer
Answer
In some situation this web tool can be handy too: Clustvis - a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap - https://biit.cs.ut.ee/clustvis/
  • asked a question related to Statistical Software
Question
16 answers
How reliable is PAST (V 3.25) statistical software in diversity analysis (Ecology)?
Relevant answer
Answer
I’ve been using PAST for few paleoecological studies/articles and it worked superb.
  • asked a question related to Statistical Software
Question
3 answers
I am writing a systematic review paper to demonstrate the global increase in drug-resistant E. coli over the past 20 years. I would like to show that in a line graph for the six WHO regions. I would like also to create a map to show the current prevalence worldwide. Which software is easy to use and good ? Thank you
Relevant answer
Answer
Hi! Do try your hand at R. It is an open software and you dont need to pay for it. Codes and syntax are easily accessible on github and the UI/UX is pretty simple and efficient. You can download it here:
After downloading R, download R Studio which acts as the interface. You can download the same here:
  • asked a question related to Statistical Software
Question
15 answers
This question was asked at me by one of my students. The question is regarding the selection of appropriate statistical packages for appropriate tests of data. How much MATLAB is superior to MSEXCEL in the accuracy of complex calculations? What are the other factors one should consider in selecting a software for data analysis?
Relevant answer
Answer
I think SPSS and MINITAB are the best software the best .They are easiest to learn.
… Read more
  • asked a question related to Statistical Software
Question
4 answers
I am looking for a software that can establish inflection points in a cumulative probability plot, where these inflection points can be used as class boundaries for geochemical anomalies.
  • asked a question related to Statistical Software
Question
3 answers
İ am working on a binominal regression analysis for a retrospective study. Outcome has two option. Positive or negative. İn my Best regression model that i have found there are a continuous and an ordinal variable (1,2,3...10 ). And İ have used Jamovi and Spss for analysis and the model is fit. The problem is my spss subcription is almost out of date so a few weeks later jamovi Will be my only option for my research. İ Wonder is there any option or alternative test for hosmer le.. Test in jamovi, jasp, or another free statistical software?
Relevant answer
Answer
Have a look at this discussion
It says that that test you seek is no longer recommended! And points you to others and a model-based approach , all implemented in R.
  • asked a question related to Statistical Software
Question
5 answers
We used SPSS to conduct a mixed model linear analysis of our data. How do we report our findings in APA format? If you can direct us to a source that explains how to format our results, we would greatly appreciate it. Thank you. 
Relevant answer
Answer
The lack of standard error depends on your software, and even then it only applies to the variance terms. The reason for this is the variance cannot go negative and the sampling distribution can often be expected not to be asymptotically normal but skewed. So just explain this in you results table.
  • asked a question related to Statistical Software
Question
15 answers
What are the various statistical software systems available which will allow you to enter regression weight, w, in weighted least squares (WLS) regression, in addition to SAS?
.
Thank you to Guillermo Enrique Ramos for confirming, in another thread/question, that SAS can be used for quadratic linear regression, in addition to straight line linear and multiple regression, to implement weighted least squares in the format y = y* + (e0)(w^(-0.5)), where w is the regression weight as shown in section 2 of "Estimating the Coefficient of Heteroscedasticity," https://www.researchgate.net/publication/333642828_Estimating_the_Coefficient_of_Heteroscedasticity. That is the format for which Ken Brewer made a convincing argument when describing the range of heteroscedasticity one should expect, as discussed in "Essential Heteroscedasticity," https://www.researchgate.net/publication/320853387_Essential_Heteroscedasticity.  In section 3 of "Estimating the Coefficient of Heteroscedasticity," one can see that another way of handling heteroscedasticity is sometimes used, but it is not consistent with the Brewer explanation of the root cause of naturally occurring heteroscedasticity.  Perhaps that alternative technique, or something related to it, might be used in some other software systems. 
Here, the two attached images show the format for the regression I think best to use, and the regression weight, w, both presented in terms of the coefficient of heteroscedasticity, gamma.  y* is the WLS prediction.  In most applications, the best practical/obtainable function values for the size measure, z, are the OLS predicted y values.  That is what is suggested for input to the spreadsheet in https://www.researchgate.net/publication/333659087_Tool_for_estimating_coefficient_of_heteroscedasticityxlsx.
(Note that the method here, using the attached images, is in completely closed form in the case where we have one regressor and a zero intercept, so z = bx, and x can be used for a (relative) size measure.  Otherwise, we first obtain OLS predictions, and then pick a reasonable coefficient of heteroscedasticity, and go back to find the improved WLS predictions.  I think that the other method appears to be more ad hoc.) 
So what other software, in addition to SAS, will allow the input of w for weighted least squares regression?  Is it for linear and multiple linear regression?  Note that such software would then minimize the sum of weighted squared estimated residuals with respect to each regression coefficient, simultaneously, to obtain estimates of regression coefficients.  What about polynomial linear regression?  Multiple regression with interaction terms?  Will it handle those too? 
The other question I wrote regarding polynomial regression is still open, but here I'd like to know about other software for any regression fitting the format in the images attached here. 
.
.
The question then is What other software, besides SAS, will accept regression weight input, w, for WLS regression? 
Thank you. 
Relevant answer
Answer
Data Desk allows you to define a "variance" variable (the reciprocal of the weight) and introduce it into the analysis. That variable can be defined with a formula involving any other variables available. However, it will choke if the definition is circular. Of course, you can evaluate the variance expression into numbers and then use it.
  • asked a question related to Statistical Software
Question
3 answers
Dear Scientists,
Greetings
Please, could anyone give me an alternative to analyse data generated from an augmented Block design layout?
The Following known softwares are not working! Could anyone know the reasons? I urgently need your help!
Here are the softwares/links
Indian Agricultural Research Institute, New Delhi
•Statistical Package for Augmented Designs (SPAD)
•SAS macro called augment.sas
CIMMYT – SAS macro called UNREPLICATE
•Developed in 2000 – uses some older SAS syntax
Thanks in advance for your help
Regards
Relevant answer
Answer
None of your links worked, so maybe explain what you are trying to achieve. have you thought of using R which is freely available and a supported Open Access Program.
There are augmented block designs in the R package agricolae .
These are designs for two types of treatments: the control treatments (common) and the increased treatments. The common treatments are applied in complete randomized blocks, and the increased treatments, at random. Each treatment should be applied in any block once only. It is understood that the common treatments are of a greater interest; the standard error of the difference is much smaller than when between two increased ones in different blocks.
  • asked a question related to Statistical Software
Question
21 answers
Which statistical software will allow entry of a regression weight in polynomial regression?  
If we factor heteroscedastic estimated residuals into random and nonrandom factors, we can use a nonrandom factor that is the predicted y, say y*, raised to gamma, the coefficient of heteroscedasticity.  This gives us a regression weight of the predicted y raised to twice the negative of gamma.  These two expressions are attached. 
The following provides information on estimating gamma, the coefficient of heteroscedasticity: 
See Brewer, K.R.W.(2002), Combined survey sampling inference: Weighing Basu's elephants, Arnold: London and Oxford University Press, especially pages 111, and 87, 130, 137, 142, and 203.
Here is a spreadsheet to use in selecting a coefficient of heteroscedasticity:
This is found under the following project:
So, for example, if the coefficient of heteroscedasticity is 0.5, then using a preliminary estimate of y*, say y_hat, the two choices of G.S. Maddala, then the estimated regression weight is 1/y_hat.
SAS PROC REG allows the regression weight here to be entered as "w" (no quotes) for most linear regressions, but I think you might have to go to SAS PROC GLM for quadratic regression. 
I do not know which software, and better to know, but I don't, which programs within them, will perform polynomial regression, and will they handle entry of regression weight w. 
 
I would appreciate hearing what you know about which statistical software include weighted least squares polynomial regression.  I would like to be able to tell people where they can find it and use the above. 
Thank you. 
Relevant answer
Answer
Dear James
I computed with GLM the "Standard error of the individual predicted value", without weighting (gamma=0), and weighting with gamma=.7.
I adjunt the graphs that as you said are very different but which is better and why?