Science topic

Statistical Software - Science topic

R, SAS, SPSS, STATA, Statistica...
Questions related to Statistical Software
  • asked a question related to Statistical Software
Question
21 answers
What statistical software do you prefer for your research works? 💻 (Excel, SPSS, Matlab,...)
Thanks for your answer 😉
Relevant answer
Answer
Depende!!
que es lo que se quiere realizar ya que hay ventajas y desventajas en todos.
Por ejemplo Excel Funciona bien para revisar y armar bases de datos (de menos de 1.000.000 de datos en el teórico, en el real se vuelve difícil, sobre 100.000 ), para hacer análisis simples y algunos mas avanzados con el modulo de análisis de datos. Para análisis mas complejos, es mejor agregar el plug de Xlsat y con eso se potencia como loco, compitiendo con el SPSS en sus versiones básicas.
Ahora no funciona para bases muy grandes, o se pone muy lento para hacer cosas simples.
El SPSS funciona para hacer casi todo y es simple, fácil de ocupar y los análisis simples y de mediana complejidad están ya programados, donde se pone mas complejo es el análisis mas complejos o con condiciones especiales (autocorrelaciones espaciales o temporales), Personalmente, ya que es opinable, el modulo de Series de tiempo, el de análisis de contingencia, y el de redes neuronales, no me gusta.
El minitab es genial para hacer análisis multivariados, y algunos análisis de calidad, quedan super bien (bacanes como diríamos en Chile)
La recomendación!!!! es depende de que quieras hacer!!!
Ahora si son cosas más complejas, siempre esta R y sus versiones amigables como el R commander o el R Studio, que son extraordinariamente variadas para hacer lo que uno casi quiera. y el siempre bien ponderado Grtel que si bien es de econometría, tiene buenas aplicaciones de análisis de regresión, series de tiempo etc. ídem el JMP, que tiene un motor de grafico de aquellos inolvidables y hermosos. Hay varios mas como el PSPP, que es una version libre y basica del SPSS en formato libre..
  • asked a question related to Statistical Software
Question
19 answers
SPSS or STATA? Python or R? Jamovi or JASP?
Relevant answer
Answer
R is a language, so you will find a lot of your early weeks (months!) spent learning how the language works and trying to remember your vocabulary. If you are also trying to learn data analysis at the same time, this results in constant interference between the task of language learning and the task of learning data analysis.
For these reasons I would recommend jamovi. The interface is transparently simple, and it encourages good data analysis habits. You can perform quite complex (and some very advanced) analyses in jamovi, and the library of modules is growing all the time.
For data manipulation and meta-data, Stata is remarkably powerful. Labelling variables, values, and datasets, merging, cleaning, consistency-checking are unrivalled. I know people who pre-process their data in Stata before moving it to R because of these strengths.
Both jamovi and Stata have excellent videos, and the Stata manuals are comprehensive, with every command illustrated with worked examples. You can learn a lot of stats from them!
One big plus to jamovi, of course, is that it's free!
Given that you are doing a masters, I would not recommend R. By the time you get up to speed, it may be time to go!
  • asked a question related to Statistical Software
Question
3 answers
Am Doing My Research Work in "Growth Potential Of A Forest Cover"
Relevant answer
Answer
Dear Sangram Sahoo . The best famous one is SPSS.
  • asked a question related to Statistical Software
Question
1 answer
I would like to use the Broad-Sense heritability equation (in the attached document) proposed by Cullis et al. (2006) for an unbalanced data. Is it possible to compute the vBLUP in Genstat beside R package? vBLUP in the equation is the mean variance of a difference of two BLUPs of genotypic effects/the average standard error of differences between BLUPs squared and σg2 is the genotypic variance.
Cullis, B., Smith, A., and Coombes, N. (2006). On the design of the early generation
variety trials with correlated data. J. Agric. Biol. Environ. Stat. 11, 381–393.
doi: 10.1198/108571106X154443
Relevant answer
Answer
To calculate the heritability the genotype term should be fitted as a random term. The VKEEP directive can be used to get the variance component for this term to use in the calculation. The Method section of the help for the VHERITABILITY procedure provides details as to how the heritability is calculated in Genstat and the Cullis method for calculating heritability is available using the VHERITABILITY procedure.The VPREDICT directive can be used to calculate the vBLUPs. Alternatively, this can be done using the prediction menus within the Linear Mixed Model menu.
  • asked a question related to Statistical Software
Question
3 answers
I am examining lifestyle change in migrant Nepalese and I am using, among other things, G-PAQ. WHO recommend using EpiInfo (from CDC) to analyse G-PAQ data, but their downloaded programs, which work through Microsoft Access, are not stand alone, and seem to need eSTEPS to work. However, the website for downloading eSTEPS is old (2007), and some are obsolete and cannot be downloaded as a result. Can anyone tell me how to get the program for anlysing the G-Paq data to work? Download order? Anything? I am becoming increasingly desperate...
  • asked a question related to Statistical Software
Question
3 answers
More statistical software and packages existing nowadays, which one you thought will be much important from your view. And what about R? Do you recommend it?
Relevant answer
Answer
Yes, it is free and R-Studio is very convenient to use.
  • asked a question related to Statistical Software
Question
3 answers
Most of the statistical software provides weighted kappa for ordinal outcomes but only with two rater. For multi-rater the fleiss kappa is provided where we can not apply weighting due to ordinal outcome. My question is which statistical software should I use for weighted kappa for multi-rater ordinal outcome?
Relevant answer
Answer
What statistical software do you use? (Someone may know of a package or add-on for software you already have.)
If you have access to Stata, Daniel Klein's kappaetc package may be of interest. Here is the Stata Journal article introducing it:
HTH.
  • asked a question related to Statistical Software
Question
11 answers
there are many statistics software that researchers usually use in their works. In your opinion, which one is better? which one do you offer to start?
Your opinions and experience can help others in particular younger researchers in selection.
Sincerely
Relevant answer
Answer
Shahin Behdarvand , in my case I simply love MatLab. I have been using R, Python, Eviews, Stata and MatLab for the last 15 years, and I could say that when one wants to estimate somebody else models, R, Python, Eviews, Stata come handy; but in my case when I want to build and estimate my own models (this is, models that I create myself), MatLab is my tool of choice. MatLab is losing popularity in econometrics, and because it is not open as R or Python, most data science laboratories and statistical units that used to apply MatLab are migrating to open tools. Stata keeps being popular despite the fact that you have to pay for use it; Stata has improved a lot true to be said.
  • asked a question related to Statistical Software
Question
1 answer
How can I have water-LiBr mixture properties in MATLAB library?
Relevant answer
Answer
One possibility would be to link Matlab with Refprop. More information can be found in the following sources:
GitHub - jowr/librefprop.so: Create a shared library from the Fortran sources provided by Refprop from NIST. This project provides an alternative to the refprop.dll that comes with the software. Please use the official instructions if possible
  • asked a question related to Statistical Software
Question
20 answers
My project is conducted as Augmented Design at filed. For doing ANOVA I am looking for SAS software code. I could not find a complete SAS code for ANOVA and means comparison. Can someone help me out?
Relevant answer
Answer
You can use R software.
  • asked a question related to Statistical Software
Question
9 answers
Hallo, can you explain me how to calculate crude incidence rate of a recurrence per 100 py and 95%CI according to different time period? I use SPSS as a statistical software. I tried with life tables but not able to get the 95%CI. Can you give me a suggestion on this topic?
Thank you
Laura
Relevant answer
Answer
Hello Laura. See Method for calculating incidence rate on this page:
Notice especially the advice about using half-units of time in cases where people are lost to follow-up, or where the event of interest has occurred. In your case, where you check every 3 months, someone who has the condition of interest at 6 months would be scored as having contributed 3 + 1.5 = 4.5 months of person-time.
  • asked a question related to Statistical Software
Question
3 answers
Without using statistical software.
Relevant answer
Answer
yes it is possible
  • asked a question related to Statistical Software
Question
12 answers
Hi everyone, I would like to run a Cox regression progressively including potential confounding factors in the models (Model 0: no confounding factors; Model 1: 1 confounding factor; Model 2: 2 confounding factors; ...)
Since I never did it on my own, I am wondering if you could suggest a practical statistics software for this purpose.
PS I usually use Stata or GraphPad Prism.
Thank you for your collaboration and time.
Relevant answer
Answer
I have the answer : the approximation methods are different!
"When there are failure time ties (note that censor ties are not a problem), the exact likelihood is very cumbersome.
NCSS allows you to select either the approximation proposed by Breslow (1974) or the approximation given by
Efron (1977). Breslow’s approximation was used by the first Cox regression programs, but Efron’s approximation
provides results that are usually closer to the results given by the exact algorithm and it is now the preferred approximation (see for example Homer and Lemeshow (1999)."
  • asked a question related to Statistical Software
Question
34 answers
I have tried EVIEWS. But I came across of many Research Papers relevant to my Research who have used STATA. Many workshops to be held are asking for installed STATA Software. How can I get STATA?
Relevant answer
  • asked a question related to Statistical Software
Question
47 answers
Increasingly R is being used in statistical classes. What is the experience elsewhere? I have been a little disappointed with first year students' spreadsheet skills and even their  interest in statistic and quantitative skills, despite it giving them an obvious advantage in their future career.
Relevant answer
Answer
SPSS and AMOS
  • asked a question related to Statistical Software
Question
1 answer
I have recently re-installed my window after severe damage to my files by ransomware. After re-installation, necessary software and drivers were also installed. But, my statistical software ADEL-R, META-R, GEA-R are not working well. They shut down automatically while performing the data analysis and reopen immediately. My analysis is interrupted. The system leaves a crash message in the location where software is installed (program/GEA-R in c drive). The message file is uploaded below.
If anyone of you happened to strike with this problem before, please suggest possible solutions. I have tried several ways but could not resolve my problem.
Thank you in advance.......
Relevant answer
Answer
Reinstall the packages and see if that helps
  • asked a question related to Statistical Software
Question
3 answers
Hi,
I am looking forward to test unit root for a panel data series. In this regard, I would want to use the Hadri and Rao (2008) test with structural break. Is there any way, I can perform the test in STATA or any other like statistical software.
thanks,
Sagnik
Relevant answer
Answer
In stata software there is xtbunitroot command for break point unitroot test.
  • asked a question related to Statistical Software
Question
2 answers
For example, I want to get a bias-corrected confidence interval for a product of two coefficients from different regression equations:
The first: PM=a0+a1SL+a2SR
The second:OC=b0+b1SR+b2PM+b3SL+b4SR*SL+b5SR*PM
Then what is the bias-corrected confidence interval for a1*b5 using STATA?
Relevant answer
Answer
use SEM command
  • asked a question related to Statistical Software
Question
4 answers
Hi
As you know, for using nonlinear regressions in statistical software like SPSS or Minitab or codes, you need to determine a start point(start value or initial guess) for regression's parameters. Actually you need to choose an optimum start value for parameters to achieve the best nonlinear equation.
How can we determine the optimum start value?
Is there an other way (the other software) to use nonlinear regression regardless to parameter's start value?
Relevant answer
Answer
NO. But a numerical analyst would do plots and look for approximate values of the regression coefficients. As my old numerical analysis professor used to say graphs tell you lots and lots of cool stuff.
David Booth
PS if you had an optimum start you wouldn't need anything else would you?
  • asked a question related to Statistical Software
Question
4 answers
I have a model proposed based on theories (see the figure attached). I am wondering if it is statistically possible to test the model? Specifically, is there any problem if my moderator is affected by the independent variable? If it is possible to test it, what statistics software should I use? Thanks for any suggestions.
Relevant answer
Answer
I think Mr Mukaram is right...
  • asked a question related to Statistical Software
Question
8 answers
I am currently working on a project that addresses educational communication and technology (ECT) barriers in online education and distance learning in the Philippines. However, according to my internet research, SPSS can only handle 1500 cases or respondents. My project requires over 3000 student respondents with more than 150 variables, which includes sub-variables. Is there any statistical software that can meet my requirements?
Relevant answer
Answer
Virtually all statistical software packages can handle voluminous contents.
The question should have been about the quickness and the presentability of each software package.
  • asked a question related to Statistical Software
Question
6 answers
Prism is bit pricey, are there any affordable options but comparable in terms of graphic output, statistical calculation? Thanks.
Relevant answer
Answer
EPI-INFO
  • asked a question related to Statistical Software
Question
14 answers
Any field 
Relevant answer
Answer
Arcmap.
  • asked a question related to Statistical Software
Question
3 answers
It has been a time, I'm studying papers that used Interrupted Time Series (ITS) for their analysis, but unfortunately these papers did not mention which software they used! Even if they mentioned software like R, Python, Matlab, they did not mention for example which R package they used, what is the procedure. It is weird because on ML and Metaheuristic studies mostly we mention the whole algorithm and methodology we applied, so other researchers can replicate our work easily. However, about ITS is not like that and it is hard to enter the field!
Appreciate the help of ITS experts.
  • asked a question related to Statistical Software
Question
4 answers
I am working on 10 days trial version as a student. I need to know if there is a correlation meta-analysis, in particular correlation meta analysis. Which method software is used for calculating Z value, and is this z value is different from fisher z value? Also, how to interpret funnel plot?
Relevant answer
Answer
Does anybody have any suggestions on how to do a correlation multilevel meta-analysis in CMA?
  • asked a question related to Statistical Software
Question
7 answers
I have run an split-plot design experiment evaluating the efficacy of a treatment (with 3 whole plots and 4 sub-plots), and I am planning to analyse the collected data in Stata.
However, I do not find a pre-defined command to perform such analyses in Stata. I found that some statistical software such as GenStat Discovery have already pre-defined designs to analyse results of such experiment. Unfortunately, I do not have a licence for GenStat. I am planning to do my analyses with Stata.
Does anyone know how to analyse such data in Stata ?
  • asked a question related to Statistical Software
Question
5 answers
We have collected drivers eye moment data (four groups: ‘<30’, 7drivers; ’30-40’, 14 drivers; ’40-50’, 10drivers; ‘>50’, 5 drivers) after 2, 3 and 4h continuous driving. I have attached a visual variable.
Relevant answer
Answer
Good question
  • asked a question related to Statistical Software
Question
10 answers
What is the name of the statistical software you use most frequently for your publications? What are your priorities when determining the software you are going to use to perform a statistical analysis?
Relevant answer
Answer
You might ask commentators which discipline they are in because this influences choice (e.g., python v R for CS v stats, or STATA v lots for econ/policy folks), and also how much computing and statistical expertise they have. Also, by publications do you mean websites or paper?
  • asked a question related to Statistical Software
Question
8 answers
What is the difference between the following statistical software: SPSS; Amos; SMART PLS ?
Relevant answer
Answer
Thank you for your contribution
  • asked a question related to Statistical Software
Question
14 answers
I want to know about the statistical software which is user friendly and widely use for cluster and path analysis of the data set.
Relevant answer
Answer
Dear Sir,
You can use SPAR 2.0 but it requires windows 7 for access...
The best statistical software for path and clucter analysis is Windostat 9.1.
Windostat software will give you wonderful picture of cluster inter and intro distance, path direct and indirect effect along with analysis of data at upto 0.001 probability level.
  • asked a question related to Statistical Software
Question
36 answers
I've been using SPSS for years. I couldn't complain less about it. Lately, I've started reading about R. Theoretically, I got impressed by some of its features. I wonder if anyone tried both of them in any social science discipline and found one of them outsmarts the other in terms of practicality.
Thanks for sharing insights!
Cheers,
Relevant answer
Answer
Your question implies one of these two is the BEST. Why do you make this assumption? Also, as previous people have noted, "it depends" which of these is likely to be better on both your circumstances and what you are specifically working on. I use both, but one about 95% of the time and one about 1% of the time (and 4% on other ones), but I don't know what you are trying to do.
  • asked a question related to Statistical Software
Question
2 answers
I recently just found out about this statistical software and am interested to learn more about how this can be applied in marine fish ecology. Any leads on relevant literature would be most helpful.
Relevant answer
Hi, Jean:
I have been using EstimateS for years, but only in order to estimate species richness and diversity (I mean, alpha diversity). As you probably know, EstimateS now computes 'true measures' of species diversity (i.e., those expressed as the "equivalent number of species", like those promoted by Lou Jost and many others, including myself:
After your question, I really feel curious about what you have found about the use of this excellent piece of software for computing beta diversity. So, if you could share your results with us, that would be great!
Best regards:
Jose
  • asked a question related to Statistical Software
Question
18 answers
The use of computer and research related softwares have made research analysis quite easy and saves time and efforts. I apply MS Excel, AMOS and SPSS. Not much familiar with EViews, ATLAS-ti. All of you may be using one or another statistical softwares. May I request you to share your practice.
Sincerely
Bodh
Relevant answer
Answer
I still use SPSS, but most of my work and my students' Ph.D. research are now using Atlas.ti. I don't want to have any marketing influence, but the software provided by Atlas.ti was able to work with diverse languages, texts, and images what is key for our investigations.
  • asked a question related to Statistical Software
Question
17 answers
What are the advantages and disadvantages regarding data analysis, resolution of figures and graphs from STATA and SPSS.
Relevant answer
Answer
spss
  • asked a question related to Statistical Software
Question
2 answers
Hi!
I would like to know with which statistical software the bang blinding index and the James blinding index and how to do it.
Thanks
Relevant answer
Answer
Use R
  • asked a question related to Statistical Software
Question
6 answers
I mean, how do I put the dropped out participants' outcome (Missing value) in my analysis? Although I have already known what's ITTA meaning, I still don't know how to do it. Which methods should I use to fill missing values? Which statistical software I can use to perform ITTA & how etc.
If you can give me any explanations or advice, thanks a lot!
Relevant answer
Answer
There are multiple imputation and model-based approaches, such as mixed models and weighted generalized estimating equations (GEEs) for repeatedly measured outcomes, based on all observed data can be valid and unbiased methods for missing not at random data, as long as the models are specified correctly. We should consider using one the following approaches (in R or SAS) that are valid for missing at random data and thought to be more robust than sensitive analysis. Those include multiple imputation, mixed models, inverse probability weighted GEEs, and Bayesian analysis. I hope this clarify your query. Good luck!
  • asked a question related to Statistical Software
Question
27 answers
Hello, I'm struggling to find out which non-parametric test I need to use to compare the VO2 Max scores between physically active women and physically inactive men.
Aka, gender and physical activity are two independent(?) variables. I need a non-parametric test because I ran the data through normality tests and it says it's not normally distributed. I've looked at a Mann-Whitney U test but I don't think it's appropriate as it only lets me select one grouping variable?
Sorry I'm still really new at all this, any help would be appreciated.
Relevant answer
Answer
Simeon Stoynov , if new variable gender and activity level is created then we will have four groups namely; active men, inactive men, active women and inactive women. Afterwards, the four groups can be compared through Kruskal Wallis test.
  • asked a question related to Statistical Software
Question
7 answers
Hello, I am looking for an Origin like Soft to use in my Macbook, preferably some that can be downloaded...
Relevant answer
Answer
Igor pro is similar to origin but a little less intuitive (https://www.wavemetrics.com/)
  • asked a question related to Statistical Software
Question
22 answers
There are many statistical software tools available in the market for quantitative and qualitative analysis. Some are very expensive while some provides student version freely. Which statistical software are you using to analyze and interpret the qualitative research?
Relevant answer
Answer
Somehow the qualitative topic for this question has gotten lost, since programs like SPSS, AMOS, and Minitab only apply to quantitative analysis.
But I'm not sure what the value is of people simply giving one-line statements about which is their favorite program. All of these programs do essentially the same thing, so the best approach is to look at the extensive online tutorials for each and decide which one matches your own preferences.
  • asked a question related to Statistical Software
Question
3 answers
Hi,
I want to maximize the following function AND also want to find the optimal values from X1 to X6 such that Y is maximized. The function contains six linear terms and eleven interactions. It would be helpful for me if I get some idea about some algorithms using statistical software to achieve my goal.
Y = -083 - 0.12X1 - .37X3 + 5.3X2 + .0029log(X4)+1.85X5 + 6:2X6 + .186X1X2 + .22X1X4+ .035X1X5+.39X1X6 + .0073X2X3 + .023X3X4 + .006X3X5+.036X3X6 - .22X2X4 - .2X2X5 - .1X5X4 - .41X4X6.
Relevant answer
Answer
In Matlab/Scilab there is a toolbox that deal with solving optimization problems involving several decision variables, also there may be online free softwares where you can feed in your objective function and the constraints and obtain the optimal set of solutions.
You can also look into the "TORA" software, which is extensively used for solving operations research problems in general.
  • asked a question related to Statistical Software
Question
9 answers
can you recommend me the best statistical package that is easy to use. 
Relevant answer
Answer
If you have an intermediate level of coding you can use R or MATLAB. but if you prefer a more user-friendly one you can try SPSS. And Microsoft Excel is a good friend.
  • asked a question related to Statistical Software
Question
3 answers
Greetings!
Currently, I am conducting a meta-analysis on the association between BMI and prognosis. I discovered that the studies used various cut-offs. However, only two studies are eligible for dose-response meta-analysis (DRMA) as only these studies reported >1 cut-off values. Regarding this study:
1. Is it possible to perform a dose-response meta-analysis when only 2 studies are included?
2. Is there any statistical software that is able to perform dose-response meta-analysis other than STATA and R?
3. Regarding the preliminary analysis, we would like to perform a two-class analysis by comparing studies reporting 1 cut-off values (i.e. >=30 vs <30, >=25 vs <25). Can you pool all the cut-offs in the analysis? Or do you have to group the analyses based on the cut-offs?
Any help will be much appreciated. Thank you very much
Relevant answer
Answer
1. This would be similar to trying to establish linearity with only two data points. You are not able to conclude anything about a dose-response relationship, with two studies with different doses.
However, if you mean that the two studies have investigated a dose-response relationship, and you want to pool these data, then you are one step closer to say something about dose-response, but two studies is very little, and I would not do a meta-analysis.
2. My suggestion would be SAS, apart from STATA and R.
3. You could do both an overall analysis with high vs low BMI, and a stratified analysis with the two different cuttoffs, and present it in a forrest plot.
Of course it should be discussed as a limitation, if you conclude on the analysis pooling results from different cut-off values.
I hope that this was helpful.
  • asked a question related to Statistical Software
Question
3 answers
Hello,
I am having micro RNA data sets of control and patients now I am interested into analyzing micro RNA as a biomarker for a particular disease. I have already delta delta CT values for miRNA expression. In order to do so I want to develop ROC curve, if possible would you please explain step wise how to calculate ROC and AUC/ what statistical software you used for analysis?. This will be very helpful for me in my future research.
Thanking you in advance.
Relevant answer
Answer
Dear Dr. Mayur Doke,
In my experience using microarray data and RNA-Seq data I was able to use the ROC curve based on a strength prediction of my Bayesian network data. For example, if I were using a Bayesian network and attempting to calculate a Roc curve from the data I would utilize the R Package bnlearn. Here were my steps:
1st) Calculate a Bayesian Network in bnlearn using the hill climbing (hc) greedy search code and a data matrix of your microRNA data that is discretized while the scoring you would like to use is bde.
Example code: j = hc(PMID3, R = 200, m =30, score = "bde")
2nd) Model your network titled "j" into a true directed acyclic graph.
Example code: true.dag = model2network (j)
3rd) Calculate the strength of your network from step 2 using the boot.strength code and hill-climbing algorithm (this allows your to perform the prediction of the true positive and false positive rates used to graph your ROC curve). R stands for repetitions your model was run in the algorithm to determine the positive and negative result rates.
strength1 = boot.strength (true.dag (Here is where you put in your mRNA data matrx) , R = 200, m =30, algorithm = "hc")
4th) Perform prediction in bnlearn. In this step you calculate the predictive rates compiled from the strength code in step 3 and also from the BN created in step 1.
Example Code: pred = as. prediction(strength1, j)
5th) In this step you calculate the performance of your predictive model in step 4 also calculating the scores for true positive rate "tpr" and false positive rate "fpr" used in the next step.
Example Code: perf = performance(pred, "tpr", "fpr")
6th) Plot the ROC curve and calculate the area under the curve. After entering this code your ROC curve should appear on the plot panel of R.
Example Code: plot(perf, main = "Arc Detection")
7th) Calculate the area under the curve score for model validation. In this final step you will generate the AUC score that explains the strength of your model and how best your results are predicted accurately.
Example Code: performance(pred, "auc")
Other R packages that can be useful for microRNA data would be pROC and ROCR
I've attached a screenshot of the code I generated for a ROC curve using bnlearn and the ROC curve plot that was also generated. If you need any further help please do not hesitate to ask me.
My Best,
Christian
  • asked a question related to Statistical Software
Question
11 answers
I am no statistics person.For one of my problems, I need to perform linear regression analysis. I am performing it in Minitab 17 statistical software. It gives R-square and optimised R-square. I know higher the value of R-square directly proportionate to good model and Adjusted R-square, model is better. However, adjusted-R-square is close to it. I read that this adjusted -R-square is a better parameter. But I can not understand why. Please help me to understand why this adjusted R square is better than R square?
Relevant answer
Answer
It can be seen that even if you add an extra variable in your model the R-squared value increases, even if the variable is just a spurious variable. However it is not the case for adjusted R-squared value. Adjusted R-square value has a formula which gets adjusted for any extra added variable. Thus if the new variable is not significant, even though the R-squared value will increase but the adjusted R-square will not increase. Hence adjusted R-square will help you to know if the new variable is significant at all or not, and thus adjusted R-square is better than R-square.
  • asked a question related to Statistical Software
Question
6 answers
I have been using Aabel (Gigawhiz) plotting and statistical software. However, it is Mac OS centric and cannot be installed on a PC. I like this software because it enables me to directly select a data point in a graph which then highlights the point in the linked spreadsheet. This is very useful when exploring data and anomalous values. However, it is only available for Mac and I am now mainly PC based. As such, can anyone recommend a PC plotting and statistical software package that provides similar functionality? Thanks in advance : -)
Relevant answer
Answer
You can find this option in "Tecplot" software. If you should use that, you would use "probe At" option.
Goodluck.
  • asked a question related to Statistical Software
Question
3 answers
If you use different tools for different steps (sampling selection, weighting, estimation, etc.), please specify the breakdown.
Relevant answer
Answer
Dear Mamadou S. Diallo ,
Hi,
You can use SPSS, MATLAB, Stata for any purpose.
Best,
Saeed
  • asked a question related to Statistical Software
Question
6 answers
May some one please let me know what syntax should be used for calculating the Hardy Weinberg equilibrium in case control studies?
Relevant answer
Answer
Dear Rubina,
I am working on analyzing polymorphisms on CYP3A5 gene. The percentaje of the AA (CYP3A51*/1*) genotype in the population (50 patients) is 72%, of Ab (CYP3A51*/3*) genotype is 28% and of bb genotype (CYP3A53*/3*)is 0.
I am trying to calculate Hardy Weinberg equilibrium on STATA but I don't know how do I have to introduce the data....¿Do I have to generate a variable named, for example, "Genotype" and put the results of the genotyping analysis as categorical results, I mean: "AA", "Ab" and "bb"? (Patient 1: AA, Patient 2: Ab, and so on.....)¿Which is the second variable to which I have to do the comparison in order to obtain a p value? I am very confused.
Please If anyone could help me, I would really appreciate it! Thanks!
  • asked a question related to Statistical Software
Question
4 answers
I am running a regression in Stata.
As the dependent variable, I have the market share of smartphones (quarterly) for Apple and Samsung, and independent variables are Functional improvements and Design innovation (scored also quarterly).
My supervisor suggested that I have time fixed in order to account for the Christmas boost and I do not really understand how to do it.
And the second question is, am I capturing the interaction effect correctly?
So far I did...
xtset idcompany qdate
reg marketshare design function
and for interaction effect I did
gen designfunction=design*function
reg marketshare design function designfunction
and I got really good P values and R^2, but my coefficient for design*function is ( -.09) I am very curious how should I interpret it.
Does this all make sense? I am really new to Stata. I would really appreciate any help.
Relevant answer
Answer
Hello Alexa Drk. Generally speaking, Statalist (https://www.statalist.org/forums/forum/general-stata-discussion/general) is a better place to post questions about how to do X using Stata.
Second, I would think that if you are using -xtset-, you would want to use -xtreg- rather than -regress-.
Third, if you use the # or ## operators to include interactions (as in the file Aymen Ammari linked to), you'll be able to use -margins- and -marginsplot- to explore the nature of the interactions. See section 11.4.3 here:
HTH.
  • asked a question related to Statistical Software
Question
6 answers
Many of us are doing ongoing prospective research, yet COVID-19 has paused our work for a while or even months because of the city shut-down. There is an unexpected huge increase in lost to follow-up in our research clinics.
How should we deal with these cases, and the associated data?
The selection bias caused by the lost to follow-up cannot be adjusted by study design, as it is started already. What methods can we use to adjust instead?
Can anyone simply explain how inverse probability-of-censoring weighted estimation technique work on this issue?
How to run it practically, e.g. by SPSS software? Or other higher level of statistical software is needed?
How about stratification-based methods or weighted methods? How are they working actually?
Any practical guide available online?
Great thanks in advance with all your help!
Relevant answer
Answer
The missing is likely random as affected by COVID-19. However, surgery groups tends to have less lost to follow up than medical treatment group
  • asked a question related to Statistical Software
Question
12 answers
Actually I am preparing a research article and need to prepare a Bray-Curtis Similarity Index (%) graph. I have been using PAST, portable statistical software for this but it didn't satisfy my need, So suggest me some good, open-source and handy software (if any).
Thanks
Relevant answer
Answer
In some situation this web tool can be handy too: Clustvis - a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap - https://biit.cs.ut.ee/clustvis/
  • asked a question related to Statistical Software
Question
4 answers
I am writing a systematic review paper to demonstrate the global increase in drug-resistant E. coli over the past 20 years. I would like to show that in a line graph for the six WHO regions. I would like also to create a map to show the current prevalence worldwide. Which software is easy to use and good ? Thank you
Relevant answer
Answer
Hi! Do try your hand at R. It is an open software and you dont need to pay for it. Codes and syntax are easily accessible on github and the UI/UX is pretty simple and efficient. You can download it here:
After downloading R, download R Studio which acts as the interface. You can download the same here:
  • asked a question related to Statistical Software
Question
16 answers
How reliable is PAST (V 3.25) statistical software in diversity analysis (Ecology)?
Relevant answer
Answer
I have used PAST for years, and have published articles using it. This program is not as famous as R but it is good. but will provide good results.
  • asked a question related to Statistical Software
Question
15 answers
This question was asked at me by one of my students. The question is regarding the selection of appropriate statistical packages for appropriate tests of data. How much MATLAB is superior to MSEXCEL in the accuracy of complex calculations? What are the other factors one should consider in selecting a software for data analysis?
Relevant answer
Answer
I think SPSS and MINITAB are the best software the best .They are easiest to learn.
… Read more
  • asked a question related to Statistical Software
Question
4 answers
I am looking for a software that can establish inflection points in a cumulative probability plot, where these inflection points can be used as class boundaries for geochemical anomalies.
  • asked a question related to Statistical Software
Question
3 answers
İ am working on a binominal regression analysis for a retrospective study. Outcome has two option. Positive or negative. İn my Best regression model that i have found there are a continuous and an ordinal variable (1,2,3...10 ). And İ have used Jamovi and Spss for analysis and the model is fit. The problem is my spss subcription is almost out of date so a few weeks later jamovi Will be my only option for my research. İ Wonder is there any option or alternative test for hosmer le.. Test in jamovi, jasp, or another free statistical software?
Relevant answer
Answer
Have a look at this discussion
It says that that test you seek is no longer recommended! And points you to others and a model-based approach , all implemented in R.
  • asked a question related to Statistical Software
Question
5 answers
We used SPSS to conduct a mixed model linear analysis of our data. How do we report our findings in APA format? If you can direct us to a source that explains how to format our results, we would greatly appreciate it. Thank you. 
Relevant answer
Answer
The lack of standard error depends on your software, and even then it only applies to the variance terms. The reason for this is the variance cannot go negative and the sampling distribution can often be expected not to be asymptotically normal but skewed. So just explain this in you results table.
  • asked a question related to Statistical Software
Question
15 answers
What are the various statistical software systems available which will allow you to enter regression weight, w, in weighted least squares (WLS) regression, in addition to SAS?
.
Thank you to Guillermo Enrique Ramos for confirming, in another thread/question, that SAS can be used for quadratic linear regression, in addition to straight line linear and multiple regression, to implement weighted least squares in the format y = y* + (e0)(w^(-0.5)), where w is the regression weight as shown in section 2 of "Estimating the Coefficient of Heteroscedasticity," https://www.researchgate.net/publication/333642828_Estimating_the_Coefficient_of_Heteroscedasticity. That is the format for which Ken Brewer made a convincing argument when describing the range of heteroscedasticity one should expect, as discussed in "Essential Heteroscedasticity," https://www.researchgate.net/publication/320853387_Essential_Heteroscedasticity.  In section 3 of "Estimating the Coefficient of Heteroscedasticity," one can see that another way of handling heteroscedasticity is sometimes used, but it is not consistent with the Brewer explanation of the root cause of naturally occurring heteroscedasticity.  Perhaps that alternative technique, or something related to it, might be used in some other software systems. 
Here, the two attached images show the format for the regression I think best to use, and the regression weight, w, both presented in terms of the coefficient of heteroscedasticity, gamma.  y* is the WLS prediction.  In most applications, the best practical/obtainable function values for the size measure, z, are the OLS predicted y values.  That is what is suggested for input to the spreadsheet in https://www.researchgate.net/publication/333659087_Tool_for_estimating_coefficient_of_heteroscedasticityxlsx.
(Note that the method here, using the attached images, is in completely closed form in the case where we have one regressor and a zero intercept, so z = bx, and x can be used for a (relative) size measure.  Otherwise, we first obtain OLS predictions, and then pick a reasonable coefficient of heteroscedasticity, and go back to find the improved WLS predictions.  I think that the other method appears to be more ad hoc.) 
So what other software, in addition to SAS, will allow the input of w for weighted least squares regression?  Is it for linear and multiple linear regression?  Note that such software would then minimize the sum of weighted squared estimated residuals with respect to each regression coefficient, simultaneously, to obtain estimates of regression coefficients.  What about polynomial linear regression?  Multiple regression with interaction terms?  Will it handle those too? 
The other question I wrote regarding polynomial regression is still open, but here I'd like to know about other software for any regression fitting the format in the images attached here. 
.
.
The question then is What other software, besides SAS, will accept regression weight input, w, for WLS regression? 
Thank you. 
Relevant answer
Answer
Data Desk allows you to define a "variance" variable (the reciprocal of the weight) and introduce it into the analysis. That variable can be defined with a formula involving any other variables available. However, it will choke if the definition is circular. Of course, you can evaluate the variance expression into numbers and then use it.
  • asked a question related to Statistical Software
Question
3 answers
Dear Scientists,
Greetings
Please, could anyone give me an alternative to analyse data generated from an augmented Block design layout?
The Following known softwares are not working! Could anyone know the reasons? I urgently need your help!
Here are the softwares/links
Indian Agricultural Research Institute, New Delhi
•Statistical Package for Augmented Designs (SPAD)
•SAS macro called augment.sas
CIMMYT – SAS macro called UNREPLICATE
•Developed in 2000 – uses some older SAS syntax
Thanks in advance for your help
Regards
Relevant answer
Answer
None of your links worked, so maybe explain what you are trying to achieve. have you thought of using R which is freely available and a supported Open Access Program.
There are augmented block designs in the R package agricolae .
These are designs for two types of treatments: the control treatments (common) and the increased treatments. The common treatments are applied in complete randomized blocks, and the increased treatments, at random. Each treatment should be applied in any block once only. It is understood that the common treatments are of a greater interest; the standard error of the difference is much smaller than when between two increased ones in different blocks.
  • asked a question related to Statistical Software
Question
21 answers
Which statistical software will allow entry of a regression weight in polynomial regression?  
If we factor heteroscedastic estimated residuals into random and nonrandom factors, we can use a nonrandom factor that is the predicted y, say y*, raised to gamma, the coefficient of heteroscedasticity.  This gives us a regression weight of the predicted y raised to twice the negative of gamma.  These two expressions are attached. 
The following provides information on estimating gamma, the coefficient of heteroscedasticity: 
See Brewer, K.R.W.(2002), Combined survey sampling inference: Weighing Basu's elephants, Arnold: London and Oxford University Press, especially pages 111, and 87, 130, 137, 142, and 203.
Here is a spreadsheet to use in selecting a coefficient of heteroscedasticity:
This is found under the following project:
So, for example, if the coefficient of heteroscedasticity is 0.5, then using a preliminary estimate of y*, say y_hat, the two choices of G.S. Maddala, then the estimated regression weight is 1/y_hat.
SAS PROC REG allows the regression weight here to be entered as "w" (no quotes) for most linear regressions, but I think you might have to go to SAS PROC GLM for quadratic regression. 
I do not know which software, and better to know, but I don't, which programs within them, will perform polynomial regression, and will they handle entry of regression weight w. 
 
I would appreciate hearing what you know about which statistical software include weighted least squares polynomial regression.  I would like to be able to tell people where they can find it and use the above. 
Thank you. 
Relevant answer
Answer
Dear James
I computed with GLM the "Standard error of the individual predicted value", without weighting (gamma=0), and weighting with gamma=.7.
I adjunt the graphs that as you said are very different but which is better and why?
  • asked a question related to Statistical Software
Question
7 answers
i'm looking for a book with a combination of easy to understand theory and some examples with statistical softwares about bayesian structural equation modeling.
Relevant answer
Answer
Benyamin Askari Hello, I send 3 Books for you. Check you're RG message.
  • asked a question related to Statistical Software
Question
4 answers
I'm aware of a package in R called 'productivity' that does malmquist however, I'm looking to conduct Luenberger productivity and Luenberger-Hicks-Moorsten productivity, which this package does not have.
Thanks in advance!
Relevant answer
Answer
Nathan Walker Have a look at Nathan W Centre for Efficiency and Productivity Analysis
  • asked a question related to Statistical Software
Question
13 answers
like in stata how one may replace for missin values ussually given by a (.) into something like 0
Relevant answer
Answer
Using the [Recode into Same Variable] function.
From Transform Menu --> Recode into Same Variable --> Old and New Variables --> System Missing --> in value space add the value you want to replace the missing data with --> continue --> Ok.
Done.
Good luck Roy Naburuki however i know for sure you already resolved the problem.
  • asked a question related to Statistical Software
Question
2 answers
HI,
I am analyzing data related to a project entitled "Assessment of stigma, fear and discrimination among healthcare workers towards PLWHIV". We are planning to associate socio-demographic characteristics to develop stigma, fear and discrimination.
Now outcome variable is dichotomous in nature like presence/absence.Where as if we want to measure the odds in professional category by keeping doctors as reference. How i can do this in the Epi- Info 7.
Kindly give your suggestions to handle this problem.
Thank you,
Relevant answer
Answer
Dear Narayana
I hope following information may help you.
Download Epi Info user guide and go through 8 chapter. I hope you will get useful information.
Epi Info™ 7 User Guide – Chapter 8 - Visual Dashboard.
  • asked a question related to Statistical Software
Question
8 answers
Markov chain analysis in possible in which any statistical software ?
Thanks
Relevant answer
Answer
My packages known as SHARPE and SPNP can do that for you. Not just steady state but transient, cumulative transient and derivatives of these can be computed for very large Markov chains. Also Markov chains can be generated starting with a high level description.
  • asked a question related to Statistical Software
Question
9 answers
Dear All,
Kindly suggest a good statistical software to analyse data collected from an augmented field designed experiment ?
Relevant answer
Answer
Francesco Desiderio is right - R Studio is good. However, it is not statistical software. It is an Integrated Development Environment (IDE) that works well with R, among other scripting languages. Think about an IDE being like a workbench on which you construct your scripted statistical operations out of R. The combination of R and R Studio is very good. Nonetheless, you would still be writing scripts (programs) instead of using a package such as SPSS, SAS etc.
Worth mentioning that R Studio is not alone in this space. If you are interested in looking at IDEs to go with R, you should also look at Jupyter notebooks.
  • asked a question related to Statistical Software
Question
4 answers
I am looking to explore my alignments and tree data more carefully and I was wondering if there was any genomic statistical packages that may give me more inside into the data I generated in the alignments. I'm not sure if this I need the additional analysis, but I am interested in learning new ways in how to analyze my data. I have heard that there is a JMP Genomics add-on but I am not failure with it. Does any one have any advice or experience with JMP Genomics or any other software that might be able to help me understand what all I could do?
Relevant answer
Answer
I use R, too, and it runs great on a Mac. However, I still use JMP for a lot of analyses and in my teaching for the introductory grad stats - much more accessible and flat out cool.
  • asked a question related to Statistical Software
Question
1 answer
Hello all,
I am running a Geographically Weighted Regression and calibrating the bandwidth of my models using GW.model R functions. After some tests and playing around with the data, I encountered the following curiosity:
Often, when the number of regressors is higher than the number of observations, we say that there is no unique solution to the system. In fact, any statistical software would throw you an error.
However, I do not know how this occurs in GWR. My bandwidth selection, k-nearest, is lower than the number of regressors, meaning that each local regression has more regressors than observations (i.e., k-nearest neighbors < p regressors). I am able to estimate a solution of which I am very skeptical of because of this issue. But I am able to estimate a solution, which my institution says I should not.
Could somebody clarify what is going on here? Should I be concerned?
Noé
Relevant answer
Answer
I assume that one of these issues are happening: i) either your initial sample size is very small to use GWR in a robust manner in the first place, ii) you are using an unnecessary large number of explanatory variables or iii) both of the above.
In general, you should have at least a few hundred samples to consider doing GWR to justify its use. Additionally, you may consider running some sort of feature selection (ie., stepwise regression) to remove some variables prior to building GWR. Also, you should pay attention to local colinearity that may arise in the GWR coefficients (check local VIF or condition indexes) as the estimates can be very unrealistic due to that. Finally, what matters is if the model solution is interpretable. A model where most of the local coefficients are not significant would not make much sense to use even if the over R squared might be very high. In fact it is very easy to create GWR models with very high R squared values but with little to no meaning. If you are looking into more refined analysis perhaps you can check Multiscale GWR which offers more tools to deal with such issues.
  • asked a question related to Statistical Software
Question
4 answers
For example, when using statistical software or with Excel.
Relevant answer
Answer
I am sorry but most of these answers are way out of date. Please see the two references attached. Best, David Booth
  • asked a question related to Statistical Software
Question
4 answers
Dear all,
Does anyone know how can I estimate the NHPP reliability function through Non-parametric method?!
I know the Kernel density estimation is widely used in this area, but seems it has very complicated theory.
I was wondering if you could suggest an example or statistical software directly.
Also, I am just attaching the needed formulas of Kernel model.
Thanks for your attention.
Best/Hamzeh
Relevant answer
Answer
Dear Marcello Fera
I know maybe it is difullcut :) but generally its possible.
Thanks for your answer anyway.
Best.
  • asked a question related to Statistical Software
Question
14 answers
I want to know whether what is the best software that can be used for statistical analysis, time series analysis and drawing surface plots in chemical, biological and environmental engineering and science research projects.
Relevant answer
Answer
In my opinion the best statistical software R and SPSS.
  • asked a question related to Statistical Software
Question
4 answers
I tried PAST3 but I want to deal with other software, preferably free software
Relevant answer
Answer
PAST or PC-ORD should work. As mentioned, R is always a safe option as well.
  • asked a question related to Statistical Software
Question
2 answers
Hi, everyone. i am calculating NRI and IDI using STATA. I want to compare the discrimination ability of two seperate models (Model A and Model B). I think the nri program of stata seems only to calculate NRI that reflects the discrimination between a base model and the  model in which a new marker  is added to the base model. Now, how should i calculate a NRI that reflects the discrimination of two models which includes different variates.
Thanks in advance!
Relevant answer
Answer
Hello, you can use the incrisk ado procedure (Fred Hutchinson Cancer Research Center).
  • asked a question related to Statistical Software
Question
2 answers
I want to know how we deal fractional cointegration in R or any statistical software. I can't find any material regarding the analysis of fractional cointegration. Kindly help me to understand the concept.
Relevant answer
Answer
Thanks
  • asked a question related to Statistical Software
Question
6 answers
Is there any tool other than QMOOD++ to measure the metrics stated by BANSIYA(QMOOD metrics) for class diagram?
Relevant answer
Answer
You can use ptidej (www.ptidej.net), and if you are only interested in the quality attributes functions, try our tool at : https://github.com/moar82/RefGen and https://github.com/moar82/RefGen/wiki/Simulation-Mode-of-RefGen
  • asked a question related to Statistical Software
Question
13 answers
There are different software to carry out confirmatory analyzes, such as SPSS AMOS, which sometimes are insufficient for the publication requirements of some journals. What would be the most recommended software today?
Relevant answer