Science topic

PRIMER-E - Science topic

Explore the latest questions and answers in PRIMER-E, and find PRIMER-E experts.
Questions related to PRIMER-E
  • asked a question related to PRIMER-E
Question
1 answer
I ENCOUNTERED A PROBLEM WITH Primer 6 - PERMANOVA since I have a dataset with about 10000 records and Permanova doesn't run. The dataset has 8 variables, 1 factor with 2 levels. Do you have idea on how to solve the problem?....my PC is new, quite powerful and I tried also on other PCs
Relevant answer
Answer
Hello! I hope you've found a solution already, but if not, here are a few tips:
PRIMER can be quite demanding with large datasets, so try running it in priority mode with as much memory and CPU as possible. Close other programs, disable energy saving mode, and consider running it overnight.. (with Windows updates disabled).
Check your data for missing values or redundant rows, as these can slow down processing significantly.
If it’s an option, you could subsample or average your data to reduce size but keep in mind this may reduce variability, so consider if that fits your needs.
Finally, you might try the vegan package in R to run PERMANOVA. With one factor, it’s straightforward and you can adapt permutations and residuals options to be similar to PRIMER.
Best from Ulm, Katja
  • asked a question related to PRIMER-E
Question
1 answer
Is it possible to draw convex hulls or ellipses on PCA and dbRDA plots on PRIMER v7 software?
Relevant answer
Answer
Yes. Usually this is done in conjunction with a cluster analysis to define groups (with or without SIMPROF). It's all under the Graph>Special>Overlays menu
  • asked a question related to PRIMER-E
Question
11 answers
Dear Community
I have a dataset consisting of a matrix of diet samples as rows and prey species as columns (in gram). I am interested in investigating any differences in diet between years, seasons and areas. The data is very skewed, and the majority of samples comes from one of the seasons and from a few of the years in the time series.
Does anyone have any hints of how I could look at this using ANOSIM and/or PERMANOVA in the Primer software?
Kind regards,
Karl
Relevant answer
ANOSIM is good for skewed data. But you must look at all samples and factors at the same time because of interactions between factors and to avoid bias. If there are only slight differences among sample sizes between treatment combinations, you could impute missing values using the mean or median of the treatment levels. However, this approach should be used with caution and should only be used when sample sizes are nearly equal to begin with. If the sample sizes are not equal and the assumption of equal variances is violated, you could instead perform a non-parametric equivalent to an ANOVA such as the Kruskal-Wallis test :)
  • asked a question related to PRIMER-E
Question
5 answers
Some datasets in ecology (e.g. CPUE) inherently contain abundant zero values, which may need to be adjusted and/or fitted. Apart from common solutions, I am asking how to work with such kind of data in PRIMER-e (i.e. possible pretreatments, adjustments or other post-treatment functions).
Relevant answer
Answer
Bit late to the party, but effectively using the right resemblance measure (for biological samples generally one that ignores joint absences) is how we deal with the inflated zero problem. That is why we use a measure like Bray-Curtis to define resemblance, rather than a measure like Euclidean distance. The zero-adjusted measure is for specific circumstances where there are few or no occurrences in samples, which is not the same thing as many zeros overall.
  • asked a question related to PRIMER-E
Question
1 answer
I want to investigate the relationship between differences in coral physiological variables based on euclidean distances and seawater environmental variables using DISTLM and dbRDA in PRIMER, but I am not sure if this analysis is suitable given the lack of replication I have in my predictor variable (environmental) matrix.
I have attached an excel file illustrating the structure of my data set (the response and predictor variables). Briefly, I have a multivariate data set of measured physiological variables (e.g. lipid concentration, protein concentration, tissue biomass etc.) for corals collected from five different locations (A-E), where each site is very unique in its seawater physico-chemical parameters. I collected 12 corals per site (total of 60 samples). I have constructed a resemblance matrix of the physiological data in PRIMER based on Euclidean distances, and there is clear grouping of data points in the NMDS, which coincides with the different collection sites for each coral. I want to investigate the proportion of the observed variation in the multivariate data cloud that can be explained by the environmental characteristics of each collection site (e.g. mean annual sea surface temperature, seawater chlorophyll concentration, salinity etc.). However, the dataset of environmental variables does not have replication. i.e. for each site (A-E), I only have one value for mean annual sea surface temp, one value of salinity etc.
All of the case-study examples I have read about distance-based redundancy analysis in R or PRIMER have two resemblance matrices (predictor and response) both of which have replication. However, in my case, my response variables have replication (i.e. 12 samples per site), whereas my environmental variables do not have replication (i.e. one measurement per variable per site).
Can someone advise me whether or not dbRDA is suitable in this instance? If as I predict, it is not suitable, can you recommend a better approach? I am not an expert in multivariate statistics, but I want to make sure that the approach I take is sound.
Any and all advice is welcome. Thanks
Relevant answer
Answer
Hi Rowan, I am in a similar situation. What I did I used an average of the response variables. But I do not know if it the optimal solution. Did you solve this riddle at the end?
  • asked a question related to PRIMER-E
Question
4 answers
We have a dataset consisting of six years of fish detection data from an open ocean acoustic tracking array that was deployed to record the presence of acoustically-tagged fishes. The array consists of 50 permanently moored and widely spaced tracking stations divided equally between a “deep” and “shallow” stratum. Our core question is “Does the ‘community’ of detected fish (16 species) differ across depth strata and seasons?” Secondarily, “What habitat covariates help explain differences in the community?” We are not especially interested in year or station effects.
I’m working in PRIMER v7 that allows PERMANOVA models with both fixed factors and continuous covariates. If I considered a model without covariates, the design might be a repeated measures approach (since stations never move) with: Season = fixed (4 levels), Stratum = fixed (2 levels), Station = random (50 levels) nested within Stratum.
Things get more tricky when we consider adding covariates. Some covariates (e.g., distance from shore, seafloor slope, sediment type) are always linked to station and will not change though time while others such as remotely-sensed water temperature and chlorophyll vary on rather shorter timescales. One thought would be to include smaller time blocks as a random effect, maybe in one week or one month increments (so 321 or 60 blocks, respectively, over 6 years) and use mean temperate and chlorophyll values for each time block.
So my questions are:
1) Is it a PERMANOVA ‘felony’ to have some habitat covariate values that repeat many times while others do not?
2) Should Station even be a random effect when habitat covariates are tied to them?
We also considered using PERMANOVA just for fixed factor Season x Stratum x Interaction tests and the DISTLM routine for the continuous covariates but the same problem of static covariates due to repeated 'sampling' of the same stations would seem to remain.
We welcome any insights or criticisms on this approach
- Eric
Relevant answer
Answer
To me there appear to be two questions. One concerns differences in the detected community in space and time, which can simply be addressed using Permanova (or Anosim). The other concerns which of the measured variables 'best explain' the observed differences in the fish community. I wouldn't try to address this using Permanova at all, but some of the other tools in the box such as BEST, which can look for the subset of measured 'environmental variables' (I wouldn't call them covariates) best explainingg the community pattern, Linktree which looks for thresholds in individual variables explaining divisions in the community data, or even just bubble plots in MDS as a first mode of exploration. I'm never comfortable with trying to set up a single test that ticks all boxes, and prefer a piecemeal approach where one builds an explanation using a variety of tests and approaches.
  • asked a question related to PRIMER-E
Question
4 answers
Hello all,
This is a real dbRDA plot using real invertebrate abundance data (taxa-station matrix) with environmental data (substrate characteristics-station matrix) as predictor variables. The plot is produced in PRIMER v.7. Invertebrate data is 4th root transformed, Bray-Curtis similarity was used. Environmental data is normalized, Euclidean distance was used.
My question is: why is the vector overlay not centered at 0,0 in the plot? Interpreting this plot, one would conclude that every sampling station within the study area has values below the mean for predictor variables 2 and 13, which is impossible. Why would the center of the vector overlay be displaced -40 units? How can this be? Why is the plot centered on the dbRDA2 axis but the dbRDA1 axis?
Please let me know if anyone needs more information. Thank you!
Relevant answer
Answer
The analysis is fine. The position of the vector diagram relative to the ordination is arbitrary - it could just as easily be in a separate key. The diagram indicates the direction across the ordination plane in which values of the selected variables increase. The length of the lines indicates the amount of total variation in each variable is explained in the chosen ordination plane. If all of the variation is explained, the line reaches the circle.
  • asked a question related to PRIMER-E
Question
4 answers
I have a dataset with 96 individuals and 1496 transcripts of interest. I have analyzed the experiment by multiway ANOVA in the Permanova platform of Primer-E. I am satisfied that I have figured out how to do this and test my experimental hypotheses. However, it occurs to me that I can also run some kind of cluster analysis to identify and examine clusters of similar individuals. What might be the best Primer-E method to accomplish this?
Relevant answer
Answer
LDM
  • asked a question related to PRIMER-E
Question
2 answers
I have a large dataset of fish abundance as well as some environmental variables covering around 795 sampling sites. I have tried to find the relationship of my environmental variables with the biological data with the RELATE function in PRIMER-E. The results indicate that using Spearman rank correlation, the sample statistic (Rho) is 0.11. Now the significance level of sample statistic is 0.1% (much less than 5%), so according to the manual, this is significant result! I used 999 permutations to get this result. I am unable to interpret this result as I would usually expect that if the p value is significant, the corresponding degree of association should be high also. So, I would expect the sample statistic to be much higher than 0.11! (above 0.7 or so). Smilar situation with the distLM procedure, here the p value suggests that each of the variable has significant effect on the model but again the overall R^2 of fit is only 0.13! How is this possible that with such a poor R^2, individual variables are all significant.
I have used the square root transformation and Bray-Curtis similarity on the biological data and have normalized the environmental variable and Euclidean measure. I haven't transformed the environmental variables.
I would really appreciate it if someone can help me to interpret these results.
Relevant answer
Answer
Yes, the null hypothesis in RELATE is that the correlation is zero. As you have a lot of samples you have a lot of statistical power to detect departure from the null. The significance tells you how likely it is that you would observe a value as high (or higher) than the value you have observed if the true correlation was actually zero.
There are many other tools in Primer you could also try (RELATE for 2-way designs, BIO-ENV, etc.), depending on your survey design. It might be that you get more meaningful results if you combine samples, maybe pooling nearby samples for example. It is likely, given that you have so many samples, that the signal (from the environment) is being lost in the noise (many samples each dominated by one or two species of fish).
  • asked a question related to PRIMER-E
Question
7 answers
Hi everyone,
I have an experiment investigating the effect of sediment condition (factor 1 - 2 levels) and the benthic macroinvertebrate community (factor 2 - 2 levels [faunated vs defaunated]) on ecosystem fluxes.
Some animals remained after defaunation, however, so there is a potentially confounding variable in the experiment. Adding a covariate (unremoved biomass of inverts) to PERMANOVA could help to account for this and avoid making a Type 1 error.
There are two opposing thoughts coming out:
1. The covariate should only be added if it has a significant effect
2. The covariate should be added regardless of significance, to account for its effect
Two similar questions online say they should be included regardless of significance:
So, should the addition of a covariate representing a potentially confounding effect be based on its significance? Your thoughts and reasoning would be greatly appreciated.
All the best,
Sorcha
Relevant answer
Answer
There are several ways to select variables: significance criterion (based on p-values), information criterion, penalized likelihood, change-in-estimate criterion, and background knowledge. As Dr. Booth suggested, variable selection should not be based only on p-values. Check this article for details:
  • asked a question related to PRIMER-E
Question
1 answer
I would like to know whether the BIOENV routine in PRIMER could be used to identify a subset of biological data (gut content items in my case) best explaining environmental data (biomarkers and condition indices in my case).
Thanks
Relevant answer
Answer
The short answer is yes, it could. The routine finds a subset of variables that, in combination, most closely match a predetermined pattern (represented as a resemblance matrix). So you could make resemblance matrices based on the condition indices and biomarkers (singly or in combination) and then ask which set of gut content items maximises the match. How you treat your data, and which resemblance measures would be appropriate, will depend on the data you have and the specific questions you want to address.
  • asked a question related to PRIMER-E
Question
9 answers
I analysed some of my data from Caspian Sea basin. these data comprised form hard substrate of macrobenthic communities. I attached the results. in the attached file Time (1,2,3, and 4) represent season and Site (1 to 8) represent sampling sites.
anyone can help me to understand the results?
Thank you
Relevant answer
Answer
SIMPER is used after the differences between factors are confirmed using an ANOSIM or PERMANOVA test, the factors sometimes or frequently are defined a priori (before to start the research or analysis) according to some geographical, physics or chemistry characteristics that show an effect on the community as a driver of its structure. If there is not a clear effect for any factor selected a priori SIMPROF test allows to observe groups with significant differences, without significant differences SIMPER analysis has no sense.
  • asked a question related to PRIMER-E
Question
4 answers
I have a data set comparing the accumulated biomass on two types of substrate. The number of samples from each substrate is different, however low (n<10).
What would be the most appropriate test to show significant differences between the two substrates?
I've tried PERMANOVA v7 on euclidean distance resemblance matrix, but it seems a bit too much for such a small sample size.
Suggestions anyone?
Relevant answer
Answer
It really depends what the nature of the data is, and what hypothesis you want to test. For example, a t-test may appropriate, or Wilcoxon-Mann-Whitney, or Mood's median test, and so on.
  • asked a question related to PRIMER-E
Question
6 answers
Hello,
I've done a field study investigating the community composition (18S and 16S) along a contamination gradient. I'm creating graphs of my environmental variables and community composition using PCA and DistLM in PERMANOVA in PRIMER 7. The labels for the variables are overlapping making it hard to read them. Is there a way to make them spread out more? If you have a way to solve this it would be greatly appreciated.
Kind regards
Megan
Relevant answer
Answer
I've just been doing this for a paper. Primer and Permanova+ are not designed to produce final publication quality graphics and the figures often need a little work, although generally they are pretty good. The easiest way I find is to copy the figure (ctrl+C) and paste it into Powerpoint. You can then use drawing tools>Group>ungroup (you need to do this twice, the first time it tells you that it's not a drawing, asking if you want to convert it, so say yes, then do it again to ungroup). You can then tidy the figure, move labels around, change fonts, add information, and so on. The final step (for me) is to use Adobe Illustrator to produce a final tiff, but first get the figures the way you want them.
  • asked a question related to PRIMER-E
Question
3 answers
Relevant answer
Answer
Enrique, i am very grateful for your answer, we finally used PERMANOVA and included Anderson et al. 2008 as follows: " Possible correlation structure among samples, derived from the stratified-random sampling methodology, would be ignored by PERMANOVA under permutation (Anderson et al. 2008).". Yet I found that PERMANOVA is unaffected by differences in correlation structure for balanced designs in Anderson & Walsh (2013) - http://coralreefdiagnostics.com/s/Anderson_et_al-2013-ANOSIM-vs-PERMANOVA.pdf
Thus i interpreted that differences found by PERMANOVA reflect changes in community structure and carried out SIMPER to detect the variables that contributed the most.
Best regards
  • asked a question related to PRIMER-E
Question
2 answers
Hi everyone,
I'm looking for a little help in the analysis of my qPCR raw results. I hope you'll be able to help me. Thanks in advance.
Here is my example:
I have 3 genes: 1 target gene, and 2 housekeeping genes.
I have estimated the PCR efficiency for these genes and their associated primers: E(target gene) = 2.1, E(HK gene 1) = 1.9, E(HK gene 2) = 2.2
I have Ct raw results for 2 technical replicates each time
I have a Treatment condition, and a Control condition, for all of the 3 genes (target gene and the 2 housekeeping genes)
Finally I have these data for 3 biological replicates
(see the attach Excel document).
From these data I have calculated the normalized expression of my target gene (using the control condition as a calibrator, and the housekeeping genes for final normalization). All the calculations are in the Excel document.
In the end I have a mean of the normalized expression for my target gene, which is great. However, I don't know yet how to obtain the standard error for this mean. I can calculate the standard error from the 3 normalized expressions from my 3 biological replicate (SE of 1.39, 1.55 and 1.26, which is easy in R or Excel), but from I what I read on internet (but I didn't understand it yet) there's apparently a way to calculate the SE value from the different SE obtained all along the calculation process (SE from the mean of the technical replicate values of CT, then SE from the relative quantity calculation, and finally SE from the normalization process).
Does anyone is familiar with that and could provide a little help or at least a little look at the attached document to confirm that my calculations are correct. From then you'll probably be able to lead me towards the correct calculations for my SE value.
Thanks a lot in advance
Regards
Marc
Relevant answer
Answer
Thank you Amy i'll have a look at it ASAP.
I agree with you on the SE thing which, in my mind also, has to be calculated from the expression values obtained for each of the biological replicates. just a classical SE calculation, relevant of the biological diversity, and not a SE based on the qPCR technique itself.
I've been in touch with other researchers since my last message. If I get anything new I'll post here some explanations of what I've been doing and what solutions I found.
Thanks again
  • asked a question related to PRIMER-E
Question
12 answers
Hello, 
I am using ANOSIM in PRIMER-E for my statistics. i need few clarifications in pairewise test ANOSIM. 
If i am correct, the more closer the R towards 1, the two samples are dissimilar and are significantly different from each other. P value also less than 0.05. (P=4.8%).
But for some of my other samples the R =1, and P=16.7 (0.167). How do i interpret this data? for eg, In general people should always consider the samples are significant when the P <0.05. but here For my sample R=1 and is my P value is 0.167. So Can some one clarify how to interpret this data?
Thanks
Venkat
Relevant answer
Answer
A couple of additional notes: R is a scaled measure of the separation between groups of samples, so if the Global test is significant (there are differences between groups) you can interpret the relative values (close to 0, not different, close to 1, very different) even if you don't have sufficient power (samples) to run the pairwise tests.  Simprof tests a different hypothesis, so be careful that you understand what each test is actually testing.
The package is called Primer, not Primer-e (which is the company that sells it).
  • asked a question related to PRIMER-E
Question
9 answers
How can I select variables for PCA analysis from huge set of environmental data?
Relevant answer
Answer
Jayachandran,
You may use PC scores to reduce variables (i.e. the correlated ones). firstly, you should do a PCA on your parameters using predetermined number of PCs (lets say N= 10 PCs). Then check to see how many PCs are needed to describe 100% cumulative %variation (for example the first 5 out of 10 PCs). In each PC (1st to 5th) choose the variable with the highest score (irrespective of its positive or negative sign) as the most important variable. Since PCs are orthogonal in the PCA, selected variables will be completely independent (non-correlated).
Hope this helps you
Amir
  • asked a question related to PRIMER-E
Question
6 answers
While I can easily assign factors to my samples to compare within sites or days of sampling, I got curious about "Edit-> Indicators". It seems that you may assign traits to the species in your dataset.
Does anyone here have experience with Indicators and is able to explain how to use them?
Thank you in advance for answers!
Relevant answer
Answer
To obtain your "indicator" data matrix:
1º Edit -> Indicators.. etc, the same way than factors
2º and then go to tools -> sum (or maybe average) -> and in variable square click on "sums for indicator" and select your indicator -> OK.
You will obtaain your new raw data for indicator levels.
  • asked a question related to PRIMER-E
Question
8 answers
I have 4 experimental treatments. From each treatment I have area measurements of bryozoan colonies over time (n>60 per treatment). Each colony has a unique identifier, thus can be traced through time and given a growth rate at any particular interval. If I plot the successive areas on a line plot, the rates can also be interpreted from the slopes of the graph. I am interested to know if there is a significant effect of treatment on growth rate.
Is it appropriate to use Anosim (performed in Primer) to compare these line plots? Is it actually comparing the rates, or just the difference between the areas? E.g. if a colony was small at the start, it may be growing at the same rate, but always be smaller than a colony that was larger at the start. Thus the areas would be different but the growth rates would not.
If I am interested in growth rate rather than size, is it appropriate to use Anosim to compare the growth rates over time, rather than the areas?- presumably this would be looking at the change in rate over time, rather than the change in size.
Thanks for any advice!
Gail
Relevant answer
Answer
I am not familiar with the specifics of ANOSIM, but I think a GLMM might also be able to provide your answer, by using the colony as a random effect to compensate for  differences in initial colony size.
  • asked a question related to PRIMER-E
Question
3 answers
Looking at presence and absence of algae species above and below a dam, the MDS is perfect with a stress of 0.11, when I did the ANOSIM I expected great results but the Global R is 0.029 and the significance is 36.4%. Why has this happened and can I do any thing about it?
How do I discuss this result?
Relevant answer
Answer
The nMDS can appear like showing there is 2 separate groups, but the way you read a nMDS, is that 2 samples close to each other show high similarity, and 2 samples far from each other show low similarity. If you look carefully to your nMDS, you will see that L7 and G5 are really close (highly similar), while L6 and L5 are apart showing low similarity. Overall, most of your samples show similarity between group and high variability within, so it is not surprising to have a non significant ANOSIM even if the green look to be on he left and the blue on the right.
Why do you have more green site than blue? Do the numbers correspond to specific location, time..?
As 2D stress of 0.24 only mean that your nMDS is not good to represent your data in 2D. 3D might improve this. It is not surprising that ANOSIM might reveal things you can't see with nMDS, because nMDS loose information by working in 2D, while ANOSIM will work will all the dimension and complexity of your data. You could try other methods than nMDS, such cluster, CAP, PCoA.
  • asked a question related to PRIMER-E
Question
6 answers
Hi, 
got a data set of three variables set as presence/absence (ie 8 possibilities) among 1200 stations and I would like to run a DISTLM model on Permanova (Primer) but I am not sure if we can run a such model on presence/absence data. Thanks for your help!
Relevant answer
Answer
Merci Phil, je pense avoir la solution! Merci !
  • asked a question related to PRIMER-E
Question
1 answer
The help of PRIMER-e advices to do it if there are a large number of variables, but i don't know exactly what it does mean with large number. 
Relevant answer
Answer
BIOENV and BVSTEP have the same aim: finding correlations between 2 matrices but BIOENV carries out a full search of all possible combinations, while BIOSTEP carries out a stepwise search where the best variables is first selected, then the second one is added and so on for up to 5 variables usually, which the forward selection. There is also the backward selection, removing one by one the variables that affect the least the rho value when omitted. Usually the limit of variables to use BVSTEP instead of BIOENV is 16 variables, although this is not an absolute cut off and is more a practical cut off. The main reason behind using one instead of the other is because of the number of combinations.
When you actually have lot of variables, you should first have a look if some variables are inter-correlated, which is likely to be the case with environmental variables. So you can remove one of the inter-correlated variables to reduce the total number of variables. BIOSTEP may not be valuable when you have lot of inter-correlated variables as you are likely to have many equally good solutions. But in other situation, when you have not lot of variables which are not highly inter-correlated, BIOSTEP would be a good option.
  • asked a question related to PRIMER-E
Question
2 answers
I've attached the Draftsman-plot that I've obtained.
I'm using 04 variables: Temperature, Dissolved oxygen, Current speed and Depth ('Profundidad' in the attached figure).
I see no necessity of transforming it, but I just want to be sure if I'm correct or not.
Thanks and cheers 
Relevant answer
Answer
to determine if you need to transform specific variables, you need to have a look on 2 things (Clarke and Gorley, 2006):
1) “are the samples roughly symmetrically distributed across the range of each variables” (i.e. are your samples skewed to one side)?
2) “if there are strong relationship between some pairs of variables, are these roughly linear rather than strongly curvilinear?”
Based on your draftsman plot, your samples are not really skewed on one side. So you do not necessarily need to transform your data, although you do have some outliers stuck to one side. So you could try a mild transformation (e.g. square root), and see if it improve or not the draftsman plot.
Sometimes if you have a doubt about transformation of your environmental variables, perform a mild (square root) or strong (log) transformation and run again the draftsman plots and see if you improve or not the plots. If not, do not bother with transformation. Sometimes just trying will help you to make your choice. Otherwise, as I mentioned, you mainly need to check if your samples are not skewed to one side or if you do not have outliers that skewed your data.
Hope that help,
Aimeric
  • asked a question related to PRIMER-E
Question
9 answers
Or are ECVs just an indice of how important each term is in the model at explaining the overall variation. In the PERMANOVA output the reported ECVs are calculated from the square root of  (the terms Mean square - residual mean square)/ n. However in a lot of papers the ECV is reported as the percentage of variation explained by each term.
Thanks
Relevant answer
Answer
Actually, the variation explained of factors can be obtained directly by dividing the sum of squares of the factor by the total sum of squares. In adonis function of vegan package in R, the R2 is part of the output. Hope I could help...
  • asked a question related to PRIMER-E
Question
66 answers
I used PRIMER-E software to perform ANOSIM and SIMPER analysis. But when I write the discussion section I faced problem in interpreting the results. Please help me by providing appropriate reference where all the basics of these analysis are described?
Thanks in advance.
Relevant answer
Answer
Hi Nayan,
As César said, ANOSIM give you the P value (i.e. significance levels) and a R value (i.e. the strength of the factors on the samples). R value is supposed to vary between 0 and 1 (not between -1 and +1) but you can obtained negative values but they are always close to 0. R value close to 1 indicates high separation between levels of your factor (e.g. control vs treatment samples), while R value close to 0 indicate no separation between levels of your factor. Be careful with the significance level as it is express as %, so if you want the actual P value you have to divide by 100. ANOSIM (one-way) gives you 2 windows: one with the detail results and one with a graph (or 2 graphs if you perform two-way ANOSIM). The result window show you “TESTS FOR DIFFERENCES BETWEEN xx GROUP” which give you the overall P value and R value for your factor. Then you get the “Pairwise Tests” which give you the P values and R values for each level of your factors. For example if you have 1 control and 2 treatments in your factor, the pairwise tests will tell you if the samples from control and treatment 1 are significantly different (P value) and how strongly they are different from each other (R value), and will give you the same for control vs treatment 2, and treatment 1 vs treatment 2. The number of permutations you perform (Actual permutation) and the number you could perform (Possible permutations) are also given and are important. It help you to know if the ANOSIM is relevant in your case, by indicating if you could perform more permutations (you obtain more accurate P value) and if the number of possible permutations is low (e.g. <100) you should not trust the results of the pairwise ANOSIM. The graph window shows you the R value for your factor/matrix (doted black line) and the R value obtained by permutation of the similarity matrix. If the black line (i.e. R value of your similarity matrix) is not overlaying the R values obtained by permutation of the similarity matrix (blue bars) you have significant difference.
The SIMPER analysis gives you the percentage of similarity and dissimilarity or your factors, between levels of your factors and for specific levels of your factors. Then it gives you which variables in your data explain the similarities or dissimilarity: the percentage of contribution of your variables (Contrib%) that explain this similarity. The variables are classified from the highest to the lowest contribution. It also show the cumulative contribution (Cum.%) so that you know how many variables explain for example 90% of the similarity... Do not forget that the % similarity = 100 - % dissimilarity and inversely.
For your results and discussion:
- for ANOSIM you need to first say if there is significance difference and if yes, you need to give the R value. You could have a significance difference, but if the R value is low (e.g. 0.2) this mean your factor have small effect on your variables and is not really important. So with ANOSIM the R value is more important to some extent than the P value.
- You can use SIMPER to support your ANOSIM as you can say how similar are the different levels of your factor and more importantly, which variables explain the difference or similarities.
Hope that help,
Aimeric
PS: César A. Cárdenas thanks for putting the pdf of PRIMER. Do you have also the pdf of the manual/tutorial? I would be grateful if you have it to send it.