Science method

SNP Genotyping - Science method

Explore the latest questions and answers in SNP Genotyping, and find SNP Genotyping experts.
Questions related to SNP Genotyping
  • asked a question related to SNP Genotyping
Question
6 answers
Hi everyone, 
I am working on case control genetic data. I have data for SNP with genotypes AA, AG and GG. I want to check the effect of these individual genotypes on the disease outcome, which in my case is diabetes. Now I want to calculate unadjusted and adjusted (for age and gender) Odds ratio in SPSS. I calculated unadjusted Odds ratio by using multinomial regression (is this suitable for my data?). But do not know how to calculate Odds ratio after adjustment for age and geneder. 
Secondly, How can I  apply dominant, recessive and co-dominant model to check which fits more. 
I would be highly obliged if you provide me the flowchart of commands like 
Spss--> analyze--> regression--> .........
Thanks, 
Misbah
Relevant answer
Answer
I also want to do same analysis coded genotypes as 1,2,3 for TT,TC and CC . And want to estimate risk for infection as dicomatous yes or no, but can not get significant OR. But when i compated percentages of infection rate among three genotype found significant p value by chi sequare. I found higest infection among TC carriers but on logistic regression lowest insignificant OR for TC . I am confused.its phd work
  • asked a question related to SNP Genotyping
Question
1 answer
Two genes are PNPLA3 and TM6SF2
Relevant answer
Answer
Hi Kumar
from naive and blind way, I would say to select theme on the effects of these SNP in the protein production (SIFT), implication in the disease or metabolism (OMIM and KEGG) and their frequencies among each group.
all the best
fred
  • asked a question related to SNP Genotyping
Question
2 answers
Hi. We have some trio pedigrees and performed whole exome sequencing. We wanted to confirm the relationship between the parents and the child. Is there any recommendations about programs for haplotype phasing and paternity test using WES data.  Thanks!
Relevant answer
Answer
Reading a bit late in the game, but I'm actively using "Hapl-o-mat" - it's pretty good, easy to use and open to a range of applications: https://github.com/DKMS/Hapl-o-Mat
  • asked a question related to SNP Genotyping
Question
8 answers
Hi Everyone, 
I am working with data of SNPs, I want to do logistic regression analysis. 
In multinomial logistic regression, is it compulsory to choose most common genotype as reference? or I can choose any genotype as reference? 
In my one SNP (Genotypes: II, ID, DD), when I choose most common II genotype as reference than Odds ratio come out like 0.57, but on choosing ID genotype Odds ratio change to 1.67 with p <0.05. Is it fine to choose heterozygous genotype as reference?
Thanks,  
Relevant answer
Answer
Malik Olatunde Oduoye , it requires some level of abstraction to wrap your head around.
Consider this simple case: you have two groups in which you measured the value of some variable. Say you measured the weight of sample of male and of female bugs. You are interested to analyze the expected (mean) difference between the weights of males and females.
A statistical model tries to express the expected (mean) weight as a function of sex.
Now "sex" is a binary (categorical, nominal) variable. How to use this in a mathematical formula that operates on numbers? This is where we need to encode the different possible values of the variable "sex" by different values of a numerical variable that we can use in a formula.
There are no restrictions in how to encode names/groups/categories by numbers. However, some ways will give you results that are meaningful for your research question and that you can readily interpret. We will now first take a small detour to demonstrate this, before coming to the "usual solution".
You have two sexes, and you may think of using an indicator function for each of the sexes that take the values 1 or 0, depending on whether the sex is male or female. These indicator variables can be used to multiply the expected weights of the respective sex, and these products can be added to give the expected weight of the desired sex.
This is the mapping from sex to the values 0 or 1 based on two separate indicator functions:
Imale(sex) = {if sex = "male" then 1, otherwise 0}
Ifemale(sex) = {if sex = "female" then 1, otherwise 0}
The function of the expected value is then
µ(sex)= bmale*Imale(sex) + bfemale*Ifemale(sex)
Hence, µ(male) gives bmale*1 + bfemale*0 = bmale and µ(female) gives bmale*0 + bfemale*1 = bfemale
Although this function works, it is somewhat unwieldy. It is about two coefficients, bmale and bfemale, and a statistical model based on this function can estimate their values along with standard errors, confidence intervals, and p-values, if you wish. The problem with this formula is that it does not give you direct access to what you actually wanted to investigate: the difference between males and females.
There is, of course, a more convenient function to address this directly. This function needs only one indicator variable for one of the sexes and includes an intercept term that will represent the expected value of the other sex. You are free to chose which sex should be represented by the intercept term. This is the "reference". Let's say you decide using females as the reference, then the intercept is bfemale and indicator function would be Imale which is multiplied with a coefficient d which will then represent the mean difference between females and males:
µ(sex)= bfemale + d*Imale(sex)
Here, µ(female) gives bfemale+d*0 = bfemale and µ(male) gives bfemale + d*1 = bfemale+d, so
µ(male) - µ(female) = d
As is obvious here, the coefficient d represents the difference in the mean weight between males and females, and a statistical model will provide an estimate for this difference with corresponding standard error, confidence interval, and p-value.
Of course you can choose to let the intercept represent the mean weight of males as well. The indicator function must then be Ifemale and the coefficient d then represents the difference in the mean weight between females and males. You may convince yourself by actually calculating the results of the formula for the two possible sexes:
µ(female) gives bmale+d*1 = bmale + d and µ(male) gives bmale + d*0 = bmale, so µ(female) - µ(male) = d
This is it. Practically, statistical software will automatically encode categorical variables into so-called dummy variable by applying the respective indicator functions and use these dummy variables in the statistical models. But these are technicalities.
This interpretation of the coefficient d depends on what sex you have chosen as the reference. If you have chosen males as reference and d > 0, then female bugs are expected to be heavier than male bugs. If you have chosen females as reference and d > 0, then female bugs are expected to be lighter than male bugs.
  • asked a question related to SNP Genotyping
Question
3 answers
I need CEL1 enzyme to use it in my laboratory work to complete my PhD thesis. Unfortunately, I didn't find it anywhere. So I'm waiting for any request about companies name where is available or anyone who sell it for me.
thank you.
Relevant answer
Answer
Unfortunately I couldnt find any company to sell it online, which is a bit weird, but as you probably know, CEL I Endonuclease is naturally exists in Celery juice, which normally utilized for detection of heteroduplex and SNPs, specifically in confirmation of genome editing by CRISPR-Cas. In this link you will find a protocol which has been described an extraction method of CEL1 from Celery juice which could be done by your own.
Also, based on your target in working with such a Endonuclease, you may be able to substitute it by T7E1 which is more common than CEL1 and could be purchased from NEB.
Good luck
  • asked a question related to SNP Genotyping
Question
1 answer
Hi,
Other than randomForest, how do you go about analyzing by GWAS the SNPs genotyping data on categorical phenotypes (say, host species for a pathogen)?
Any pointers would be great!
-Marcin
Relevant answer
Answer
I use Tassel to analyse SNPs panel. It can provide the similar result compared with Plink
  • asked a question related to SNP Genotyping
Question
1 answer
Explain
Relevant answer
Answer
How long is the DNA strand or how many SNPs genotype you are looking for?
Based on the probability SNPs occur almost once in every 1,000 nucleotides on average, which means there are roughly 4 to 5 million SNPs in a person's genome. So, if you want to have a general idea how many SNPs are there in your desired DNA sequence then you can use above information for calculating the probability. Otherwise, there are certain databases where you can provide the genomic coordinates/loci to get the number of SNPs and their detailed information.
  • asked a question related to SNP Genotyping
Question
3 answers
Hi!
I'm currently looking into setting up a protocol for in situ mutation detection by rolling circle amplification (RCA) in our lab.
I've read lots of publications that have successfully done this in either DNA or transcripts. My doubt comes when designing the the padlock probes around my SNP of interest.
Most articles don't specify how this is done, so I assume this is done manually based on the target sequence?
I have downloaded the ProbeMaker java application, which was developed for the design of padlock probes. However, the software hasn't been updated in about a decade, has limited tutorials and I haven't found it very intuitive. Since I'll only be working with a few probes to start with I guess manual design may be a better option?
I'm aware of previous work (Larsson et al, 2004. Nat Methods) that have shown the optimal length for binding regions and design of cDNA primers (if working from mRNA), but I would appreciate if anyone has any technical recommendations for padlock probe design.
In your experience, has manual design worked out? What are some things to look out for in the design? (I assume standard observations in primer design, like similar Tm, avoiding crossmatching, etc).
Thanks!
Relevant answer
Answer
I did have some success with this, mostly based on the advice in the published literature. Happy to give you some pointers if you need further advice.
Carlos
  • asked a question related to SNP Genotyping
Question
3 answers
Dear community,
I am planning on conducting a GWAS analysis with two groups of patients differing in binary characteristics. As this cohort naturally is very rare, our sample size is limited to a total of approximately 1500 participants (low number for GWAS). Therefore, we were thinking on studying associations between pre-selected genes that might be phenotypically relevant to our outcome. As there exist no pre-data/arrays that studied similiar outcomes in a different patient cohort, we need to identify regions of interest bioinformatically.
1) Do you know any tools that might help me harvest genetic information for known pathways involved in relevant cell-functions and allow me to downscale my number of SNPs whilst still preserving the exploratory character of the study design? e.g. overall thrombocyte function, endothelial cell function, immune function etc.
2) Alternatively: are there bioinformatic ways (AI etc.) that circumvent the problem of multiple testing in GWAS studies and would allow me to robustly explore my dataset for associations even at lower sample sizes (n < 1500 participants)?
Thank you very much in advance!
Kind regards,
Michael Eigenschink
Relevant answer
Answer
for the second part of your problem, you can try vcf2gwas pipeline, that is very easy to run as a docker image
  • asked a question related to SNP Genotyping
Question
1 answer
SNP
Genotyping
Allele
Relevant answer
Answer
Gerald Zvobgo that would depend upon the type of technique you want to use. Taqman, ARMS-PCR, RFLP all assays require different type of primers.
  • asked a question related to SNP Genotyping
Question
3 answers
What is the meaning of the string (*) in CYP3A4*22, knowing that this SNP has the ID rs35599367?
Relevant answer
  • asked a question related to SNP Genotyping
Question
3 answers
Does anyone have recommendations (design and/or protocols) for carrying out SNP genotyping with HRM on a CFX96 without the precision melt analysis package? Is it possible?
My understanding is the sensitivity is gated by the machine (and possibly the calibration kit), so I'm not sure why the software is even needed when open source analysis packages are available.
Thanks in advance!
Relevant answer
Answer
I am interested in following up on this topic..
  • asked a question related to SNP Genotyping
Question
3 answers
I have genotyped my lines using Infinium 6k SNIP CHIP and have received variants in hapmap format. I want to convert this into VCF format so that i can annotate variants using some online tools. 
Relevant answer
Answer
I have VCF file and want to filter only SNP using Ms Excel. How should I do it
  • asked a question related to SNP Genotyping
Question
1 answer
We wish to explore pairwise relatedness in haplodiploids in a way that allows asymmetrical coefficients (eg. relatedness of mothers to sons is not the same as sons to mothers, ditto for brother-sister relationships). Queller & Goodnight's Relatedness program will do this but won't accept large SNP genotype datasets. Any recommendations for software that will do this would be fantastic.
Relevant answer
Answer
Hi Mike,
Sorry this is such a late reply, but we also struggled to figure this out. In the end we used the 'related' package in R to obtain relatedness values using the Wang 2002 method, which basically runs COANCESTRY but in R (details in Scarparo et al. 2021 Evol. Appl.). Q & G is great for microsats, but I don't think it will work for biallelic data like SNPs, plus there's the limit on number of markers. I am not sure if this method will work with haploid males, but i think so.
'Polyrelatedness' package is another option in R. We used it initially (with the Huang MOM estimator), but COANCESTRY produced better average R values matching theoretical expectations for our all-female data (e.g. full sib females peak averages 0.75). I believe Polyrelatedness is designed to work specifically for non-diplodiploid scenarios, including haplodiploidy, and might be worth a look.
Best,
Kevin
  • asked a question related to SNP Genotyping
Question
3 answers
How would one associate a single phenotype factor (binary presence / absence of trait) to SNPs in Tassel (v5.2.58) GUI?
I generated VCF file using GATK and am able to import the VCF and phenotype data into Tassel. The phenotype data is just two columns the 'Taxa' and the 'Factor' (as Y and N; where Y = has the phenotype and N = does not).
The desired end result is a Manhattan plot with any SNPs associated with the trait, and ideally, an output file which contains the SNP locations.
Relevant answer
Answer
I recommend you to use GAPIT, which is much more easier and also its possible to do multiple model comaprision in same time.
  • asked a question related to SNP Genotyping
Question
4 answers
I have to find out restriction enzymes and design primers for SNP genotyping, but unable to access the SNP cutter and SNP - RFLPing site. Please give me a solution.
Relevant answer
Answer
http://rebase.neb.com/rebase/rebtools.html for all Type II and commercially available Type III restriction enzymes that cut the sequence just once
  • asked a question related to SNP Genotyping
Question
6 answers
'm doing SNP genotyping in the chickpea association panel, I have fetched 1KB up and downstream sequences flanking SNPs. m trying to find out restriction sites at the particular position i.e. the restriction sites within SNPs. Please provide the details except for the NEB cutter.
Relevant answer
Answer
  • asked a question related to SNP Genotyping
Question
1 answer
Hello people,
I did the SNP genotyping of two different plant species through TASSEL5-GBS pipeline. In one of the species, I am getting significantly high number of missing genotype information (N) (Attached figure 1) but in other species I am not getting high number of missing genotype (Figure 2) information.
What could be the reason of getting high number of missing genotype information (N), how to filter them before performing downstream analyses like GWAS, LD and so on.
Any suggestion?
Relevant answer
Answer
  • asked a question related to SNP Genotyping
Question
4 answers
I did genotyping for a 92 patients, then I calculated the allele frequency for them. I get chi2 = 29, which is high, I have to reject the null hypothesis and indicate that my population is out of Hardy-Weinberg equilibrium. Is this is normal, if it is not, what I can do?
Relevant answer
Answer
Dear Suzanne,
If you are confident about your statistics, one or more assumptions behind the Hardy-Weinberg equilibrium must be false. It is most likely that your sampling of patients is not random and does not reflect the real population structure that they belong to.
There are two interesting possibilities.
First, their might be something unique about your group of patients which is linked to the alleles you have been investigating; i.e., the patients, because they all have the same type of medical condition may mean that the alleles you measured (which somehow underlies the medical condition), will be in disequilibrium. You could test this (perhaps) by looking at a second set of alleles that have nothing to do with the medical condition, and these we would expect to be in equilibrium.
Second, all of your patients may be from a population of people in which the alleles you are investigating are in disequilibrium (and the alleles are not directly linked to the medical condition they may all have). This might be the case if all of the patients were in a regional hospital serving an area which had little migration and mixing, but less likely if they were in a metropolitan hospital that serves a diverse city. You could test this by looking at other alleles not linked to the medical condition, and in this case we would expect them to be in disequilibrium too.
Regards, Andrew
  • asked a question related to SNP Genotyping
Question
6 answers
Hi,
we obtained our data from SNP genotyping from external lab. We found out, that there are letters "'D" and "I" in some positions. Do you know what those means? It is also in the reference fields.
example:
1       13901895        chr1_13901894   D       I
1       13903334        chr1_13903333   I       D
1       13903422        chr1_13903421   I       .
Thank you very much for your help.
Relevant answer
Answer
Have you asked that "external lab" that gave you these data?
  • asked a question related to SNP Genotyping
Question
2 answers
"The development and validation of a medium density SNP genotyping assay in Shrimp" is a research proposal I'm currently working on. Given the restricted budget allotted (9,600 USD) to the project, I'd like to know ahead of time how much it might probably cost me.
Relevant answer
Answer
sorry
outside my area of expertise
  • asked a question related to SNP Genotyping
Question
3 answers
I have performed high-resolution melt analysis using KAPA HRM FAST mastermix; PCR conditions were the same for both experiments - Image 1 and Image 2. However, there is a lot of variability in the two consecutive experiments. I haven't changed any parameter between the two experiments. (Image 2 contains additional samples).
Can anyone point out what I could be the possible reasons for variation in results.
Thank you.
Relevant answer
Answer
Hi
In addition to the above suggestions, the observed problem may also be related to the quality of the RNA in different samples, or cDNA synthesis efficiency. So, compare the integrity of the RNA samples, and the profiles of the reference genes between samples.
  • asked a question related to SNP Genotyping
Question
3 answers
Hello!
I recently received many *.CEL files from a recent UK biobank genotyping. According to the SNPolisher guide, I have to conduct certain metrics on these SNPs to keep just the ones fulfilling essential criteria. It is not clear (at least for me, first time using it) how these CEL files are using in the SNPolisher inputs. The first input is: Ps_Metrics(posteriorFile, callFile, output.metricsFile, pidFile).
Is there a previous step where these cell files are converted into these posterior or cal files? Both should be in *.txt format but all I have are *.CEL files.
Hope for some guidance from any expert!
Kind regards,
B
Relevant answer
Answer
Thanks a lot, Anna! Are the Axiom Analysis Suite or Axiom Power Tools equivalents? I'm planning to use the command line prompt for my work and I found Axiom Analysis Suite is a Windows-based tool. Which are you commonly using?
  • asked a question related to SNP Genotyping
Question
3 answers
I have cDNA samples extracted from 90+ individuals which I intend to utilise in a series of qPCR experiments to assess the influence of a specific SNP genotype/s on target gene/s expression. I was unaware at the time that no-RT controls were needed, and it is only now as I prepare to perform qPCR that I have realised that this is recommended.
I am able to synthesise these controls, however was hoping whether someone could clarify:
(1) if a no-RT control is recommended for each cDNA sample prepared?, and if so
(2) whether a no-RT control is recommended for every sample in the qPCR run (i.e., three technical replicates + no-RT control)? I'm assuming no-RT controls would not be required in replicates?
Other QC parameters have been applied: All RNA samples were purified prior to cDNA synthesis, and NTCs were included in every batch. Primers have been designed across two exons, etc.
Many thanks in advance!
Relevant answer
Answer
Hi Kealan,
in principle, if a no-RT control is required it has to be done for every RNA sample and PCR target. However, if your target gene allows a proper primer design targeting cDNA from spliced RNA only and no pseudogenes are present I would recommend to test your primer on cDNA and noRT, gDNA controls of some individuals. If you get the specific PCR product only with the cDNA samples but not the controls I would suggest that you don´t need to run no-RT controls for all of your samples. If you are not lucky and your primers amplify DNA you should do no-RT controls. However, I would suggest that you can put RNA equivalent to the amount of cDNA per PCR reation directly into the qPCR to check for contaminating DNA.
Best,
Norman
  • asked a question related to SNP Genotyping
Question
4 answers
Why my genotyping results for a SNP is T>C, but the NCBI reported is A>G? How should I describe this issue in the article?
Relevant answer
Answer
NCBI provides us information of a particular loci in any one strand (i.e., either forward or reverse). To elaborate, for example you have a variant change from T/C based on your sequence retrieved from NCBI/Ensembl (the sequence is given in reverse strand), but when you search for the variant in NCBI it shows A/G (forward strand). Both are correct highlighting the complimentarity. Minor and major allele remains the same. Yes. As previously mentioned you can refer to it as reference allele/mutant allele.
  • asked a question related to SNP Genotyping
Question
5 answers
I have data for SNP with genotypes TT, TC and CC. I want to check the effect of these individual genotypes on the disease outcome, which is iron deficiency. How can I apply dominant, recessive and additive binary logistic regression model to check which fits more in SPSS ??
Relevant answer
Answer
Do you analyses your data? If you interest, i will help you. please inform me by text Dr Halinne Lokuge Thilakshi Chamanika Abeywickrama
  • asked a question related to SNP Genotyping
Question
2 answers
Hello. I'm a beginner in bioinformatics.
I'm handling Illumina SNP microarray data and got genotyping results of several SNPs. But in some SNP sites, for example, in the case of the SNP site of one individual is "BICF2P1141058", the genotyping result is [A/T] and the other SNP site "BICF2G630601486", the genotyping result is [T/A].
I really do not know the difference between [A/T] and [T/A]. For getting more information, I searched the Illumina SNP genotyping technical note (https://www.illumina.com/documents/products/technotes/technote_topbot.pdf) and found some information that to provide accurate SNP strand and orientation, Illumina offers strand information based on SNP genotype and nucleotide sequences surrounding the SNP.
However, I still do not clearly know why the strand information is needed. The SNP genotype result is already determined by a specific nucleotide pair. Then what kind of additional information does the strand information provide? In the case of my example [A/T] and [T/A], aren't the SNP sites just composed of A and T? Could you please tell me how considering stand information helps in the interpretation of SNP array data?
Thank you!
Relevant answer
Answer
The reason is that "+" strand (sense) is considered as the default DNA strand in the most cases, whereas targets for probes for SNP array can be located on any strand, according to convenience for array design. So, you need to take into account strand information in order to avoid mistakes in interpretation of data.
Also, it seems like you confuse complementary base pairs with a couple of allelic variants. In your example ([A/T] and [T/A]) both bases are allelic variants on the same strand (check if the workflow included strand conversion). Their order in the genotype record depends on what allele is considered to be the reference, and what is the alternative. This information is contained in the manifest file of corresponding microarray.
  • asked a question related to SNP Genotyping
Question
6 answers
I have a database of snp genotypes analyzed with an Affimetryx platform I want to analyze with a reference genome, however I find obvious differences between Btau8 / UMD3.1.1 or ARS-UCD1.2, I must work with the latter that has been updated or keep me with UMD3.1.1?
Relevant answer
Answer
I would choose the last updated one, ARS-UCD1.2. There are quite important differences sometimes with previous assemblies, e.g. position of DGAT1 in ARS-UCD1.2 and in UMD3.1.1. If I would be the Reviewer I would ask for the most accurate/recent reference genome
  • asked a question related to SNP Genotyping
Question
2 answers
Good day. I am working on collecting larvae samples of a saturnid moth (Imbrasia belina), of which i would like to use for DNA barcoding and SNP genotyping. How do i prepare the specimens to avoid contamination from the gut contents? do i have to degut the larvae prior to preserving in absolute ethanol? so as to extract good grade DNA?
Many thanks in advance..
Relevant answer
Answer
Thanks very much Sven Thatje ..that is so helpful. the larvae is about 5cm length, on average....so i believe i will get enough muscle tirrue from it.
  • asked a question related to SNP Genotyping
Question
4 answers
I am using TaqMan probes for SNP genotyping on LightCycler 480. I am facing some difficulties in resolution of heterozygous genotypes. The software can not identify it as " both alleles" automatically. I am using Universal TaqMan master mix with UNG.
Can anyone suggest me if including UNG activation in cycling conditions will make any difference in fluorescence call of FAM or VIC dye?
Other suggestions?
Relevant answer
Answer
I am planning to genotype few SNPs using Taqman SNP genotyping. So this discussion would be very much useful for me.
  • asked a question related to SNP Genotyping
Question
1 answer
Hello All,
Greetings,
I have 2 cohort datasets with 500 sample size each and 200 SNPS of genotype files as (example1.gen and example2.gen) as input files and trying to generate other formats from the .gen files such as (example1.bgen and example2.bgen), sample ID and phenotype for the cohort data set (example1.sample and example2.sample) and VCF format (example1.vcf & example2.vcf). I am currently using SNPTEST (latest version) for GWAS analysis . Is there any possible ways to look into it ???
Thanks in advance
Relevant answer
Answer
  • asked a question related to SNP Genotyping
Question
4 answers
We are using AppliedBiosystems TaqMan® SNP Genotyping Assays. Is it ok not to run the samples in duplicate to increase sample size?
Relevant answer
Answer
Hi, I would like to add that when we work with TaqMan for genotyping - always 1 sample / 1 well. When we do repeats (duplicates, triplicates, even 5 wells per 1 sample) - always the same result.
When we work with Sybr Green based melt curve analysis for insertions and deletions, or even use HRM - triplicates.
good luck
  • asked a question related to SNP Genotyping
Question
11 answers
To perform HRM analysis for genotyping on CFX96 machine for the first time, BioRad recommends to use HRM calibration kit along with Precision melt analysis software. I am just wondering whether simply using any good quality DNA binding dye along with standard PCR reagents will work or not?
Relevant answer
Answer
Hi all,
I followed the Lukas Beule logic here:
And made a shiny app with quite a simple GUI for HRM analysis. The app needs no installation and can be accessed via the link:
The GitHub project page is here (the quick guide is in README):
Please feel free to contribute, post any issues and suggestions!
Main app features:
1. Data manipulations (increase the resolution, normalize, trim the temps)
2. HRM analysis (melting curves, difference curves, denaturation curves)
3. Clustering (4 algorithms, 2 modes (on melting temps (Default mode), or on a whole data range (HRM mode)))
4. Find the optimal number of clusters for k-means and hierarchical algorithms
  • asked a question related to SNP Genotyping
Question
1 answer
I am working with Amaranth accessions which are green to red in color and contain high phenolic compounds. Using the CTAB method with the slight modification I am trying to get high-quality DNA so that it can be used for SNP genotyping.
So, please anyone can solve this problem? And help me to get high-quality DNA which is required for the SNP chip.
Relevant answer
Answer
Hi,
Have you tried to use non water soluble PVP during homogenisation?
  • asked a question related to SNP Genotyping
Question
3 answers
I am working with Amaranth accessions which are green to red in color and contain high phenolic compounds. Using the CTAB method with the slight modification I am trying to get high-quality DNA so that it can be used for SNP genotyping. So, please anyone can solve this problem? And help me to get high-quality DNA which is required for the SNP chip.
Relevant answer
Answer
Shantanu Das Can you write all the protocol steps? Then we can discuss.
  • asked a question related to SNP Genotyping
Question
4 answers
Hi all,
I am trying to calculate homozygosity by loci from my SNPs data according to Aparicio et al., 2006 https://pubmed.ncbi.nlm.nih.gov/17107491/. I have a vcf file of 38,140 SNPs with 60 individuals. Does it sounds sensible to use Cernicalin? It seems that the current Cernicalin version can only handle 30 loci, 200 alleles per locus, and 1200 individuals. There are also two R packages developed for calculating homozygosity indexes,
rhh (Alho et al., 2010. https://onlinelibrary.wiley.com/doi/full/10.1111/j.1755-0998.2010.02830.x) and Genhet (Coulon 2010. https://onlinelibrary.wiley.com/doi/pdf/10.1111/j.1755-0998.2009.02731.x). However, it's not clear how the input files were prepared. The genotype information looks different from different datasets. Can the genotype information be represented as GT format extracted directly from the vcf file?
Thanks,
Jia
Relevant answer
Answer
Maybe you could give a try to Genepop (https://genepop.curtin.edu.au/) or Arlequin (http://cmpg.unibe.ch/software/arlequin35/) and do the file conversion with PGDSpider (http://www.cmpg.unibe.ch/software/PGDSpider/)
  • asked a question related to SNP Genotyping
Question
1 answer
Hi, I have two data sets from the illumina omniexpress snp array platform. The first data set was mapped using the GRCh37 build and the second one was more recently read using the GRCh38 build. Not surprisingly when I've tried to merge the files in PLINK for a larger analysis it comes up with the warning snp rs... is in a different genetic position. Is there any way to update the build of the first data set? Or suggestions for how best to proceed, I haven't done much genetic analysis before so any help would be welcome :)
Best,
Mari
Relevant answer
Answer
Hi,
Between GRCh37 and GRCh38 some of the items should be the same. For those that differ, I recommend the website:
Unfortunately, I'm afraid that with a large amount of data, the conversion may be a time-consuming process, but it's always a solution.
Hope this helps! Feel free to ask if you have questions about how to use the program to convert.
  • asked a question related to SNP Genotyping
Question
3 answers
Dears,
I need help with the interpretation of why not obtain results in real-time PCR when I use Taq man SNP genotyping assay ID: rs 22756913 (IL17A) from Thermo fisher scientific
where the final volume reaction was10 microliter
Taq man SNP genotyping assay (20X) working was 0.3 microliter
DNA sample volume was 3 microliter with a concentration of 0.6 ng
the master mix was 5 microliter
nuclease-free water added to complete the total volume of 10 microliter
but when I use another SNP ID: rs 40401 from the same company (Thermo fisher ) with the same concentration and the volumes, I obtained results in real-time PCR
Regards.
Relevant answer
Hi Noor Nihad Baqer , probable explanations are one or more of the following:
1. Your samples only contain just C or T alleles (emsembl reports A/G). Your probes are designed to detect just A or G. Different dog breed maybe?
2. DNA is degraded or contaminated with a PCR inhibitor.
3. Not enough sample, if you are using an standard master mix is better to use from 10-100 ng. Fast master mix 1-10 ng. In practice 33 ng always works well for both master mixes.
4. TaqMan assay must be 0.5- 1 uL for your 10 uL of reaction volume (at least for that particular assay)
5. Degraded probes due to light exposure or more than 3 thaw cycles.
6. Suboptimal probe design, CCC or more in a row, etc.
7. Primer dimerization due to high affinity or previous F & R mixture.
8. Change in the PCR cycle.
I hope this may be useful.
  • asked a question related to SNP Genotyping
Question
6 answers
i want to find a specific SNP in a fASTA sequence for a gene?
Relevant answer
Answer
Pairwise with dots for identities under alignment in BLAST gives SNPs
  • asked a question related to SNP Genotyping
Question
3 answers
I am trying to look for positive selection in the genome of a non-model specie, therefore I do not have chromosomes structure, I just have scaffolds and positions of SNPs in a VCF file.
I used different commands to create the input file, but the result is the same error message “Please specify a chromosome name. Conversion stopped”.
Could someone please suggest to me how do it?
Thanks in advance.
Relevant answer
Answer
Change sequence identifiers from scaffolds to chromosomes
  • asked a question related to SNP Genotyping
Question
3 answers
Can anyone suggest a reason why the association between my SNPs (90 thousand) and two quantitative traits (BMI and age) has a FDR-adjusted P-value over 0.9 for all my SNPs? I think it is very unlikely that not a single SNP is associated... They come from ichip genotyping and I did an imputation with the Michigan Imputation Server.
I had my files in PLINK vcf format and I transformed them to HapMap using TASSEL, and my phenotypes are in a tab separated file just as in the GAPIT manual.
My code was:
myG_tAPIT <- GAPIT(
Y=myY_t,
G=myG_t,
PCA.total=3,
model=c("CMLM")
)
Any ideas?
Thanks!
Relevant answer
Answer
Hi, Dr Sanchez, maybe you can try the "GLM" model.
Sometimes, the CMLM model is not suite for your data.
For instance,
If the observered point set showed lower than the expected, it means that there maight be a overcorrection by K-matrix or Q-matrix.
At the moment, you should change the model according to the QQ plot, namely remove the parameter of K or Q.
I hope I can help you.
  • asked a question related to SNP Genotyping
Question
3 answers
Hi,
I am doing a GWAS analysis of european bison's a dog's SNPs. Right now I am doing multiple data clean up, according to previous studies and what I have learned.
I was thinking, if it is ok to test HWE and discard data that does not fit if I am working with animals, where mating is controlled and inbreeding coefficient is high?
And what about tests for population structure etc.? It is obvious that there is going to be some structure in reintroduced and domestic animals. I don't want to loose any important data and I want to be sure, that I have done everything correctly.
Thank you for your help.
Relevant answer
Answer
Lenka Ungrová If your population is large enough, it is recommended to perform HWE test.
I feel these papers give you some insight into which decision might be right for you:
  • asked a question related to SNP Genotyping
Question
2 answers
I want to design BaseScope probes for human samples of a certain gene in his Wild-type and his mutated form (single nucleotide point mutation). To do that I need to provide stable cell lines that overexpress the WT or mutated form. Does anyone know a way to circumvent the hassle to generate a stable cell line from scratch? Is there any database of cell lines commercially available with SNP’s information?
thanks!
Relevant answer
Answer
The COSMIC database will likely be useful for you:
HeLa:
Many other cell lines:
Of course, these cell lines might not be homogeneous and any clone you pick or the strain you have might have some differences (even ploidy differences) from the strain used to generate these sequencing data.
  • asked a question related to SNP Genotyping
Question
3 answers
I will be using Axiom Genome-wide Human Origins 1 array. Is there a way to tell whether a sample is contaminated with DNA from another sample during the extraction stage (for example, excessive heterozygous calls)? Is there a way to eliminate alleles of the contaminant from the data? What is the minimimum proportion of contaminating DNA which can be detected by most SNP arrays?
Relevant answer
Answer
Hi, dear
The best to take a look at the allele frequencies, because contamination would give low allelic frequencies.
  • asked a question related to SNP Genotyping
Question
6 answers
I would like to separately perform Genotyping-by-Sequencing (GBS) on individuals from different non-model species. These species have no close phylogenetically species with complete genome sequenced. Therefore, using GBS I will have a set of SNPs genotypes from genomic DNA for each species. Is currently there a method that can allow me to detect and isolate only loci, among those I have retrieved, that are shared and genotyped in both the two species?
Many thanks!
Paolo
Relevant answer
Answer
How fortune i am finding this great question and answer.
I am a junior researcher and recently working with rice, just want to know how to retrieve SNP data from one specific gene in many rice cultivar.
What kind of database i should go for this purpose?
Thank you for your responds
  • asked a question related to SNP Genotyping
Question
2 answers
HI,
I am carrying out a project on association of DC SIGN CD209 polymorphism (rs4804803) with severe malaria. I need journal publications with specification on PCR-RFLP cycling condition and incubation temperature (for the restriction enzyme) for the SNP genotyping.
thank you.
Vincent.
Relevant answer
Answer
Follow
  • asked a question related to SNP Genotyping
Question
2 answers
What would be the best model/algorithm for Genetic Risk Score calculation (for risk prediction) based on genotype information of multiple SNPs.
For example, I have identified 10 SNPs that are reported to increase the risk of Atrial Fibrillation in several GWAS studies. I've curated data, like OR/Risk allele frequency, for these 10 SNPs. Now, if I get a random sample from a person, in whom all these 10 SNPs are genotyped simultaneously. I would like to calculate cumulative risk score of this person for atrial fibrillation using this genotype information from all 10  SNPs (like additive model). Would it be feasible to calculate such risk score? If yes, can any one suggest/share views/algorithms?
Relevant answer
Answer
Good luck ...
  • asked a question related to SNP Genotyping
Question
6 answers
I have a DNA sample which was ran as a duplicate on a real-time qPCR experiment for the purposes of SNP genotyping. Bearing in mind that only one of its two technical replicates amplified, can I consider the genotyping result valid, or do I need to omit it from any further analysis e.g. qPCR call rate and deviation from Hardy Weinberg equilibrium?
Relevant answer
Answer
Thank you Mustafa A Adhab , negative controls performed as expected indicating no nucleic acid contamination for the reagents. Sadly there was no capacity to test the sample in a normal PCR, however it was ran on Nanodrop prior to amplification and it bore a yield of 357.5ng/μl, with a 260/280 ratio of 0.99. Samples with even lower 260/280 ratios amplified successfully nonetheless.
  • asked a question related to SNP Genotyping
Question
13 answers
I'm trying to understand epigenetic variability but the study I'm performing involves measuring the gene expression in a given region according to the SNP's alleles associated with the increase of that gene's expression. But epigenetic is related to heritable variations that affect the phenotype without affecting the DNA sequence, so can SNPs be used in this study?
I'm new in the genetics field and really confused in this matter, any help would be appreciated! Thank you
Relevant answer
Answer
Parham has given a correct answer, SNP is not an epigenetic modification, it's a genetic modification, it's a mutation. But SNP could be an excellent marker to correlate different levels of allele expression and allele structure. Even though the SNP could not be directly responsible of this variation in expression level, it can be linked to other mutations on the same allele sequence so giving the opportunity to establish the causal relation.
  • asked a question related to SNP Genotyping
Question
1 answer
Which of the following SNP Tping Chemistries for large scale genotyping of plants?
Thermo - TaqMan®
Fluidigm - SNP Type
rhAmp SNP Genotyping
LGC - KASP
rhPCR
high resolution melt (HRM)
DASH-2
ARMS PCR
cleaved amplified polymorphic sequence (CAPS)
SNP Array
Qbead system
Can any body provide insights for the above assays in terms of cost and applicability, advantages and disadvantages of the assays for crop breeding.
Relevant answer
KASP is the leading technology in crop breeding. Flexibility and accuracy is the main reasons. You can genotype hundred of SNPs in few samples and the other way around. It is cost effective, but LGC is increasing the price every year. However there are cheaper alternatives for the same technology:
rhAmp SNP genotyping, very good theoretically, but never tested for me. However I will come back to you in some weeks because we are planning to test this technology.
One advantage of KASP and rhAmp is that they are able to genotype indels. Indels are more and more important in crop breeding. Several trait markers are indels.
Taqman and HRM are good technologies, but expensive and similar to KASP, no great advantages. Fluidigm has a fixed number of SNPs, not useful for trait markers. Just to quality control or something like this.
The others are quite old, forget them.
Final conclusion: try Kasp and rhAmp and decide according to your interest. Fluidigm can be an option if you don't mind fixed number of SNPs.
I hope it helps.
  • asked a question related to SNP Genotyping
Question
4 answers
Hi everyone,
I have a computer background without any biological knowledge. For one part of my PhD I have been given 44 samples of data (I attach a part of one sample to this question). I know it is a vcf file but wanted to know the exact type of data ( SNPS, RNA Seq ,...) to find suitable tools to work on them. I should calculate the similarity between each pairs ( two patients) and build a similarity matrix based on them to give it to a classifier and ...
I want to know which similarity measurement method is used for this type of data ( for example jaccard) and is there any tools to calculate it.
your help and biological knowledge is highly appreciated.
Relevant answer
Answer
It's SNP's data the Allelic Depth (DP) must be atleast above 30 if you need highly filtered SNP's.and to calculate the similarity beween the pairs do a plast in ncbi you will have a matrix and the percentage of similarity with other genes
For manipulation and analysis check out the website https://blast.ncbi.nlm.nih.gov/Blast.cgi
  • asked a question related to SNP Genotyping
Question
11 answers
Dear my professors and colleagues,
I prepare a review about use biotechnology and molecular marker at poultry breeding.
I want your help.
Relevant answer
Answer
Hello Dr. Esteftah
I recommend for you the following chapter
"Selection Methods in Poultry Breeding: From Genetics to Genomics"
Good Luck
  • asked a question related to SNP Genotyping
Question
22 answers
I can't seem to identify conclusively if these are homo or heterozygous. How should I best do this? How high should the second peak be for me to take it as a heterozygous SNP? These are from human samples.
Relevant answer
Answer
Following...
  • asked a question related to SNP Genotyping
Question
3 answers
I am planning to standardize a TaqMan duplex PCR assay for allelic discrimination. I intend to use FAM and TAMRA for one probe. Which dye and quencher would be best for the second probe that can work efficiently in combination with FAM?
Relevant answer
Answer
Yes is the same of FAM. When you order your probe, you need to label at 5' with JOE or VIC or FAM and at 3' with the quecer (this depend of which type of dye you use). You need to chose the dye in function of the spectra emission and in function of the dye is possible to detect from your qPCR.
  • asked a question related to SNP Genotyping
Question
3 answers
I have data for which differential gene expression was found. Now I am interested in finding SNPs only in the genes which are differentially expressed (gene name is known).
Could someone suggest me how to predict SNPs for the specific gene alone?
Relevant answer
Answer
Dear Sathya
What kind of data are you working on? Is it WTCCC data or something else?
  • asked a question related to SNP Genotyping
Question
4 answers
I am working on building a linkage map based on SNP genotypes from a single full sib family. Most other studies I've seen have used JoinMap. I wanted to know if anyone had used JoinMap or any good alternatives (maybe in R?) that could still be used with full sib data. Is there a large computational bottleneck with any of these softwares?
Relevant answer
Answer
Thanks for answering everyone! I have decided to use Lep-MAP3 (thanks Rishi De-Kayne !). It is free software that seems to have fairly high accuracy with efficient computing times, and can deal with multiple full sib families.
  • asked a question related to SNP Genotyping
Question
21 answers
Which type of PCR techniques is the best in your opinion?
There are several different types of PCR techniques. Any technique is used as the best for most reliable, accurate diagnosis, fast, and lower cost.. in your opinion? Why?
Please, Share your experience in this field.
All appreciation for all contributions.
Relevant answer
Answer
Dear Dr.
Shaban A. A. Abdel-Raheem
, Thanks for your participation in this discussion.
Regards.
  • asked a question related to SNP Genotyping
Question
2 answers
Hi all,
My research related to identifying SNP combinations associated to disease. I found that there are many methods to measure the association between SNP combination and disease. Could you show me some good measure methods for this issue?
Thank you in advance!
Best wishes,
Relevant answer
Answer
Hi there,
I understand that your question is about evaluating the association between a set of SNPs, in relationship to a disease. I thus imagine that you have at your disposal SNPs data for cases (individual with the disease) and controls (comparable individuals without the disease).
There are different scales to evaluate this association. A key information that could hold for all of them: SNPs are associated with a disease if they bring you information on the risk that an individual has the disease. Therefore, any classifier that tries, from SNP(s), to predict if an individual has the disease is susceptible to be used to assess association.
1. At the individual SNP level
a) Indepentently
Here you consider each SNP without considering the other. The most used association measure for that are Chi-Square and Fisher's Exact test. Statistics may have to be controlled for eventual population substructure (see ).
Some more complex method also exist (see Paragraph "Deriving a gene-wide test statistic for each gene" in ).
b) In relationship with the other SNPs
Any classifier that is able to give you information of the feature's importance can be used to assess the level of predictivity given by a SNP in relationship to the other. For instance, the weight of a logistic model trained to predict cases can be used. For more complex models, the loss of accuracy occurring when you permutate the values of a SNP can give you a sense of its importance in the prediction.
2. At the set of SNP level
To compare multiple set of SNPs that may be good predictors of a disease, you can use the accuracy (or any metric that evaluate predictive quality) of a classifier (e.g. Random Forest, Naive Bayes Classifier, etc.).
For instance, if you ranked all SNPs based on their individual association, you can evaluate the top-N SNPs by training a model and computing its accuracy, to identify at which point adding a SNP does not improve the model anymore.
A few additional words:
Consider Cross-Validation in order to avoid overfitting when finding best parameters of a model.
Do not forget to evaluate your association measures (e.g. accuracy of a set of SNP or their individual association) on an independant validation set, in order to have a real sense of the replicability of your results. See for instance [1] to understand a bit more about overfitting.
  • asked a question related to SNP Genotyping
Question
4 answers
Hi,
I have just a list of SNPs located in a specific gene, and I want to select Tag SNPs for genotyping. It was possible to do it using Haploview but Hapmap data are not updated anymore.. It is possible to do it using PLINK but I don't have a .ped file because I still don't genotype any sample. So, I really don't know how can I select TagSNPs with only a list of ID (RS) SNPs. Can anyone know a way to do it?
Thanks in advance,
Relevant answer
Answer
Hi David,
A tag-SNP is a SNP that represent the LD pattern of a region. Therefore, without this information (Ld), it is not possible to select tag-SNPs. You can use the LDLink web-page (from NIH) to check the LD pattern around your candidate regions or between your candidate SNPs. You can use the LDHap option to check the LD between markers using only the rs. However, I guess that there is no option to extract tag-SNPs from the results. You will have to do it manually or to extract the LD results and to apply in another software that performs the selction of the tag-SNP. Additionally, you can also select your SNPs based on the functional evidence.
  • asked a question related to SNP Genotyping
Question
3 answers
I will using tetra-primer ARMS PCR for genotyping specific SNP in adiponectin gene. I have taken the  4 primers designed in a previous paper. but I noticed that the paper used one annealing temp for the 4 primer in one pcr reaction. I don't know will this give accurate result or not. I want to know is it better to use 2 annealing temp (one for outer primers and one for inner primers) but I don't want how many cycles for the Ta of outer primers and how many for Ta of inner primers
Relevant answer
Answer
I used one annealing temp, one tube, and so that the same cycle number.
  • asked a question related to SNP Genotyping
Question
3 answers
we are using arms pcr method for snp genotyping so i want to know the principle behind this pcr and how it differs from normal pcr
Relevant answer
Answer
I hope this article will help you
PRIMER1: A Network Service for Tetra-Arms PCR Primer Design Based on Well-Known dbSNP
  • asked a question related to SNP Genotyping
Question
7 answers
I'm attempting to use a Biomeme two3 portable qPCR machine for SNP genotyping as part of an MSc project. I am using DNA from lizard blood samples purified with the Biomeme M1 purification kit and PrimeTime primers and probes designed and manufactured by IDT for our SNP of interest.
The amplification curves we've obtained so far look extremely strange. Our most recent runs consistently show a small, early increase in fluorescence around cycle 10, which almost immediately levels off (after 2-3 cycles). Two previous qPCR runs (with the same reagents) show patterns of increasing and decreasing fluorescence which seem almost random - at the time we assumed this represented background noise, but the relative changes in fluorescence were actually much higher than for more recent runs, suggesting that perhaps something more was going on?
Any help interpreting this somewhat perplexing data would be much appreciated!
Relevant answer
Answer
Hi John,
Please send me an email and I will be happy to trouble shoot with you. I am a senior molecular biologist at Biomeme and will definitely be able to help you further.
My email is: mieke@biomeme.com
Thanks
  • asked a question related to SNP Genotyping
Question
4 answers
Generally, my data will be falling into a 3 x 3 contingency tables.
               AA  AB  BB
Low
Mid
High
We are looking at adrenal suppression by measuring post-metyrapone test ACTH values which are grouped as Low-range(<106pg/ml), Mid-range(106-319pg/ml) and High-range(> 319pg/ml). We have 5 SNPs data from TaqMan Assay. We want to look at which SNP genotypes/alleles are associated with our grouped ranges. One of the SNPs did not have the BB genotype observed in our data.
So we have:
                       AA            AB
Low                 33            3
Mid                  24            5
High                26             5
1. Could this locus be informative without the BB genotypes?
2. Can I still do HWE analysis on these?
3. How do I calculate the risk of A or B?
4. How do I check the association of these genotypes with the Phenotypes (what will be the measure of association)?
5. What association test will be appropriate for the general 3x 3 data?
6. Which software or tool or publication can be worthy of use or reading?
I will very much appreciate your kind responses.
Regards
Wisdom
Relevant answer
Answer
I am not much into it , but if you have genotypes and categorical phenotypes you may use multinomial regression in SPSS putting genotypes in factors and each trait in categorical format (1 as low , 2 as mid....etc) in dependent variable. The pairwise estimation may give you an idea of particular genotype associating with particular category of response. You may also use Popgen for population genetic data (HWE, Het, Hom etc).
PS- I am not much into this sort of data, so i am just giving a guess, please wait for much better answers and we will see how it works out.
  • asked a question related to SNP Genotyping
Question
3 answers
I purchased Custom Taqman SNP genotype assay mix from ThermoFisher around April 2018. All the assay mix was aliquoted into small volumes upon arrival. So I use up an entire tube per experiment instead of freeze-thaw, and they are light sensitive so I cover them with aluminum foil. The basic principle of the assay mix works like this: Primer 1 has base A and is attached to VIC fluorophore, primer 2 has base G and is attached to FAM fluorophore. Depending on which SNP is in my DNA samples, the primer will be conjugated and the corresponding fluorophore will emit fluorescence at a certain wavelength. The signal amplifies as rxn goes.
I recently extracted rat genomic DNA using the Qiagen DNeasy Blood and Tissue kit. The concentrations, 260/280, 260/230 were measured using a nanophotometer. The concentrations were above 200ng/ul and ratios are all above 1.8. I furthered confirmed the integrity by running a DNA gel to make sure there is genomic DNA.
However, when I tried to genotype the SNP in my genomic DNA, the reaction doesn't work. I've tried with 10ul rxn/ well instead of the suggested 25ul because I'm just testing, I did this by dividing every component in the well by 2.5. So in each reaction I have: 5ul of EagleTaq Universal Mastermix, 0.5ul of 10X or 20X SNP assay mix, and 4.5ul of genomic DNA at 5ng/ul, 10ng/ul and 20ng/ul. None of these works.
The program was adapted from the Taqman SNP genotype manual. In the protocol, I added a 50C 2min pre-incubation according to one of the troubleshooting technicians. Then it's 10min of 90C activation, then 40 cycles of 15secs of 90C and 1min of 60C. Then just a 1 minute 40C cooling step. The result is that there is no amplification of fluorescence.
Can someone please help me and suggest possible reasons for why there is no fluorescence amplification? Could it be not enough SNP assay mix? mastermix? or DNA? Please help!
Relevant answer
Answer
It looks to me like your samples are not amplifying at all.
Did you test out your DNA samples with regular PCR to make sure that they amplify? That way you know for sure that your DNA has the region you want to amplify.
Second, double-check the settings on your real-time machine to make sure it's reading at the correct wavelength for your fluorephores and that the cycling conditions are correct.
Good luck!
  • asked a question related to SNP Genotyping
Question
2 answers
I have 50 patients genotyped using Illumina Infinium Global Screening Array and I wonder whether it is possible to type the HLA allele in each patient using the SNP data I have. GSA data sheet say it has 400+ bio markers in HLA genes. Do I need to impute the SNPs or can I directly identify whether the Tag SNPs are present in SNP genotype data and infer the HLA allele quickly?
Your help is much appreciated.
Relevant answer
Answer
Hi,
I suggest using one of these approaches:
Good luck!
  • asked a question related to SNP Genotyping
Question
10 answers
I’ve extracted dna from animal tissue using Qiagen dnaeasy blood and tissue kits. The tissues are frozen at -80 before I took pieces from them and extracted dna. The ratios of 260/280 and 260/230 are all between 1.8 to 2.2. The concentrations vary a bit between 100 to 700ng/ul. I usEd Qiagen elution Buffer to elute , but used water to dilute dna because I know that edta will affect downstream SN genotyping. The dna samples are stored at -20C unless I take them out for dilutions. However, when ran on a dna agarose gel of 1% at 75V for 1.5hr, with dna concentrations between 15-40 ng/ul loaded at 5 ul per well, there is visible degradation. I am planning on doing SNP genotypinG after extraction and am not sure if this will be successful with partially degraded DNA. Ideally undegraded genomic DNA is needed. Can someone please suggest what else I can do to Improve this or rescue this? Thank you!
Relevant answer
Answer
it looks like high molecular weight dna mainly so I would expect it to digest and amplify well especially as the OD ratios look good. I would go ahead with these dna samples
  • asked a question related to SNP Genotyping
Question
5 answers
I have been working with SNP markers and I want to calculate Jaccard distance using NTSYSpc 2.2. I tried to prepare my SNPs data as 0, 2 for homozygotes and 1 for heterozygote but it did not work out. Could anyone tell me how to format input data. Any suggestion would be appreciate.
Relevant answer
Answer
Dear Nguyen,
No problems. I have described you the base of the way to arrange the data, after which you could use any distance calculation, Nei as well I used it was fine. and any R package or software.
Still you did the right way - searched for the tool which showed you how to structure your data and perform the calculations.
Good luck!!!
  • asked a question related to SNP Genotyping
Question
5 answers
I am on the periphery of several projects looking at developing SNP genotyping arrays for relatively small numbers of markers, and to be run on relatively small numbers of individuals: potentially hundreds of individuals total, but with runs capable of doing few individuals at a time.
The SNP discovery phase has already been done through GBS.
I am wondering if anyone has any opinions on the best method to do this?
The questions to be answered with the data are things like: 1) matching an individual predator to predation marks, 2) relatedness of invading individuals, 3) landscape genetics.
I am guessing something like SNaPshot would be most cost effective for when just tens of markers are required? But what about when more markers are needed?
Obviously just doing more GBS would be best when thousands of markers are needed - probably landscape genetics questions would be best served by doing this and waiting for individuals to fill the runs. But is there a cost effective option somewhere in the middle in terms of marker number?
Relevant answer
Answer
We've had success with MassArray/iPlex chemistry (Agena, formerly Sequenom), and this continues to be our go to platform for pilot studies (fee hundred samples) as well as few thousand when samples are limited (saliva), and when the number of polymorphisms is small to medium. Our designs cover in average 29 SNPs at a time and the PCRs and extensions are done in 384 plates, using small volumes (cost saving). Another advantage is that depe ding on the surrounding sequence one can also test small indels. If you have a set of target polymorphisms, this is great, otherwise sequencing or pyrosequencing, as you get to take a peek at the surrounding nucleotides.
  • asked a question related to SNP Genotyping
Question
5 answers
Halo, I would like to design probes for SNP genotyping. Did anyone design such probes and can give me a few tips how to handle?
Thank you,
Joanna
Relevant answer
Answer
OK,
thermo fisher offers more than 10 million predesigned assays for 25 species. take some time in their website and see if your targets are waiting for you in their lists (http://www.thermofisher.com/fr/fr/home/life-science/pcr/real-time-pcr/real-time-pcr-assays.html?gclid=Cj0KCQjw9LPYBRDSARIsAHL7J5msnI54CPU9Jz71cou1ygsCuVi3V1wnpgk3k5p3XFxSn-m5JMmzdpwaAk4hEALw_wcB&s_kwcid=AL!3652!3!245695868609!b!!g!!taqman&ef_id=Wh@vxgAABFfpjRHK:20180530064023:s).
good luck
fred
  • asked a question related to SNP Genotyping
Question
5 answers
I am doing SNP genotyping on Lightcycler 480 using taqman probes and Taqman Universal Master Mix with UNG. Sometime the software get a low signal for heterozygous samples and the software is not able to call it as "both alleles". Will there be any differences if I include the UNG activation step in my protocol? Is there any way to increase the heterozygous call of FAM and vic?
Relevant answer
Answer
I am in GMO analysis, if I use the TaqMan Universal Master Mix, without UNG, should I suppress the incubation step of the PCR-RT cycle? Hell me please.
  • asked a question related to SNP Genotyping
Question
2 answers
Hello,
I would like your comments/suggestions for my strategy.
I have F0 samples with two different phenotype. I have F2 samples with unknown phenotype. I would like to create a library with homozygous F0 genotypes.
Then i would like to genotype my F2 samples using this previously created library.
I already pre-processed BAM files (I have all raw data if required) : bowtie2 and samtools.
STRATEGY: 1) Create genotype library with F0 samples:
  • GATK HaplotypeCaller for both F0 phenotype samples : java -Xmx30g -jar GenomeAnalysisTK_3-8.jar -nct 16 -T HaplotypeCaller -R GENOME --emitRefConfidence GVCF -I INPUT.bam -o OUTPUT.g.vcf
  • Merge the results: java -Xmx16g -jar GenomeAnalysisTK_3-8.jar -nt 16 -T GenotypeGVCFs -R GENOME --variant F0Variant1.g.vcf --variant F0Variant2.g.vcf -o Results_Merge_F0.vcf
  • then i used a homemade script to select only position with homozygous genotype and different genotype between both F0 phenotype samples (like 1/1 for a F0 sample and 0/0 for the other one): Results_Merge_F0_filtered.vcf
2) Genotype F2 sample with the library:
  • GATK HaplotypeCaller : java -Xmx30g -jar GenomeAnalysisTK_3-8.jar -nct 16 -T HaplotypeCaller -R GENOME --emitRefConfidence GVCF -I INPUT.bam -o OUTPUT.g.vcf -L Results_Merge_F0_filtered.vcf
  • then i used a homemade script to identify genotype related to one (or the other) F0 phenotype.
BUUUUUUT ... At this last step i mostly got homozygous SNP for my F2 samples... I should get around 25% phenotype1 -- 25% phenotype2 -- 50% phenotype 1/2 I miss something but I don't know where.
Relevant answer
Answer
Thank you for your advices.
I will try theses steps using homemade Perl scripts.
  • asked a question related to SNP Genotyping
Question
6 answers
I have used the following procedures but am still having strong primer dimer bands while the desired product is absent.
The cycling conditions for PCR program that i used were 5 min at 95C for activation followed by 35 cycles of 95oC for 30 s for denaturation, 55oC for 30 s for annealing, 72C for 30s for elongation and a final cycle 72C for 10min for final elongation
the primer used are
F: CAGCCATACAGGGCATCCAG
R: ACAGATGGGTGTGTGGGGAT
and expected pcr product size is 250
please i need a help about what is wrong
Relevant answer
Answer
Hi
I hope you are fine
- First, make sure of the nucleotide sequence from Blast Program online
- it is possible to give the time instead of 30 seconds to be 1 minute in all cycles (denaturation 1 min, annealing 1 min and extension 1 min) to give chance of primers for catch in gene target
- using dilut concentration of DNA
- Try different annealing temperatures of 50 to 65 (gradient PCR)
Good luck
  • asked a question related to SNP Genotyping
Question
7 answers
I want to get data about different SNPs i am working on. especially their role in change in structure and function of gene and regulatory regions. What are different tools available for this type of analysis?
Relevant answer
Answer
Hi,
If you are talking about the sequencing technology, PacBio is the best for the SNP detection. Oxford nanopore is also offering fast and high-throughput sequencing for the SNP genotyping. Another advanced method in SNP genotyping is the SNP microarray, which uses thousands of probes. In comparison to the sequncing, other methods may or may not be well established, since SNP analysis is largely dependent on the suitable probes/primers/troubleshooting/previous studies available. There are a bunch of methods available which can be used as per the resources available.
Since SNP detection from the known SNPs and SNP discovery are two different things, there is some brainstorming needed before choosing for the method of analysis.
  • asked a question related to SNP Genotyping
Question
7 answers
Could anybody please explain how to translate the output from GENELAND of population memberships to use in the program DISTRUCT or if there is some software/R code to convert it please? 
Relevant answer
Answer
Hi Simone. In case you remember (was already a year ago), how did you manage to run DISTRUCT from Geneland results ? I changed by hand the proba.pop.membership.indiv.txt into the .indivq file needed. However, geneland does not provide a population file and Distruct is not running without this .popq file...
  • asked a question related to SNP Genotyping
Question
4 answers
I am reporting some cross-phenotype genetic correlations (LDSC method) that are statistically significant after multiple test correction. Peer reviewers have asked us to report top SNPs that might contribute to these correlations. I understand that genetic correlations reflect a genome-wide effect, but I want to satisfy reviewers/readers. Has the field arrived at a best practice for identifying/prioritizing SNPs/regions that mediate a significant genetic correlation? I have a few ideas below, but hope to get feedback from the community of experts.
1) It would be simple to threshold the 2 sets of GWAS results at some p-value (arbitrary threshold?) and compare the overlap of the two results. The surviving results could be clumped to LD-independent markers and nearby genomic features could be reported. Simple, but arbitrary selection.
2) Meta-analysis has been a common approach for combining data across phenotypes. We might see some signals persist or become more significant if a SNP is associated with both phenotypes. We also might see some associations driven by only one phenotype, so this method isn't exactly specific to SNPs contributing to correlation. Overall, this approach doesn't seem to answer the exact question I'm asking. From my cursory understanding, an MTAG analysis also wouldn’t exactly answer my question.
3) pHESS is a new method that seems promising for addressing my question. https://www.biorxiv.org/content/biorxiv/early/2016/12/08/092668.full.pdf; however, this approach does not control for ancestry stratification as nicely as the LDSC method.
As an extension on this question: Have we arrived at methods for assessing whether the SNPs/regions mediating a genetic correlation are significantly enriched for biological pathways/functions? The approach laid out here seems relevant, but I haven't had a chance to read the article (https://www.biorxiv.org/content/early/2017/03/07/114561)
Relevant answer
Answer
Hi Daniel,
you mentioned only the tools to list important SNP/variants. I think you should add some pathways analysis as IPA or its free counterpart DAVID to select SNP and of course neighboring genes to go further the analysis and get closer to the pathology.
fred
  • asked a question related to SNP Genotyping
Question
11 answers
Hi! We've recently purchased Fisher TaqMan SNP APOE Genotyping Assays and have been running them on brain samples with the BioRad CFX96 thermocycler. Test brain samples are supposed to be homozygous E3/E3 and we don't seem to be getting that. Has anyone had experience with this assay and have any tips on analyzing these results in terms of CTs or data interpretation? The protocol was less than helpful, and we're having difficulty finding help online. Thanks!
Relevant answer
Answer
If you're using brain tissue it may be that you have to perform a clean-up on your template. Brain tissue is very fatty and can often require adjuvant addition to the PCR reaction (DMSO or Betaine) to ensure reactions are successful or require you to first clean-up your sample.
Try running your PCR product on a gel before performing allelic discrimination to ensure you are getting amplification as well. If you don't see bands it means the PCR isn't working and it's not a probe issue. You need to optimize. This could mean (1) diluting your template, (2) adding an adjuvant, (3) cleaning up your template first, (4) MgCl titrations, or (5) gradient PCR to find the optimum annealing temperature for you experimental conditions
  • asked a question related to SNP Genotyping
Question
5 answers
SNP Genotyping using T-ARMS PCR.
Relevant answer
Answer
Hi, this technique need more optimization to give good result, but in general the primer concentration outer primer to inner primer forward is 1:10 more acceptable with some exception to less than this concentration
  • asked a question related to SNP Genotyping
Question
1 answer
I want to use HRM analysis for SNP genotyping and I need to amplify just the fragment of target gene that include these two SNPs.
Relevant answer
You should follow NCBI.
  • asked a question related to SNP Genotyping
Question
5 answers
Hello,
I am working with SNP genotyping by PCR-RFLP method. When I put the FASTA sequence of my target region in the NEB cutter to know the relevant cut site of the restriction enzymes, it shows cleavage affected by cpg methylation or cleavage affected by other methylation in case of some enzymes. Should I use those enzyme for genotyping my SNP locus?? I think if there is any methylation in DNA then restriction enzymes will not cut that site. Then how can I determine the polymorphic allele?? Is there any in silico method to check DNA methylation before doing RFLP??? Please suggest.
Relevant answer
Answer
  • asked a question related to SNP Genotyping
Question
2 answers
In 23&Me files for personal genetics services, I found some genetic variants named like (i6033918) and the genotype is (II). What does this mean?
Relevant answer
Answer
I'm sorry, but I don't know this website.
  • asked a question related to SNP Genotyping
Question
7 answers
I am looking at genotypes caused by allele A and allele G. The minor allele is allele G with frequency 0.01. Consequently I did not get recessive homozygotes during genotyping of patient samples. So, is it necessary to test whether these genotypes are in HWE ?
Relevant answer
Answer
Yes HWE is necessary to test. If the alleles you are testing are in HWE, it represents the real population and your results are justified and correct.
  • asked a question related to SNP Genotyping
Question
22 answers
I am going to inspect positive selection for a list of SNPs (more accurately loci which contain my SNPs). I am trying to use the rehh package in R for working on 1000 genomes project phased data. 
rehh requires as input :
  1. Standard haplotype format. Each line represents a haplotype (the rst element being the haplotype id.) with SNP genotype in columns.
  2. A SNP information file: This data file should contain SNP information. Each line correspond to the SNP name, its chromosome of origin, its position on the chromosome, its ancestral and derived alleles (as coded in the haplotype input file).
I am novice in this field and really need your help to make such input files.
Any suggestion would be appreciated greatly. 
Relevant answer
Answer
Let me develop the ideas provided by Mohammed Ali Al Abri.
1. fastPHASE input files can be generated directly by plink. Just use `--recode fastphase-1chr ` option. One input file should include one chromosome. For example,
plink --file chr22 --recode fastphase-1chr --out chr22
2. Ancestry alleles are required for iHS metric and are not required for cross population one, e.g., XP-EHH.
3. fastPHASE is out of date. It was fast indeed a few years ago. Now it is slowPHASE :) I advice to use SHAPEIT or BEAGLE.
  • asked a question related to SNP Genotyping
Question
8 answers
Anyone using BFDstar and trying to get their SNP alignment in binary format?
They have a pretty good tutorial, but I am having trouble using the R package phrynomics to convert my data.
I keep getting this message:
> TranslateBases(snpdata, translateMissing = FALSE, ordered = FALSE)
Error in `[<-.data.frame`(`*tmp*`, j, , value = c("B", "E", "G", "I", :
replacement has 12 items, need 37
And since I am not a R expert (way far from that) I can not figure out what that means, not even with all the google. Even in the phrynomics google groups they are not answering me. 
Anyone can help? That will be much appreciated. 
Thank you!
Relevant answer
Answer
Thank you Justin!
I figured out what I was doing wrong... But I will look you Gist up!
  • asked a question related to SNP Genotyping
Question
8 answers
Dear all,
I am trying to design primers for a AS-PCR and I am facing some problems with one of my target SNPs. As for the others, I have been considering the following when designing the AS primer:
-          The target SNP [A/G] will be the base at the 3’end of the primer
-          Introduced an extra mismatch at the 2nd (or 3rd) base closest to the 3’end of the primer
It worked great for two of my target SNPs. For the third one, the method kept on failing. I had only a couple of sequences available, and thus I have been sequencing the possible fragment around this target SNP (amplicon of around 100 bp only) to be able to check for other sources of variability. And yes, they are there:
GCAGCAAATCGGRMGMGCAGTG[A/G]GCGGATCAYTYYTTCACCTGCC
Flanking my target SNP, I have identified 3 other SNPs. My idea was to try and design degenerated primers that would still be allele specific. Made a clumsy attempt, without success. Does anyone have tips or suggestions on how to work this out? Thank you for your feedback.
Relevant answer
Answer
Hi Tania
now I see the limits of your request. as I imagine that you have some degenerated genomes, and guess your difficulties to design primers, next to or away form your target, I would design an ARMS protocol which is efficient, cheap and rapid (one reverse primer, common for the 2 separated PCR with forward primers specific for both alleles).
hope this help
fred
  • asked a question related to SNP Genotyping
Question
4 answers
Hi Everyone, 
I am working with data of SNPs, I want to do logistic regression analysis. 
In multinomial logistic regression, is it compulsory to choose most common genotype as reference? or I can choose any genotype as reference? 
In my one SNP (Genotypes: II, ID, DD), when I choose most common II genotype as reference than Odds ratio come out like 0.57, but on choosing ID genotype Odds ratio change to 1.67 with p <0.05. Is it fine to choose heterozygous genotype as reference?
Thanks,  
Relevant answer
Answer
Hi Misbah,
I think statistically, yes you can pick any category as your reference so long as you interpret your results accordingly. Genetically, I'm not sure it can be interpreted. Usually either homozygous group is used, or by alleles carried (eg. all carriers of "I" or "D").
By using heterozygotes as your reference, are you saying that the odds are higher in both homozygous genotype groups compared to the het group? Biologically, this is not possible.
I would first check how many individuals are in your homozygous rare group (DD). If the number is very low you might want to club all rare allele carriers together (ID and DD), for power or because your effect might be dominant. 
Hope this helps. 
  • asked a question related to SNP Genotyping
Question
5 answers
I am working with several immortalized cell lines and need a protocol to uniquely ID each, perhaps by genotype?  SNP array?  Anyone have a good SOP for this?  or know of a company that provides such a service? 
Thank-you in advance!
Relevant answer
Answer
Yes I think so, presuming you have unique variants in each line that are traceable (i.e. expressed variants for RNA-seq data) - However your mileage may vary :)  We have done variant calling from mRNA seq data using UNCqeR (https://www.ncbi.nlm.nih.gov/pubmed/26076459)
-Benjamin
  • asked a question related to SNP Genotyping
Question
2 answers
I have to do SNP gentyping assay, so I am planing to do Genotyping by sequencing (GBS)assay. However my problem is that I have reference genome of one parent but do not have reference genome of other two parents and moreover my material is also allotetraploid. In GBS I might be in trouble during assembling the reads of samples that do not have reference genome because its polyploidy nature will impose hurdles. Please suggest me something
Relevant answer
Answer
Create your own reference genome for the other parents from the assay.  It's pretty cheap to do and you can use the sequence of the known parent as a reference for the assembly.
  • asked a question related to SNP Genotyping
Question
8 answers
I have done the Real Time PCR for SNP genotyping with TaqMan probe genotyping assays for 1000 patients.  In this method, we have detected the SNP with VIC and FAM due to TaqMan. It is characterized by the green and yellow colors.
Now, how do we distinguish that whether VIC or FAM detected the SNP?
Thank you there.
Relevant answer
in this link:
tipiyng of rs1051730., after click en search.
after click in  assay ID. and this is your information:
Context Sequence [VIC/FAM]:
AGCAGTTGTACTTGATGTCGTGTTT[A/G]TAGCCTGGGGCTTTGATGATGGCCC.
VIC is allele A and FAM is allele G.
Best regards
Leonardo
  • asked a question related to SNP Genotyping
Question
6 answers
I want to genotype 3 specific SNPs from a single gene in about 150 human patients. I want the cheapest method, I'm a university student and I'm going to pay it by my own.
Thanks for your time and help.
Relevant answer
Answer
Definitely, PCR-RFLP or ARMS PCR would be the best... But I would like to mention if all the three SNPs are in vicinity enough to amplify in a single PCR, then PCR-Sequencing is also a cheap option. This will also give you the added information about any other polymorphisms located in the amplicon.
  • asked a question related to SNP Genotyping
Question
5 answers
In our research, we have three groups of genotypes: 1) Resistant, 2) Susceptible and 3) Unknown (maybe resistant, maybe susceptible).  We performed NBS (Nucleotide Binding Site) profiling in all genotypes and the results are ready as 0/1 scores. Thus, we want to find a way to make a correlation between the presence/absence of bands and the resistance/susceptibility of genotypes to guess the resistance status in unknown genotypes. I would be grateful if you could kindly share your experiences in this context.  
Relevant answer
Answer
Dear Dr. Levitchi,
Thank you for your helpful discussion.
I used biological replications and the NBS bands are reproducible.
For phenotypic observations, I have only  resistant/susceptible traits.
Do you recommend the Mantel test?
  • asked a question related to SNP Genotyping
Question
9 answers
As in my study I used set of 18 microsatellite markers for the genetic diversity study but I am not sure this number is relevant for my research.
Relevant answer
Answer
 You can use Multilocus program ... It does exactly as Hernando Rodriguez-Correa suggested.
If you find it difficult to get, I have a share link to the source I have...
Best wishes
  • asked a question related to SNP Genotyping
Question
22 answers
How can I calculate haplotype frequency from genotype frequency of two polymorphisms in the gene?
For example
Variant 1      cases         controls
CC                   20                40
CT                    50                 40
TT                     30               20
Variant 2          cases        controls
GG                       22               30
GA                       28                 40
AA                        50               30
I suggest haplotype as 
CG 
CA
TG
TA
Thanks
Relevant answer
Answer
Hi Rami,
Please see attached the instructions and an example input file I wrote a decade ago for Haploview input file generation:
1) PDF file: Instructions for generating Haploview input files
2) XLS file: an example of raw data and input files (check out all tabs)
I hope this helps.
Best,
Naser
  • asked a question related to SNP Genotyping
Question
3 answers
a project in our group investigating the association between specific and drug resistance. One of these SNPs has been reported that the wild type alleles GG have been altered into TT. Using Sequencing, we found that the wild type allele GG have been altered into CC. How could we explain that regardless of ethnicity variations.
Thank you in advance.
Relevant answer
Answer
Dear Paul Rutland and Shale Dames, 
Thank you so much for your responses. They were very heplful and I am trying to solve the problem.
  • asked a question related to SNP Genotyping
Question
3 answers
can anybody suggest a friendly software for the analysis of LD and population structure with large SNP data in polyploids?
Relevant answer
Answer
You can use Haploview software
  • asked a question related to SNP Genotyping
Question
5 answers
in the HELP of the Genpop online of how you can input your data 
genepop.curtin.edu. /help_input.html
they input the allele size of the locus as in the attached file 
I ask about the homozygous of allele size 90 
it was written as 090090
this means that it has only one band  of 90 bp ?
but another one was written as  000230 
this means that it has one band at 230 bp 
why it wasn't written as 230230?
is this not considered homozygous  
Relevant answer
Answer
Hi Ergy,
I belive that should be only an example of input file with microsatellite alleles. However when you see clearly and unequivocally one band, you can considere as homozigosous locus (230230). But if in your gel you have one band not clear or with other frangments in your sample, then it could be an missing allele (000230). Anyhow, in this case I suggest you to remake your PCR, modifing the concentration of DNA template and other PCR reagents.
  • asked a question related to SNP Genotyping
Question
2 answers
FBAT software provides several different models to run family-based association test, such as dominant, recessive and additive. However, when I used PLINK software, I just can select --tdt, --dfam, or --qfam-total to do family-based association test. I don't know which model PLINK applied. I looked at the document and instruction, but I can't find answer. Does anyone know?
Thanks
Daniel
Relevant answer
Answer
Daniel Weng,
The standard option (and default) is the basic allelic test, but if you use model option (-model option) you can choose Dominant and Recessive models
Section: 
"Alternate / full model association tests
It is possible to perform tests of association between a disease and a variant other than the basic allelic test (which compares frequencies of alleles in cases versus controls), by using the --model option. The tests offered here are (in addition to the basic allelic test):Cochran-Armitage trend test
Genotypic (2 df) test
Dominant gene action (1df) test
Recessive gene action (1df) test"
Kind regards
Hugo
  • asked a question related to SNP Genotyping
Question
8 answers
I have a list of selected SNPs in which I am interested for genotyping. The conducting company needs the flanking sequence of each SNP ? How do I get this or how can I download it from NCBI Gene (it links me directly to the whole gene?)
I am looking forward for your answers
Relevant answer
Answer
I agree with Kush Shrivastava
You can get flanking sequence depending SNP ID (rsXXXX ) for  aspecific SNP and mostly involve 50bp in addition to allele
  • asked a question related to SNP Genotyping
Question
2 answers
The materials are durum wheat accessions (allotetraploids with A and B genomes; 85% landraces + 15% improved varieties) and the software used to analyse the data is Axiom Analysis suit. v2.
Relevant answer
Answer
It depende on your genotype file, anyway you could use PGDSpider, a automated data conversion tool ( http://cmpg.unibe.ch/software/PGDSpider/). it converts the most of biological data formats.
  • asked a question related to SNP Genotyping
Question
4 answers
Hi everybody,
I would want to describe my figure showing pairwise genetic differentiation between Vipera berus individuals against logarithm of distance.
More precisely, i would want to know how interpret pairwises with a high rousset's distance, but i can not find any explanation on what is a high or low value. And if this is explain in Rousset 2000, i don't understand.
So, is there a consensus on what constitutes a high and low value of â rousset's distance ?
Thanks in advance
Relevant answer
Answer
Hi Juan José,
Thank you very much for your help. I believe i follow your recommendations and I will propose a range of D in order to compute σ, Nb and several linear models based on.
Thank you again
Best regards