Science topic

Phylogenetics - Science topic

Explore the latest questions and answers in Phylogenetics, and find Phylogenetics experts.
Questions related to Phylogenetics
  • asked a question related to Phylogenetics
Question
2 answers
How can large-scale phylogenomic datasets be improved to reduce uncertainty in tree resolution?
Relevant answer
Answer
Hi David,
I haven't operated many whole-genome phylogenies but the followings may be of some help:
The error of genomic phylogeny may be caused by the mistakes of alignments, the introgression/ILS phenomenon of some genes, and the search methods of phylogenetic reconstructions.
In the first case, perhaps it is wise to apply a strong trimming method to the sequence alignment, or use non-alignment ways. I've heard an invention of Zhejiang University - a phylogenetic strategy to construct trees using the cluster of gene blocks, not that of whole sequences. Otherwise you may map your dataset to a reference genome, extracting SNP sites of each gene and use sequences straightly constructed by SNPs to do tree search. In that way, you can also lighten your computation burden because each chromosome can be reduced to SNP sites below about 30000 bp before tree reconstrcution.
In the second case, do coalescent phylogenies, or only keep highly conservative regions can be a good idea. You can select those CDS regions ahead by analysing reference genomes and use RNA-seq to extract those regions.
In the third case, applying rapid and accurate tree search methods, or switch to advabced hardwares, so that you can do more times of search to find the best tree.
  • asked a question related to Phylogenetics
Question
1 answer
I study plants and I've never come across a paper where the outgroup is a selfer. Is it okay if that is the case?
Relevant answer
Answer
Yes, no problem at all: selfing taxa can perfectly act as outgroup in phylogenetic analyses. After all, selfing taxa evolve just as any other group and are just as well part of the network of life.
With kind regards,
Thierry Backeljau
  • asked a question related to Phylogenetics
Question
6 answers
I used Beast to analyse COI sequence (741 bp) in a bat species, and the analysis clearly indicated that instead of one species, we had two different ones. I'd like to estimate the divergance date between these two genetic clades. I decided to use Beast. However, I have no fossil Datong information, so in BEAUti I fixed mean substitution rate to '1'. I received the tree with ages of nodes estimated in units: substitution per site (as stated in the manual). Is it possible to 'translate' this information into years, for example using the substitution rate for a particular gene?
Relevant answer
Answer
I have a dataset of 6 species Baboon cytochrome b aligned, and want to find out estimates of divergence time. I have a mutation rate of 0.159. How do I calculate divergence time with no fossil priors?
  • asked a question related to Phylogenetics
Question
7 answers
Dear All
I am trying to applying a phylogenetic correction to an MCMC model, but I have problems in making the inverse matrix. I can visualise the treeplot very well, but when I use the script:
inv.phylo<-inverseA(phylo_ultra,nodes="TIPS",scale=TRUE)
R tells me that there is an error:
Error in pedigree[, 2] : incorrect number of dimensions
In addition: Warning message:
In if (attr(pedigree, "class") == "phylo") { :
Do you have any experience with this? I couldn't find a solution so far
Thanks in advance
David
Relevant answer
Answer
I've had the same issue sometimes. Usually it's because the names in both data files aren't exactly the same and therefore it fails matching them. Check every latin names or ID column used for the match is exactly the same (just a space character might be an issue).
  • asked a question related to Phylogenetics
Question
7 answers
I'm currently working on resolving a phylogenetic tree, and I'm wondering if it's possible to combine both mitochondrial and nuclear genes to achieve a more accurate result.
Relevant answer
Answer
Here is a great tutorial (with references) for estimating species trees with *BEAST: https://taming-the-beast.org/tutorials/StarBeast-Tutorial/
  • asked a question related to Phylogenetics
Question
3 answers
I've been battling with codes and it's no use. There must be a way to select a group of taxa as the outgroup, but I couldn't find how.
I suppose that the alternative path is to make the whole analysis and constraint that group as a family.
Any thoughts on how to resolve this?
Relevant answer
Answer
To the best of my knowledge, that's the way to go.
You could force the monophyly of the ingroup, and then select the outgroup taxon (which will be with the rest of the excluded taxa). Remember that in the search settings window, you must activate the enforce constraints for them to apply.
  • asked a question related to Phylogenetics
Question
10 answers
I am using MEGA to build a phylogenetic tree using two genes. They were already aligned separatelly using MUSCLE, but now I don't know how to continue...
Relevant answer
Answer
Haven't tried it myself yet but R package "apex" seems to do just that.
  • asked a question related to Phylogenetics
Question
1 answer
What is the explanation of this change which has affected the well-known Cupressus lusitanica (I assume some phylogenetic studies)
Is this a definitive change and should we use this new scientific name, or continue using Cupressus as a synonym?
Relevant answer
Answer
Hola Profesor.
Según la literatura taxonómica, solo cambión una sección y fue publicada en el siguiente artículo: Phytologia 91: 179 (2009). Con base en esto y a la nomenclatura de estos grupos de coníferas, la especie C. lusitanica no entra dentro de esta sección y lo puede corroborar en el siguiente link: https://wfoplantlist.org/plant-list/taxon/wfo-0000630722-2022-12?page=1
Saludos.
  • asked a question related to Phylogenetics
Question
5 answers
I am working on a dataset with 65 individuals that belong to 6 species. I am trying to use a linear model that corrects for phylogenetic relatedness amongst individuals, however I am concerned that this is reducing the power of my models given that my effective sample size is small, due to having only 6 species. IS 6 species too few to incorporate phylogeny?
Relevant answer
Answer
You could argue for not correcting for phylogeny with very few species (e.g. <20, see ) because of difficulties estimating the phylogenetic parameters. Reporting and/or comparing with non-phylogenetic regression is my recommendation when dealing with small sample sizes. But be aware of the phylogenetic distribution of your species, e.g. having 3 mammals and 3 fish species could create spurious associations. For simple phylogenetic sensitivity tests, see:
  • asked a question related to Phylogenetics
Question
1 answer
Hello,
I am facing the following situation:
I have two species phylogenies of groups very distantly related to each other. For each tip (species) I have the value for a continuous trait. I would like to check whether or not there is a significant difference between the two groups, while accounting for within-group relatedness.
AFAIK, this type of analysis is usually done with a phylogenetic ANOVA test, e.g. as implemented in the R package phytools. However, the function expects a single phylogeny, not two.
Can anyone suggest a software that can do what I need, or help me think of a procedure for performing the analysis?
Thanks!
Relevant answer
Answer
Hello professor Glick
I’m not sure about your problem but if you want to detect a value that is different between your species and the value is a continuous trait, you probably can use this.
You can set one data frame for each species. For example, in data frame A, the column will include the samples(name or ID or ...) and the row will include their unique value for each continuous trait
data frame A (color blue)
[sample 1, value A1 value B1 and …
sample 2, value A2 value B2 and …
....
data frame B ( color red)
[sample 1, Value A1, value B1 and ...
...
then you should display each value with a specific color Blue or Red
For example, in value A, most of the blue colors are …. (Type of value) And the reds are ….
Then you can check and understand what is the type of each value in each species and detect the significant differences between them. Moreover, you can apply different conditions to your data set. Here is an example, you can only display a value that has more than 80% similarity in each species but is different compared to other types. Also, if you don't need to display it as a graph, you can use a simple code with the Pandas library in python to detect the significant values and their types. I don’t know what are the types of your value but I hope I understand your problem correctly and this will help you
  • asked a question related to Phylogenetics
Question
1 answer
Dear colleagues,
I need a laboratory for the phylogenetic study of fishes, I have a new species that has never been identified before, so please could anyone of you guide me ?
By the way, it's a new fish in northern Algeria.
I hope I could find help from you!
Thank you in advance
Relevant answer
Answer
There are many laboratories around the world that specialize in the phylogenetic study of fishes. Here are a few options to consider:
  1. Museum of Natural History in Paris, France: The museum's Ichthyology department is known for its expertise in fish taxonomy and systematics. They have a large collection of fish specimens from around the world, and offer molecular and morphological analyses for phylogenetic studies.
  2. University of California, Los Angeles (UCLA), USA: The university's Department of Ecology and Evolutionary Biology has a fish systematics research group that focuses on phylogenetics and molecular evolution of fishes. They offer molecular and morphological analyses for phylogenetic studies.
  3. Natural History Museum in London, UK: The museum's Ichthyology department has a long history of fish taxonomy and systematics research. They have a large collection of fish specimens from around the world, and offer molecular and morphological analyses for phylogenetic studies.
  4. University of Bergen, Norway: The university's Department of Biology has a fish systematics research group that focuses on phylogenetics, systematics, and biogeography of fishes. They offer molecular and morphological analyses for phylogenetic studies.
  5. University of Sao Paulo, Brazil: The university's Department of Zoology has a fish systematics research group that focuses on phylogenetics, taxonomy, and biogeography of fishes. They offer molecular and morphological analyses for phylogenetic studies.
These are just a few examples of laboratories that specialize in the phylogenetic study of fishes. I recommend reaching out to these or other institutions directly to inquire about their services and availability. Good luck with your research!
  • asked a question related to Phylogenetics
Question
3 answers
Please provide answer as we have got burkholderia cepacia n purified water which is identified as herbaspirillum huttiense during further analysis.
Relevant answer
Answer
Both in phylum Proteobacteria but think morphology is part of phylogenetics - they are not identical in morphology
  • asked a question related to Phylogenetics
Question
2 answers
Thank you for your suggestions.
Relevant answer
Answer
Dear Mr. Pchelin
thanks for your consideration,
The problem is that the mentioned site doesn't have the gene sequence we want.
We are looking for a way to find the desired gene sequence by using species that are both close to the target animal and have the desired gene sequence.
  • asked a question related to Phylogenetics
Question
6 answers
I am trying to compare 2 population using GenAlex (haploid), but I would like to add the region factor, the problem is that both populations are distributed in the same regions and I can't find a way to insert this type of data...
Relevant answer
Answer
Poppr package in R is the best for this type of amova analysis.
  • asked a question related to Phylogenetics
Question
1 answer
Hi,
I am trying to do phylogenetic independent contrasts for my honours project however, when using the code
pic.bl<-pic(female.body.length,cent.tree)
pic.gw<-pic(avg.egg.diameter..mm.,cent.tree)
fit.pic<-lm(pic.gw~pic.bl+0)
fit.pic
I keep getting the error - 'phy' is not rooted and fully dichotomous
Would some be able to explain why this is/ have any way to resolve it?
Thank you in advance
Emily
Relevant answer
Answer
If the only problem with your tree is that it is not fully bifurcating, you can use a function from the ape package, multi2di, to resolve it into a series of dichotomies.
  • asked a question related to Phylogenetics
Question
3 answers
I do not find clear to which specie the aforementioned organism belongs. On some sites, I read that it references to a group of unclassified species instead ofjust one. I would appreciate if anyone may enlighten me with a taxonomy/phylogenetic approach.
Thanks in advance!
Relevant answer
Answer
Hello Juan; Here is a link with a little discussion and some citations. Maybe they will be useful. Best regards, Jim Des Lauriers
  • asked a question related to Phylogenetics
Question
5 answers
I would like to know what are the differences between phylogenetic conflicts in the trees and also the meaning of lack of resolution (solved and unsolved trees) and the presence of conflicts. Can you tell me how we can evaluate the resolution and conflict of a phylogenetic analysis?
Thank you for your responses.
Best,
Houtan
Relevant answer
Conflict is a contradiction of individual or collective ideological, intellectual, realistic and economic wills. If there are common denominators in a specific field, then it is possible to approach a solution to this conflict by employing diplomatic maneuvers, economic pressures, security alliances, etc. Otherwise, the available solution remains deterrence or war.
  • asked a question related to Phylogenetics
Question
4 answers
I am working with an enzyme with high substrate specificity and only a few bacteria have it. I recently discovered that my protein has extremely high structural similarity with a protein present in all bacteria. The substrates are somewhat similar but definitely not the same. The sequence identity is ~30%, and the active site residues are lining up pretty well. I just don't know what kind of phylogenetic analyses I can perform to explore this relationship further. Is there any way to determine if this other protein is in fact an ancestral version of my protein of interest?
Relevant answer
Answer
Dominik is quite right. In particular long branch attraction give misleading phylogenies if any of the sequences have long branches (lots of substitutions) and are joined by relatively few substitutions (the so-called Felsenstein Zone) . Algorithms will generally group these long branches that are in fact not closely related. This is where using an optimal maximum likelihood model of sequence evolution is really critical.
Astonishly perhaps, phylogenetics can provide information on the branching pattern and amount of evolution that has happened throughout the evolutionary process. However, the tree that is inferred is, in all cases, an unrooted tree. Finding the root adds such a degree of mathematical complexity that computers are not able to evaluate the vast number of topologies and the placement of the root on these trees.
So we root a tree in order to find the ancestral state using data that is from other sources. Sometimes that is biogeographical or morphological or perhaps from some outside molecular information that gives clues to who is the closet to the so-called 'most recent common ancestor". This requires some general knowledge of the biology of the organisms and is ofter ignored with the result that the root is misleadingly placed.
  • asked a question related to Phylogenetics
Question
1 answer
Dear all,
I have been testing a dataset of sequences for the best substitution models to use in phylogenetic reconstruction. Each dataset used was tested for models independently three times (using jModelTest 2.1.10), yet each of the three runs gave different results when it comes to BIC value (I refer to it because next step is employing BI) - therefore proposing a different model for each run.
Can anyone offer some insight into what model to select in such cicrumstances and/or why that matters?
Thank You for Your replies
Relevant answer
Answer
First I would look at the models which are being chosen -- they may be very similar, and could even involve parameterizations which are not implemented in your downstream software you are using to build the tree (so, there might be a practical constraint influencing your selection).
Then, barring any other information regarding the fit of each model (which you haven't provided; I personally like hLRT, see Posada and Crandall 2001 attached), you have two different philosophies you could use: (1) take the most conservative (i.e., least parameter rich) model; or (2) take the most generalized (i.e., most parameter-rich) model. I would fall into the latter camp, in that I think if the signal from your data is very strongly in favor of e.g., a single transition and transvehrsion rate or not, then there is nothing about a more generalized (GTR etc) parameterization which excludes that. On the other hand, using a more conservative model when a more parameter-rich one may have been a better fit in theory, does constrain rates to those assumed by the model. Attached Abadi et al. 2019 for recommended reading.
  • asked a question related to Phylogenetics
Question
2 answers
Hello, I am researching the phylogenetic tree of snakes. (bayesian Inference tree and molecular clock..)
My sequences are 1300 bp, but sequences of outgroups are about 600 bp.
Can I change the difference of about 700bp to a missing data and analyze it?
especially phylogenetic tree(ML, BI tree) and Relaxed molecular clock(using BEAST2).
Relevant answer
Answer
I recommend using a different outgroup with a longer sequence.
  • asked a question related to Phylogenetics
Question
1 answer
I have a published phylogenetic tree but no access to the rough data used to generate the tree. I want to compare among clades in terms of morphological characters. How do I include the phylogenetic correction? Should I just measure the distance between clades and use this as my phylogenetic correction? The phytools package in R assumes I have the rough tree data, and I don´t. Some suggestions will be most welcomed.
Relevant answer
Answer
The publication supplementary materials might have the tree file. Or, you could contact the authors and request the .tre or nexus file.
  • asked a question related to Phylogenetics
Question
1 answer
Does it vary with the abundance of homologues and phylogenetic relationship of source organism with the organism (either within the same genus or across different kingdom) from which we want to discover/mine the homologue?
Relevant answer
Answer
Dear Lai,
Here I am attaching a research paper for your information, in which a sequence alignment strategy has been used to infer the correct sequence homologue of a protein, and the threshold to assign the function most accurately. This paper clearly emphasises the importance of pairwise sequence identity of alignment to assign a particular function to a putative sequence homologue. Personally, I don't think this method is absolute; however, to date, it is indeed the best-suited method to search for the correct matches. There are various tools available such as the BLAST algorithm based on this methodology.
Moreover, in sequence alignment-based methods either local or global, the most accurate consensus sequences are predicted with least Expect value irrespective of the abundance of homologues, and the phylogenetic relationship between query and subject sequences.
I think it would be helpful, though up to a certain extent.
Thank you
Nitish
  • asked a question related to Phylogenetics
Question
3 answers
Any details (Books , courses, slides, articles) can help me. what are the difference between botanical and phylogenetic classification? Thank for your response
Relevant answer
Answer
Botanical classification was designed by Linnaeus. Phylogenetic classification deals with the relationship amongst the organisms. Based on the characteristic features of the ancestors, species can be classified. There are many books, journal, power point presentation available online related to your query.
To enhance skill: (i) Regular interaction with plant species (ii) Information of different plant groups (iii) Knowing plant species of your surrounding (common name, botanical names, habit & their importance) (iv) Kitchen gardening (v) Field visits to documentation and photography of plant species (vi) Visit to the herbaria (vii) Reading updated literature (viii) Botany related short projects (ix) Participation in botany related seminar/ symposia/ workshop/ training.
  • asked a question related to Phylogenetics
Question
2 answers
Hello, I got a question with the units of sequencing products. Most of the phylogenetic study that using ddRAD or other NGS method provided the data size of their analyzed dataset using the unit "loci" or "SNP". While for research using Sanger sequencing and PCR, they used "base pairs" or "nts". If I would like to compare the molecular data size between these researches, is there a transforming algorithms to do it? Or could I just simply use these number to compare (e.g. 1400 bps v.s. 7000 SNPs)? Thanks!
Relevant answer
Answer
Why do you want to "compare" the size of markers? I think the absolute size of bp is incomparable between a sequence and SNPs among genome, cause all nucleotides in a sequence are linked and not all of them are polymorphic; while SNPs are polymorphic loci scattered on different positions of a genome, which may be less linked to other SNPs. Also, sequence and SNP provide different aspects of evolutionary information, so I think comparing a single sequence to a RAD-seq data could not provide you some useful information
  • asked a question related to Phylogenetics
Question
11 answers
I will appreciate any of the professors who can shed additional lights on two basic problems, I have recently faced with:
1. why two accessions belonging to one species with exact bases show a low posterior probabilities in phylogenetic analyses (bayesian, maximum likelihood, etc.). Alignments are the same, then we predict the high clade credibility, but why do they show the poor supports. What is the reason?
2. I am studying on two close species. I am sure that they are not the same due to distinct morphological differences and difference in their alignments. Which software (s) can help me to indicate that they are still separated not the same? I have tested SplitTree, RDP, Haplotype and different phylogenetic analyses, but still I need to find much precise software. Any suggestion will deeply appreciated.
Best Regards
Relevant answer
Answer
Dear Atena,
at best we have a ZOOM meeting and have a direct look on your alignment and data.
Low node support is of course due to poor (relative) support of low numbers of SNPs and/or homoplastic features of those.
But finally, a genetic distance based on a cladogram (and maybe few genes ?) is very often not a good argument to separate closely related species. Microevolutionary process have different dimensions compared to macroevolutionary (often cladogenetic) perspectives.
Cheers and best
Marcus
  • asked a question related to Phylogenetics
Question
4 answers
Dear all, in the lab I currently work in we intented to sequence entire mtDNA of selected mammalian species, that had not yet been annotated or deposited into GenBank. However, one stumbles upon a question, whether or not this is trully a prudent thing to do. The advent of NGS in the last decade has enabled mtDNA to be sequenced in entirety as a by-product of WGS/shotgun sequencing. Judging by complex evolutionary patterns (eg. reticulate evolution), using only mtDNA can also hardly be justified as a base for larger and well-supported phylogenetic studies, since nDNA is a must in such casses. The question therefore is, whether or not annotating single mitochondrial DNA's from a single individual of a species is still a sound final objective of a research project?
I look forward to receiving Your opinions.
Relevant answer
Answer
Mitochondrial DNA is marvelous for phylogenies. For one thing it is not subject to the confusion of recombination so what you see is similarity by descent alone.The protein coding genes are really quite tidy and phylogenetically informative for a range of ancient (COI, 1st and 2nd positions) and recent divergences (e.g. ND2, COI 3rd position) and very fast changes (control region). mtDNA alone might be somewhat biased since it is maternally inherited and the genes are all involved with respiration and metabolism. So one must also use a few messier nuclear gene to offset the bias that could result from using only mtDNA. Also the mtDNA is inherited as a single marker so represents a very nice long seuquence compared to any nuclear gene which Is subject to recombination. The best analyses will estimate a species tree from mitochondrial and well-chosen nuclear genes that will likely have slightly different gene trees from one another. Model the mitochondrial genes with a different model of sequence evolution from the the nuclear genes with using coalescent methods to coax the gene trees into a single species tree.
  • asked a question related to Phylogenetics
Question
1 answer
While working with some stratigraphically significant extinct larger foraminiferal species, I noticed signatures of ‘plastogamy’ and other ‘sexual reproduction’ in some specimens (Journal of Foraminiferal Research, v. 37(1), pp. 41-45, 2007). Since ‘interbreeding’ and ‘sexual reproduction’ are some essential parameters of a biological species, the specimens representing the morphospecies are different in possessing crucial traits of a biological species. Being part of phylogenetic species, these may serve as a bond between paleontological and biological species and help in establishing gene flow and genetic classification in paleontology. Such forms may be grouped separately in paleontological species.
Relevant answer
Answer
I am sorry, I am a geomorphologist, not the least skill in foraminifera!
  • asked a question related to Phylogenetics
Question
3 answers
Hello,
This is for an undergrad paper I need to complete.
I am comparing the R^2 from a regression of 3 biodiversity measures: phylogenetic, functional and species richness to see which one is the best predictor of primary production.. Phylogenetic had a r2 of 0.28 and species richness had a r2 of 0.21, while functional diversity 0.077.
I was wondering if I could state from the r2 values that phylogenetic diversity was the best predictor? Since species richness had a r2 of 0.21, is it too similar to say phylogenetic is better than species richness?
Thanks
Relevant answer
Answer
Kindly go through the following link, I hope all doubts regarding R^2 shall be cleared.
  • asked a question related to Phylogenetics
Question
6 answers
Does anyone know of a recent review or meta-analysis of molecular vs. morphological phylogenies? That is, has there been a recent analysis of how often the two lead to statistically indistinguishable phylogenetic hypotheses for the same sets of taxa (across many clades) and how often they significantly differ?
Relevant answer
Answer
Dear Matthew, many thanks for asking this very interesting technical question on RG. Apparently the problem is finding a recent review article on this topic. The latest review I found dates back to the year 2016 which is not really recent any more:
Morphological and molecular convergences in mammalian phylogenetics
This paper has been published Open Access so that it is freely accessible as full text (please see attached pdf file).
You can find and access other potentially useful articles when you search the "Publications" section of RG for the term "Molecular versus morphological phylogeny":
I hope this helps. Good luck with your work and best wishes, Frank Edelmann
P.S. Lucky you live Hawaii! 🤟
  • asked a question related to Phylogenetics
Question
5 answers
Experiencing slight difference in the results of tress obtained from BEAST and Maximum Likelihood...
Relevant answer
Answer
I will suggest to delete the insertion deletion from the alignments. The analysis based on the alignment may provide similar results.
  • asked a question related to Phylogenetics
Question
2 answers
Hello everyone !!! Currently, I am working on the structure, engineering, and bioinformatics of industrially important enzymes. I am looking for collaborators, having an experience of working in protein bioinformatics, are welcome to working with enzyme bioinformatics projects from phylogenetics to molecular docking, molecular dynamics, simulation, intermolecular interaction, etc. Interested scientists are welcome to contact me on:
Relevant answer
Answer
Please have look on our(Eminent Biosciences (EMBS)) collaborations.. and let me know if interested to associate with us
Our recent publications In collaborations with industries and academia in India and world wide. EMBS publication In association with Universidad Tecnológica Metropolitana, Santiago, Chile. Publication Link: https://pubmed.ncbi.nlm.nih.gov/33397265/ EMBS publication In association with Moscow State University , Russia. Publication Link: https://pubmed.ncbi.nlm.nih.gov/32967475/ EMBS publication In association with  Icahn Institute of Genomics and Multiscale Biology,, Mount Sinai Health System, Manhattan, NY, USA. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/29199918 EMBS publication In association with  University of Missouri, St. Louis, MO, USA. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30457050 EMBS publication In association with  Virginia Commonwealth University, Richmond, Virginia, USA. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27852211 EMBS publication In association with  ICMR- NIN(National Institute of Nutrition), Hyderabad Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/23030611 EMBS publication In association with  University of Minnesota Duluth, Duluth MN 55811 USA. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27852211 EMBS publication In association with  University of Yaounde I, PO Box 812, Yaoundé, Cameroon. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30950335
EMBS publication In association with  Federal University of Paraíba, João Pessoa, PB, Brazil. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30693065 Eminent Biosciences(EMBS) and  University of Yaoundé I, Yaoundé, Cameroon. Publication Link: https://pubmed.ncbi.nlm.nih.gov/31210847/ Eminent Biosciences(EMBS) and  University of the Basque Country  UPV/EHU, 48080, Leioa, Spain. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27852204 Eminent Biosciences(EMBS) and  King Saud University, Riyadh, Saudi Arabia. Publication Link: http://www.eurekaselect.com/135585 Eminent Biosciences(EMBS) and  NIPER , Hyderabad, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/29053759 Eminent Biosciences(EMBS) and  Alagappa University, Tamil Nadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30950335 Eminent Biosciences(EMBS) and  Jawaharlal Nehru Technological University,  Hyderabad , India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/28472910 Eminent Biosciences(EMBS) and  C.S.I.R – CRISAT, Karaikudi, Tamil Nadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30237676 Eminent Biosciences(EMBS) and  Karpagam academy of higher education, Eachinary, Coimbatore , Tamil Nadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30237672 Eminent Biosciences(EMBS) and  Ballets Olaeta Kalea, 4, 48014 Bilbao, Bizkaia, Spain. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/29199918 Eminent Biosciences(EMBS) and  Hospital for Genetic Diseases, Osmania University, Hyderabad - 500 016, Telangana, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/28472910 Eminent Biosciences(EMBS) and  School of Ocean Science and Technology, Kerala University of Fisheries and Ocean Studies, Panangad-682 506, Cochin, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27964704 Eminent Biosciences(EMBS) and  CODEWEL Nireekshana-ACET, Hyderabad, Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/26770024 Eminent Biosciences(EMBS) and  Bharathiyar University, Coimbatore-641046, Tamilnadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27919211 Eminent Biosciences(EMBS) and  LPU University, Phagwara, Punjab, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/31030499 Eminent Biosciences(EMBS) and  Department of Bioinformatics, Kerala University, Kerala. Publication Link: http://www.eurekaselect.com/135585 Eminent Biosciences(EMBS) and  Gandhi Medical College and Osmania Medical College, Hyderabad 500 038, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27450915 Eminent Biosciences(EMBS) and  National College (Affiliated to Bharathidasan University), Tiruchirapalli, 620 001 Tamil Nadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27266485 Eminent Biosciences(EMBS) and  University of Calicut - 673635, Kerala, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/23030611 Eminent Biosciences(EMBS) and  NIPER, Hyderabad, India. ) Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/29053759 Eminent Biosciences(EMBS) and  King George's Medical University, (Erstwhile C.S.M. Medical University), Lucknow-226 003, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/25579575 Eminent Biosciences(EMBS) and  School of Chemical & Biotechnology, SASTRA University, Thanjavur, India Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/25579569 Eminent Biosciences(EMBS) and  Safi center for scientific research, Malappuram, Kerala, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30237672 Eminent Biosciences(EMBS) and  Dept of Genetics, Osmania University, Hyderabad Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/25248957 EMBS publication In association with  Institute of Genetics and Hospital for Genetic Diseases, Osmania University, Hyderabad Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/26229292
Sincerely, Dr. Anuraj Nayarisseri Principal Scientist & Director, Eminent Biosciences. anuraj@eminentbio.com www.eminentbio.com Mob :+91 97522 95342
  • asked a question related to Phylogenetics
Question
7 answers
Hi everyone,
I am looking for phylogeny of pollinators and ants to construct a phylogenetic correlation matrix. Something similar to what can be found in verlife.
Any ideas?
Thanks in advance!
Relevant answer
  • asked a question related to Phylogenetics
Question
28 answers
I have worked on a phylogenetic analysis based on morphological and molecular characters and the review of a genus of Cicadas. As the research results, we will describe a new genus and 20 new species. I am facing trouble searching for a zoological journal that accepts papers with this amount of taxonomic information. The MS has around 80 pages.
Do you have suggestions of biological/zoological journals that commonly publish large papers that include taxonomy?
For now, I have in mind Zootaxa and Zoological Journal of Linnean Society.
I really appreciate any help you can provide.
Relevant answer
Answer
the European Journal of Taxonomy is a 'diamond' open access journal, meaning that you don't pay to publish and nobody pays for reading.
  • asked a question related to Phylogenetics
Question
5 answers
I would like to add a data matrix of morphological data, assembled in the software Mesquite, to a manuscript. I would either add an electronic supplement (MS Excel format) or a table as *.txt or *.dic file. Anyone with experience around? I find Mesquite to be a bit user-unfriendly with this regard.
Relevant answer
Answer
you can just copy paste the selected range using ctrl+v and ctrl+v as long as the selected range is selected on the mesquite matrix. So for example, if you want to copy a range of 3 rows and 3 columns from the excel, you have to copy that range and go to the mesquite matrix and select 3 rows and 3 columns then paste it.
  • asked a question related to Phylogenetics
Question
4 answers
As the title says, I have performed multiple ILD tests on PAUP with different settings and all of them reflect congruence between chloroplastic genes and ITS sequences (P > 0.1). I was wondering what would the implications be for these sequences or the taxa involved? Does this mean that the rate of change in nuclear and plastidial sequences are comparable? What does the congruence tell me about the history of the genes?
I was not able to find many papers in plants where they have performed phylogenetic analyses with plastidial and nuclear sequences concatenated, so any suggestion would be highly appreciated. Thanks in advance for your response
Relevant answer
Answer
Si el análisis te da congruencia entre particiones (nuclear y plastidial) significa que dichas particiones están contando la misma historia, reflejan la misma hipótesis filogenética. El ILD es para todo el árbol, pero vos visualmente podes verificar si los clados que se forman en ambas particiones son los mismos, y si agrupan los mismos taxones. Si las particiones fueran incongruentes, y además los clados están bien soportados, entonces podrías sospechar que ocurrieron eventos de hibridación e introgresión, captura de cloroplasto, o sorteo incompleto de linajes, considerando que los genes nucleares se heredan por ambas vías, y los plastidiales por una sola vía. La congruencia entre particiones te permite poder concatenar tu dataset plastidial y nuclear y analizarlo como uno, para encontrar el árbol de especies; si hubiera incongruencia, es mejor analizar los árboles de genes por separado. Espero que esto te haya servido un poco!
  • asked a question related to Phylogenetics
Question
4 answers
Hello! I'm a new observer on the iNaturalist system, and I mostly post insect macro photos ( https://www.inaturalist.org/observations?place_id=any&user_id=cback&verifiable=any ).
I find that Mecoptera, in spite of their relative abundance, diversity and phylogenetic interest, are an orphan group in terms of systematics references, at least for North America. I read your comment on BugGuide about the non-availability of Penny's monograph. Has anyone managed to secure a copy of this document? It would be a starting point in order to provide high quality photos of specimens with a verifiable ID. Grateful if you can spare the time for a reply,
Regards,
Christian Back
Relevant answer
Answer
Christian Back (This is not about the monograph, but may be of use, please ignore if it is off-top or repeats what you already have). The Internet Archive Wayback Machine saved the site Penny's curated collection site entitled: "The Mecoptera of North America" seven times between March 29, 2013 and February 9, 2021. [In my location behind the Great Firewall the images do not load, but you may have more luck if they are linked and archived.] The archied site also list some contact information for (deceased) Norm Penny which you may be able to use to contact the current collection manager or find out the disposition of the collection:
Cheers,
Leo
  • asked a question related to Phylogenetics
Question
1 answer
I am doing a phylogenetic analyses of two sets of proteins (A and B) that are functionally very closely related and share a "large degree" of sequence similarity. I have identified from various species protein sequences I want to include in the analyses (for both proteins). Each separate sequence was included based on their similarity / BLAST results to the known and characterized (functionally) proteins (A and B) in Arabidopsis. I am worried that some of the species included might however represent paralogs and not orthologs. Is there any analyses where I can "plug and play" the data that I have and see whether it comes out as orthologs (hypothetically then an orthologous group for protein A and one for protein B). I do not want to do an analyses where I search a database for orthologs, I want to ID it in the sequences I already have in my dataset (which were included obviously based on certain pre selected criteria). Not all the species / sequences we include might be from completed sequenced and annotated genomes.
Relevant answer
Answer
Hi Rouvay,
If I understand this question correctly, one way to proceed could be to evaluate, by mutual alignments of your n protein datasets, on your n assembled genomes.
Synteny could possibly be used to identify/differentiate orthologs from paralogs. However, tandem duplication could be a problem...
Another way could be to use dedicated tools: orthomcl, OMA...
Romain.
  • asked a question related to Phylogenetics
Question
4 answers
CIPRES is charging non-US users, and I need to find another external server to run some phylogenetic analyses.
It is a large dataset for some Andean plants, NTAX=214 NCHAR=5849, 40 million of generation.
Relevant answer
  • asked a question related to Phylogenetics
Question
2 answers
I saw that some papers use both cpDNA and nrDNA in a single phylogenetic study. Why not just choose one? What are the functions of each type of DNA? What is the benefit of using both?
Relevant answer
Answer
In the most of green plants, chloroplast (DNA) transmits , usually, to the progenies through the maternal gamete. Then, we can determine the (cytoplasmic) maternal plant (by the cp DNA haplotype) in the hybrids when both parents have the different cpDNA haplotypes. Nuclear DNA transmits to the progenies from both parents. We can determine, luckily, the both parents of the hybrids and the direction of crossings when you examine both the cpDNA & nrDNA.
  • asked a question related to Phylogenetics
Question
6 answers
Most of the literature used partial cytochrome b gene sequences for animal phylogenetic studies instead of the complete sequence. What is the reason behind this where full or complete cytochrome b gene sequence could explain more details?
Thank you very much!
Relevant answer
Answer
Hi!
You got the answer.
Simply to show/prove populations are phylogenetically distant from other species three regions of mtDNA viz. cytochrome b, 16S RNA and and D-loop regions will be used.
  • asked a question related to Phylogenetics
Question
7 answers
Better means it gives faster results and higher likelihood!
Relevant answer
Answer
Well...I guess depends of your type of data and the type of analysis you want to do. For example, for bayesian inference analyses the softwareBEAST is well known: https://www.beast2.org/
And RaxmL for Maximum likelihood you can use RAxML
Also, if you have a problem with the time of the analyses, I suggest to use a web server like CIPRES https://www.phylo.org/portal2/login!input.action
  • asked a question related to Phylogenetics
Question
4 answers
Can anyone help e to interpret my phylogenetic network?
Relevant answer
Answer
Dear Reem,
Could you explain your problem? Exactly what kind of help do you need?
  • asked a question related to Phylogenetics
Question
2 answers
I have a set of trees and am using Blomberg's K to detect phylogenetic signal. I am finding that for several of them, my values for K are close to 0 (in the region of K = 0.001 - 0.004) which would suggest no phylogenetic signal. However, the P values are high significant (p = 0.001), which suggests these results are different from a random distribution of traits.
Does this suggest that there is significant signal, but it is extremely weak?
Relevant answer
Answer
Well, Blomberg's K is a two-tailed test. Your result indicates an opposite of trait conservatism. It is called a significant trait convergence. Closely related species are the most distinct. See for instance what is called 'anti-signal' in this paper: Pei et al. (2011; https://doi.org/10.1371/journal.pone.0021273). I will give you another example: Take a look in this paper: Jombart et al. (2010; doi:10.1016/j.jtbi.2010.03.038), what is called "local structure", or "negative phylogenetic autocorrelation". They say: "A local structure would be observed whenever closely related taxa tend to be more different with respect to a given trait than randomly chosen taxa. (...)". OK, but you have more tools to investigate this, like the "Root/tips skewness test (two sided)" of Pavoine et al. (2010; Ecological Monographs, 80(3): 485–507).
  • asked a question related to Phylogenetics
Question
3 answers
In case of cyanobacterial taxonomy, what is the final status of the organism which forms phylogenetically distinct clade but are not distinguishable by any of the morphological feature? Can we conclude it as a novel taxon? If no, then how can we report them in a publication?
Relevant answer
Answer
If it were a taxon of the animal or plant kingdom, the answer is that it is a new taxon. In Monera I don't know that. I attach an example of a work on a cryptic species in sea urchins just in case it is of interest to you:
Bronstein, O., Kroh, A., Tautscher, B., Liggins, L & Haring, E. 2017. Cryptic speciation in pan-tropical sea urchins: a case study of an edge-of-range population of Tripneustes from the Kermadec Islands. Scientific Reports, 7: 5948. DOI:10.1038/s41598-017-06183-2.
  • asked a question related to Phylogenetics
Question
9 answers
Rather than using sequence alignment data, I wanted to have phylogenetic tree from distance matrix and bootstrap as part of statistical analysis. Anyone to tell me how to execute this analysis?
Relevant answer
Answer
Dear all,
I have been working with MEGA and distance matrix. I am attching a mini-tutorial to compute UPGMA dendrograms.
I have tried with MEGA-X and it perfectly works.
#############################
1-Calculate your distance matrix with your favourite software (e.g. in ".xls" format)
2-Prepare the matrix as explained by MEGA developers (Figure 1) in a ".txt" file. I show my own datafile (Figure 2). Warning: Keep the structure, including blanks.
3-Save this file as ".meg" file. It will be ready to use in MEGA.
4-Open ".meg" file with MEGA, select Pairwise distance > Lower left matrix > OK
5-Check data in the file read by MEGA. The matrix should be identical that your previous one (".xls" or similar; Step 1)
6-Select data and click "Phylogeny". You could compute UPGMA tree
7-MEGA offers a good variety of options to customize your tree
Good luck!
  • asked a question related to Phylogenetics
Question
3 answers
Dear experts,
In technical articles, we see lots of statements telling us "indels" are useful in phylogenetic reconstruction because they play a major role in genomic evolution.
** I am dealing with 14bp deletion at several sequences of a non-coding gene, and I have two ambiguities as follows in my interpretations:
1- Taking into account that "non-synonymous" refer to frame-shifted mutations which alter amino-acid or protein in "protein-coding" genes, is it correct to name 14bp deletion in a "non-coding" gene as non-synonymous in my dataset?
2- In routine phylogenetic reconstruction using RaxML or MrBayes (when deletions are considered as gaps), strains with the same deletion did not cluster closely, as I expected, do I have to do something specific while making phylogeny with these data?
Thanks a lot for your answers,
Ahmad
Relevant answer
Answer
Insertions or deletions in non-coding regions should not be considered "nonsynonymous" they are not coding for any protein. Even in known structural RNA genes such as ribosomal RNA genes or transfer RNA genes, the terms synonymous and nonsnonymous really do not apply.
A major problem with insertions and deletions in estimating phylogenies, is that there are many types of insertions and deletions and the size of an insertion or deletion does not correlate with genetic "distance" or the importance of the in/del. For example gain or loss of an intron, or gain or loss of a transposable element is a large change but of lesser phylogenetic importance than many other types of smaller insertions/deletions. Gain or loss of a number of repeat elements in repetitive sequences is also of low importance. So "scoring" insertions and deletions is problematic.
  • asked a question related to Phylogenetics
Question
2 answers
For my master thesis I am conducting a phylogenetic research and for this I'm using tools like MEGA, IQTREE, FigTree, etc to make a phylogenetic tree. I wanted to know if IQTREE integrates the evolutionary placement algorithm (EPA) when it calculates a phylogenetic tree
Relevant answer
Answer
The EPA is used by raxml. I don't think iqtree uses it
  • asked a question related to Phylogenetics
Question
4 answers
Hi all,
Considering that one has the whole genome sequence (annotated) of a bacterium and wants to know if it has the potential to cause human diseases, what steps would you recommend (insilico)?
I noticed that one of my novel species falls into the phylogenetic cluster of Pseudomonas aeruginosa and is closely related to P. alcaligenes (which was isolated from human blood samples and is also responsible for bioremediation purposes) based on genome tree.
Does this affiliation can be trusted with respect to the pathogenecity tendency?
Thanks,
Timsy
Relevant answer
Answer
  • asked a question related to Phylogenetics
Question
6 answers
I have been working on a study trying to estimate body mass in mammals using a skeletal proxy in R. My dataset has high phylogenetic signal (Pagel's Lambda = 0.9) and I am interested in removing the effects of phylogenetic signal to produce more accurate estimates of body mass.
I had tried to do this using PGLS, but PGLS ended up producing higher error rates than the same data under OLS despite the high phylogenetic signal. Part of this appears to be because there are several particular clades with long branches (e.g., Monotremata) that form regression lines above or below the line of best fit, and their phylogenetic position distorts the resulting best-fit line. I had thought PGLS took phylogenetic covariance into account when estimating values, but it looks as though it only uses phylogenetic signal into account to minimize the residuals of the best fit line. It does not put the phylogenetic covariance back into the estimation of the y-value.
I.e., the function does not go: "Taxon X is positioned as sister to Monotremata. Therefore it should deviate above or below the regression line to a degree similar to its phylogenetic distance and the mass estimate should be adjusted accordingly?"
Given this, is there any way to use a topology to estimate a value that considers phylogenetic covariance and the phylogenetic postion of the unknown taxon when estimating this value, rather than just using it to minimize the residuals when calculating the best fit line?
Relevant answer
Answer
Agreed. PVR R-package is all you need to run your analysis.
  • asked a question related to Phylogenetics
Question
10 answers
I have a question, when I used the picante function to analyze the phylogenetic community structure and phylogenetic signals in R, it appeared an error:"'phylo' is not rooted and fully dichotomous", I don't understand what's the problem, the attached file is my phylogenetic information, please check it, I am sorry to trouble you all, but I really want to solve this problem, thanks a lot.
Relevant answer
Answer
Yes, i think too
*The unrooted tree should be transformed into rooted tree. Try this code, phy=multi2di(phy1), where phy1 is the unrooted tree.
Thanks Wenjie Wan
  • asked a question related to Phylogenetics
Question
2 answers
I am an undergraduate and I would like to broaden my knowledge on this subject, beginning by its principles. Maybe it would help that purpose to broaden my knowledge on Phylogenetics as well. Thanks all of you in advance.
Relevant answer
Answer
The official rules for animal nomenclature are available at https://www.iczn.org/the-code/the-international-code-of-zoological-nomenclature/the-code-online/ This gives the rules for using names, not the systematic and phylogenetic question of which names are biologically useful. The Treatise on Invertebrate Paleontology volumes (http://paleo.ku.edu/treatise2/treatise.html) have some useful guidelines on taxonomy and taxonomic usage.
  • asked a question related to Phylogenetics
Question
2 answers
I have a regression equation in which I am curious if the amount of scatter around the regression line is driven by phylogeny. I know there is a lot of phylogenetic signal because Pascal's lambda is very high, but I am more interested in seeing how much it affects the strength of the relationship between X and Y. I had thought about trying to calculate phylogenetic independent contrasts and using those to see if the spread about the regression line was less under phylogenetic independent contrasts than data that does not take phylogeny into account, but in thinking about it I realized that I haven't seen phylogenetic contrasts done in biology in what seems like ages. Everyone seems to be doing Phylogenetic Generalized Least Squares now, and if I recall right I remember someone like Revell (2010) mentioning that PGLS and independent contrasts are functionally identical.
My question is, do people still use phylogenetic independent contrasts in biology and if so would this be an appropriate use of them to investigate my data.
Relevant answer
Answer
Yes Sir .
  • asked a question related to Phylogenetics
Question
3 answers
Hi all,
I am interested to do a comparative genome analysis using EDGAR comparitive genomic tool. I have a novel bacteria whose genome sequence was obtained by us and now I was to do compare this novel strain´s genome to it´s phylogenetically closely related neighjbours. This bacterium belongs to the genus of Paracoccus.
I notice that EDGAR allowed to select the genomes which are already provided in their database but does not allow to add a sequence of any genome (may be I missed?). Could you please suggest me how can I do a genome comparison using thsi tool or if you can suggest some other tools (in case EDGAR does not allow to add new WGS to their database for comparison)?
I know about BRIG but that is not as cool as EDGAR since it provides Venn diagrams and many other results for better interpretation.
Thank you and Regards,
Timsy
Relevant answer
Answer
Hello, I think you need to register to EDGAR and create a new project.
  • asked a question related to Phylogenetics
Question
1 answer
Hi all,
A tree is binary if every interior vertex has degree three. So, I just want to find some methods like recursive algorithm to construct all phylogenetic binary trees. The running result is like these:
n=3
Input:Sklu,Scer,Spar
Output:(Sklu,Scer,Spar);
n=4
Input:Sklu,Scer,Spar,Calb
Output:((Sklu,Calb),Scer,Spar);
Output:(Sklu,(Scer,Calb),Spar);
Output:(Sklu,Scer,(Spar,Calb));
n=5
Input:Sklu,Scer,Spar,Calb,Smik
Output:(((Sklu,Smik),Calb),Scer,Spar);
Output:((Sklu,(Smik,Calb)),Scer,Spar);
Output:(((Sklu,Calb),Smik),Scer,Spar);
Output:((Sklu,Calb),(Smik,Scer),Spar);
Output:((Sklu,Calb),Scer,(Smik,Spar));
Output:((Sklu,Smik),(Scer,Calb),Spar);
Output:(Sklu,((Scer,Smik),Calb),Spar);
Output:(Sklu,(Scer,(Calb,Smik)),Spar);
Output:(Sklu,((Scer,Calb),Smik),Spar);
Output:(Sklu,(Scer,Calb),(Spar,Smik));
Output:((Sklu,Smik),Scer,(Spar,Calb));
Output:(Sklu,(Scer,Smik),(Spar,Calb));
Output:(Sklu,Scer,((Spar,Smik),Calb));
Output:(Sklu,Scer,(Spar,(Calb,Smik)));
Output:(Sklu,Scer,((Spar,Calb),Smik));
b(n)=(2*n-5)!! b(3)=1!! b(4)=3!!=1×3 b(5)=5!!=1×3×5
Relevant answer
Answer
Hi Qiyun,
The function allFurcTrees in the R package phytools can generate all bi- and multifurcating unrooted trees.
  • asked a question related to Phylogenetics
Question
6 answers
Need a detailed tutorial on XLSTAT ( Input type format, output file, drawing conclusions etc) Unable to find online.
Relevant answer
Answer
As Csaba mentions, I believe there isn't any tool for this. However, if you want to analyze genetic data via Excel, I recommend you GenAlEx: https://biology-assets.anu.edu.au/GenAlEx/Welcome.html Another user-friendly tool that can parse with Excel look-alike genetic data is Genodive: http://www.patrickmeirmans.com/software/GenoDive.html And here are some online tools specifically for phylogenetics: https://ngphylogeny.fr/tools/ www.atgc-montpellier.fr/phyml/
  • asked a question related to Phylogenetics
Question
7 answers
I have a set of 16 species of nematodes, i build a mitochondrial and nuclear phylogeny with them, and i found discordance between the two phylogenetic hypothesis. One of the common reasons for this discordance is incomplete lineage sorting and introgression, i want to test for this phenomena as the cause of the discordance in my set of species. Is there a software or way you could recommend to test for this two causes?
Relevant answer
Answer
Hi Guillermo. I'm not sure if you're still looking for answers to this question, but if you are then MSCquartets might be of interest to you. MSCquartets is an R package developed by myself and coauthors that tests the MSC. It tests either a specific species tree topology specified by the user or any species tree topology, with gene trees arising under the MSC as the null hypothesis. The tests are exhaustive, so rejection of the null hypothesis could be due to introgression or other factors. To use MSCquartets you need gene tree ``data'' (technically not data because they will be inferred gene trees). Let me know if you want more information.
  • asked a question related to Phylogenetics
Question
10 answers
Dear fellow researchers,
Our team is still developing SARS CoV-2 culture for a foundation of various testing, however as we known that besides established culture it is needed to precisely identified the virus that are present in our culture. PCR based methods is our choice based on availability to confirm SARS Cov-2 existence qualitatively.
Besides confirmation, we think that phylogenetic study is still needed. However our facility does not have next gene sequencer for whole genome sequencing, therefore we think that partial genome sequence could offer a rapid results and information about virus origin based on phylogenetic study.
We would ask, is there any primer for specific sequence in SARS CoV-2 genome that could be used for phylogenetic study (partial sequence)? And what software or tools that fellow researchers could suggest for us?
Thank you
Best Regards
Andi YW
Faculty of Medicine
Airlangga University, Surabaya, Indonesia
Relevant answer
Answer
Interesting question
  • asked a question related to Phylogenetics
Question
3 answers
Dear community,
I am rather new to phylogenetic comparative methods and seek your advice.
I want to compare a continuous trait (dependent) between species belonging to either groupA and groupB (groupA and groupB are two states of a categorical binary trait in this case). Using traditional statistics, a Mann-Whitney U test lets me reject the null hypothesis, as clearly the trait values of groupA tend to be greater than the trait values of groupB (see attached plot).
However, how can I implement a phylogenetic correction into the above analysis? I have already performed a PGLS and calculated phyogenetic signal for the regression using Pagel's lambda in ape/nlme (lambda-0.98). I have found that the traits evolve according to the phylogeny, as after performing a likelihood ratio test I found that the calculated lambda is not different than lambda=1.
My question is how do i go on from that point to test whether the trait values are different between groupA and groupB, taking into account the phylogenetic nonindepednence of the traits?
I understand that if the lambda of the regression would not be significantly different compared to lambda = 0, then I could just treat the trait values of each species as independent data points, but I do not understand how to compare trait values between the two groups if 0<lambda<1 or if lambda=1 (which is the case in this example).
Thank you for your time and your patience!
P.S. What do I do if instead of just groupA and groupB I have an additional 2 groups (groupC and groupD).
I have understood that using software like mcmcPGLMM might be a good idea for doing what I want, but I seem to hardly understand anything from the documentation
Relevant answer
Answer
Interesting question. Following the discussion.
  • asked a question related to Phylogenetics
Question
36 answers
In this case we have two phylogenetically species, but they do not differ morphologically. We are treating it as a cryptic species and we want to formally describe it, using only molecular data (ITS nucleotide bases differences).
Relevant answer
Answer
It seems most of the new researchers have forgotten that the molecular phylogenetic data are secondary and morphological characteristics are primary.
  • asked a question related to Phylogenetics
Question
11 answers
Normally dN/dS ratios are calculated and interpreted as below one negative selection, above 1 is positive selection and 1 means neutral selection. How to interpret dS/dN ratios? Programs like SNAP provides dS/dN graphs and ratios.
For example:   
Averages of all pairwise comparisons: ds = 0.1678, dn = 0.4090, ds/dn = 0.4072, ps/pn = 0.4707
Please see image as well.
Can somebody explains it in simple words as I am not much familiar with this?
Relevant answer
Answer
Can anyone interpret the dN/dS=.099, which is defined as defualt in PAML when dS=0 or both dN=dS=0?
  • asked a question related to Phylogenetics
Question
1 answer
I would like to analyse fish gut bacteria by the Functional analysis by Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt). How is this analysis done?
Relevant answer
Answer
see
The complete mitochondrial genome of the mantid shrimp Harpiosquilla harpax, and a phylogenetic investigation of the Decapoda using mitochondrial sequences
A.D. Miller, C.M. AustinJournal:Molecular Phylogenetics and Evolution
  • asked a question related to Phylogenetics
Question
8 answers
I am trying to infer both maximum likelihood and bayesian phylogenetic trees using RADseq reads. I have already created my sets of radtags using Ipyrad which also I used to export the consesus sequence for all of my samples both in a concatenated sequences phylip file and as a SNPs-only phylip file.
Should I be using the whole sequences for my phylogenetic reconstructions or only the snps?
Also, I know that a best-fit nucleotide substitution model should be chosen before trying to build a phylogenetic tree. However, I have not seen any consensus on published papers of how this is done with RADseq data. Some articles choose a nucleotide substitution model without explaining how they chose it, others use a software as IQ_Tree to estimate a single model for their whole data set, while others use software as PartitionFinder to split their data and estimate a substitution model for each partition.
Which would it be the best approach? Specially considering that my data sets have high amounts of missing data (around 60%). And, if a model-test should be done, which is a good program to work with these large files?
Any help will be kindly appreciated!
Relevant answer
Answer
¡Hola Rodrigo!
Espero te encuentres muy bien,
¿Tienes un outgroup para enraizar tu árbol? ¿Has usado CIPRES?
Te recomiendo construir tu árbol con RAxML. Considerando el origen de tus datos, yo utilizaría jModelTest2 para encontrar el modelo de sustitución más adecuado (que en general va a ser GTR+G+I, aunque también puedes considerar correrlo solamente como GTR+I, porque RAxML corre mejor con ese). Utiliza solamente los sitios variables.
Una opción muy versatil para correr análisis preliminares, es subir tu archivo en .phy a http://www.atgc-montpellier.fr/phyml/, poner todo en automático y con base en los resultados ahora sí correrlo largo en RAxML.
Te comparto un consejo que me salvó la vida: a veces las señales de las relaciones filogenéticas entre individuos es más clara si solamente analizas los árboles con marcadores neutrales. Puede que no pase en todos los casos, solamente que en la experiencia personal, fue hasta que removí los outliers que mis muestras encontraron su sitio en una monofilia congruente con los análisis de estructura.
Ya por último, no te preocupes por el missing data. Te comparto un artículo que habla de eso:
  • asked a question related to Phylogenetics
Question
5 answers
I am working on phylogenetic beta diversity in microbial assemblages across biomes, and interested in calculating betaNTI through the null distribution of betaMNTD by randomizing OTUs across the phylogeny and recalculating bMNTD 999 times. Does anyone have a script for this?
Relevant answer
Answer
The easiest way is to follow Stegen et al. 2013 ISMEJ:
An Updated Raup-Crick script is here
If you are to use the script by Franz Krah, please cite (sorry for the self -advertisment)
  • asked a question related to Phylogenetics
Question
3 answers
Which is one is better in between 16S rRNA and microsatellite phylogentic study?
Relevant answer
Answer
16s rRNA study
  • asked a question related to Phylogenetics
Question
5 answers
I am doing a study on phylogeography. An average pairwise difference and Nei's distance matrix was generated in Arlequin. I am having a hard time interpreting the results. As far as I know, higher values would mean higher pairwise difference.
(1) What if I get a negative Nei's distance? (2) What do they mean by "corrected" average pairwise difference? 
Hope someone could help. Thank you.
Relevant answer
Answer
Nei's DA is only applicable to frequencies, which inherently have values 0 < p <= 1.
For example, if the value x=12 occurs 30 times among all the loci values in a dataset of 120 profiles, then x has a Loci frequency of 30/120.
  • asked a question related to Phylogenetics
Question
11 answers
I have a small data set consisting of 16 sequences, 919bp each. I am trying to determine the phylogenetic relationship among the individuals. I am wondering if the neighbor-joining method, maximum likelihood or bayesian analysis works best for small data sets like mine. Thank you.
Relevant answer
Answer
MrBayes will give you robust results, it's just about finding the best running parameters. And for ML, researches tend to go for RaxML, maybe you could try it and see what happens with the bootstrap values of the output.
Cipres server has various online versions: http://www.phylo.org/
  • asked a question related to Phylogenetics
Question
5 answers
I have four whole genomes sequencing of E. coli isolates , and I wanna to classify in phylogenetics groups.
Whats is the best ways?
Thanks.
Relevant answer
Answer
Dear Andres,
The pan-genome of each phylogroup procured from the pan-genome of the species is noteworthy to be read. Moreover, Identification of the nodes corresponding to the most recent common ancestor (MRCA) for each phylogroup from the rooted species tree using the findMRCA function from the phytools v.0.6.44 R package.
I hope the attached article will be helpful.
  • asked a question related to Phylogenetics
Question
14 answers
I want to learn R Programming for phylogenetics, phylogeography, and biodiversity studies. I hope it will be free and online! Or Introduce the best books for self studying
Relevant answer
Answer
I may suggest you to look for "Cookbooks" for the specific objectives you are trying to solve. As the name says, it's more like a recipe, where you can read workflows that will guide you through every step. (Other tip may be googling for "subject/software tutorial".
Here are some pages that I believe can help you:
  • asked a question related to Phylogenetics
Question
10 answers
The recent official publication of PhyloCode has caught my attention, not for new arguments being presented for a system that has already been extensively discussed and, as I see it, whose downsides and problems surpass the envisioned advantages, but surprisingly because of the mediatic coverage it is currently receiving. So many of the major newspapers, which are generally regarded as part of the "mainstream media", are publishing articles and stories praising the "new" PhyloCode as a much-welcome, even necessary, advance to the "terribly outdated" taxonomic classification system currently in use, which "ignores decades of advances in molecular phylogenetics and does not reflect evolutive relationships of organisms". Anyone familiar with the current practices in Systematics is aware that this is malicious disinformation.
In talking with colleagues some have resonated with my impression that, instead of an academic debate, the PhyloCode is becoming a mediatic thing, so that systematists in disagreement with its adoption are branded as "reactionary" and "outdated", not only by the colleagues with whom they disagree but also by the "mainstream media" and possibly the public influenced by it. This type of political interference is negative to Science, and certainly very much unwanted by those endeavouring to advance, improve and refine human understanding of Earth's biodiversity and Natural History.
I think this debate should be kept academic, free as much as possible from this type of political interference. Your thoughts on this topic are very much welcome.
Relevant answer
Answer
The success of the Linnean taxonomic system has been its stability. The proof is that the system has persisted for more than two centuries and it is still useful.
Stability was granted because -among other reasons- any change that wanted to be performed on the nomenclatural structure had to be done formally within the academic world.
Paradoxically, it resulted to be a double-edged sword:
1- On the one hand, it enabled communication between researchers, as everybody has to attach to a same nomenclatural framework.
2- But on the other, it really hampered any change, making the Linnean system to look old and rigid.
Turning the ordering of biodiversity in a "matter of opinion" as presented in media is clearly counterproductive. We all must be aware that embracing an unstable phylocode system that tends to name any sister-clades relationship, may have deleterious effect on knowledge transference and mutual understanding.
Now there are a series of journals specialized in fostering formal taxonomic changes. These publications ease taxonomic rearrangements in our raping-changing world, transferring phylogenetic results into manageable taxonomic units, and helping taxonomy to keep up to date.
Let's do not defenestrate what has been achieved over so many years!
  • asked a question related to Phylogenetics
Question
5 answers
Using IQTREE I received the most optimum models for my data to analyse phylogenetic tree-reconstruction. IQTREE is able to use, aside from the Gamma distribution for heterogeneity across sites also the FreeRate model:
"FreeRate model (Yang, 1995; Soubrier et al., 2012) that generalizes the +G model by relaxing the assumption of Gamma-distributed rates. The number of categories can be specified with e.g. +R6 (default 4 categories if not specified). The FreeRate model typically fits data better than the +G model and is recommended for analysis of large data sets."
Does anyone know if it is possible to use this FreeRate model in MrBayes (or in RevBayes), and if yes, how to I put that in the script?
Relevant answer
Answer
Hey Pepijn! I came across this problem too. It seems it's not possible to include FreeRate model in mrbayes. Here's a link to another answer:
There's a cool parameter in IQTRee '-mset mrbayes' that will only include models also available in mrbayes when doing model testing. It only work with nucleotide data though so if you have something like PICS-Ord regions in your alignment you'll get an error.
  • asked a question related to Phylogenetics
Question
1 answer
I am looking for an abundance weighted phylogenetic diversity index. Barker (2002; Biological Journal of the Linnean Society, 2002, 76, 165-194) adapted the faith index to incorporate also absolute or relative abundance. However, in my opinion, this index results in some weird artifacts when relative abundance is used.
Does anybody have some tips + preferential suggestions for r-packages to implement other indices?
Relevant answer
Answer
Hill number (familly), a = 2; similar to the Simpson diversity, also named effective number of variants.
  • asked a question related to Phylogenetics
Question
3 answers
I want to determine the phylogenetic age of the members of the Lepidoptera (Sphingidae) based on mtDNA, using BEAST software; But I do not have a good indicator. Please what is mutation rate in the mitochondrial genomes of the Lepidoptera (in million years)?
Relevant answer
Answer
I would highly recommend to anyone interested in tectonic calibration to read 'Molecular Panbiogeography of the Tropics" (2012) and 'Biogeography of Australasia' (2014) as starting points. These books include hundreds of examples of tectonic correlation with extensive integration of phylogeny for an immense range of animal and plant groups, and also presents a critical account of the issues in molecular calibration. This is just the tip of the iceberg. As for molecular rates of divergence, there seem to be all sorts of models so you take your pick - assuming there is a molecular clock in any meaningful sense. Some have objected to tectonic calibration where it provides ages for a subclade that would then push the larger clade back into e.g. the pre-Cambrian. Of course they never consider that perhaps their clock model is erroneous. Primates provide an excellent example for tectonic correlation with group after group of distributions corresponding to major tectonic events of Mesozoic and Cenozoic time. Some molecular researchers fall back on the mystical 'I can't believe it' when tectonic ages considerably predate the oldest fossil, but other molecular researchers are quite happy when their fossil calibrated estimates considerably predate the oldest fossils. There is a lot of waffle thinking out there, but the fact is that the vast array of modern life shows tectonic correspondence with Mesozoic tectonics. That is the great reality. Fossil-calibrated molecular clocks were widely proclaimed to falsify that, but it was all smoke and mirrors.
  • asked a question related to Phylogenetics
Question
6 answers
Greetings from Brazil
I've been searching for the most appropriate gene to determine the phylogenetic relationship among Gram-negative bacteriophages and I've found several options, such as "major capsid protein", "helicase" and "terminase large subunit". Does anyone has some personal experience they'd like to share?
Thank you in advance.
Relevant answer
Answer
Abhijeet Singh I'll try it right away, thank you very much.
  • asked a question related to Phylogenetics
Question
5 answers
I have a 16s rRNA amplicon sequences (next generation sequence), I want to find out the phylogenetic diversity of a geographical region, How can I do that? please tell me the different approaches. thanking in anticipation.
… 
Relevant answer