Science topic
Ensemble - Science topic
Explore the latest questions and answers in Ensemble, and find Ensemble experts.
Questions related to Ensemble
Hello everyone!
I'm calculating protein dimer structure in CNS-solve v1.21 using distance restraints obtained from solution NMR experiments.
There is an issue during calculation that most structures (not all) in the ensemble have two specific amino acids: one tyrosine and one phenilalanine broken like shown in the picture. The problem reproduces even after I remove all restraints associated with this amino acids.
I tried to review topology file, but did not find anything suspicious about these residues.
I would greatly appreciate if you could give me any hints on how to solve this problem.
I've set up an emcee EnsembleSampler() with 50 walkers, 500 iterations. However, looking at the resulting traces, I don't think my walkers are fully exploring the parameter space. The walkers don't converge around what the true value of the parameter is.
The prior I'm giving to whole parameters is uniform and the p0 distribution is a random uniform distribution. The reason I believe this is wrong is because it appears that these walkers are exploring all of the parameter space and for the majority of their iterations without converging to the parameter
Graphs below:
Hi,
I need to quantify the expression of CD44v6 in my RNAseq data. For this, I would need the ensembl number. Does anyone know what the ensemble number for CD44v6 is?
Thank you.
Hi everybody,
I'm looking for an SNP (rs497692), but when I insert the rs in the databases, it returns the alleles as T>A / T>C . But there is frequency for T and C . Not for A . Why?
can anyone help me?
I am currently working on a financial project and require an expert in machine learning. Specifically, I need expertise in tree boosting, neural networks, random forests, linear and nonlinear frailty models, as well as ensemble methods. In addition to discussing the work, I will need the source code files of the models and analyses we develop. I hope this clarifies my requirements.
I want to compare ensemble mean and member mean with era5 data as time serise
Une image d'un caractère manuscrit est une distribution de pixels dans un vecteur chaque pixel est une valeur comprise entre O et 255. Cette distribution peut être utilisée pour comprendre la répartition des données et en tirer des conclusions. La distribution de valeurs dans un vecteur peut être représentée en général sous forme de diagramme, d'histogramme ou d'autres types de graphiques. Ces graphiques permettent de visualiser la fréquence de chaque valeur dans le vecteur et d'identifier les valeurs qui se produisent le plus ou le moins souvent. Il existe plusieurs mesures statistiques qui peuvent être utilisées pour décrire la distribution de valeurs dans un vecteur notamment la moyenne, la médiane, l'écart-type et la variance. Ces mesures permettent de caractèriser la distribution de manière plus précise et de comparer les distributions de différents vecteurs. La méthode à utiliser dans une distribution de valeurs dans un vecteur peut être influencée par plusieurs facteurs, tels que la taille de l'échantilion, la nature des données ou la présence de valeurs aberrantes. C'est ce demier point qui consiste la problématique de cet artice.
.
Les méthodes statistiques sont généralement utilisées pour comparer la variance de plusieurs vecteurs de même type. L'explorations de ces nombreuses méthodes anciennes et récentes m'a conduit à tester l'utilisation de Principal Components Analesis combiné avec la distance de Mahalanobis
PCA (analyse en composantes principales) est une technique qui permet de réduire la dimensionnalité d'un ensemble de données en transformant les variables d'origine en un nombre plus petit de variables non corrélées appelées composantes principales.
La distance de Mahalanobis est une mesure de la distance entre un point et une distribution multidimensionnelle, qui tient compte des covariances entre les différentes variables. Autrement dit la distance de Mahalanobis est comme une seuil de distance entre un point et un groupe d'autres points pour identifier les points les plus éloignés ou inhabituels (données aberrantes (outliers)) ou pour déterminer si un point de données appartient à une distribution particulière (classification).
Théoriquement l'application de PCA sur un ensemble d'images constituées de 28x28 (vecteur de 784 valeurs de pixels) réduira ces données à un nombre plus petit de variables, la distance Mahalanobis est appliquée sur les composantes principales pour identifier les images qui sont éloignées de la moyenne. si l'on trouve que l'image a une distance très élevée par rapport au centre de la distribution, cela peut indiquer que l'image est aberrante. Plus la distance de est grande, plus l'image est différente de l'ensemble de données. Le réajustement du seuil de distance utilisé est nécessaire en fonction des résultats pour identifier le seuil approprié.
l'utilisation de ces méthodes statistiques sur des images de caractères manuscrit peut-elle donner de bons résultats en pratique?
Background
MD simulation is often used to investigate conformational changes caused by mutation. Consider a case where one has two ensembles of trajectories. We wish to see if there is a statistically significant difference between the ensembles by comparing the mean Calpha distance between specific residues. We require observations to be independent from each other.
Question
Within a single trajectory if the observations are taken far enough apart are they considered independent. For instance if we track the distance between Ser55 and Thr101 can we treat the distances at time point 10ns, 20ns, 30ns etc as independent observations.
Hello,
I want to use AMBER to run a simulation of a membrane protein, small molecule and POPC bilayer membrane complex system, and the complex parameter files .inpcrd and .prmtop have been generated by tleap.
But I do not know how to optimize and balance the system. I searched some posts on the Internet and summed up the following optimization and balancing steps:
(1) Energy minimization of water and ions;
(2) Energy minimization of protein, small molecule and POPC;
(3) Heating system to 310K;
(4) Balancing water and ions using NVT ensemble;
(5) Balancing POPC using NPT ensemble;
(6) Balance protein and small molecule using NPT ensemble;
(7) Conventional molecular dynamics simulation under the NPT ensemble.
Whether this optimization and balancing process suitable for membrane protein, small molecules and POPC system? If not, which step needs to be improved?
In addition, if a residue of small molecule in .prmtop file is named SW2, how can I use the 'restraintmask' instruction in the .in file to restrict the small molecule?
Thank you!
Best wishes!
Does anyone know a method for generating a disordered protein ensemble with AlphaFold-Multimer? The protein I am interested in is disordered and is bound to a protein that is mostly folded. I know of some methods that have been described in the literature (for instance, varying the random seed), but if someone has implementation details on such an approach I would be extremely grateful.
To induce Notch signalling in Huh 7 cells, I plan on cloning the NICD and then using it to activate the Notch Pathway. I need the region in the DNA that specifically codes for the Notch 1 intracellular DNA.
I need to know how one can calulate the Wasserstein distance for probability measures of Gaussian Unitary Ensemble (GUE) and Gaussian Orthogonal Ensemble (GOE). These are two famous ensembles in Random Matrix Theory (RMT). Is there any analytical way or programming code you can help me with?
Does some one know some examples or tutorial how to use the biomaRt R package to automate the gene name recognition of the Ensembl database?
Best wishes,
Hi everyone,
I have a doubt regarding the high negative average pressure values in MD.
The prepared system was minimized (NVT ensemble) with 10ns using DESMOND software. While doing Simulation Quality Analysis, I found high negative average pressures after minimization. Further, when I performed MD (NPT ensemble) with 200ns, I observed the average pressure was around 4 bars. Is it okay to get high negative average pressure? What does the change from highly negative to positive values tell about the system?
I appreciate it if you provide me with an answer. I thank you in advance.
With regards,
I'm planning to do some molecular dynamic simulation with Gaussian, (trying to simulate the alignment of an ensemble of water molecules in the presence of strong field), can anyone please explain if that is possible with Gaussian?
how i can assign weights at the output of the class so that i can ensemble them
all of the models are already fitted.
Overfitting is a type of modeling error that results in the failure to predict future observations effectively or fit additional data in the existing model. It occurs when a function is too closely fit a limited set of data points and usually ends with more parameters than the data can accommodate. It is common for huge data sets to have some anomalies, so when this data is used for any kind of modeling, it can result in inaccuracies in the analysis.
Overfitting can be prevented by following a few methods namely-
- Cross-validation: Where the initial training data is split into several mini-test sets and each mini-data set is used to tune the model.
- Remove features: Remove irrelevant features manually from the algorithms and use feature selection heuristics to identify the important features
- Regularisation: This involves various ways of making your model simpler so that there’s little room for error due to obscurity. Adding penalty parameters and pruning your decision tree are ways of doing that.
- Ensembling: These are machine learning techniques for combining multiple separate predictions. The most popular methods of ensembling are bagging and boosting.
i have a large nanoparticle (diameter of 16 nm) and I want to study its entrance through a lipid bilayer. i am running in NPT ensemble using GROMACS. However, for pressure coupling there are isotropic and semiisotropic options. The mdp documentation says that for semiisotropic coupling, the z is decoupled from the and y directions and is useful when simulating membranes. however, I tried both options.
the one with semiisotropic coupling, a huge change in the box dimensions in all directions occurred and this was because the nanoparticle penetrated the upper leaflet so the lipids moved away in both x and y directions increasing these dimensions which resulted in a decrease in the z direction.
the one with the isotropic coupling did not have a huge differences in the box dimensions.
attached are the images of both trials
Hi,
I would like to generate background error using ensemble perturbation method. I am using WRF's genbe module. I am fairly new to modeling and I have no idea how to generate the ensemble outputs. Could someone please tell me the steps I have to follow to generate the ensemble outputs?
Thankyou.
I am running a cooling simulation of stainless steel alloy using EAM potential. The initial configuration at 5000[K] was well equilibrated with a density plateau at around 7.9g/cc simulated in the NPT ensemble followed by additional NVT ensemble simulation to achieve a well-equilibrated structure. However, during cooling, I notice that the density of the system continue to decrease, indicating box expansion while the temperature was decreasing! I am using an NPT ensemble for the simulation and pressure is kept at iso 0 0 (gauge value). I do not understand why the box keeps enlarging while kinetic energy is being withdrawn from the system (as the temperature is being reduced).
I have raster images from three models and I want to create their ensemble output. The correlation among rasters is near 70%. Is it a scientifically true approach? Can we combine 3 model outputs like that?
If yes, what statistical method should apply
We know that aqueous electrolyte solutions have a lower heat capacity compared to pure water. For example, the heat capacity of a saturated CaCl2 solution at 20℃ (74.5 g/100 g H2O) has a specific heat capacity (Cp) of ~2.4 kJ/kg∙K, much lower than that of water (4.18 kJ/kg∙K).
My question is, how can we explain this phenomenon on a molecular or even quantum perspective?
I understand that, at such a high concentration, there are very few "free" water molecules. The majority of them are "trapped" in the hydration shells of the Ca2+ and Cl- ions. These water molecules from dative covalent bonds with the ions, thus unable to have free translational or rotational movement (i.e. their degrees of freedom are decreased). The water-ion ensemble must now move together.
But how does that explain the lower heat capacity?
How can I combine three classifiers of deep learning in python language ?
hello everyone
my question is how i can find a standard deviation of a mode before and after optimization. i am working on supervised learning model. i am applying an ensemble technique to this. can anyone help me please.
thank you
Hi,
I wonder if anyone knows how to create ensembles using the pdb-tools (https://wenmr.science.uu.nl/pdbtools/reference)? The protein-protein docking have been completed by InterEvDock2 where I set different constraints and did several docking runs. Because the InterEvDock2 only performs rigid docking, I would like to normalize the score of different runs using pdb-tools. For each docking run, I need to create an ensemble with a few best docking poses. However, I don't know how to implement it in pdb-tools (what specific pipeline needs to be loaded).
Any feedback is welcome. Thanks.
Ensemble docking allow to dock a single ligand or a ligand library against multiple conformations of a single receptor.
Now, imagine we have a group of proteins which are functionally conserved and share similar ligand/s. Moreover, they are highly similar in the structures (Identity rate in AA level is more than 90%) and almost a perfect superimposition of 3D structure can be made by different tools.
Docking analysis was performed for each protein solely and as expected the binding pocket and residues are similar.
Now here is the question: Can we perform Ensembled docking for this situation?
Hello All, I am just a beginner on the CRISPR world. I need to extract a gene locus sequence from zebrafish. do you have a step-by-step guide to ensembl or any other software for me to do this?
hi
please guide
how to make multi model ensemble of regional climate model?
I am using south asia domain of cordex and my variables are precipitation, tmax and tmin.
there exists 153 different combinations for these three variables of historical, rcp 4.5 and rcp8.5 scenarios.
how to shortlist models and then how to proceed?
How we can find different independent configurations sampled from the NVT ensemble in gromacs?
Hi all,
I attempt to use the long short-term memory (LSTM) of a deep learning method to generate the precipitation ensemble of 20 CMIP6 model simulations for SSP scenarios. Dear all, could anybody provide some memo or specification about LSTM in ensemble use? Anyway, thank you!
Anyone did the GO term enrichment analysis for the non-model organism or plants Differentially expressed genes recently? I tried to use the agriGOv2 analysis toolkit to get the GO term but couldn't access it. Is it down permanently? ShinyGO is another option but the genes need to be in a specific format like panther, ensemble etc. Could you suggest a better option of doing the analysis in R or using other software?
hello guys!
Can anyone tell me how I can ensemble a neural network. I use the patternet type. if some one know please help me. I am doing my code using Matlab. Can anyone please help with my code?
hope to get a reply from you guys.
thank you
Can anyone suggest any ensembling methods for the output of pre-trained models? Suppose, there is a dataset containing cats and dogs. Three pre-trained models are applied i.e., VGG16, VGG19, and ResNet50. How will you apply ensembling techniques? Bagging, boosting, voting etc.
Hi all,
Besides CESM LENS2, which modeling groups provide large ensemble experiments in the CMIP6 era?
Thanks
Tl;dr: I’m trying to convert gene IDs of an obscure MRSA strain from Ensembl Bacteria to KEGG.
Hello,
I’m trying to do a pathway enrichment analysis of MRSA strain 107 using GSEA. I have gene expression data that are associated with the gene IDs from Ensembl Bacteria. I plan to use KEGG as my pathway database.
GSEA requires a .gmt file of the gene IDs/enrichment data (of which the gene IDs are from Ensembl), then requires a pathway file (from KEGG). If I try to do the analysis with both of these files, the gene IDs don’t match up, so GSEA can’t do it.
My question is whether there’s a way to convert these gene IDs specifically with these strains of MRSA from Ensembl Bacteria to a site like KEGG. Here are the resources I’ve already tried:
DAVID
Dbtodb
Syngoportal
G:convert
MetaScape
BioMart from Ensembl
Annotationdbi
All these are tools that work, but they don’t include my strain. How should I convert these Ensembl Bacteria gene IDs? Is there another option I don’t know about?
PS. I don’t need to use KEGG; if a different pathway database works, that would also be acceptable.
I was wondering if training a neural network in the deep ensemble setting can lead to a network with a posterior vs. a point estimate architecture?
Recently there have been discussions over the interpretability of Deep ensembles as Bayesian models. This led me to this thought that whether or not we can learn a posterior at the end of training in such a scenario?
I would like to know why the system is equilibrated at 10 K with NVT and followed with 300 K with the NPT method while performing MD simulations. Please, provide available references.
Why do I get high negative pressure values at the NVT ensemble step with 10 ns at 10 K temperature, and the pressure increases to single-digit positive pressure values at the NPT ensemble step with 200 ns at 300 K temperature?
Please, provide the answer with a reference. I appreciate any help you can provide.
hi. I am working on ensemble learning algorithm and how i can implement this using an ANN. please help me on this. and how a simple ensemble looks like in matlab can anyone help me on this
Could you explain to me more about the cubist model?
is it ensemble or individual?
Thanks
Also on NCBI, these Ensembl IDs match to one gene only
I want to develop an ensemble approach where the final layer of a CNN model(Flatten layer in this case) will be followed by a K-Means Clustering algorithm where I want to cluster inputs into a number of categories same as required number of categories in a task. I want help regarding how to apply K-Means Clustering with a CNN.
Hi everyone,
I am currently developing a Gibbs Ensemble Monte Carlo algorithm. I am trying to implement a Widom Insertion Method to calculate the chemical potential of the liquid-phase and gas-phase boxes; however, I haven't been able to successfully do it. My inter-particle potential is that of a hard sphere (i.e. equal to infinity when particles overlap). I suspect the issue with my implementation has to do with how I've been treating the instances where the inserted particles overlap with any of the particles already present in the box I'm trying to determine the chemical potential of. I've been guiding myself by the work of Frenkel & Smit; more specifically, the article attached. Can anyone with experience in this topic help me figure this out?
Thank you beforehand for any assistance anyone may provide!
Hi!
I`ve downloaded the "Supplementary_files_format_and_content" of one deposited dataset from GEO (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE129718 ). The excel file includes RPKM values, genes and annotation for each Sample. I am not interested in re-analyzing everything from scratch but I would like just to see the trend (either UP or DOWN regulation) of specific genes I am interested. My plan then was to convert the gene_id column into gene_symbol column so it would be easier for me to identify my gene of interest but I have noticed that a lot of gene has multiple transcript_id for the same gene (and ensembl id). How am I supposed to deal with this multiple "transcript_id"? which one am I suppose to look at?
I can have for example same gene_id (ENSMUSG00000028943), same locus (4:152120313-152152454) but different length (1375, 1593 etc) and different transcript_id (ENSMUST00000105657, ENSMUST00000105656). the transcrpt_id a lots of them for multiple genes (not only 2 as shown brefly above)!
Thank you in advance!
Camilla
Hi. I'm dealing with spatial transcriptomic data and find the gene of interest. Now we need to know what transcript isoform of the RNA was expressed in our sample. However, NCBI shows this gene has 3 isoforms while ENSEMBL only shows one. Thus we want to run spaceranger with the reference of NCBI, but 10X only provides the mice reference of ENSEMBL. So I downloaded the gff and fna file from NCBI, transfered the gff into gtf, then generated the reference directory as taught in the spaceranger tutorial. But spaceraneger can not work with this reference directory. It just crashes in the middle of the process. Did I do something wrong when generating the reference? Or does anyone have the mice NCBI reference for spaceranger?
Hi, everyone,
I just calculated a pure water-box(32 molecules pre-equilibrated by LAMMPS) to learn how to simulate a NVT ensemble by VASP, but unfortunately, I cannot get a converged energy profile(shown as the figure). It keeps increasing! Could anyone provide some suggestions?
Besides, I noticed that it is the potential energy of the Nosé thermostat keeps increasing, while the F or E0 converges well.
Here is my INCAR file:
SYSTEM = Test
LSCALAPACK = .FALSE.
#Start parameters
NPAR = 6
PREC = Normal
LREAL = Auto
ISTART = 0
ICHARG = 2
#Electronic relaxation
ENCUT = 600
ALGO = Fast
NELM = 300
EDIFF = 1E-5
NELMIN = 5
#MD parameters
ISYM = 0
IBRION = 0
POTIM = 0.5
NSW = 30000
TEBEG = 300
IWAVPR = 11
#NVT canonical model
ISIF = 2
MDALGO = 2
SMASS = 0
#DOS related
ISMEAR = 0
SIGMA = 0.05
#Switches
LWAVE = .FALSE.
LCHARG = .FALSE.
IVDW = 11
Thanks a lot.
i was making a classification model ( 3 class ) for early detection of cracks in ball bearings, the data set is limited 120 rows and 14 features. the classifiers and their parameters is listed below can you please suggest which model will be the best (not simply accuracy also consider model complexity )
I am working on the future stream flow of my study area by using a single GCM. Before this, I did accuracy assessment of all the available GCMs on the basis of available data and selected the top most model for further use. Is this a good approach? I do not want to use the ensemble data of 4-5 models.
We need to prepare a weighted average multi-model ensemble of projected future daily precipitation by assigning weights to individual CMIP6 models based on past performance. For this purpose, We want to use Bayesian Model Averaging. Since the distribution of precipitation is highly skewed with large number of zeros in it, a mixed (discrete-gamma) distribution is preferred as the conditional PDF as per Sloughter et al., (2007).
Considering 'y' as the reference (observed ) data and 'fk' as the modelled data of kth model,
The conditional PDF consists of two parts. The first part estimates P(y=0|fk) using a logistic regression model. The second part consists the following the term P(y>0|fk)*g(y|fk).
Since the computation of P(y>0|fk) is not mentioned in the referred manuscript, If I can compute P(y=0|fk), Can I compute P(y>0|fk) as 1-P(y=0|fk) in this case?
If not, Can someone help in computing P(y>0|fk)?
You can find the the referred paper here https://doi.org/10.1175/MWR3441.1
Thanks
Dear researchers
Objective:
We've applied machine learning methods such as artificial neural networks, random forest, and support vector machines to predict stroke patient's recovery.
Materials and methods:
We have stroke patients' clinical data from EMRs(electronic medical records) and their kinematic data obtained by the exoskeleton robot's sensor system(from gait training).
The clinical data are ordinal and categorical, and the kinematic data are time-series data.
Clinical data and kinematic data have been integrated into tabular data by applying moving windows to time-series data (obtained mean, std, median, max, and min).
Limitations:
In our experience, it was not easy to use all the data for training at once because the types and characteristics of clinical data and kinematic data were different.
Thus, we are applying the ensembling method to various neural network models.
(We've tried conventional bagging or stacking algorithms to the outputs of the neural networks.)
Question:
At this point, we would like to know some reasonable, preferred, recommended methods for ensembling the neural network models with different data learned separately. (i.e., how to combine a neural network model trained by clinical data and another model trained by kinematic data)
Hello everyone
I am working on a sparsely gauged mountainous watershed.
I want to use RCM for precipitation and temperatures.
Please help me how to select RCMs and please also share source from they can be obtained.
I have a VASP MD simulation of a 2x2x1 supercell of Al2O3 totaling 120 atoms. The supercell was initially relaxed and then run for 1500 time steps (0.1 fs time step, 1e-7 EDIFF) in the NVE ensemble (MDALGO=1, ANDERSEN_PROB=0). Velocities were initialized to 500 K (TEBEG=500). As a sanity check, I ran the same MD simulation with TEBEG=0 and the energy does remain constant. I'm struggling to understand why there is an initial jump in the energy. My intuition is that the energy should be more or less constant as in classical MD. Is there a reason for this?
Is there any simple code to perform the training of ensemble classifier of SVM and ANN on a set of data (available in Matlab like wine, fisheriris, etc... )
Thanks
Python code for forecasting using ensemble model is needed to study.
I have identified 12 transcript variants of my gene of interest from ensembl and I want to find the expression of these transcripts in body tissues using GTEx. I think to do this however, I need to have the rs number for the transcript variants. I was wondering if anyone can suggest the best way to go about finding this information out as I am struggling?
The main result of decoherence theory is that the non-diagonal elements of a quantum object's density matrix become zero due to uncontrolled interactions with the environment. For me, that only means that there will we no more interference effects between the superposed states. But there still remain the diagonal elements of the density matrix. So there is still a superposition of classical alternatives left. How does that solve the measurement problem ?
Moreover, doesn't the mathematical derivation of the decoherence effect involve an ensemble average over all possible environmental disturbances ? How does this help when we are interested in the behavior of a specific system in a specific environment ?
I'm working on the impact of climate change on water resources. how to choose the best ensemble from RCM projected rainfall? what method I should use to compare different RCM and choose the best out of that?
In addition to the experimental data, various thermodynamic models are used to evaluate defect concentration in materials. How to understand those thermodynamic models such as Wagner Schottky and Bragg- Williams?
Regards
Subha Sanket Panda
I have generated denoised images using several models and would like to ensemble at the prediction level to achieve superior denoising results. What would be the best way to combine (averaging, max voting, weighted averaging, etc.) these denoised images to achieve superior denoising performance?
Hello everybody
I ran 50 ns MD simulation in NPT ensemble, using Desmond, on tyrosinase a metallo-enzyme containing Cu 2+. I am doing this to evaluate the stability of the complex obtained from molecular docking of the protein with an active ligand. Before running the production stage, I used the default relaxation protocol provided by Desmond. The system was parametrized by employing the OPLS3e force field.
Cu 2+ chelation by the catalytic histidines remains stable during the entire simulation. However, the ligand, which do not chelate the ions, already at the first frame of the simulation, tend to leave the active site. So, I would like to figure out what could be the reason. Furthermore, I would like to have some suggestion about some specific relaxation protocols different from default one, to deal with the problem just mentioned.
hello,
in order to find methylated regions of murine promoter we are currently integrating data from Ensembl, DBTSS and EPD. Any suggestion for further databases?
I am a beginner at Molecular dynamics. I am trying to gather snapshots of a given material at different temperatures. For each temperature, I have thought to increase the system temperature to "T" K, and equilibriate at that temperature. For the first part(heating) I am using an NVE ensemble. But somehow the temperature is not raising beyond 0K (start temperature). Is this because there is no thermostat at NVE? What can be the alternative route? Using something like the Berendsen Thermostat(SMASS=-1)?
Input file :
PREC = Normal ! standard precision
ENMAX = 400 ! cutoff should be set manually
ISMEAR = 0 ; SIGMA = 0.1
ISYM = 0
IBRION = 0 ! molecular dynamics
IALGO=48
ISIF = 0
NSW = 380 ! 1000 steps
POTIM = 0.5 ! timestep 0.5 fs
MDALGO = 0
SMASS = -3
TEBEG = 0; TEEND = 190 ! temperature
Hi there, Is there any software tool or database to identify the entire 5' and 3' UTR regions of bacterial gene? I am aware that eukaryotic genes are clearly annotated in Ensembl and Genbank with these details. But unfortunately I couldnt able to find this information for bacterial genes. Your help on this would be very much appreciated. Many thanks in advance.
Hi
Dear researchers
In ensemble-based architectural design
Which algorithms are more useful for classification?
What is the difference between parallel and ensemble architecture?
Thanks
I am waiting for your answer
I am new to Docking and MD, and currently try to catch on it. I try one of the webserver for docking, but unfortunately not working. it says " Your PDB contains multiple forms of the same residue VAL 134. This is not supported in the current form. If you would like to supply multiple conformations, please create an ensemble". Then, when I checked manually, I found many atom that had different version (attached in file). Anyone has suggestion to fix the problem? Thanks!
While teaching Quantum mechanics to beginners, do you feel that the traditonal historical development of the subject followed by wave mechanics approach be replaced by axiomatic introduction to the subject, followed by discussion on Quantum mechanics of spin ensembles approach ?
which would be better mode of exposition option for such studentship level ?
On what base could it be possible to determine the number of models? does using two or three models during a given application could it imply multi-model approach?
Do you have excellent knowledge of both SAS and Matlab programming, and would you be interested in collaborating on a manuscript that deals with Methodologies for Ensemble Forecasting, with application to fisheries population dynamics? You are preferably a MSc/PhD student with strong quantitative background.
Hello,
I have a list of Ensembl protein Ids ("ENSP...", got them from PAXdb) and I wish to find their matching dna sequences.
It seems trivial but I didn't find a way to do it...
I could find the appropriate gene Id for each protein and then get the cds nucleotide sequence but it seems inaccurate (because of alternative splicing).
Any thoughts?
Thank you!
Hello All,
I am working on MD simulation study using DESMOND for a protein-ligand complex (size of my protein is around 500 a.a). Can anyone please tell me on what basis I need to set the different parameters for the same, like -
1. Simulation time
2. Recording interval for energy and Trajectory
3. Ensemble Class (NPT, NVT, etc)
Thank you all
Regards
Does anybody know? The site http://pedb.vib.be/ seems to do not work. Maybe, it is located now in different site?
I am trying to calculate some hydrodynamic properties from MD results. Part of the process in to calculate the transvers current correlation function which is formulated as C(q,t)=⟨J∗(0)J(t)⟩
. The issue is that this formula is regarded as canonical ensemble average in literature which should be calculated based on parameter $\Beta$. However my intuition is that this should be a form of autocorrelation or cross correlation of a rolling window. This is confusing to me and I would like to ask if anyone can provide me a pseudo code example for this calculation.