MBEToolbox: a MATLAB toolbox for sequence data analysis in molecular biology and evolution.

Department of Microbiology, University of Hong Kong, Pokfulam, Hong Kong, China.
BMC Bioinformatics (Impact Factor: 2.67). 02/2005; 6:64. DOI: 10.1186/1471-2105-6-64
Source: PubMed

ABSTRACT MATLAB is a high-performance language for technical computing, integrating computation, visualization, and programming in an easy-to-use environment. It has been widely used in many areas, such as mathematics and computation, algorithm development, data acquisition, modeling, simulation, and scientific and engineering graphics. However, few functions are freely available in MATLAB to perform the sequence data analyses specifically required for molecular biology and evolution.
We have developed a MATLAB toolbox, called MBEToolbox, aimed at filling this gap by offering efficient implementations of the most needed functions in molecular biology and evolution. It can be used to manipulate aligned sequences, calculate evolutionary distances, estimate synonymous and nonsynonymous substitution rates, and infer phylogenetic trees. Moreover, it provides an extensible, functional framework for users with more specialized requirements to explore and analyze aligned nucleotide or protein sequences from an evolutionary perspective. The full functions in the toolbox are accessible through the command-line for seasoned MATLAB users. A graphical user interface, that may be especially useful for non-specialist end users, is also provided.
MBEToolbox is a useful tool that can aid in the exploration, interpretation and visualization of data in molecular biology and evolution. The software is publicly available at and


Available from: Xuhua Xia, Mar 31, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Bioinformatics is a field that requires its professionals to have a level of expertise in multiple disciplines in order to contribute in practice and research. In recognition of this fact, the College of Engineering at UPRM has set out to train a new generation of Engineers capable of tackling bioinformatics problems with mathematical and statistical proficiency, computational efficiency, and biological know-how. This work describes two courses, one in Quantitative Biology and the other in Statistics for Bioinformatics that will be part of a curricular sequence proposed to this end.
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In this study, a biosensor was developed to rapidly diagnose the swine sarcoptic mange (Sarcoptes scabies var. suis). The ELISA was modified to reduce the processing time for rapid diagnosis. The biosensor consists of a biological reaction part, and a measurement and control part. The biological reaction part was designed for using micro-pumps and valves for fluid transportation, and the measurement control part composed of a photodiode, a light-emitting diode fur light measurement, and a microcomputer to implement assay A polystyrene covet was used as a reaction chamber. Signal output was read as the rate of change in optical density at 645nm. Eighteen pigs diagnosed with sacroptic mange and 19 control pigs were tested. Fifteen sacbies-infested pigs showed positive results ( sensitivity). Sixteen control pigs showed negative results ( specificity). The system could execute a diagnosis cycle in about 45 min. The results suggest that this biosensor is useful for the rapid diagnosis of swine sacroptic mange.
    01/2005; 30(5). DOI:10.5307/JBE.2005.30.5.306
  • [Show abstract] [Hide abstract]
    ABSTRACT: Feature selection on mass spectrometry (MS) data is essential for improving classification performance and biomarker discovery. The number of MS samples is typically very small compared with the high dimensionality of the samples, which makes the problem of biomarker discovery very hard. In this paper, we propose the use of genetic programming for biomarker detection and classification of MS data. The proposed approach is composed of two phases: in the first phase, feature selection and ranking are performed. In the second phase, classification is performed. The results show that the proposed method can achieve better classification performance and biomarker detection rate than the information gain- (IG) based and the RELIEF feature selection methods. Meanwhile, four classifiers, Naive Bayes, J48 decision tree, random forest and support vector machines, are also used to further test the performance of the top ranked features. The results show that the four classifiers using the top ranked features from the proposed method achieve better performance than the IG and the RELIEF methods. Furthermore, GP also outperforms a genetic algorithm approach on most of the used data sets.
    Connection Science 04/2014; 26(3):215-243. DOI:10.1080/09540091.2014.906388 · 0.77 Impact Factor