Volodya Vovk’s research while affiliated with Royal Holloway, University of London and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (56)


Table 2 .1: Classification of algorithms according to their output 
Figure 3.3: Forced accuracy of VM-1NN (dashed line), VM-RF1/VM-RF2A (solid lines) and VM-SVM2 (dotted lines) applied to the UKCTOCS OC data with the different number of features 
Table 3 .1: Data sets used in algorithmic testing 
Figure 3.2: Validity of Venn machine VM-RF2A with 5 categories applied to the Sonar data in the leave-one-out mode: the number of errors lies between the cumulative lower error probabilities and cumulative upper error probabilities up to statistical fluctuations 
Table 3 .7: Venn taxonomies applied to data sets other than Sonar, their para- meters K and the corresponding number of categories K 

+21

Multiprobabilistic prediction in early medical diagnoses
  • Article
  • Full-text available

June 2013

·

192 Reads

·

14 Citations

Annals of Mathematics and Artificial Intelligence

·

Dmitry Devetyarov

·

Volodya Vovk

·

[...]

·

This paper describes the methodology of providing multiprobability predictions for proteomic mass spectrometry data. The methodology is based on a newly developed machine learning framework called Venn machines. Is allows to output a valid probability interval. The methodology is designed for mass spectrometry data. For demonstrative purposes, we applied this methodology to MALDI-TOF data sets in order to predict the diagnosis of heart disease and early diagnoses of ovarian cancer and breast cancer. The experiments showed that probability intervals are narrow, that is, the output of the multiprobability predictor is similar to a single probability distribution. In addition, probability intervals produced for heart disease and ovarian cancer data were more accurate than the output of corresponding probability predictor. When Venn machines were forced to make point predictions, the accuracy of such predictions is for the most data better than the accuracy of the underlying algorithm that outputs single probability distribution of a label. Application of this methodology to MALDI-TOF data sets empirically demonstrates the validity. The accuracy of the proposed method on ovarian cancer data rises from 66.7 % 11 months in advance of the moment of diagnosis to up to 90.2 % at the moment of diagnosis. The same approach has been applied to heart disease data without time dependency, although the achieved accuracy was not as high (up to 69.9 %). The methodology allowed us to confirm mass spectrometry peaks previously identified as carrying statistically significant information for discrimination between controls and cases.

Download

Learning by Transduction

January 2013

·

154 Reads

·

227 Citations

We describe a method for predicting a classification of an object given classifications of the objects in the training set, assuming that the pairs object/classification are generated by an i.i.d. process from a continuous probability distribution. Our method is a modification of Vapnik's support-vector machine; its main novelty is that it gives not only the prediction itself but also a practicable measure of the evidence found in support of that prediction. We also describe a procedure for assigning degrees of confidence to predictions made by the support vector machine. Some experimental results are presented, and possible extensions of the algorithms are discussed.


Multiprobabilistic Venn Predictors with Logistic Regression

September 2012

·

110 Reads

·

8 Citations

IFIP Advances in Information and Communication Technology

This paper describes the methodology of providing multiprobability predictions for proteomic mass spectrometry data. The methodology is based on a newly developed machine learning framework called Venn machines. They allow us to output a valid probability interval. We apply this methodology to mass spectrometry data sets in order to predict the diagnosis of heart disease and early diagnoses of ovarian cancer. The experiments show that probability intervals are valid and narrow. In addition, probability intervals were compared with the output of a corresponding probability predictor. © 2012 IFIP International Federation for Information Processing.


Fig. 1 Example of a spectrum with identified peaks
Table 3 Dynamics of confidence and credibility for measurements taken for two ovarian-cancer cases
Fig. 4 Cumulative logistic regression predictions for the OC data (all samples); for the BC data (samples in the 5-17 month time slot)
of Mondrian predictors applied to the ovarian-cancer data set (CA125 and 5 most frequent peaks) in the leave-one-out mode in different time slots
Conformal predictors in early diagnostics of ovarian and breast cancers

September 2012

·

151 Reads

·

20 Citations

Progress in Artificial Intelligence

The work describes an application of a recently developed machine-learning technique called Mondrian predictors to risk assessment of ovarian and breast cancers. The analysis is based on mass spectrometry profiling of human serum samples that were collected in the United Kingdom Collaborative Trial of Ovarian Cancer Screening. The work describes the technique and presents the results of classification (diagnosis) and the corresponding measures of confidence of the diagnostics. The main advantage of this approach is a proven validity of prediction. The work also describes an approach to improve early diagnosis of ovarian and breast cancers since the data in the United Kingdom Collaborative Trial of Ovarian Cancer Screening were collected over a period of 7 years and do allow to make observations of changes in human serum over that period of time. Significance of improvement is confirmed statistically (for up to 11 months for ovarian cancer and 9 months for breast cancer). In addition, the methodology allowed us to pinpoint the same mass spectrometry peaks as previously detected as carrying statistically significant information for discrimination between healthy and diseased patients. The results are discussed.


Early detection of ovarian cancer in samples pre-diagnosis using CA125 and MALDI-MS peaks

November 2011

·

59 Reads

·

22 Citations

Cancer Genomics & Proteomics

A nested case-control discovery study was undertaken to test whether information within the serum peptidome can improve on the utility of CA125 for early ovarian cancer detection. High-throughput matrix-assisted laser desorption ionisation mass spectrometry (MALDI-MS) was used to profile 295 serum samples from women pre-dating their ovarian cancer diagnosis and from 585 matched control samples. Classification rules incorporating CA125 and MS peak intensities were tested for discriminating ability. Two peaks were found which in combination with CA125 discriminated cases from controls up to 15 and 11 months before diagnosis, respectively, and earlier than using CA125 alone. One peak was identified as connective tissue-activating peptide III (CTAPIII), whilst the other was putatively identified as platelet factor 4 (PF4). ELISA data supported the down-regulation of PF4 in early cancer cases. Serum peptide information with CA125 improves lead time for early detection of ovarian cancer. The candidate markers are platelet-derived chemokines, suggesting a link between platelet function and tumour development.


Figure 1. Calibration plots for the Gaussian prediction intervals on Gaussian (top left) and Laplace (bottom left) data and for the Laplace prediction intervals on Gaussian (top right) and Laplace (bottom right) data. The horizontal axis is ∈ (0, 0.2], and the range of the vertical axis is also [0, 0.2].  
Figure 2. The median widths of prediction intervals for various ∈ (0, 0.2], with the same layout as Figure 1. The range of the vertical axis is always [0, 40].  
Figure 5. Median widths of the prediction intervals for the ChickWeight data set.  
Conditional Prediction Intervals for Linear Regression

January 2010

·

158 Reads

·

5 Citations

We construct prediction intervals for the linear regression model with IID errors with a known distribution, not necessarily Gaussian. The coverage probability of our prediction intervals is equal to the nominal confidence level not only unconditionally but also conditionally given a natural sigma-algebra of invariant events. This implies, in particular, the perfect calibration of our prediction intervals in the on-line mode of prediction.


Table 1 Data distribution. APP DIV PPU NAP CHO INO PAN RCO DYS Total 
Table 2 Predictive Accuracy of NN-ICP Compared to Other Methods. 
Confidence Predictions for the Diagnosis of Acute Abdominal Pain

July 2009

·

80 Reads

·

24 Citations

IFIP International Federation for Information Processing

Most current machine learning systems for medical decision support do not produce any indication of how reliable each of their predictions is. However, an indication of this kind is highly desirable especially in the medical field. This paper deals with this problem by applying a recently developed technique for assigning confidence measures to predictions, called conformal prediction, to the problem of acute abdominal pain diagnosis. The data used consist of a large number of hospital records of patients who suffered acute abdominal pain. Each record is described by 33 symptoms and is assigned to one of nine diagnostic groups. The proposed method is based on Neural Networks and for each patient it can produce either the most likely diagnosis together with an associated confidence measure, or the set of all possible diagnoses needed to satisfy a given level of confidence.


Table 3 Predictive Accuracy of NN-ICP Compared to Other Methods.
Table 4 NN-ICP Set Prediction Results.
Table 5 NN-ICP Set Prediction Results by Diagnostic Group with Non-conformity Measure (9) for a Confidence Level of 90%.
Reliable diagnosis of acute abdominal pain with conformal prediction

June 2009

·

664 Reads

·

48 Citations

International Journal of Engineering Intelligent Systems for Electrical Engineering and Communications

Medical decision support is an area in which a lot of machine learning research has been conducted and several diagnostic and prognostic systems have been developed. The majority of these systems only produce bare predictions, without any indication of how reliable each of these predictions is. An indication of this kind however, is highly desirable especially in the medical field. In this paper we address this problem with the use of a recently developed technique, called conformal prediction, for accompanying the predictions of traditional machine learning algorithms with measures of their accuracy and reliability. We apply conformal prediction based on a Neural Network classifier to the problem of acute abdominal pain diagnosis and obtain predictions which have a high level of accuracy and are complemented with well-calibrated and practically useful confidence measures.


Serum Proteomic Abnormality Predating Screen Detection of Ovarian Cancer

May 2009

·

27 Reads

·

23 Citations

The Computer Journal

Ovarian cancer is characterized by vague, non-specific symptoms, advanced stage at diagnosis and poor overall survival. A nested case control study was undertaken on stored serial serum samples from women who developed ovarian cancer and healthy controls (matched for serum processing and storage conditions as well as attributes such as age) in a pilot randomized controlled trial of ovarian cancer screening. The unique feature of this study is that the women were screened for up to 7 years. The serum samples underwent prefractionation using a reversed-phase batch extraction protocol prior to MALDI-TOF MS data acquisition. Our exploratory analysis shows that combining a single MS peak with CA125 allows statistically significant discrimination at the 5% level between cases and controls up to 12 months in advance of the original diagnosis of ovarian cancer. Such combinations work much better than a single peak or CA125 alone. This paper demonstrates that mass spectra from the low molecular weight serum proteome carry information useful for early detection of ovarian cancer. The next step is to identify the specific biomarkers that make early detection possible.



Citations (49)


... Our work investigates the impact of various data characteristics-such as dataset size, noise, and dimensionality-on CP performance in regression settings using Inductive Conformal Prediction (ICP) [7,9]. ICP is one of the two major categories of conformal predictors; the other being Transductive CP (TCP) [10]. Specifically, we will simulate conditions with significantly smaller datasets compared to other studies. ...

Reference:

Inductive Conformal Prediction under Data Scarcity: Exploring the Impacts of Nonconformity Measures
Transduction with Confidence and Credibility
  • Citing Conference Paper
  • January 1999

... The KME technique maps joint, marginal and conditional probability distributions to vectors in a high (or even infinite) dimensional feature space that completely characterises the distribution (Fukumizu, Song, and Gretton 2011). Building upon KME, we used the maximum mean discrepancy (MMD) (Smola et al. 2007), a distance metric defined on the space of probability measures (here, EEG spectrograms) in combination with Kernel ridge regression (KRR) (Saunders, Gammerman, and Vovk (1998)). We refer to our approach as Kernel mean embedding regression (KMER), which we compare to KRR and ridge regression (RR). ...

Ridge Regression Learning Algorithm in Dual Variables
  • Citing Conference Paper
  • January 1998

... Before the appearance of inductive conformal predictors, several other possibilities had been studied, but not with great success. To speed computations up in a multi-class pattern recognition problem which uses support vector machines in its implementation, Saunders, Gammerman and Vovk (2000) used a hashing function to split the training set into smaller subsets, of roughly equal size, which are then used to construct a number of support vector machines. In a different way, just to mention but a few, Ho and Wechsler (2004) exploit the adiabatic version of incremental support vector machine, and lately Vovk (2013b) introduces Bonferroni predictors, a simple modification based on the idea of the Bonferroni adjustment of p-values. ...

Computationally Efficient Transductive Machines
  • Citing Conference Paper
  • January 2000

... In fact, I believe that work should be done in this direction, which is a promising field of investigation. Furthermore, it turns out that the best possible formal definition of technical complexity, as defined above, is that given by Kolmogorov (Gammerman and Vovk, 1999) describes the dynamics of the phenomenon. In this context, we should point out that "central banks are always faced with the problem of choosing the most appropriate model or class of models, given the existing economic situation" (ECB, 2001: p19). ...

Kolmogorov complexity: Sources, theory and applications
  • Citing Article
  • January 1999

The Computer Journal

... The proposed approach is based on the recently developed Venn Prediction (VP) framework [20] , which produces probabilistic bounds that are guaranteed to contain well-calibrated probabilities (up to statistical fluctuations). Until now the VP framework has been combined with the k -Nearest Neighbour classifier in [20] and [4], with Support Vector Machines in [9, 21], with Logistic Regression in [15] and with Artificial Neural Networks (ANN) in [16, 17]. Here we follow a modified version of the ANN-VP approach proposed in [17], so as to address the class imbalance problem of the particular task; the same approach with different oversampling and undersampling schemes was followed in [18, 19] for the detection of Vesicoureteral Reflux. ...

Multiprobabilistic Venn Predictors with Logistic Regression

IFIP Advances in Information and Communication Technology

... Conformal Prediction (CP) methods conformalize the prediction. CP methods, which are distribution-free and lightweight, have shown some successful application in health-related domains, mostly for cancer prediction ( [33], [34], [35], [32], [36], [34], [37], [38], [39]). ...

Conformal predictors in early diagnostics of ovarian and breast cancers

Progress in Artificial Intelligence

... Palabras clave: árboles tropicales; madera ilegal; imágenes macroscópicas; aprendizaje automático; herramientas de corte; microscopio portátil. contar con estrategias y decisiones en tiempo real (Nouretdinov et al., 2015;Porcelli & Martínez, 2020;Silver et al., 2018;Simić et al., 2018;Vinyals et al., 2019;Ye, 2015). ...

Multiprobabilistic prediction in early medical diagnoses

Annals of Mathematics and Artificial Intelligence

... Radial basis function classifiers were introduced in SVM to solve nonlinear separable problems (Scholkopf et al, 1997). The use of SVM for density estimation (Weston et al, 1997) and ANOVA decomposition (Stitson et al, 1997) has also been studied. Least squares SVM (LS-SVM) (Suykens and Vandewalle, 1999) modifies the equality constraints in the optimization problem to solve a set of linear equations instead of quadratic ones. ...

Support vector anova decomposition
  • Citing Article
  • January 1997

... Here, ∈ ℝ denotes the weight, and ∈ ℝ denotes the bias in (8). It is important to note that to train an SVM; the labels must be mapped either -1 and 1 or 0 and 1 (Stitson et al. (1996); Suthaharan and Suthaharan (2016b)). For linearly separable data, to obtain the final classifier ↦ sign ( + ) ...

Theory of Support Vector Machines
  • Citing Article
  • January 1996

... It can be checked that A is stable w.r.t. the extremes p 14 or the means p 23 permutations. P is stable for p 12 and p 34 permutations, while R is stable for p 13 and p 24 d, a, b)). This is observable on their truth tables: See the top part of Table 1, where only the 6 patterns that make the logical proportions true appear). ...

Computationally Ecient Transductive Machines