Article

Active learning of causal networks with intervention experiments and optimal designs

Journal of Machine Learning Research (impact factor: 2.56). 01/2008; 9:2523-2547.

ABSTRACT The causal discovery from data is important for various scientific investigations. Because we cannot distinguish the different directed acyclic graphs (DAGs) in a Markov equivalence class learned from observational data, we have to collect further information on causal structures from experiments with external interventions. In this paper, we propose an active learning approach for discovering causal structures in which we first find a Markov equivalence class from observational data, and then we orient undirected edges in every chain component via intervention experiments separately. In the experiments, some variables are manipulated through external interventions. We discuss two kinds of intervention experiments, randomized experiment and quasi-experiment. Furthermore, we give two optimal designs of experiments, a batch-intervention design and a sequential-intervention design, to minimize the number of manipulated variables and the set of candidate structures based on the minimax and the maximum entropy criteria. We show theoretically that structural learning can be done locally in subgraphs of chain components without need of checking illegal v-structures and cycles in the whole network and that a Markov equivalence subclass obtained after each inter-vention can still be depicted as a chain graph.

0 0
 · 
0 Bookmarks
 · 
38 Views
  • Source
    Chapter: SemCaDo: A Serendipitous Strategy for Learning Causal Bayesian Networks Using Ontologies
    [show abstract] [hide abstract]
    ABSTRACT: Learning Causal Bayesian Networks (CBNs) is a new line of research in the machine learning field. Within the existing works in this direction [8, 12, 13], few of them have taken into account the gain that can be expected when integrating additional knowledge during the learning process. In this paper, we present a new serendipitous strategy for learning CBNs using prior knowledge extracted from ontologies. The integration of such domain’s semantic information can be very useful to reveal new causal relations and provide the necessary knowledge to anticipate the optimal choice of experimentations. Our strategy also supports the evolving character of the semantic background by reusing the causal discoveries in order to enrich the domain ontologies.
    01/1970: pages 182-193;

Full-text

View
1 Download
Available from

Keywords

acyclic graphs
 
batch-intervention design
 
candidate structures
 
causal discovery
 
causal structures
 
chain graph
 
checking illegal v-structures
 
discovering causal structures
 
external interventions
 
intervention experiments
 
manipulated variables
 
Markov equivalence class
 
Markov equivalence subclass
 
maximum entropy criteria
 
optimal designs
 
randomized experiment
 
sequential-intervention design
 
subgraphs
 
various scientific investigations
 
whole network
 

Yang-Bo He