Horst Bunke’s research while affiliated with University of Bern and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (653)


On the Impact of Using Utilities Rather than Costs for Graph Matching
  • Article
  • Publisher preview available

October 2018

·

66 Reads

·

6 Citations

Neural Processing Letters

·

·

Horst Bunke

The concept of graph edit distance constitutes one of the most flexible graph matching paradigms available. The major drawback of graph edit distance, viz. the exponential time complexity, has been recently overcome by means of a reformulation of the edit distance problem to a linear sum assignment problem. However, the substantial speed up of the matching is also accompanied by an approximation error on the distances. Major contribution of this paper is the introduction of a transformation process in order to convert the underlying cost model into a utility model. The benefit of this transformation is that it enables the integration of additional information in the assignment process. We empirically confirm the positive effects of this transformation on five benchmark graph sets with respect to the accuracy and run time of a distance based classifier.

View access options


Data Mining in Time Series and Streaming Databases

January 2018

·

216 Reads

·

17 Citations

This compendium is a completely revised version of an earlier book, Data Mining in Time Series Databases, by the same editors. It provides a unique collection of new articles written by leading experts that account for the latest developments in the field of time series and data stream mining. The emerging topics covered by the book include weightless neural modeling for mining data streams, using ensemble classifiers for imbalanced and evolving data streams, document stream mining with active learning, and many more. In particular, it addresses the domain of streaming data, which has recently become one of the emerging topics in Data Science, Big Data, and related areas. Existing titles do not provide sufficient information on this topic. © 2018 by World Scientific Publishing Co. Pte. Ltd. All rights reserved.



Improved Graph Edit Distance Approximation with Simulated Annealing

May 2017

·

53 Reads

·

15 Citations

Lecture Notes in Computer Science

The present paper is concerned with graph edit distance, which is widely accepted as one of the most flexible graph dissimilarity measures available. A recent algorithmic framework for approximating the graph edit distance overcomes the major drawback of this distance model, viz. its exponential time complexity. Yet, this particular approximation suffers from an overestimation of the true edit distance in general. Overall aim of the present paper is to improve the distance quality of this approximation by means of a post-processing search procedure. The employed search procedure is based on the idea of simulated annealing, which turns out to be particularly suitable for complex optimization problems. In an experimental evaluation on several graph data sets the benefit of this extension is empirically confirmed.


Product Graph-based Higher Order Contextual Similarities for Inexact Subgraph Matching

February 2017

·

79 Reads

·

26 Citations

Pattern Recognition

Many algorithms formulate graph matching as an optimization of an objective function of pairwise quantification of nodes and edges of two graphs to be matched. Pairwise measurements usually consider local attributes but disregard contextual information involved in graph structures. We address this issue by proposing contextual similarities between pairs of nodes. This is done by considering the tensor product graph (TPG) of two graphs to be matched, where each node is an ordered pair of nodes of the operand graphs. Contextual similarities between a pair of nodes are computed by accumulating weighted walks (normalized pairwise similarities) terminating at the corresponding paired node in TPG. Once the contextual similarities are obtained, we formulate subgraph matching as a node and edge selection problem in TPG. We use contextual similarities to construct an objective function and optimize it with a linear programming approach. Since random walk formulation through TPG takes into account higher order information, it is not a surprise that we obtain more reliable similarities and better discrimination among the nodes and edges. Experimental results shown on synthetic as well as real benchmarks illustrate that higher order contextual similarities add discriminating power and allow one to find approximate solutions to the subgraph matching problem.


Product Graph-based Higher Order Contextual Similarities for Inexact Subgraph Matching

February 2017

·

2 Reads

Many algorithms formulate graph matching as an optimization of an objective function of pairwise quantification of nodes and edges of two graphs to be matched. Pairwise measurements usually consider local attributes but disregard contextual information involved in graph structures. We address this issue by proposing contextual similarities between pairs of nodes. This is done by considering the tensor product graph (TPG) of two graphs to be matched, where each node is an ordered pair of nodes of the operand graphs. Contextual similarities between a pair of nodes are computed by accumulating weighted walks (normalized pairwise similarities) terminating at the corresponding paired node in TPG. Once the contextual similarities are obtained, we formulate subgraph matching as a node and edge selection problem in TPG. We use contextual similarities to construct an objective function and optimize it with a linear programming approach. Since random walk formulation through TPG takes into account higher order information, it is not a surprise that we obtain more reliable similarities and better discrimination among the nodes and edges. Experimental results shown on synthetic as well as real benchmarks illustrate that higher order contextual similarities add discriminating power and allow one to find approximate solutions to the subgraph matching problem.


Approximation of Graph Edit Distance by Means of a Utility Matrix

September 2016

·

44 Reads

·

7 Citations

Lecture Notes in Computer Science

Graph edit distance is one of the most popular graph matching paradigms available. By means of a reformulation of graph edit distance to an instance of a linear sum assignment problem, the major drawback of this dissimilarity model, viz. the exponential time complexity, has been invalidated recently. Yet, the substantial decrease of the computation time is at the expense of an approximation error. The present paper introduces a novel transformation that processes the underlying cost model into a utility model. The benefit of this transformation is that it enables the integration of additional information in the assignment process. We empirically confirm the positive effects of this transformation on three standard graph data sets. That is, we show that the accuracy of a distance based classifier can be improved with the proposed transformation while the run time remains nearly unaffected.


Improved Quadratic Time Approximation of Graph Edit Distance by Combining Hausdorff Matching and Greedy Assignment

June 2016

·

213 Reads

·

46 Citations

Pattern Recognition Letters

Approximation of graph edit distance in polynomial time enables us to compare large, arbitrarily labeled graphs for structural pattern recognition. In a recent approximation framework, bipartite graph matching (BP) has been proposed to reduce the problem of edit distance to a cubic-time linear sum assignment problem (LSAP) between local substructures. Following the same line of research, first attempts towards quadratic-time approximation have been made recently, including a lower bound based on Hausdorff matching (Hausdorff Edit Distance) and an upper bound based on greedy assignment (Greedy Edit Distance). In this paper, we compare the two approaches and derive a novel upper bound (BP2) which combines advantages of both. In an experimental evaluation on the IAM graph database repository, we demonstrate that the proposed quadratic-time methods perform equally well or, quite surprisingly, in some cases even better than the cubic-time method.


Approximate Graph Edit Distance in Quadratic Time

September 2015

·

71 Reads

·

40 Citations

IEEE/ACM Transactions on Computational Biology and Bioinformatics

Graph edit distance is one of the most flexible and general graph matching models available. The major drawback of graph edit distance, however, is its computational complexity that restricts its applicability to graphs of rather small size. Recently the authors of the present paper introduced a general approximation framework for the graph edit distance problem. The basic idea of this specific algorithm is to first compute an optimal assignment of independent local graph structures (including substitutions, deletions, and insertions of nodes and edges). This optimal assignment is complete and consistent with respect to the involved nodes of both graphs and can thus be used to instantly derive an admissible (yet suboptimal) solution for the original graph edit distance problem in O(n3) time. For large scale graphs or graph sets, however, the cubic time complexity may still be too high. Therefore, we propose to use suboptimal algorithms with quadratic rather than cubic time for solving the basic assignment problem. In particular, the present paper introduces five different greedy assignment algorithms in the context of graph edit distance approximation. In an experimental evaluation we show that these methods have great potential for further speeding up the computation of graph edit distance while the approximated distances remain sufficiently accurate for graph based pattern classification.


Citations (88)


... The structural or syntactic approach is used to represent the hierarchical (tree-like) structure (as shown in Figure 4). Here, structural information is used for the recognition and measurements of patterns, both in a local and global context (Bunke et al.,1990). Its aim is to classify data based on the structural interrelationships of features. ...

Reference:

A Systematic Analysis of Various Word Sense Disambiguation Approaches
Syntactic and Structural Pattern Recognition — Theory and Applications
  • Citing Book
  • November 2011

... Several exact and approximate algorithms have been presented in the past related to the computation of the distance between two graphs, one of them the graph edit distance [29] [30] [31]. Other distance between graphs in [32] [33] [34]. In this work, we have used one of these distances based on the maximum common subgraph mcs(G 1 ,G 2 ) and minimum common supergraph MCS(G 1 ,G 2 ) given in [35], the distance between two graphs that we have used is : ...

Graph-Theoretic Techniques for Web Content Mining
  • Citing Book
  • November 2011

... Any OCR requires human annotated image datasets to train the model; good training improves accuracy, shortens runtime and makes the algorithm more robust (Fornés et al., 2017;Holley, 2009;Prasad et al., 2020;Shen et al., 2021;Terras, 2022). There is no dataset suitable for most handwritten cursive or for various languages (Dahl et al., 2021;Fischer et al., 2014;Nikolaidou et al., 2022). It is not just a matter of clipping out characters and then labeling them (e.g., n instances of a handwritten 'a'). ...

The HisDoc Project. Automatic Analysis, Recognition, and Retrieval of Handwritten Historical Documents for Digital Libraries
  • Citing Chapter
  • December 2014

·

Horst Bunke

·

·

[...]

·

... Computing the has an exponential computational cost in the number of nodes of the involved graphs (Garey and Johnson, 1990). For this reason, some heuristic algorithms have been presented that deduce a sub-optimal distance in polynomial time (Riesen and Bunke, 2009;Serratosa, 2014aSerratosa, ,b, 2015Riesen et al., 2018). These algorithms define the edit cost of two graphs given a specific node-to-node mapping between nodes of both graphs as the addition of the substitution, deletion and insertion costs of the local structure of nodes. ...

On the Impact of Using Utilities Rather than Costs for Graph Matching

Neural Processing Letters

... Simulated annealing is a general meta-heuristic to escape local optima in local search by accepting deteriorations from local optima with probabilities that decrease as the algorithm runs ( 47 ). For NeEDL, we adapted a simulating annealing algorithm for graph edit distance computation proposed by Riesen et al. ( 48 ), using its implementation available in GEDLIB ( 49 ,50 ). ...

Improved Graph Edit Distance Approximation with Simulated Annealing
  • Citing Conference Paper
  • May 2017

Lecture Notes in Computer Science

... This is the author's version which has not been fully edited and content may change prior to final publication. [9], VF2 [6], Ceci [10], FilM [11], VF3 [12], Kim et al [24] Heuristic Algorithm-based Subgraph Matching G-Finder [25], Saga [26], PG-N [27] Learning-based graph matching DLGM [17], PCA-GM [18], RDGCN [20], GMNN [21], INFMCS [28] Learning-based subgraph matching Sub-GMN [14], AEDNet [15] volutions. However, our ablation study demonstrates that the sigmoid function is more effective for subgraph matching. ...

Product Graph-based Higher Order Contextual Similarities for Inexact Subgraph Matching

Pattern Recognition

... The CCM performs the computation until all mesh routers are connected considering the maximum communication range of mesh routers. In addition, we propose a mesh router placement optimization approach that combines SA and Delaunay Edges (DE) [14] in CCM (called DECCM-based SA). The DECCM-based SA approach considers a more realistic placement of mesh clients. ...

Delaunay-supported edges for image graphs
  • Citing Conference Paper
  • September 2015

... The present paper is concerned with structural pattern recognition with a strong focus on graph-based data representations [20]. The field of structural and graph-based pattern recognition has a long tradition [1,21] and can roughly be subdivided into three areas, viz. ...

Approximation of Graph Edit Distance by Means of a Utility Matrix
  • Citing Conference Paper
  • September 2016

Lecture Notes in Computer Science