ArticlePDF Available

New Acyclic and Star Coloring Algorithms with Application to Computing Hessians

January 2007
SIAM Journal on Scientific Computing 29(3):1042-1072

January 2007
29(3):1042-1072

DOI:10.1137/050639879

Source
DBLP

Authors:

Assefaw Gebremedhin

Washington State University

Fredrik Manne

University of Bergen

Acyclic and star coloring problems are specialized vertex coloring problems that arise in the efficient computation of Hessians using automatic differentiation or finite differencing, when both sparsity and symmetry are exploited. We present an algorithmic paradigm for finding heuristic solutions for these two NP-hard problems. The underlying common technique is the exploitation of the structure of two-colored induced subgraphs. For a graph G on n vertices and m edges, the time complexity of our star coloring algorithm is O(nd2), where dk, a generalization of vertex degree, denotes the average number of distinct paths of length at most k edges starting at a vertex in G. The time complexity of our acyclic coloring algorithm is larger by a multiplicative factor involving the inverse of Ackermann's function. The space complexity of both algorithms is O(m). To the best of our knowledge, our work is the first practical algorithm for the acyclic coloring problem. For the star coloring problem, our algorithm uses fewer colors and is considerably faster than a previously known O(nd3)-time algorithm. Computational results from experiments on various large-size test graphs demonstrate that the algorithms are fast and produce highly effective solutions. The use of these algorithms in Hessian computation is expected to reduce overall runtime drastically.

(a): A partial star coloring of a graph; vertex v is the only vertex that remains to be colored. (b)-(f): A subset of the two-colored induced subgraphs incident on the vertex v; each subgraph is a collection of stars. The illustration uses the six colors red, blue, green, yellow, pink, and black (Bk).

…

Visualizing a step in greedy coloring algorithms; v is the vertex currently being colored. Table 6.1 shows how the set of forbidden colors for v is determined in various cases.

…

Execution time comparison. Vertices are colored in the orderings specified in Figure 7.1. The plots, in semilog scale, are normalized relative to the runtime of Algorithm D2, values listed under column D2 of Table 7.1.

…

3 Top row: Total number of colors and runtime summed over all test graphs, when vertices are colored in their natural order in an input graph (Nat). Bottom row: Color savings and relative time cost in comparison with coloring using an appropriate SL ordering (numbers listed in Table

…

Figures - uploaded by Fredrik Manne

Content may be subject to copyright.

Content uploaded by Fredrik Manne

Content may be subject to copyright.

SIAM J. SCI. COMPUT.c

2007 Society for Industrial and Applied Mathematics

Vol. 29, No. 3, pp. 1042–1072

NEW ACYCLIC AND STAR COLORING ALGORITHMS

WITH APPLICATION TO COMPUTING HESSIANS∗

ASSEFAW H. GEBREMEDHIN†, ARIJIT TARAFDAR†, FREDRIK MANNE‡,AND

ALEX POTHEN†

Abstract. Acyclic and star coloring problems are specialized vertex coloring problems that arise

in the eﬃcient computation of Hessians using automatic diﬀerentiation or ﬁnite diﬀerencing, when

both sparsity and symmetry are exploited. We present an algorithmic paradigm for ﬁnding heuristic

solutions for these two NP-hard problems. The underlying common technique is the exploitation of

the structure of two-colored induced subgraphs. For a graph Gon nvertices and medges, the time

complexity of our star coloring algorithm is O(nd2), where dk, a generalization of vertex degree,

denotes the average number of distinct paths of length at most kedges starting at a vertex in G.

The time complexity of our acyclic coloring algorithm is larger by a multiplicative factor involving

the inverse of Ackermann’s function. The space complexity of both algorithms is O(m). To the best

of our knowledge, our work is the ﬁrst practical algorithm for the acyclic coloring problem. For the

star coloring problem, our algorithm uses fewer colors and is considerably faster than a previously

known O(nd3)-time algorithm. Computational results from experiments on various large-size test

graphs demonstrate that the algorithms are fast and produce highly eﬀective solutions. The use of

these algorithms in Hessian computation is expected to reduce overall runtime drastically.

Key words. acyclic coloring, star coloring, computation of Hessians, automatic diﬀerentiation,

ﬁnite diﬀerences

AMS subject classiﬁcations. 05C15, 05C85, 68R10

DOI. 10.1137/050639879

1. Introduction. The cost involved in computing a Jacobian or a Hessian ma-

trix using automatic diﬀerentiation or ﬁnite diﬀerencing is proportional to the number

of columns in the matrix. When the target derivative matrix Ais sparse, huge sav-

ings in runtime can be attained by ﬁrst computing a compressed matrix B, where

each column of Bis a sum of several structurally “independent” columns of A, and

then recovering the nonzero elements in Afrom B. Thus, an important component

of the eﬃcient computation of a sparse derivative matrix, whose sparsity pattern is

known a priori, is the problem of partitioning the columns of the matrix into the

fewest groups, each consisting of structurally independent columns.

The appropriate notion of structural independence—and the corresponding par-

titioning problem—depends on whether the derivative matrix to be computed is sym-

metric or nonsymmetric, and on whether the nonzero entries are to be recovered

from the compressed matrix directly or via substitution. Conceptually, direct recovery

corresponds to solving a diagonal system of equations, and recovery via substitution

corresponds to solving a set of simple triangular systems of equations. Several pre-

vious studies have shown that these matrix partitioning problems can be modeled

and eﬀectively solved using specialized graph coloring problems and their algorithms

[9, 12, 13, 14, 22, 27]. See our earlier work [17] for a comprehensive review.

∗Received by the editors September 8, 2005; accepted for publication (in revised form) November

7, 2006; published electronically May 10, 2007. This work was supported by the U.S. National Science

Foundation through grants ACI 0203722 and CCF 0515218, by DOE grant DE-FC02-01ER25476 and

by subcontract B 533520 from the Lawrence Livermore National Laboratory.

http://www.siam.org/journals/sisc/29-3/63987.html

†Department of Computer Science and Center for Computational Science, Old Dominion Univer-

sity, Norfolk, VA 23529-0162 (assefaw@cs.odu.edu, tarafdar@cs.odu.edu, pothen@cs.odu.edu).

‡Department of Informatics, University of Bergen, N-5020 Bergen, Norway (fredrikm@ii.uib.no).

1042

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1043

This paper is concerned with the design and implementation of eﬀective algo-

rithms for star and acyclic coloring, models for eﬃcient computation of Hessians.

In a star coloring, adjacent vertices receive distinct colors (distance-1 coloring), and

every path on four vertices (P4) uses at least three colors. In an acyclic coloring,

adjacent vertices receive distinct colors, and every cycle uses at least three colors.

Coleman and Mor´e [13] showed that star coloring models the partitioning problem

that occurs in the computation of a Hessian via a direct method; Coleman and Cai [9]

showed that the corresponding model in a substitution-based computation is acyclic

coloring. Both problems, whose ob jectives are to minimize the number of colors used,

are known to be NP-hard [9, 13]. As we show in section 2, these problems are also

hard to approximate.

Star coloring as a model for direct Hessian computation was preceded by two

less accurate models: the distance-2 coloring model due to McCormick [27] and the

restricted star coloring model due to Powell and Toint [28]. Similarly, acyclic coloring

as a model for Hessian computation via substitution was preceded by a less accurate

model, triangular coloring, due to Coleman and Mor´e [13]. These models and the

interrelationships amongst them are discussed in detail in section 5.

In a previous work [17], we suggested a simple heuristic algorithm for the star

coloring problem. In each step of that algorithm, the color of a vertex is determined

by examining every P4starting at the vertex. The time complexity of the algorithm

for a graph on nvertices is O(nd3), where dkdenotes the average number of paths

of length at most kedges starting at a vertex. A similar O(nd2)-time algorithm for

restricted star coloring, originally due to Powell and Toint [28], exists. However, when

used for solving the star coloring problem, the latter algorithm uses signiﬁcantly more

colors than the O(nd3)-time algorithm mentioned earlier.

Although Coleman and Cai [9] identiﬁed acyclic coloring as the model for comput-

ing a Hessian via a substitution method, the algorithm they suggested is an algorithm

for triangular coloring. In their approach, ﬁrst a graph Gis constructed by adding

edges to the input graph G, and then a distance-1 coloring algorithm is applied on

the graph G. A similar approach for triangular coloring was taken earlier by Cole-

man and Mor´e [13]. An optimal solution for the triangular coloring problem is not

necessarily an optimal solution for the acyclic coloring problem. Moreover, working

with a denser graph Gis undesirable for large-scale problems, since the approach

may require excessive storage space and lead to memory overﬂow. As for the acyclic

coloring problem, we are not aware of any previous practically relevant algorithm.

In this paper, we present new algorithms for both the acyclic and the star coloring

problems. The common paradigm underlying the two algorithms is the exploitation

of the structure of two-colored induced subgraphs. In the acyclic coloring case, every

subgraph induced by any two color classes is a collection of tree s; the corresponding

structure in the star coloring case is a collection of stars. In both cases, the two-

colored induced subgraphs partition the edge set of the input graph. Our algorithms

are based on these observations.

The algorithms presented in this paper are greedy—they progressively extend a

partial coloring by processing one vertex at a time. In each step, a vertex is assigned

the smallest (assuming colors are positive integers) allowable color with respect to

the current partial coloring, and the color assigned to the vertex remains unchanged

during the course of the algorithm. Greedy coloring heuristics stand in contrast

to iterative, local improvement heuristics, where the color assigned to a vertex may

change several times during the course of an algorithm in an attempt to ultimately

1044 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

reduce the number of colors used. Greedy coloring heuristics are in general fast, which

makes them attractive approaches for solving subproblems in scientiﬁc computing

applications, such as the case in this paper. An important issue in greedy coloring

algorithms is the order in which vertices are processed. We consider several eﬀective

ordering techniques in this paper.

A key component of our new algorithms is the eﬃcient management of two-colored

induced subgraphs. For the acyclic coloring case, the disjoint-set data structure is used

for this purpose; for the star coloring case, a simpler data structure is shown to be

suﬃcient. In addition, a few intricate techniques which yet use simple data structures

are employed to avoid two-colored cycles in the acyclic coloring case, and two-colored

P4’s in the star coloring case. The time complexity of the star coloring algorithm

is O(nd2), and that of the acyclic coloring algorithm is larger by a multiplicative

factor involving the inverse of Ackermann’s function. The space complexity of both

algorithms is O(m), where mdenotes the number of edges in a graph. The acyclic

coloring algorithm will be presented (in section 3) before the star coloring algorithm

(in section 4) to reﬂect that the former in an algorithmic sense corresponds to the

more general case.

We have implemented the new acyclic and star coloring algorithms presented in

this paper as well as several previously known algorithms for related coloring problems;

the latter algorithms will be discussed in section 6. Results from experiments carried

out on various large-size test graphs show that the new algorithms are highly eﬀective,

both in terms of the number of colors used and in practical runtime. On the test graphs

used, the number of colors used by the acyclic coloring algorithm is observed to be

very close to (within a factor of two) the coloring number of a graph—the number

of colors used by a greedy distance-1 coloring algorithm that employs a smallest last

vertex ordering. The coloring number is an important graph parameter with close

relationship to graph notions such as degeneracy, maximal core, and arboricity. The

runtime of the acyclic coloring algorithm is observed to be nearly the same as that

of a greedy distance-2 coloring algorithm. In comparison with our implementation

of an algorithm for triangular coloring—the previously known approach for Hessian

computation via a substitution method—the acyclic coloring algorithm runs faster,

uses fewer colors, and requires less memory and storage space. In agreement with time

complexity analyses, our new star coloring algorithm is observed to be signiﬁcantly

faster than our previous O(nd3)-time algorithm—it oﬀers a relative speedup as high

as ten on the test graphs used. Moreover, the new algorithm uses slightly fewer colors.

In comparison with the restricted star coloring algorithm of Powell and Toint [28],

which is of the same time complexity as the new star coloring algorithm proposed here,

our algorithm signiﬁcantly reduces the number of colors used. In terms of observed

runtime, the new star coloring algorithm is slower than the restricted star coloring

algorithm by only a small factor, typically two.

2. Background.

2.1. Preliminary concepts and notations. Throughout this paper, we con-

sider simple, undirected graphs without loops or parallel edges. Two classes of

(sub)graphs that we have need for are trees and stars. A tree is a connected graph

without any cycle, and a forest is a collection of trees. A star is a complete bipartite

graph in which one of the vertex sets consists of a single vertex. In a star with at

least two edges, we call the vertex at which the edges meet the hub of the star.

We denote an edge whose endpoints are the vertices vand wby vw. Two distinct

vertices vand ware distance-kneighbors if a shortest path connecting them consists

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1045

Table 2.1

Characterization of subgraphs induced by a pair of color classes. The coloring variants are

listed in an increasing order of restrictiveness; correspondingly, the two-colored induced subgraphs

get increasingly less connected as one goes from top to bottom in the table.

Coloring variant A two-colored induced subgraph

Distance-1 coloring A bipartite graph

Acyclic coloring A collection of trees

Star coloring A collection of stars

Distance-2 coloring A distance-2 matching

of at most kedges. We denote the set of distance-kneighbors of a vertex v, excluding

the vertex vitself, by Nk(v). As usual, the degree of a vertex is the number of edges

incident on it. We extend this notion and deﬁne the degree-kof a vertex v, denoted

by dk(v), to be the number of distinct paths of length at most kedges starting at v.

(Two paths are distinct if they diﬀer in at least one edge.) We denote the average

degree-kin a graph by dk. Note that degree-1 is synonymous with degree, and that

the degree-2 of a vertex vis equal to the number of edges that are incident either on

vor on vertices adjacent to v. Note also that in general dk(v)≥|Nk(v)|.

Adistance-k coloring of a graph G=(V,E) is a mapping φ:V→{1,2,...,p}

such that whenever vertices vand ware distance-kneighbors, φ(v)=φ(w). By

deﬁnition, two distance-kneighboring vertices are distance-kneighbors for k>k,

and a distance-kcoloring is a distance-k coloring for k <k. The kth power of a

graph G=(V, E) is the graph Gk=(V,F), where vw ∈Fwhenever vertices vand w

are distance-kneighbors in G. A mapping φis a distance-kcoloring of a graph Gif

and only if φis a distance-1 coloring of the graph Gk.Astar coloring of a graph is a

distance-1 coloring with the additional requirement that every path on four vertices

uses at least three colors.Anacyclic coloring of a graph is a distance-1 coloring with

the further restriction that every cycle uses at least three colors. In a distance-k,a

star, an acyclic, or any other coloring variant on a graph, a color class is a set of

vertices that have received the same color.

The names star and acyclic coloring are due to the structures of subgraphs induced

by any two color classes. In an acyclically colored graph, a subgraph induced by any

two color classes is clearly a collection of trees, and thus acyclic. Similarly, in a star-

colored graph, a subgraph induced by any two color classes is a collection of stars.

It is easy to see that a two-colored connected induced subgraph here ought to be a

star, for otherwise there would exist a two-colored path on four vertices, violating a

condition of star coloring. These structures are used in the design of the acyclic and

star coloring algorithms to be presented in the next two sections. Although not so

useful for algorithm design, the structures of two-colored induced subgraphs in the

cases of distance-1 and distance-2 coloring are also interesting. A coloring of a graph

is a distance-1 coloring if and only if every two-colored induced subgraph is a bipartite

graph. In the case of a distance-2 coloring, every two-colored induced subgraph is a

set of edges in which a pair of edges is apart by a path of length at least two edges

in the original graph, a structure appropriately called a distance-2matching. These

observations on two-colored induced subgraphs are summarized in Table 2.1.

2.2. Complexity results. The objective of an optimization problem associated

with each of the coloring variants we have introduced is to minimize the number of

colors used. Lin and Skiena [26] proved that the distance-kgraph coloring problem

is NP-hard for every ﬁxed integer k≥1. Coleman and Mor´e [13] showed that the

1046 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

star coloring problem is NP-hard even if the graph is bipartite. The acyclic coloring

problem was proven to be NP-hard by Coleman and Cai [9].

The distance-1 coloring problem is also known to be hard to approximate: for a

graph on nvertices, there exists δ>0 such that approximating distance-1 coloring

within O(nδ) is NP-hard (Corollary 10.1 in [5]). We use this to prove similar inap-

proximability results for acyclic and star coloring. We ﬁrst introduce some notation.

The distance-k chromatic number of a graph G, denoted by χk(G), is the least number

of colors required to distance-kcolor G. The acyclic and star chromatic numbers are

deﬁned analogously, and are denoted by χaand χs, respectively.

Theorem 2.1. There exists ﬁxed >0such that it is NP-hard to approximate

the acyclic chromatic number χa(G)of a graph Gwithin O(n), where nis the number

of vertices in G.

Proof. The reduction is from distance-1 coloring, and it is a small modiﬁcation

of the reduction for proving the NP-completeness of acyclic coloring in Coleman and

Cai [9].

Assume that we are given a graph G=(V, E) which we are required to distance-1

color; from Gwe create a new graph H=(W, F ) for acyclic coloring as follows. Let

Δ denote the maximum degree of a vertex in G. Each edge uv ∈Eis replaced by

a subgraph (a gadget) in Hconsisting of vertices u,v, and Δ new vertices w1,w2,

...,wΔ; and edges join vertices uand vto every one of the Δ vertices wi.

We claim that Ghas a distance-1 coloring using pcolors if and only if Hhas an

acyclic coloring using pcolors, where 3 ≤p≤Δ.

Suppose Ghas a distance-1 coloring with pcolors. In any edge uv ∈E, the

vertices uand vreceive distinct colors. In the graph H, assign uthe same color as

u, and vthe same color as v. Since each vertex wiis adjacent only to the pair of

vertices uand v, it can be assigned any one of the p−2 colors distinct from the two

colors that have been used for uand v; note that the quantity p−2 is positive (there

exists a color for the vertex wi) since pis assumed to be at least three. The coloring

of Hobtained by “processing” every edge in Gin this fashion is acyclic. To see this,

notice that every cycle in Hcontains one uvertex, one vvertex, and at least one

wivertex from the gadget we created for some edge uv ∈E, and these three vertices

receive distinct colors in H. Hence every cycle in Huses at least three colors, and

one direction of the claim follows.

Now consider an acyclic coloring of Hwith pcolors, again where 3 ≤p≤Δ.

We claim that the vertices uand vcorresponding to an edge uv ∈Ereceive

distinct colors. Suppose not. Then every pair of vertices wiand wjin this gadget

would have to be colored with distinct colors, or else there would be a cycle u,wi,v,

wj,uwhich would be colored with two colors. But then this subgraph of Hwould

need Δ + 1 colors, contradicting the choice of p.

Thus in every gadget the vertices uand vreceive distinct colors in the acyclic

coloring, and these same colors can be assigned to the vertices uand vto obtain a

distance-1 coloring of Gusing at most pcolors.

There exists δ>0 such that approximating distance-1 coloring within O(nδ)is

NP-hard [5], where n=|V|, the number of vertices in G. Let Ndenote the number

of vertices in H. Then N=n+Δ|E|=O(n+n3). Thus it follows that there exists

an >0 such that approximating acyclic coloring within O(N) is NP-hard.

Under stronger complexity conjectures (NP =ZPP, the class of problems with

polynomial time zero-error randomized algorithms), we can prove that it is hard to

approximate acyclic coloring within O(n1/3−), using a technique similar to that in

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1047

Agnarsson and Halld´orsson [3].

We can also prove inapproximability results for star coloring, using the same

gadget used in the acyclic coloring proof. We omit the details.

Theorem 2.2. There exists ﬁxed >0such that it is NP-hard to approximate

the star chromatic number χs(G)of a graph Gwithin O(n).

2.3. Related work. Some of the coloring problems introduced in section 2.1

have been studied by other authors, from an application or a pure graph-theoretic

point of view. Krumke, Marathe, and Ravi [24] studied the role of distance-2 coloring

in channel assignment problems in radio networks. Kumar et al. [25] have undertaken

a related study on scheduling problems in wireless networks. In these two studies,

several approximation and distributed algorithms for distance-2 coloring of disk and

other special classes of graphs have been suggested. Balakrishnan et al. [6] showed

that the problem of maximizing concurrent transmissions in certain ad-hoc wireless

networks can be modeled as a maximum cardinality distance-2 matching problem, a

problem related to distance-2 coloring as discussed at the end of section 2.1. The

distance-2 matching problem was ﬁrst studied and shown to be NP-hard by Stock-

meyer and Vazirani [29].

Agnarsson and Halld´orsson [2] have studied distance-kcoloring of planar graphs,

and a similar study of chordal graphs is available in Agnarsson, Greenlaw, and

Halld´orsson [1]. Acyclic colorings were ﬁrst deﬁned and studied by Gr¨unbaum [21],

who asked whether the acyclic chromatic number χa(G) of a graph Gsatisﬁes the in-

equality χa(G)≤Δ + 1. A negative answer to this question was given by Erd¨os, who

proved probabilistically the existence of graphs where χa(G)≥Δ4/3−[23]. Borodin

[7] proved that every planar graph can be acyclically colored using at most ﬁve col-

ors. Star coloring has recently been studied by Albertson et al. [4], who showed that

every acyclic coloring that uses qcolors can be reﬁned to a star coloring with at most

2q2−qcolors. These authors also prove that planar graphs have star colorings with

at most 20 colors, and they exhibit an example in which 10 colors are required. For

more pointers to theoretical works on distance-k, acyclic, and star coloring, see the

monograph by Jensen and Toft [23] and our earlier work [17]. The focus of this paper

is on practical, fast, suboptimal algorithms for acyclic and star coloring.

3. The new acyclic coloring algorithm.

3.1. Overview. Suppose a subset U⊂Vof the vertices of a graph G=(V, E)

has been colored satisfying the two requirements of an acyclic coloring. As discussed

in section 2.1, every subgraph of Ginduced by a set of vertices assigned any two

colors is a forest. Furthermore, every edge in Gboth of whose endpoints are in the

set Uof colored vertices belongs to a unique tree (in a two-colored forest). Therefore,

the union of all possible two-colored forests induced by the vertices in the set Uis a

collection of edge-disjoint trees. Our algorithm maintains such a collection of trees

eﬃciently and extends the partial coloring by processing one vertex at a time.

We use the illustration in Figure 3.1 to provide an overview of the algorithm.

The graph shown at the left is the input graph Gwhich has been partially colored,

and vis the only vertex that remains to be colored. The right half of Figure 3.1

shows two-colored forests induced by the rest of the vertices of the graph. The input

graph currently uses four colors, red, blue, green, and yellow (to ease black-and-white

viewing, each colored vertex in the ﬁgure has been labeled with a letter reﬂecting the

color used). Thus there could exist at most six two-colored induced forests. Out of

these, the subgraph induced by green and yellow vertices is a graph whose edge set

1048 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

BGBR

YBR

YRG

RYBB

BGB

Fig. 3.1.Left: A partial acyclic coloring of a graph; vertex vis the only vertex that remains to

be colored. Every other vertex has been assigned one of the four colors red, blue, green, or yellow;

the label on each vertex reﬂects the color used. Right: All of the nonempty, two-colored induced

subgraphs; each subgraph is a forest.

is empty. The remaining ﬁve forests are depicted in the ﬁgure. Each of the red-blue

and blue-green forests consists of two trees, while each of the other forests consists of

a single tree. In the example, all of the two-colored trees are incident on the vertex v.

The task of choosing a valid color for the vertex vcan be divided into two natural

parts. The ﬁrst part involves excluding the colors of each of the distance-1 neighbors

of vfrom its set of allowable colors. In our illustration, this results in the exclusion of

the colors red and blue from the allowable set of colors for v. The second part involves

ensuring that a color chosen for the vertex vdoes not lead to a two-colored cycle in

the graph G. Since the vertex v(and, in general, any vertex in a graph) could be part

of many, arbitrarily long cycles, the second part is more diﬃcult than the ﬁrst, which

is conﬁned to the distance-1 neighborhood of vin G.

The key point in our algorithm is the detection of a potential two-colored cycle

that involves the vertex vin an eﬃcient way. In particular, the cycle detection is

done eﬃciently by examining two-colored trees incident on v, a task accomplished

by visiting the distance-2 (and not any further) neighborhood of vonce. Due to the

distance-1 coloring requirement in an acyclic coloring, an odd cycle would necessarily

use at least three colors. Thus in avoiding two-colored cycles, we will be concerned

only with even cycles.

A two-colored cycle that involves the vertex vcan arise only if the following two

conditions are satisﬁed:

1. the vertex vis adjacent to two vertices in a two-colored tree, and

2. these adjacent vertices are of the same color.

Note that condition 2 restricts the concern to only even cycles, for if the vertex vis

adjacent to two vertices with diﬀerent colors in a two-colored tree, the corresponding

cycle involving vis of odd length and thus uses three colors, posing no concern. For

example, in Figure 3.1, though vertex vis adjacent to two vertices in the upper red-

blue tree, vwill not induce a cycle in that tree since the adjacent vertices have distinct

colors.

Now let Fvdenote a collection of two-colored trees that are incident on the vertex

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1049

vand satisfy the aforementioned two conditions. For each tree Tv∈Fv, let c1denote

the color of the vertices adjacent to the vertex v, and let c2be the other color used

in the tree Tv. Then, for each tree Tv, we exclude the color c2from the allowable set

of colors for the vertex v, for otherwise vmight be assigned the color c2, creating a

two-colored cycle involving v. In Figure 3.1 this process would lead to the exclusion

of the color yellow from the allowable set of colors for the vertex v, for otherwise a

two-colored cycle could arise in the red-yellow tree.

When both the ﬁrst and the second tasks discussed above are completed for the

vertex v, we know exactly the colors that cannot be assigned to v, and thus we can

choose the smallest allowable color for v.

To complete the overview of the algorithm, we now need to update the collection

of two-colored trees to reﬂect the fact that the vertex vhas been colored. The update

can be performed in two stages. In the ﬁrst stage, edges vw that connect the vertex

vand an already colored vertex ware grouped into two-colored stars. In the second

stage, these stars are merged with existing two-colored trees as needed. A point that

should be noted here is that the second stage of the update may cause previously

disconnected trees in a two-colored forest to be connected. For example, if the vertex

vin Figure 3.1 receives the color green, then the two (currently disconnected) trees in

the green-blue forest need to be merged. Just as the task of choosing a color for the

vertex v, we will show that the task of updating the collection of two-colored trees

after vis assigned a color can be accomplished by visiting the distance-2 neighborhood

of vonce.

3.2. Detailed description. For an eﬃcient implementation of the algorithm

sketched in section 3.1, we use the disjoint-set data structure to maintain the collection

of two-colored trees. For a discussion of the disjoint-set data structure, see, e.g., [15].

A set in our context corresponds to a two-colored tree, and an element of a set

corresponds to an edge. The disjoint-set data structure supports three operations:

makeSet,find, and union. Operation makeSet(e) creates a new set consisting

of the element e. Operation find(e) returns the representative element of the set to

which element ebelongs, and operation union(e1,e

2) merges the two sets S(e1) and

S(e2) to which elements e1and e2belong.

Further, in our implementation of the sketched algorithm, we use the two arrays

color and forbiddenColors for coloring purposes. The vertex-indexed array color is used

to store the colors assigned to vertices. In particular, color[u]=cindicates that the

vertex uhas been assigned the color c, a value that once set remains unchanged during

the course of the algorithm. The color-indexed array forbiddenColors is used to mark

the colors that are forbidden to a speciﬁc vertex. In particular, forbiddenColors[c]=u

indicates that the color cis impermissible for the vertex u; otherwise the color cis a

candidate color for the vertex u. Unlike the array color, the value stored at a particular

index in the array forbiddenColors varies during the course of the algorithm, an issue

that will be clear in a moment.

3.2.1. The main routine. The driver routine in our acyclic coloring algorithm

is outlined in Algorithm 3.1 and the current section is devoted to its discussion. The

subroutines called by Algorithm 3.1 will be presented in section 3.2.2.

Leaving the initialization of various data structures in line 2 aside for a moment,

the body of the outermost for-loop of Algorithm 3.1 consists of two parts. The ﬁrst

part (lines 4–10) is concerned with ﬁnding a color for the vertex v, the current vertex

in the iteration over the vertex set V. The second part (lines 11–16) deals with

updating the collection of two-colored trees incident on the vertex v.

1050 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

Algorithm 3.1. Input: a graph G. Eﬀect: ﬁnds an acyclic coloring of G.

1: procedure AcyclicColoring(G=(V, E))

2: Initialize data structures

3: for each v∈Vdo

4: for each colored w∈N1(v)do

5: forbiddenColors[color[w]] ←v

6: for each colored w∈N1(v)do

7: for each colored x∈N1(w)do

8: if forbiddenColors[color[x]] =vthen

9: preventCycle(v, w, x)

10: color[v]←min{i>0,forbiddenColors[i]=v}

11: for each colored w∈N1(v)do grow two-colored stars around the vertex v

12: growStar(v, w)

13: for each colored w∈N1(v)do

14: for each colored x∈N1(w), x=vdo

15: if color[x]=color[v]then

16: mergeTrees(v, w, x)merge trees T1vw and T2wx if T1=T2

Finding a color. Pursuant to the ﬁrst requirement of acyclic coloring, the for-loop

in lines 4–5 marks the colors of the distance-1 neighbors of the vertex vas forbidden

colors to v. The subsequent for-loop in lines 6–9 forbids additional colors that can

result in two-colored cycles involving the vertex v, to enforce the second requirement

of acyclic coloring. Here, a subset of the two-colored trees incident on the vertex

v—through edges wx, where wis an already colored vertex adjacent to vand xis an

already colored vertex adjacent to w—is examined. The routine preventCycle (to

be discussed in section 3.2.2) checks whether the vertex vis connected to the two-

colored tree containing the edge wx via two edges. If this turns out to be the case, the

routine forbids the color of the vertex xfrom being a candidate color for the vertex

vby updating the global array forbiddenColors. The if-test in line 8 ensures that a

two-colored tree incident on the vertex vis investigated only if there is a potential for

a two-colored cycle to arise in the same tree; recall that the upper red-blue tree in

Figure 3.1 need not be examined while coloring the vertex vsince it cannot lead to

a two-colored cycle involving v. The if-test also avoids an unnecessary computation:

once a color cin a two-colored tree Tis forbidden to the vertex vto prevent a potential

two-colored cycle C(that involves Tand v), then the tree Tneed not be investigated

any further in the determination of a color for the vertex v, for the existence of any

other cycle C(that also involves Tand v) would only result in the exclusion of the

same color c.

Once all the colors that are not allowed for the vertex vare marked in the array

forbiddenColors, the array is scanned sequentially in search of the ﬁrst index in which

a value other than vis stored. This corresponds to the smallest allowable color for

the vertex vand is chosen as the color for v(line 10). The array forbiddenColors is

initialized at the beginning of the algorithm with some value outside the set V. This

array can be used without being reinitialized while coloring another vertex, since in

each iteration a unique vertex is colored (i.e., the identity of a vertex serves as a

“time-stamp”).

Updating the collection of two-colored trees. The remainder of Algorithm 3.1 is

concerned with creating and updating two-colored trees incident on the vertex v. This

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1051

task is done in two stages, reﬂected by the two for-loops in lines 11–12 and lines 13–16.

In the ﬁrst stage (the ﬁrst for-loop), two-colored trees consisting only of edges incident

on the vertex vare grown. More precisely, the structures grown are two-colored stars,

and the hub of each star is the vertex v. To this end, for every edge vw that connects

the vertex vwith an already colored vertex w, (1) a new two-colored star consisting

only of the edge vw is created, and (2) if there is already a similarly two-colored star

Swith one or more edges of the form vw(hence color[w]=color[w]), then the newly

created star (edge vw) is merged with S. Both of these items are achieved by the call

to the routine growStar in line 12; the details of this routine will be discussed in

section 3.2.2.

At the end of the for-loop in lines 11–12, every edge vw connecting vand an

already colored vertex whas been placed in a star in the right two-colored forest.

There may now be a need for another level of merges in a forest, constituting the

second stage of the data structure update. Such merges involve stars grown at the

current iteration, and trees grown in iterations prior to the coloring of v. The piece

of code in lines 13–16 determines whether there is a need for such merges, and if so,

updates the disjoint-set data structure appropriately. In particular, for each colored

vertex wadjacent to v, and each colored vertex xadjacent to wwith color[x]=color[v],

the routine mergeTrees is called in line 16 with the arguments v,w, and x. The

routine checks whether the tree that contains the edge vw is the same as the tree that

contains the edge wx; if the two trees turn out to be diﬀerent, then they are merged.

To illustrate the points discussed in the previous two paragraphs without revealing

the details of the routines growStar and mergeTrees, consider the example in

Figure 3.1. Suppose the vertex vhas been assigned the color green. At the end of

the for-loop in lines 11–12, two stars are grown around the vertex v. These are the

star (path) B–v–Band the star R–v–R, where Band Rstand for a blue and a red

neighbor of the vertex v, respectively. In the subsequent for-loop, the star R–v–Ris

merged with the only tree in the current red-green forest. In the same loop, the star

B–v–Bis ﬁrst merged with one of the two disconnected trees in the current blue-green

forest, and then the resulting tree is further merged with the other tree. As a result,

the blue-green forest now becomes a single tree (to be more precise, a path on eight

vertices).

3.2.2. The subroutines. Let us now take an inside look at the routines pre-

ventCycle,growStar, and mergeTrees. In the discussion below, we will refer

to the actions performed on the vertex vin lines 4–16 of Algorithm 3.1 as processing

vertex v.

Avoiding two-colored cycles. The routine preventCycle (see Algorithm 3.2)

takes three vertices v,w, and xas input parameters: vis the vertex currently being

processed, wis a colored vertex adjacent to v, and xis a colored vertex adjacent to

w. The primary task of the routine is to forbid the color of the vertex xfrom being

chosen as a color for the vertex v, if it could lead to a two-colored cycle in the input

graph. This would happen if the vertex vis incident on the two-colored tree to which

the edge wx belongs via two edges. An auxiliary task of the routine is to update an

associated data structure.

To determine whether or not a vertex vis connected to a two-colored tree via

two edges in an eﬃcient way, we mark each two-colored tree containing an edge wx

incident on the vertex v. While doing so, if a tree is marked twice, then we know

that the vertex vis incident on the tree via two edges. We use an edge-indexed array

called ﬁrstVisitToTree for this purpose. An index of the array ﬁrstVisitToTree is the

1052 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

Algorithm 3.2. Input: the vertex vbeing colored, a colored neighbor wof v, and a

colored neighbor xof w. Associated data structure: ﬁrstVisitToTree[e]=(p, q) indicates that

ﬁrstVisitToTree[e] was last set when processing vertex pand at that time the tree represented

by the edge ewas ﬁrst visited through the edge pq. Eﬀect: (1) updates the array ﬁrstVisit-

ToTre e , or (2) excludes color[x] from the allowable set of colors for vif it could otherwise

lead to a two-colored cycle involving v.

1: procedure preventCycle(v, w, x)

2: e←find(wx)edge wx belongs to the 2-colored tree Trepresented by edge e

3: (p, q)←ﬁrstVisitToTree[e]

4: if p=vthen Tis being visited from vertex vfor the ﬁrst time

5: ﬁrstVisitToTree[e]←(v, w)

6: else if q=wthen Tis connected to vertex vvia at least two edges

7: forbiddenColors[color[x]] ←v

ID of the representative edge of a two-colored tree. A vertex pair (p, q) is stored at

index eof the array to indicate that ﬁrstVisitToTree[e] was last set when processing

the vertex p, and at that time the two-colored tree whose representative edge is

ewas ﬁrst visited through the edge pq.InﬁrstVisitToTree and other similar data

structures used elsewhere in this paper, the word ﬁrst refers to the context in which

the neighborhood of the vertex pbeing processed is visited. The array ﬁrstVisitToTree

is globally declared and initialized at the beginning of Algorithm 3.1 with a pair of

values not corresponding to any vertex. As in the array forbiddenColors, the “marker”

vertex pin the array ﬁrstVisitToTree serves as a time-stamp. Hence, this array need

not be reinitialized while processing another vertex.

The routine preventCycle begins by getting the two-colored tree Tto which

the edge wx belongs. Let ebe the representative edge of Treturned by the find

operation. In line 3, the vertex pair (p, q) stored at index eof the array ﬁrstVisitToTree

is retrieved. If p=v, then the implication is that the tree Tis being visited from

the vertex vfor the ﬁrst time. To reﬂect this, the vertex pair (v,w) is stored at

ﬁrstVisitToTree[e]. If, on the other hand, p=v, then there are two possibilities,

corresponding to the cases q=wand q=w. The case q=wimplies that the tree

Thas previously been visited from the vertex v, but the visit was through the same

edge vw. This occurs if the vertex wis adjacent to at least two vertices (besides the

vertex v) in the tree T, and does not cause the color of the vertex xto be a forbidden

color for the vertex v. The second case, q=w, implies that the tree Thas previously

been visited from the vertex vthrough an edge other than vw. In this case, setting

color[v]=color[x] would create a two-colored cycle, and therefore color[x] is forbidden

from being a color for the vertex v.

Updating the collection of two-colored trees. As discussed earlier, the ﬁrst stage of

updating the collection of two-colored trees incident on the vertex vbeing processed

involves growing two-colored stars around the vertex v. This is achieved by the calls

growStar(v, w) made in line 12 of Algorithm 3.1 for each colored vertex wadjacent

to the vertex v.

To achieve the desired merging of edges into two-colored stars in an eﬃcient

manner, we maintain a color-indexed array, ﬁrstNeighbor, in which we store pairs

of vertices. The array is used to determine whether or not a particular color is

encountered for the ﬁrst time while visiting the distance-1 neighborhood of a particular

vertex. Speciﬁcally, we let ﬁrstNeighbor[c]=(p, q) to indicate that while processing

the vertex p, the vertex qwas the ﬁrst-to-be-encountered neighbor having the color

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1053

Algorithm 3.3. Input: the vertex vbeing processed and a colored neighbor wof v.

Associated data structure: ﬁrstNeighbor[c]=(p, q) indicates that the vertex qis the ﬁrst-to-

be-encountered neighbor of the vertex phaving the color c. Eﬀect: (1) creates a new two-

colored tree Tvw consisting only of the edge vw, and (2) either updates the array ﬁrstNeighbor

or merges the tree Tvw with a two-colored star being grown around v.

1: procedure growStar(v, w)

2: makeSet(vw)create a new tree Tvw consisting only of edge vw

3: (p, q)←ﬁrstNeighbor[color[w]]

4: if p=vthen a neighbor of vwith color color[w] encountered for 1st time

5: ﬁrstNeighbor[color[w]] ←(v, w)

6: else merge Tvw with a two-colored star being grown around v

7: e1←find(vw); e2←find(pq)

8: union(e1,e

Algorithm 3.4. Input: the vertex vbeing processed, a colored neighbor wof v, and a

neighbor xof whaving the same color as v. Eﬀect: merges the tree containing the edge vw

with the tree containing the edge wx, if the two trees are distinct.

1: procedure mergeTrees(v, w, x)

2: e1←find(vw); e2←find(wx)

3: if e1=e2then

4: union(e1,e

c. As the array ﬁrstVisitToTree discussed earlier, the array ﬁrstNeighbor is globally

declared and initialized in Algorithm 3.1, and can be used without reinitialization

while iterating through the vertex set V.

The routine growStar (Algorithm 3.3) begins by creating a two-colored tree

(star) consisting only of the edge vw, a task accomplished by the call makeSet(vw).

The routine then retrieves the vertex pair (p, q) stored at the index color[w]ofthe

array ﬁrstNeighbor.Ifp=v, then the implication is that the color used by the vertex

wis encountered for the ﬁrst time while visiting the vertices adjacent to the vertex

v. To signify this, the vertex pair (v,w) is stored at the index color[w] of the array

ﬁrstNeighbor. Otherwise, if p=v, then the implication is that a vertex q(distinct

from w) that has the same color as the vertex whas already been encountered while

visiting the neighbors of the vertex v. In this case, the edge vw—the two-colored star

just created—is merged with the (growing) two-colored star containing the edge pq.

The second stage of the update involves merging the two-colored stars just grown

with existing two-colored trees as needed. This is achieved by the calls to mergeTrees

made in line 16 of Algorithm 3.1. Each such call is made with the arguments v,w,

and x, where vis the vertex currently being processed, wis a colored vertex adjacent

to v, and xis a colored vertex adjacent to wwith color[x]=color[v]. The routine

mergeTrees (Algorithm 3.4) ﬁrst queries for the representative edges e1and e2of

the two-colored trees T1and T2to which the edges vw and wx, respectively, belong.

If e1=e2, then it means that the two trees T1and T2are distinct, and therefore they

need to be merged. Otherwise no further action needs to be taken.

3.3. Complexity. The outermost for-loop in Algorithm 3.1 consists of n=|V|

iterations. In each iteration, the two visits to the distance-2 neighborhood of the

vertex vbeing processed, i.e., the for-loops in lines 6–9 and 13–16, constitute the

dominant part of the computation. On each visited neighbor, a constant number

1054 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

of operations is performed. Thus, disregarding the time spent on updating and

querying the disjoint-set data structure, the time complexity of the algorithm is

O(v∈Vd2(v)) = O(nd2).

The disjoint-set data structure in our context contains at most mdistinct ele-

ments, the edges of G. Thus there can be at most m−1union calls and mmakeSet

calls. Further, there can be at most m=2n·d1+3n·d2≤5n·d2find calls, since the

routine find is called once in the procedure PreventCycle and twice in each of the

procedures growStar and mergeTrees. In our implementation of the disjoint-set

data structure, we use path compression for the find operation and union-by-size for

the union operation to achieve the best runtime performance. When these techniques

are used, the average time spent on each such operation is at most α(m,m), where α

is the inverse of Ackermann’s function [15]. Thus the overall time complexity of our

acyclic coloring algorithm is O(nd2·α(m,m)). Clearly, the space complexity of the

algorithm is O(m).

4. The new star coloring algorithm.

4.1. Overview. Our star coloring algorithm is based on the same idea as the

acyclic coloring algorithm presented in section 3. Instead of managing a collection

of edge-disjoint two-colored trees, we now manage a collection of edge-disjoint two-

colored stars. Recall that the hub of a star is the vertex at which the edges meet.

The key diﬀerence between the acyclic and the star coloring algorithms stems from

the following fact: An edge may be added to a tree at any vertex of the tree, whereas

the only way an edge may be added to a star is at the hub of the star. A consequence

of this fact is that the addition of an edge to a star does not lead to the merging of

two existing stars. Hence, a simpler data structure than disjoint-set suﬃces. We use

a one-dimensional array that maps edges to stars, and a similar array that maps stars

to hubs, to manage the collection of two-colored stars.

Figure 4.1 provides an overview of the algorithm. Part (a) of the ﬁgure shows

the input graph Gwhere every vertex except vhas been colored consistent with the

requirements of a star coloring. The illustration uses the six colors red, blue, green,

yellow, pink, and black (Bk). Parts (b) through (f) show a subset of the two-colored

stars with nonempty edge sets that are incident on v.

Since a star coloring is also a distance-1 coloring, the set of colors used by ver-

tices adjacent to the vertex vis impermissible for v. Enforcing this requirement is

straightforward. In addition, any color that leads to a two-colored path involving v

and three other vertices is impermissible for v. There are exactly two cases that need

to be considered to identify and forbid a color of this type.

•Case 1. If vertex vhas two adjacent vertices wand xof the same color, then

the color of every vertex adjacent to w, and the color of every vertex adjacent

to xshould be excluded from the allowable set of colors for v.

•Case 2. If vertex vhas exactly one adjacent vertex wof color c, and wbelongs

to a star Scontaining at least two edges but is not the hub of S, then the

color used by the hub of Sshould be disallowed from being a candidate color

for v.

Parts (b) and (c) of Figure 4.1 illustrate Case 1; neither the color blue nor the color

green can be assigned to the vertex v. Similarly, parts (d) and (e) illustrate Case 2;

the colors green and pink used by the hubs of the respective stars are not allowed for

the vertex v. As illustrated in part (f), if the star Sin a situation similar to Case 2

has only one edge, then it does not impose any additional color restriction on the

vertex v. We say that the hub of such a star consisting of a single edge is undeﬁned.

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1055

v v vv v v

(a) (b) (c) (d) (e) (f)

PBk Y

RRRR

BkP

Fig. 4.1. (a): A partial star coloring of a graph; vertex vis the only vertex that remains to

be colored. (b)–(f ): A subset of the two-colored induced subgraphs incident on the vertex v; each

subgraph is a collection of stars. The illustration uses the six colors red, blue, green, yellow, pink,

and black (Bk).

The hub is deﬁned later if and when another edge joins the star, in which case the

point of incidence between the two edges becomes the hub.

Like the acyclic coloring algorithm, our star coloring algorithm iterates over the

vertex set Vin some order, coloring one vertex at a time. In the step where a vertex v

is colored, the color used by each adjacent vertex and every additional color identiﬁed

by checking Cases 1 and 2 is excluded from the allowable set of colors for v. Then

the smallest permissible color is chosen and assigned to the vertex v. After this,

every edge vw connecting the vertex vand an already colored vertex wis placed in

the appropriate two-colored star. In particular, if an edge vw is incident on some

(color[v],color[w])-star S, then the edge vw joins the star S; otherwise, the edge vw

forms a star on its own with an undeﬁned hub. Unlike the acyclic coloring algorithm

in which the addition of an edge to a two-colored tree may result in the merging of

two existing trees, the addition of an edge to a two-colored star does not lead to the

merging of two existing stars.

4.2. Detailed description. To maintain the collection of two-colored stars ef-

fectively, we use an edge-indexed array, star, that maps edges to stars, and a star-

indexed array, hub, that maps stars to hubs. In particular, star[vw]=ssigniﬁes that

the edge vw belongs to the star s, and hub[s]=uindicates that the hub of the star

sis the vertex u. We work with the two separate arrays, star and hub, in order to

distinguish between stars consisting of at least two edges (having deﬁned hubs) and

stars consisting of a single edge (having undeﬁned hubs). In addition, we make use of

the two arrays color and forbiddenColors in just the same way as in section 3.

Our star coloring algorithm is outlined in Algorithm 4.1. The body of the outer

for-loop has two parts. The ﬁrst part, the piece of code in lines 4–16, is concerned

with ﬁnding a color for the vertex v, the current vertex in the iteration over the vertex

set. The second part deals with updating the collection of two-colored stars incident

on the vertex v, and is achieved by the call to updateStars in line 17. Hereafter,

we shall refer to the actions performed on the vertex vin lines 4–17 as processing the

vertex v.

Finding a color. The for-loop in lines 4–15 of Algorithm 4.1 determines the colors

that cannot be assigned to the vertex v. In line 5, the color used by a vertex w

adjacent to the vertex vis marked as a forbidden color for v, since a star coloring

is also a distance-1 coloring. To determine and forbid additional colors that lead to

two-colored paths involving vand three other vertices, Cases 1 and 2 outlined earlier

need to be checked. To detect Case 1 in an eﬃcient manner, we use the color-indexed

array ﬁrstNeighbor which stores pairs of vertices: ﬁrstNeighbor[c]=(p, q) indicates

1056 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

Algorithm 4.1. Input: a graph G. Associated data structure: ﬁrstNeighbor[c]=(p, q)

indicates that the vertex qis the ﬁrst-to-be-encountered neighbor of the vertex phaving the

color c;treated[y]=xindicates that the color of every vertex adjacent to the vertex yis

already forbidden from being a color for the vertex x. Eﬀect: ﬁnds a star coloring of G.

1: procedure StarColoring(G=(V,E))

2: Initialize data structures

3: for each v∈Vdo

4: for each colored w∈N1(v)do

5: forbiddenColors[color[w]] ←v

6: (p, q)←ﬁrstNeighbor[color[w]]

7: if p=vthen Case 1

8: if treated[q]=vthen

9: treat(v,q)forbid colors of neighbors of q

10: treat(v,w)forbid colors of neighbors of w

11: else

12: ﬁrstNeighbor[color[w]] ←(v, w)

13: for each colored x∈N1(w), x=vdo

14: if x=hub[star[wx]] then potential Case 2

15: forbiddenColors[color[x]] ←v

16: color[v]←min{i>0,forbiddenColors[i]=v}

17: updateStars(v)

Algorithm 4.2. Input: the vertex vbeing processed and a colored vertex wadjacent to

v. Eﬀect: excludes the colors used by vertices adjacent to wfrom the allowable set of colors

for v, and updates the array treated to reﬂect this.

1: procedure treat(v, w)

2: for each colored x∈N1(w)do

3: forbiddenColors[color[x]] ←v

4: treated[w]←v

that when processing the vertex p, the vertex qwas the ﬁrst neighbor with color cto

be encountered. (An array with the same name was used for a similar purpose in the

acyclic coloring algorithm discussed in the previous section.) In line 6, a lookup in

the table ﬁrstNeighbor at the index color[w] is made and the stored value is retrieved

in the vertex pair (p, q).

If p=v, then the implication is that a vertex which is distinct from, but has

the same color as, the vertex w—the vertex q—has already been encountered while

visiting the distance-1 neighbors of the vertex v. This clearly corresponds to Case

1. The appropriate action to be taken is to forbid the colors used by the vertices

adjacent to qand the colors used by the vertices adjacent to w. The calls to the

subroutine treat (outlined in Algorithm 4.2) in lines 9 and 10 do precisely that. In

connection with this case, we use the vertex-indexed array treated in which we store

vertices; treated[y]=xindicates that the color of every vertex adjacent to the vertex

yhas already been marked as a forbidden color for the vertex x. The if-test in line 8 is

performed to avoid unnecessary computation in the case where color[w] is encountered

for a third time or later.

If, on the other hand, p=v, then the implication is that the color used by the

vertex wis encountered for the ﬁrst time while visiting the distance-1 neighborhood

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1057

Algorithm 4.3. Input: the vertex vbeing processed. Eﬀect: adds edges vw, where wis

colored, to appropriate two-colored stars.

1: procedure updateStars(v)

2: for each colored w∈N1(v)do

3: if ∃x∈N1(w) where x=vand color[x]=color[v]then vw,wx ∈E

4: hub[star[wx]] ←w

this may already be true

5: star[vw]←star[wx]

6: else

7: (p, q)←ﬁrstNeighbor[color[w]]

8: if (p=v) and (q=w)then vw, vq ∈E∧color[w]=color[q]

9: hub[star[vq]] ←v

this may already be true

10: star[vw]←star[vq]

11: else vw forms a new star

12: starID ←starID +1

13: star[vw]←starID

of the vertex v. This information is recorded in line 12 for future use. The piece of

code in the subsequent for-loop takes a proactive measure in anticipation of Case 2. In

particular, the loop iterates over edges wx that belong to existing two-colored stars.

For each star to which an edge wx belongs, if the star has a deﬁned hub and the hub

is the vertex x, then the color of xis forbidden from being a color for the vertex v

(line 14). Potentially, this corresponds to Case 2; if the vertex vis later found to have

a neighbor vertex w=wwith the same color as w, then it reduces to Case 1. Recall

that only stars containing at least two edges have deﬁned hubs; for a star with only

one edge the test in line 14 will fail and therefore will not impose an additional color

restriction (cf. Figure 4.1 (f)).

At the end of the for-loop in lines 4–15, every color that is not allowed to be a

color for the vertex vis marked in the array forbiddenColors. The smallest permissible

color is then chosen and assigned to the vertex vin line 16.

As with the arrays used in section 3, in the arrays ﬁrstNeighbor and treated the

marker vertex pserves as a time-stamp; it is used to determine whether the infor-

mation stored in the data structures concerns the vertex currently being processed.

These data structures are initialized at the beginning of Algorithm 4.1, and no reini-

tialization is required in the course of iteration over the vertex set.

Updating the collection of two-colored stars. The call to the routine updateStars

(outlined in Algorithm 4.3) is made at line 17 to update the collection of two-colored

stars to reﬂect the fact that the vertex vhas been assigned a color. In Algorithm 4.3,

each edge vw that joins the vertex vwith an already colored vertex wis placed in

an existing or a new two-colored star depending on the colors of the vertex vand its

distance-2 neighbors. The edge vw joins an existing star if (1) there exists an edge wx

and color[v]=color[x], or (2) there exists an edge vq and color[w]=color[q]. The ﬁrst

case can be detected by looking at the distance-1 neighborhood of the vertex w; for the

second case, the information in the array ﬁrstNeighbor, which is set in Algorithm 4.1,

is utilized. Clearly, in the ﬁrst case the vertex wbecomes the hub of the resulting star

(if it was not already so), and in the second case the vertex vbecomes the hub (again,

if it was not already so). If neither of cases (1) or (2) turns out to be positive, then

the edge vw forms a new two-colored star on its own with an undeﬁned hub. Notice

that the deﬁnitions of the hubs in lines 4 and 9 are necessary, since the edges wx and

1058 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

vq could be single-edge-stars with undeﬁned hubs at the time the routine is called.

4.3. Complexity. Given a graph on nvertices and medges, the computational

work involved in each of the niterations of the outer for-loop in Algorithm 4.1 is

proportional to d2(v). Hence the overall time complexity of our star coloring algorithm

is O(nd2). Its space complexity is clearly O(m).

5. Coloring models in Hessian computation. We have experimentally com-

pared the new algorithms for acyclic and star coloring presented in this paper with

previously suggested algorithms for these and other related coloring problems that

occur in the computation of sparse Hessians. In this section, we brieﬂy review such

coloring models and discuss their interrelationships. In section 6, we will discuss

corresponding greedy algorithms.

One step in an eﬃcient computation of a sparse derivative matrix Ausing auto-

matic diﬀerentiation (or ﬁnite diﬀerencing) is to use structural information about A

to partition its ncolumns into pdisjoint groups, with pas small as possible. This

step can be posed as a problem of seeking an n×pseed matrix Swhose (j, k) entry

is deﬁned as follows:

(5.1) sjk =1 if column ajbelongs to group k,

0 otherwise.

Since the matrix Sspeciﬁed by (5.1) corresponds to a partitioning of the columns of

the matrix A, in every row rof the matrix Sthere exists exactly one column cin

which the entry src is equal to one. In the automatic diﬀerentiation literature, there

exist approaches that use a seed matrix where a row-sum is not necessarily equal to

one [19]. Such approaches are not partitioning-based as they allow a column of Ato

be placed in more than one group. In this paper we consider only seed matrices that

deﬁne a partition.

The speciﬁc criteria used to deﬁne a seed matrix Sdepend on whether the entries

of the matrix Aare to be recovered from the compressed representation AS directly

(by conceptually solving a diagonal system of equations) or via substitution (by con-

ceptually solving a set of triangular systems of equations). The criteria also depend on

whether the derivative matrix Ais Jacobian (nonsymmetric) or Hessian (symmetric).

In general, a seed matrix Ssuitable for a direct method requires a larger pcompared

to one suitable for a substitution method. On the other hand, the recovery of entries

via substitution incurs an additional, but aﬀordable, computational cost.

5.1. Models for direct methods. Curtis, Powell, and Reid [16] observed that

astructurally orthogonal partition of a Jacobian matrix A—a partition of the columns

of Ain which no two columns in a group share a nonzero at the same row index—

gives a seed matrix Swhere the entries of Acan be directly recovered from the

compressed representation AS. Obviously, a structurally orthogonal partition could

also be used in the context of computing a Hessian if one were to settle for not

exploiting the available symmetry. Following this approach, McCormick [27] showed

that a structurally orthogonal partition of a Hessian is equivalent to a distance-2

coloring of its adjacency graph.

The adjacency graph G(A) of a Hessian Ahas a vertex for each column, and an

edge joins column vertices aiand ajwhenever the entry aij ,i=j, is nonzero; the

diagonal entries in Aare assumed to be nonzero, and they are not explicitly repre-

sented by edges in G(A). In the distance-2 coloring model for structurally orthogonal

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1059

partition as well as in other models discussed in this section, a set of vertices having

the same color corresponds to a set of columns that belong to the same group in the

induced partition. In other words, each column of the seed matrix corresponds to a

color class.

Powell and Toint [28] were the ﬁrst to suggest symmetry-exploiting partitions for

both direct and substitution-based methods for computing Hessians. When translated

to a coloring φof the adjacency graph, the partition Powell and Toint suggested for

a direct Hessian computation requires that (1) φbe a distance-1 coloring, and (2)

in every path on three vertices P3=v,w, x, the vertices vand xreceive diﬀerent

colors whenever φ(w)>min{φ(v),φ(x)}. We call a coloring that satisﬁes these two

requirements a restricted star coloring. Notice that, unlike in a distance-2 coloring, in

a restricted star coloring, the terminal vertices vand xin a path v, w, x are allowed

to have the same color as long as the color of the middle vertex wis lower in value.

Coleman and Mor´e [13] generalized the approach of Powell and Toint and showed

that a symmetrically orthogonal partition of a Hessian suﬃces for a direct recovery

of its entries from a compressed representation. A partition of the columns of a

Hessian matrix Ais symmetrically orthogonal if for every nonzero element aij, either

(1) the group containing column ajhas no other column with a nonzero in row i,

or (2) the group containing column aihas no other column with a nonzero in row j.

Coleman and Mor´e established that a symmetrically orthogonal partition of a Hessian

is equivalent to a star coloring of its adjacency graph.

5.2. Models for substitution methods. Based on the work of Powell and

Toint [28], Coleman and Mor´e [13] found a coloring model for a partitioning of a

Hessian matrix Athat deﬁnes a seed matrix Ssuch that the nonzero entries of Acan

be recovered from the compressed representation AS via a (restricted) substitution

method. The recovery of the entries of the matrix Afrom the matrix AS requires

solving a triangular system of equations, a system that heavily relies on the structure

of A. Appropriately, Coleman and Mor´e called the model triangular coloring. Given

a graph G=(V,E), a mapping φ:V→{1,2,...,p}is a triangular coloring if

there exists a vertex ordering πsuch that (1) φis a distance-1 coloring, and (2) in

every P3=v, w, x, the vertices vand xreceive diﬀerent colors whenever π(w)>

max{π(v),π(x)}. A vertex ordering πthat is well suited for triangular coloring exists

and will be discussed in section 5.3.

Triangular coloring exploits symmetry only to a limited extent. This limitation

was later addressed by Coleman and Cai [9], who introduced a generalized notion of

substitutable partitions for which they provided a simple graph-theoretic characteri-

zation. In particular, they proved that an acyclic coloring of the adjacency graph of

a Hessian induces a substitutable partition of its columns. They also showed that,

using an acyclic coloring, the nonzeros of the Hessian can be recovered from its com-

pressed representation by considering a pair of color classes (groups of columns) at a

time. The key diﬀerence between a partition induced by a triangular coloring and a

partition induced by an acyclic coloring is the following. Triangular coloring induces

a partition that deﬁnes one large, coupled triangular system of equations, whereas

acyclic coloring induces a partition that deﬁnes several smaller, decoupled triangular

systems of equations, each deﬁned by a pair of color classes.

Table 5.1 summarizes the coloring models in Hessian computation discussed thus

far. In the table, the coloring variants are listed in an increasing order of restrictive-

ness, with the least constrained variant at the top and the most constrained variant

at the bottom. In the following subsection, we present results that make the interre-

1060 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

Table 5.1

Overview of coloring models used in Hessian computation. In each case, the adjacency graph

G=(V, E)of the underlying Hessian is used, and the mapping φ:V→{1,2,...,p}deﬁnes

a coloring. In addition to the input graph G, the deﬁnition of triangular coloring assumes the

existence of a vertex ordering π. *Powell and Toint [28] stated these models using their equivalent

matrix partitioning counterparts; the translations into graph colorings are due to Coleman and

Mor´e[13].

Name Coloring conditions Computation modeled

Acyclic 1. vw ∈E⇒φ(v)=φ(w) substitution

2. every cycle uses ≥3 colors Coleman and Cai [9]

Triangular 1. vw ∈E⇒φ(v)=φ(w) restricted substitution

2. in every P3=v, w, x : Powell and Toint [28]∗

π(w)>max{π(v),π(x)}⇒φ(v)=φ(x) Coleman and Mor´e [13]

Star 1. vw ∈E⇒φ(v)=φ(w) direct

2. every P4uses ≥3 colors Coleman and Mor´e [13]

Restricted 1. vw ∈E⇒φ(v)=φ(w) restricted direct

star 2. in every P3=v, w, x : Powell and Toint [28]∗

φ(w)>min{φ(v),φ(x)}⇒φ(v)=φ(x) Coleman and Mor´e [13]

Distance-2 1. vw ∈E⇒φ(v)=φ(w) direct (ignores symm.)

2. P3=v, w, x ∈G⇒φ(v)=φ(x) McCormick [27]

lationships among these variations more precise.

5.3. Interrelationships. Recall that we denote specialized chromatic numbers

by χwith an appropriate subscript. For example, χa(G) denotes the acyclic chromatic

number of G. In Lemma 5.1, we relate the acyclic, star, and restricted star chromatic

numbers of a graph. Similarly, in Lemma 5.2 we show the relationship among the

acyclic, triangular, and restricted star chromatic numbers of a graph.

Lemma 5.1. For every graph G,χa(G)≤χs(G)≤χrs(G).

Proof. Let φbe a star coloring of a graph G. Observe that a cycle in Gon three

vertices uses three colors, since φis also a distance-1 coloring of G. Moreover, a cycle

on at least four vertices contains a P4and therefore uses at least three colors. Hence

φis an acyclic coloring of G, and the ﬁrst inequality follows.

For the second inequality, we show that if φis a restricted star coloring of G, then

φis also a star coloring of G. Let P=v, w, x beaP3in G. Since φis a restricted

star coloring, φ(v)=φ(w) and φ(w)=φ(x). Furthermore, either φ(v)=φ(x)

or φ(v)=φ(x)>φ(w) holds. In the former case, in any P4in Gthat contains

the path P, adjacent vertices have diﬀerent colors, and the P4uses at least three

colors, satisfying the requirements of a star coloring. To see that the latter case also

fulﬁlls the requirements of a star coloring, consider any P4=v, w, x, y in Gthat

extends the path Pin one of its ends. (A similar argument would hold for a path

u, v, w, x that extends Pin the other end.) Again, since φis a restricted star coloring,

φ(y)=φ(x). Moreover, the relationship φ(y)=φ(w) holds, since otherwise the

relationship φ(x)>φ(w)=φ(y) in the path w , x, y would hold, contradicting the

fact that φis a restricted star coloring. Hence, in the path v , w, x, y, adjacent vertices

have diﬀerent colors, and at least three colors are used. Therefore φis a star coloring

of G.

Lemma 5.2. For every graph G,χa(G)≤χt(G)≤χrs(G).

Proof. Suppose φis a triangular coloring of a graph Gfor a vertex ordering π.

We show that φis then an acyclic coloring of G. Consider any cycle Cin G, and let

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1061

zbe the vertex in Cwith the highest index in the ordering π. Since Cis a cycle,

there exists a path P=u, z, v in Cfor some vertices uand v. Further, since φis a

triangular coloring, φ(u)=φ(v) by choice of z. Since φis also a distance-1 coloring,

both of these colors are distinct from φ(z). Hence, the cycle Cuses at least three

colors, and therefore φis an acyclic coloring of G.

We now show the second inequality. Suppose φis a restricted star coloring of

a graph Gon nvertices. Consider a vertex ordering π=v1,v

2,...,v

nsuch that

the sequence {φ(vi)}is nondecreasing. In other words, the ordering πis such that

vertices in the same color class are listed consecutively, and the color classes in turn

are listed in increasing order. Consider any path P=v, w, x on three vertices in G.

Since φis a restricted star coloring, a pair of adjacent vertices in the path Phave

diﬀerent colors. There are two possibilities regarding the terminal vertices vand x

in P: either φ(v)=φ(x)orφ(v)=φ(x)>φ(w). In the former case, the path P

trivially satisﬁes the requirements of a triangular coloring. Given the ordering πas

deﬁned in the proof, the path Pin the latter case also satisﬁes the requirements of

a triangular coloring, since π(w)<min{π(v),π(x)}. Hence, the coloring φand the

ordering πtogether deﬁne a triangular coloring of G.

The inequalities in Lemmas 5.1 and 5.2 can easily be extended in both ends.

Clearly, χ1(G)≤χa(G) holds, since an acyclic coloring is also a distance-1 color-

ing. In a distance-2 colored graph, the vertices in every P3use three distinct colors.

Hence, a distance-2 coloring is also a restricted star coloring, implying the relationship

χrs(G)≤χ2(G). We summarize these relationships in Theorem 5.3.

Theorem 5.3. For every graph G, the following relationships hold:

χ1(G)≤χa(G)≤χt(G)≤χrs(G)≤χ2(G)=χ1(G2),

χ1(G)≤χa(G)≤χs(G)≤χrs(G)≤χ2(G)=χ1(G2).

The relationship between χt(G) and χs(G) is not clear. There are examples of

star colorings where there does not exist a vertex ordering that makes a given star

coloring a triangular coloring. An example would be a star coloring of a graph in

which every vertex is the hub of a two-colored P3. Conversely, there are examples of

colorings that are triangular but not star. A simple example is a two-colored P4in

which adjacent vertices have diﬀerent colors. Thus the class Cs(G) of star colorings

of a graph Gand the class Ct(G) of triangular colorings of Gare such that neither

Ct(G)⊆Cs(G) nor Cs(G)⊆Ct(G) holds. However, the relationship χt(G)≤χs(G)

might still hold. We leave settling the latter issue as an open problem.

The last equality in Theorem 5.3 follows from the fact that a distance-2 coloring

of Gis equivalent to a distance-1 coloring of the square graph G2. In fact, most of

the coloring variants on a graph G=(V,E) listed in Table 5.1 can also be viewed as

a distance-1 coloring of an appropriately deﬁned “ﬁlled” graph G=(V, E ∪F).

When the original graph Gis the adjacency graph of a Hessian, the ﬁlled graphs in

the distance-2 and triangular coloring cases have interesting interpretations. The ﬁlled

graph in the distance-2 coloring case (the square graph G2)isthecolumn intersection

graph of the underlying Hessian A. The column intersection graph of a matrix has

a vertex for each column, and an edge between a pair of columns exists whenever

the two columns share nonzero entries at some common row index. In the case of a

triangular coloring for a given vertex ordering π, the ﬁlled graph G, in which each

ﬁll edge fis generated as per the speciﬁcation

∀P3=v, w, x in G:π(w)>max{π(v),π(x)}⇒f=vx,(5.2)

1062 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

is the column intersection graph of the lower triangular part Lπof the permuted

matrix PTAP , where Pis a permutation matrix deﬁned by the ordering π. These

relationships were observed and used by Coleman and Mor´e [13], who showed that

asmallest last (SL) vertex ordering as a choice for πis particularly well suited for

triangular coloring.

Given a graph on nvertices, a vertex ordering v1,v

2,...,v

nis an SL ordering

if for all 1 ≤i≤n, the vertex viis a vertex with the smallest degree in the graph

induced by vertices v1,...,v

i. Here is why an SL ordering is well suited for triangular

coloring: Among the n! possible orderings (permutations) π, an SL ordering minimizes

the maximum number of nonzeros in a row in Lπ. Thus, an SL ordering provides a

lower bound on the triangular chromatic number, a result proven by Coleman and

Mor´e [13].

Orthogonal to its role in triangular coloring, an SL ordering is also known to

be one of the eﬀective ordering techniques for reducing the number of colors used

by a greedy coloring algorithm. The number of colors used by a greedy distance-1

coloring algorithm that employs an SL ordering to color a graph Gis an important

graph parameter. The parameter is known as the coloring number col(G)ofGand

is closely related to several other graph notions, including degeneracy, maximal core,

and arboricity [17]. In our context, since an SL ordering can be computed in linear

time in the number of edges, the coloring number is an easy-to-compute, nontrivial

upperbound for the distance-1 chromatic number. As mentioned in the previous

paragraph, the coloring number is also a lower bound for the triangular chromatic

number. Theorem 5.4 summarizes these relationships.

Theorem 5.4. For every graph G,χ1(G)≤col(G)≤χt(G).

6. Previous algorithms. Coleman and Cai [9] accurately modeled the parti-

tioning problem that occurs in substitution-based Hessian computation as the acyclic

coloring problem. However, the algorithm they suggested is an algorithm for tri-

angular coloring. Speciﬁcally, they suggested that a ﬁlled graph Gbe explicitly

constructed as described by the expression (5.2), using an SL vertex ordering for π,

and then that Gbe distance-1 colored. The same approach has been taken earlier

by Coleman and Mor´e [13] and in the related software papers [10, 11]. However, as

we will soon show in the current section, it is possible to achieve a triangular col-

oring of Gby using a greedy algorithm that works directly on G. This section also

discusses previously known greedy algorithms for star and distance-kcoloring. All of

the algorithms discussed in this section are included in our experiments.

In a previous work [17], we suggested an algorithm for star coloring that enforces

the coloring requirements in a straightforward manner—by checking paths on four

vertices. We refer to this algorithm here as NaiveStarColoring (NS ). Powell and

Toint [28] suggested an algorithm of the same spirit, formulated in the language of

matrices, for the restricted star coloring problem. We will call this algorithm here

RestrictedStarColoring (RS ).

Algorithms NS and RS are both greedy. In determining a set of forbidden colors

for a vertex v, algorithm NS consults the colors used by the distance-3 neighbors of v,

whereas algorithm RS checks the colors used by only the distance-2 neighbors. Once

the set of forbidden colors is obtained, both algorithms choose the smallest color not

included in the set as the color for the vertex v. In a similar fashion, one can design

greedy algorithms for distance-1 coloring, distance-2 coloring, and triangular coloring

for a given ordering π. We refer to such algorithms as Distance1Coloring (D1),

Distance2Coloring (D2), and TriangularColoring (T-π). The suﬃx in the

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1063

Fig. 6.1.Visualizing a step in greedy coloring algorithms; vis the vertex currently being colored.

Table 6.1 shows how the set of forbidden colors for vis determined in various cases.

Table 6.1

Conditions under which a particular color is forbidden to v(see in conjunction with Figure 6.1).

Here φis a coloring, and πis a vertex ordering.

1x2

D1always

D2always always always

NS always always only if φ(w)=φ(y)

RS always always only if φ(w)>φ(x)

T-πalways only if only if

π(w)>max{π(v),π(x)}π(w)>max{π(v),π(x)}

notation T-πsigniﬁes that a speciﬁc vertex ordering πis used by the algorithm.

We use Figure 6.1 in conjunction with Table 6.1 to illustrate how the set of

forbidden colors for a vertex is determined by the various greedy algorithms in coloring

a graph Gon nvertices and medges. The tree depicted in Figure 6.1 is not a

subgraph of G, but rather a visualization of the classiﬁcation of the neighborhood

Nk(v) of the vertex vbeing colored at the current step of the particular algorithm.

The classiﬁcation is based on whether a vertex in the set Nk(v) is already colored

or not. In the ﬁgure, the vertices of the graph Gthat are one, two, and three edges

away from the vertex vare represented by the tree nodes at the levels w,x, and y,

respectively. A shaded node signiﬁes already colored vertices and an unshaded node

signiﬁes vertices not yet colored. The shaded nodes at level xare further labeled x1

and x2, to distinguish between the cases where a vertex win a path v, w,x is already

colored or not.

Table 6.1 summarizes the conditions under which a color used by a vertex in the

scenario described in Figure 6.1 is forbidden to the vertex vfor the various algorithms.

Cases x1and x2correspond to distinct decisions in algorithms NS and RS. These two

cases are indistinguishable in algorithms D2 and T-πand are irrelevant to algorithm

D1. As column x1in Table 6.1 shows, in algorithms NS and RS, the color of vertex

xin a path v, w, x where vertex wis not yet colored is immediately forbidden to the

vertex v. This ensures that any extension v, w, x, y of the path v, w, x would end

up using at least three colors, as desired, since in the step in which the vertex wis

colored, the distance-1 coloring requirement guarantees that the vertex wgets a color

distinct from the colors of vertices vand x. In contrast, in the case where vertex

wis already colored (see column x2in the table), determining whether the color of

vertex xshould be forbidden to vertex vor not requires a further check. In algorithm

NS, the check involves vertices wand y(note that yis three edges away from v). In

1064 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

algorithm RS, the check involves vertices wand x. In each of the algorithms listed in

Table 6.1, once the set of forbidden colors to the vertex vis determined, the smallest

color not included in the set is chosen as the color for v.

The illustration should make it clear that the complexity of algorithm D1is

O(nd1)=O(m), the complexity of each of algorithms D2, RS, and T-πis O(nd2),

and the complexity of algorithm NS is O(nd3).

7. Experimental results.

7.1. Setup. To demonstrate the performance of our new algorithms, we ran

experiments on 29 graphs obtained from molecular dynamics applications [8, 30].

The test graphs were chosen because of their large size and irregular structure. The

left half of Table 7.1 shows the number of vertices |V|, the number of edges |E|, the

maximum degree Δ, and the average degree d1in each test graph. The right half of the

table shows certain computed values that we will discuss shortly. All of the algorithms

used in our experiments were implemented in C++. The hardware used was a Linux

machine equipped with Dual AMD Opteron Processor 244, 1.8 GHz, 2 GB memory,

and 1 GB cache. The operating system was RHEL 3 and the compiler was GCC.

7.2. Algorithms compared and orderings used. In addition to the new

acyclic and star coloring algorithms presented in sections 3 and 4—abbreviated hence-

forth as Aand S—in this section, we report experimental results on the algorithms

D1, D2, RS,NS, and T-sl discussed in section 6. The suﬃx sl in the triangular

coloring algorithm indicates that we used an SL vertex ordering as the ordering π

to deﬁne the coloring, since this is optimal as discussed in the paragraph leading to

Theorem 5.4. Algorithms D1 and D2 are included in our experiments primarily to

serve as references against which quantities could be normalized.

The order in which vertices are colored in a greedy algorithm determines the

number of colors used by the algorithm. As mentioned earlier, an SL ordering (more

precisely, a distance-1 SL ordering) is one of the eﬀective known ordering techniques

for distance-1 coloring. As a natural extension, for algorithms such as D2, where the

distance-2 neighborhood of a vertex is explored at each step, a distance-2 SL ordering

(D2SL) can be deﬁned and used to attain a similar beneﬁt. Speciﬁcally, we deﬁne a

D2SL ordering using the quantity |N2(v)|associated with each vertex vin a graph.

Using linear-time sorting techniques, a D2SL ordering can be computed within the

same order of time as a subsequent distance-2 coloring, i.e., in O(nd2) time.

In our experiments (results to be presented in section 7.3), for each of the al-

gorithms D1, D2, RS,NS,S, and A, we used one of the orderings among natural

(the order in which vertices appear in the input graph), D1SL, and D2SL orderings

that led to the fewest colors on a majority of the test cases. For algorithm D1, this

ordering was D1SL; for algorithms D2, RS,NS, and S, this was the D2SL ordering;

and for algorithm A, it was the natural ordering. We have also experimented with

appropriately extended variants of incidence degree and largest degree ﬁrst orderings

[17]. The results we obtained using these are in general similar to or worse than the

ones obtained using the SL orderings and hence are not reported here.

In a greedy algorithm for triangular coloring, vertex orderings arise in two diﬀerent

contexts: (1) as part of the deﬁnition of the coloring, and (2) as part of the coloring

algorithm. In terms of an ordering (π) that deﬁnes the coloring, a D1SL ordering

in the input graph Gis optimal. The same ordering can be used while coloring the

vertices sequentially, and that is what we did in algorithm T-sl. To (potentially)

reduce the number of colors used, one could also use a D1SL coloring order based on

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1065

Table 7.1

Test graph statistics (left half ) and computed values (right half ). Column col(G)lists the

coloring number of each graph, and columns D1and D2list the runtime in seconds of algorithms

D1and D2. The last column shows the ratio of number of edges in a derived graph Gto that in an

original G; the graph Gis obtained from the graph Gas per the speciﬁcation in expression (5.2)

where πis an SL vertex ordering in G. The symbol *** indicates that a graph G, being too large

to ﬁt in memory, could not be computed.

ID Name |V||E|Δd1col(G) Runtime (sec) |E|

|E|

D1D2

1 er-gre-2 36,573 53,046 8 3 4 0.01 4.44 1.1

2 apoa1-2 92,224 139,351 8 3 4 0.00 31.74 1.1

3 HIV-3 11,414 64,134 20 11 8 0.00 0.44 1.6

4 er-gre-3 36,573 169,787 19 9 8 0.00 4.34 1.7

5 popc-br-3 24,916 102,058 22 8 9 0.01 2.05 1.6

6 apoa1-3 92,224 456,220 20 10 9 0.02 26.85 1.7

7 mol1-2 131,072 1,179,648 18 18 11 0.08 65.47 2.8

8 HIV-4 11,414 130,332 39 23 14 0.00 0.58 2.5

9 apoa1-4 92,224 1,131,436 43 25 15 0.04 28.39 3

10 er-gre-4 36,573 451,355 42 25 15 0.01 4.91 3

11 popc-br-4 24,916 255,047 43 20 15 0.01 2.35 2.7

12 popc-br-5 24,916 497,269 75 40 24 0.01 3.16 4.2

13 apoa1-5 92,224 2,254,805 73 49 25 0.08 33.02 5.2

14 HIV-6 11,414 412,623 116 72 34 0.01 1.74 6

15 er-gre-6 36,573 1,482,904 116 81 36 0.04 9.76 7.8

16 popc-br-6 24,916 850,043 125 68 38 0.03 4.89 6.4

17 apoa1-6 92,224 3,864,429 123 84 38 0.11 42.14 8.1

18 mol1-3 131,072 5,636,096 86 86 42 0.27 123.59 9.8

19 popc-br-7 24,916 1,316,771 192 106 54 0.04 8.39 9

20 popc-br-8 24,916 1,914,476 275 154 72 0.05 14.67 12.4

21 popc-br-9 24,916 2,668,555 380 214 98 0.06 25.78 16.4

22 er-gre-10 36,573 6,511,122 460 356 124 0.16 92.92 30

23 popc-br-10 24,916 3,587,724 514 288 126 0.08 43.83 21

24 mol1-4 131,072 14,680,064 224 224 104 0.59 354.34 ***

25 HIV-10 11,414 1,655,383 454 290 115 0.03 18.29 ***

26 apoa1-10 92,224 17,100,850 503 371 135 0.39 305.52 ***

27 HIV-12 11,414 2,683,956 760 470 183 0.05 47.78 ***

28 er-gre-12 36,573 10,928,521 775 598 198 0.24 275.53 ***

29 popc-br-12 24,916 5,924,137 855 476 199 0.14 116.7 ***

tot (1–23) 1,139,699 35,129,231

tot (1–29) 1,447,312 88,102,142

vertex degrees in the ﬁlled graph Gderived from Gaccording to the speciﬁcation

given by expression (5.2).

In a method they call slsl, Coleman and Mor´e [13] explicitly construct such a ﬁlled

graph Gusing a D1SL ordering in Gas π, and then distance-1 color the graph G

using a D1SL ordering in Gto obtain a triangular coloring of the original graph G.

We have implemented this method as well; here we call it T-slsl to reﬂect that it is a

triangular coloring of the original graph. The last column of Table 7.1 shows the ratio

of the number of edges in the derived graph Gto that in the original graph G. For 6

out of the 29 test graphs we used (the last six in Table 7.1), the graph Gcould not

be computed as it was too big to ﬁt in the size of the available memory. The asterisks

used in the corresponding entries in the last column of Table 7.1 indicate this.

1066 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

0 5 10 15 20 25 30

1.5

2.5

3.5

4.5

5.5

Graph id

Colors normalized wrt D1

MaxDeg

0 5 10 15 20 25 30

1.5

2.5

3.5

4.5

Graph id

Colors normalized wrt D1

MaxDeg

T−sl

Fig. 7.1.Normalized number of colors used by various greedy algorithms. In algorithm T-sl,

aD1SL ordering of the vertices in Gis used to both deﬁne and perform the coloring. Algorithm

D1uses a D1SL ordering for coloring. In algorithms D2, RS, NS, and S, vertices are colored in

aD2SL ordering in G; in algorithm A, vertices are colored in the natural order they appear in G.

The plots are normalized relative to the number of colors used by algorithm D1, i.e., the coloring

numbers of the graphs. These numbers are listed in Table 7.1.

7.3. Results. We will now describe a few ﬁgures and tables that show our ex-

perimental results. The ﬁgures will be discussed collectively in section 7.4.

The upper graph of Figure 7.1 displays plots of the number of colors used by

algorithms D2, RS,NS, and Snormalized relative to the number of colors used by

algorithm D1, the coloring numbers of the graphs. The bottom graph shows analogous

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1067

0 5 10 15 20 25 30

−3

−2

−1

Graph id

Runtime normalized wrt D2

T−sl

Fig. 7.2.Execution time comparison. Vertices are colored in the orderings speciﬁed in Fig-

ure 7.1. The plots, in semilog scale, are normalized relative to the runtime of Algorithm D2, values

listed under column D2of Table 7.1.

plots for algorithms T-sl and A. The common curve in the two parts is the plot of the

maximum degrees in the graphs, again normalized relative to the coloring numbers.

The coloring numbers of the graphs, sorted in increasing order, are listed under column

col(G) in Table 7.1.

Figure 7.2 shows semilog plots of the runtimes of algorithms RS,NS,S,T-sl,

and Anormalized relative to the runtime of algorithm D2, the values of which are

listed in the column labeled D2 in Table 7.1. Column D1 in the same table shows

the runtimes of algorithm D1. In these two columns as well as in execution times

reported elsewhere in this section, the reported times include time spent on ordering,

when applicable, but not time for reading test graph ﬁles from disk.

Figure 7.3 shows a comparison between algorithms Aand T-slsl. The results

reported in the ﬁgure concern the test graphs on which the latter could be run suc-

cessfully, the ﬁrst 23 of the 29 test graphs used.

Table 7.2 shows a summary of the results for the various algorithms; the upper

row gives data summed over all test cases, and the lower row shows data summed

over test cases on which algorithm T-slsl was successfully run.

To show the reduction in number of colors that algorithms D1, D2, RS,NS,

and Swere able to get through their use of D1SL or D2SL orderings, we ran these

algorithms with natural vertex ordering as well. Table 7.3 summarizes the results

obtained. In addition to absolute ﬁgures, the table shows the savings in number of

colors (as a percentage) and the relative time cost in comparison with a coloring that

uses an appropriate SL ordering (data listed in Table 7.2).

1068 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

0 5 10 15 20 25

1.5

2.5

3.5

Graph id

Colors normalized wrt D1

T−slsl

0 5 10 15 20 25

−3

−2

−1

Graph id

Runtime normalized wrt D2

T−slsl

Fig. 7.3.Comparison of normalized number of colors (left) and normalized runtimes (right)

between algorithms A and T-slsl. The number of colors are normalized relative to that used by

algorithm D1, and the runtimes are normalized relative to that of algorithm D2.

Table 7.2

Top row: Total number of colors and runtime summed over all test graphs. Bottom row: Totals

over test graphs on which algorithm T-slsl could be run successfully (i.e., graphs without asterisks

in Table 7.1). The order in which vertices are colored is as speciﬁed in Figure 7.1.

D2RS NS S T-sl T-slsl A D1

Colors 9,240 8,749 7,636 7,558 5,065 4,110 1,757

Runtime (min) 28.2 34.4 930 162 12.4 32.5 0.04

Colors 3,951 3,734 3,207 3,191 2,189 2,149 1,709 823

Runtime (min) 9.6 11 128 34.5 3.8 41.8 7 0.02

Table 7.3

Top row: Total number of colors and runtime summed over all test graphs, when vertices are

colored in their natural order in an input graph (Nat). Bottom row: Color savings and relative time

cost in comparison with coloring using an appropriate SL ordering (numbers listed in Table 7.2).

D2RS NS S D1

Colors 11,429 11,275 9,973 9,470 2,384

Runtime (min) 4.32 9.60 679 126 0.03

SL vs. Nat colors saved (%) 19 22 23 20 26

SL to Nat time ratio 6.5 3.6 1.4 1.3 1.3

7.4. Discussion. A reader can make several observations from Figures 7.1–7.3

and Tables 7.1–7.3. We highlight the following few points to supplement such obser-

vations in the context of Hessian computation.

•The advantage of exploiting sparsity using coloring cannot be overstated. If

all of the Hessians (with the nonzero structures of the test graphs) were to be

computed without exploiting sparsity or symmetry, the time required would

be proportional to 1.5 million steps (the sum total of the vertices in Table 7.1).

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1069

If the strongest coloring model considered here—acyclic coloring—were to be

used, all of these Hessians could be evaluated in about four thousand time

steps (the number of colors, or compressed columns, in the Acolumn at the

top half of Table 7.2). Moreover, in general, coloring constitutes only a small

fraction (typically far less than 10%) of the overall computation time involved

in obtaining numerical values in a derivative matrix [18, 19]. Note that the

total coloring time required to acyclic-color all of the test graphs, totaling

over 88 million edges, is about 30 minutes.

•As the upper part of Figure 7.1 shows, among the three algorithms RS,NS,

and Sfor direct Hessian computation where symmetry is exploited, algorithm

Suses the fewest colors. Algorithm RS uses slightly fewer colors than the

symmetry-ignoring algorithm D2. The diﬀerence in the number of colors

used by algorithms RS and Sis signiﬁcant, whereas that between NS and S

is marginal.

•The maximum degree Δ in a graph is a lower bound for the distance-2 chro-

matic number; this fact is clearly reﬂected in Figure 7.1. One can also see

that for a majority of our test graphs, algorithms RS,NS, and Sused more

than Δ colors. However, there were cases, noticeably among the relatively

sparse graphs of ID less than 10, where these algorithms used fewer than Δ

colors.

•The more important advantage of algorithm Sover algorithm NS is its lower

time complexity, a fact supported by the nearly sixfold reduction in observed

running times, as Figure 7.2 and Table 7.2 attest. Algorithms Sand RS have

the same theoretical complexity, and, in practice, algorithm Sis observed to

be slower by a small factor.

•As the bottom graph of Figure 7.1 illustrates, the coloring algorithms for

Hessian computation via substitution—algorithms T-sl and A—use many

fewer colors than the coloring algorithms for direct Hessian computation—

algorithms RS,NS, and S. On the test graphs we used, our acyclic coloring

algorithm Aconsistently used fewer colors than the triangular coloring algo-

rithm T-sl. Moreover, in all of our test cases, algorithm Aused fewer than Δ

colors, while for algorithm T-sl the statement holds true for a majority, but

not all, of the test cases. Interestingly, the exceptions in the latter case are

regular graphs (graphs 7, 18, and 24 in Table 7.1). Note that the reference

against which the number of colors in Figures 7.1 and 7.3 are normalized is

a lower bound for triangular coloring.

•As Figure 7.3 shows, our acyclic coloring algorithm Aalso used fewer colors

than algorithm T-slsl. On the subset of the test cases on which algorithm T-

slsl could be run successfully, algorithm Aused 20% fewer colors than T-slsl.

In terms of runtime, algorithms Aand T-sl are comparable, whereas T-slsl is

much slower.

•In comparison with a natural ordering, an appropriate SL vertex (re)ordering

for coloring reduced the number of colors used by algorithms D1, D2, RS,

NS, and Sby roughly 20% (see Table 7.3). In contrast, the numbers listed

in Table 7.2 show that the savings in number of colors rendered by algorithm

T-slsl compared to algorithm T-sl is only 1.8%, obtained at a cost of runtime

that is larger by a factor of 11. The acyclic coloring algorithm Adid not

beneﬁt from an SL ordering.

•Disregarding time for computing vertex orderings, the practical runtimes of

algorithms D2, RS,S,T-sl, and Aare expected to be similar since the time

1070 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

complexity of each of these algorithms is nearly O(nd2). In the observed

runtimes in Figure 7.2, the plots for T-sl and Afor the most part lie below

the line y= 1, which corresponds to the runtime of algorithm D2. This is

mainly due to the variation in the orderings used: for algorithm D2 (as well

as RS and S) a D2SL ordering in Gwas used, for algorithm T-sl a D1SL in

Gwas used, and for algorithm Athe natural ordering was used.

•Even when vertices were colored in natural order, the acyclic coloring algo-

rithm Awas observed to be slightly faster than the star coloring algorithm S.

On the other hand, time complexity analyses of the two algorithms show A

to be slightly slower than S. The most likely reason for Sbeing slower than A

in practice is that there are many more two-colored stars to be maintained in

algorithm Sthan there are two-colored trees to be maintained in algorithm A,

which in turn makes the former use more memory operations, and therefore

more runtime.

8. Conclusion. We have presented new algorithms for star and acyclic coloring,

models for eﬃcient Hessian computation via direct and substitution methods. We

experimentally demonstrated that the new algorithms outperform previously known

methods. Our acyclic coloring algorithm runs faster, uses fewer colors, and requires

less memory and storage space than the T-slsl triangular coloring method of Coleman

and Mor´e [13]. In comparison with our implementation of the T-sl triangular coloring

algorithm—which unlike the original algorithm of Coleman and Mor´e works directly

on an input graph G—the new acyclic coloring algorithm produces fewer colors at

comparable runtime. Our new star coloring algorithm runs faster and uses fewer

colors than our previous algorithm for the same problem [17]. In comparison with the

previously known restricted star coloring algorithm, our new star coloring algorithm

yields a signiﬁcant reduction in number of colors at the cost of a slight increase in

observed runtime.

Using an acyclic coloring, Coleman and Cai [9] showed that the nonzero entries

of a Hessian can be evaluated via substitution by considering a pair of color classes

at a time. Our new acyclic coloring algorithm has such structures (two-colored trees)

in place, which facilitates an eﬃcient implementation of a procedure for recovering

a Hessian from a compressed representation. We have implemented such a routine.

In collaboration with Andrea Walther at the Technical University of Dresden, our

coloring and Hessian recovery codes have been incorporated in the automatic diﬀer-

entiation tool ADOL-C [20]. A paper that demonstrates the eﬃcacy of the new star

and acyclic coloring algorithms in the overall process of computing the Hessian of

a given function using automatic diﬀerentiation has been submitted for publication

elsewhere [18]. The results we obtained in our experiments using ADOL-C show that

the new coloring algorithms render huge savings in overall runtime, and that Hessian

computation via substitution using acyclic coloring is faster than a direct computation

using star coloring, considering the overall process.

The underlying idea in the new algorithms, the exploitation of two-colored struc-

tures, can be extended to bicoloring models in eﬃcient computation of Jacobians [14].

We are currently working in this direction.

We conclude this paper by pointing out a few other directions for further work.

•As discussed in the paragraph immediately after Theorem 5.3, the relation-

ship between the triangular and star chromatic numbers of a graph is not

clear. Prove or disprove the claim χt(G)≤χs(G).

NEW ACYCLIC AND STAR COLORING ALGORITHMS 1071

•The size of a largest clique in a graph is an obvious lower bound for the

distance-1 chromatic number χ1(G). Analogously, Δ + 1 is a lower bound for

the distance-2 chromatic number χ2(G), since there exists a clique of that

size in the square graph G2. Finding nontrivial lower bounds for acyclic and

star chromatic numbers is a worthwhile issue.

•Appropriate extensions of orderings such as smallest last,largest ﬁrst, and

incidence degree helped reduce the number of colors used by our star color-

ing algorithm, but not that of the acyclic coloring algorithm. It would be

interesting to ﬁnd orderings suitable for the latter algorithm.

Acknowledgments. We thank Jorge Mor´e for suggesting a comparison between

acyclic and triangular coloring and for his helpful comments on a draft of this paper.

We also thank Paul Hovland for making the test graphs available to us. Our sincere

thanks go to the anonymous referees for their valuable comments on the content and

presentation of this paper and for bringing related works in the literature to our

attention.

REFERENCES

[1] G. Agnarsson, R. Greenlaw, and M. M. Halld ´

orsson,On powers of chordal graphs and

their colorings, Congr. Numer., 100 (2000), pp. 41–65.

[2] G. Agnarsson and M. M. Halld´

orsson,Coloring powers of planar graphs, SIAM J. Discrete

Math., 16 (2003), pp. 651–662.

[3] G. Agnarsson and M. M. Halld´

orsson,Coloring powers of planar graphs, in Proceedings of

the Eleventh Annual ACM-SIAM Symposium on Discrete Algorithms, SIAM, Philadelphia,

2000, pp. 654–662.

[4] M. O. Albertson, G. G. Chappell, H. A. Kierstead, A. K¨

undgen, and R. Ramamurthi,

Coloring with no 2-colored P4’s, Electron. J. Combin., 11 (2004), article R26.

[5] S. Arora and C. Lund,Hardness of approximations, in Approximation Algorithms for NP-

Hard Problems, D. S. Hochbaum, ed., PWS Publishing, Boston, MA, 1997, Chap. 10, pp.

399–446.

[6] H. Balakrishnan, C. L. Barrett, V. S. A. Kumar, M. V. Marathe, and S. Thite,The

distance-2matching problem and its application to the MAC-layer capacity of ad hoc

wireless networks, IEEE J. Selected Areas in Commun., 22 (2004), pp. 1069–1079.

[7] O. V. Borodin,On acyclic colorings of planar graphs, Discrete Math., 25 (1979), pp. 211–236.

[8] B. Brooks, R. Bruccoleri, B. Olafson, D. States, S. Swaminathan, and M. Karplus,

CHARMM: A program for macromolecular energy, minimization, and dynamics calcula-

tions, J. Comput. Chem., 4 (1983), pp. 187–217.

[9] T. F. Coleman and J.-Y. Cai,The cyclic coloring problem and estimation of sparse Hessian

matrices, SIAM J. Algebraic Discrete Methods, 7 (1986), pp. 221–235.

[10] T. F. Coleman, B. Garbow, and J. J. Mor´

e,Software for estimating sparse Jacobian ma-

trices, ACM Trans. Math. Softw., 10 (1984), pp. 329–347.

[11] T. F. Coleman, B. Garbow, and J. J. Mor´

e,Software for estimating sparse Hessian matri-

ces, ACM Trans. Math. Softw., 11 (1985), pp. 363–377.

[12] T. F. Coleman and J. J. Mor´

e,Estimation of sparse Jacobian matrices and graph coloring

problems, SIAM J. Numer. Anal., 20 (1983), pp. 187–209.

[13] T. F. Coleman and J. J. Mor´

e,Estimation of sparse Hessian matrices and graph coloring

problems, Math. Program., 28 (1984), pp. 243–270.

[14] T. F. Coleman and A. Verma,The eﬃcient computation of sparse Jacobian matrices using

automatic diﬀerentiation, SIAM J. Sci. Comput., 19 (1998), pp. 1210–1233.

[15] T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein,Introduction to Algorithms,

2nd ed., MIT Press, Cambridge, MA, 2001.

[16] A. R. Curtis, M. J. D. Powell, and J. K. Reid,On the estimation of sparse Jacobian

matrices, J. Inst. Math. Appl., 13 (1974), pp. 117–119.

[17] A. H. Gebremedhin, F. Manne, and A. Pothen,What color is your Jacobian? Graph

coloring for computing derivatives, SIAM Rev., 47 (2005), pp. 629–705.

[18] A. H. Gebremedhin, A. Pothen, A. Tarafdar, and A. Walther,Eﬃcient computation of

sparse Hessians: An experimental study using ADOL-C, submitted, 2006.

1072 A. H. GEBREMEDHIN, A. TARAFDAR, F. MANNE, AND A. POTHEN

[19] A. Griewank,Evaluating Derivatives: Principles and Techniques of Algorithmic Diﬀerentia-

tion, SIAM, Philadelphia, 2000.

[20] A. Griewank, D. Juedes, and J. Utke,ADOL-C: A package for the automatic diﬀerentiation

of algorithms written in C/C++, ACM Trans. Math. Softw., 22 (1996), pp. 131–167.

[21] B. Gr¨

unbaum,Acyclic colorings of planar graphs, Israel J. Math., 14 (1973), pp. 390–408.

[22] S. Hossain and T. Steihaug,Computing a sparse Jacobian matrix by rows and columns,

Optim. Methods Softw., 10 (1998), pp. 33–48.

[23] T. Jensen and B. Toft,Graph Coloring Problems, Wiley-Interscience, New York, 1995.

[24] S. O. Krumke, M. V. Marathe, and S. S. Ravi,Models and approximation algorithms for

channel assignment in radio networks, Wireless Networks, 7 (2001), pp. 567 – 574.

[25] V. S. A. Kumar, M. V. Marathe, S. Parthasarathy, and A. Srinivasan,End-to-end packet-

scheduling in wireless ad-hoc networks, in Proceedings of the Fifteenth Annual ACM-SIAM

Symposium on Discrete Algorithms, SIAM, Philadelphia, 2004, pp. 1014–1023.

[26] Y.-L. Lin and S. S. Skiena,Algorithms for square roots of graphs, SIAM J. Discrete Math., 8

(1995), pp. 99–118.

[27] S. T. McCormick,Optimal approximation of sparse Hessians and its equivalence to a graph

coloring problem, Math. Program., 26 (1983), pp. 153–171.

[28] M. J. D. Powell and P. L. Toint,On the estimation of sparse Hessian matrices, SIAM J.

Numer. Anal., 16 (1979), pp. 1060–1074.

[29] L. Stockmeyer and V. Vazirani,NP-completeness of some generalizations of the maximum

matching problem, Inform. Process. Lett., 15 (1982), pp. 14–19.

[30] M. M. Strout and P. D. Hovland,Metrics and models for reordering transformations,in

Proceedings of the 2004 Workshop on Memory System Performance, ACM, New York,

2004, pp. 23–34.

The Complexity of Star Colouring and its Relatives

Thesis

Full-text available

May 2023

Cyriac Antony

(Short description) Star colouring, restricted star colouring (abbreviated rs colouring) and acyclic colouring are variants of graph colouring used as models in the computation of sparse Hessian matrices. We study the complexity of these three colouring variants (i) in well-known graph classes such as planar graphs and bipartite graphs, and (ii) with respect to the maximum degree of the graph, focusing on graphs of maximum degree d and d-regular graphs. We also prove some interesting structural results on star colouring of regular graphs.

Complexity of Restricted Star Colouring

Preprint

Full-text available

Aug 2021

Restricted star colouring is a variant of star colouring introduced to design heuristic algorithms to estimate sparse Hessian matrices. For $k\in\mathbb{N}$, a $k$-restricted star colouring ($k$-rs colouring) of a graph $G$ is a function $f:V(G)\to{0,1,\dots,k-1}$ such that (i)$f(x)\neq f(y)$ for every edge $xy$ of G, and (ii) there is no bicoloured 3-vertex path ($P_3$) in $G$ with the higher colour on its middle vertex. We show that for $k\geq 3$, it is NP-complete to test whether a given planar bipartite graph of maximum degree $k$ and arbitrarily large girth admits a $k$-rs colouring, and thereby answer a problem posed by Shalu and Sandhya (Graphs and Combinatorics, 2016). In addition, it is NP-complete to test whether a 3-star colourable graph admits a 3-rs colouring. We also prove that for all $\epsilon > 0$, the optimization problem of restricted star colouring a 2-degenerate bipartite graph with the minimum number of colours is NP-hard to approximate within $n^{(1/3)-\epsilon}$. On the positive side, we design (i) a linear-time algorithm to test 3-rs colourability of trees, and (ii) an $O(n^3)$-time algorithm to test 3-rs colourability of chordal graphs.

The complexity of restricted star colouring

Article

Full-text available

May 2021
DISCRETE APPL MATH

Restricted star colouring is a variant of star colouring introduced to design heuristic algorithms to estimate sparse Hessian matrices. For k∈N, a k-restricted star colouring (k-rs colouring) of a graph G is a function f:V(G)→{0,1,…,k−1} such that (i) f(x)≠f(y) for every edge xy of G, and (ii) there is no bicoloured 3-vertex path (P3) in G with the higher colour on its middle vertex. We show that for k≥3, it is NP-complete to test whether a given planar bipartite graph of maximum degree k and arbitrarily large girth admits a k-rs colouring, and thereby answer a problem posed by Shalu and Sandhya (2016). In addition, it is NP-complete to test whether a 3-star colourable graph admits a 3-rs colouring. We also prove that for all ϵ>0, the optimization problem of restricted star colouring a 2-degenerate bipartite graph with the minimum number of colours is NP-hard to approximate within n13−ϵ. On the positive side, we design (i) a linear-time algorithm to test 3-rs colourability of trees, and (ii) an O(n3)-time algorithm to test 3-rs colourability of chordal graphs.

Hardness transitions and uniqueness of acyclic colouring

Article

Mar 2024
DISCRETE APPL MATH

Star Chromatic Number of the Edge Corona of Two Graph

Conference Paper

Full-text available

Mar 2023

Acyclic Coloring of Certain Graphs

Article

Jan 2023

A graph G with acyclic coloring has no two adjacent vertices with the same color and no bichromatic cycle. Also, the coloring results in a forest when any two-color classes are combined. The concept of acyclic coloring plays a pivotal role in the computation of Hessians, Kekule structures classification, coding theory, and statistical mechanics. In this paper, the acyclic chromatic number of generalized fan graph, generalized Möbius ladder graph and flower snark graph have been determined.

Unified Modeling for Emulating Electric Energy Systems: Toward Digital Twin That Might Work

Chapter

Sep 2022

Large-scale computing, including machine learning (MI) and AI, offer a great promise in enabling sustainability and resiliency of electric energy systems. At present, however, there is no standardized framework for systematic modeling and simulation of system response over time to different continuous- and discrete-time events and/or changes in equipment status. As a result, there is generally a poor understanding of the effects of candidate technologies on the quality and cost of electric energy services. In this chapter, the authors discuss a unified, physically intuitive multi-layered modeling of system components and their mutual dynamic interactions. The fundamental concept underlying this modeling is the notion of interaction variables whose definition directly lends itself to capturing modular structure needed to manage complexity. As a direct result, the same modeling approach defines an information exchange structure between different system layers, and hence can be used to establish structure for the design of a dedicated computational architecture, including AI methods.

Star chromatic number of some graph products

Article

Oct 2022

A star coloring of a graph [Formula: see text] is a proper vertex coloring in which every path on four vertices uses at least three distinct colors. Equivalently, in a star coloring, the induced subgraphs formed by the vertices of any two colors have connected components that are star graphs. A graph [Formula: see text] is [Formula: see text]- star-colorable if there exists a star coloring of [Formula: see text] from a set of [Formula: see text] colors. The minimum positive integer [Formula: see text] for which [Formula: see text] is [Formula: see text]-star-colorable is the star chromatic number of [Formula: see text] and is denoted by [Formula: see text]. In this paper, upper and lower bounds are presented for the star chromatic number of the rooted product, hierarchical product, and lexicographic product.

The Complexity of Star Colouring in Bounded Degree Graphs and Regular Graphs

Chapter

Jan 2022

A k-star colouring of a graph G is a function f:V(G)→{0,1,⋯,k-1} such that f(u)≠f(v) for every edge uv of G, and G does not contain a 4-vertex path bicoloured by f as a subgraph. For k∈N, the problem k-Star Colourability takes a graph G as input and asks whether G is k-star colourable. By the construction of Coleman and Moré (SIAM J. Numer. Anal., 1983), for all k≥3, k-Star Colourability is NP-complete for graphs of maximum degree d=k(k-1+⌈k⌉). For k=4 and k=5, the maximum degree in this NP-completeness result is d=20 and d=35 respectively. We reduce the maximum degree to d=4 in both cases: i.e., 4-Star Colourability and 5-Star Colourability are NP-complete for graphs of maximum degree four. We also show that for all k≥3 and d

Packing, partitioning, and covering symresacks

Article

Mar 2020
DISCRETE APPL MATH

Christopher Hojny

In this paper, we consider symmetric binary programs that contain set packing, partitioning, or covering inequalities. To handle symmetries as well as set packing, partitioning, or covering constraints simultaneously, we introduce constrained symresacks which are the convex hulls of all binary points that are lexicographically not smaller than their image w.r.t. a coordinate permutation and which fulfill packing, partitioning, or covering constraints. We show that linear optimization problems over constrained symresacks can be solved in cubic time. Furthermore, we derive complete linear descriptions of constrained symresacks for particular classes of symmetries. These inequalities can then be used as strong symmetry handling cutting planes in a branch-and-bound procedure. Numerical experiments show that we can benefit from incorporating set packing, partitioning, or covering constraints into symmetry handling inequalities.

Optimal Approximation of Sparse Hessians and Its Equivalence to a Graph Coloring Problem

Article

Full-text available

Jun 1983

S. Thomas Mccormick

We consider the problem of approximating the Hessian matrix of a smooth non-linear function using a minimum number of gradient evaluations, particularly in the case that the Hessian has a known, fixed sparsity pattern. We study the class of Direct Methods for this problem, and propose two new ways of classifying Direct Methods. Examples are given that show the relationships among optimal methods from each class. The problem of finding a non-overlapping direct cover is shown to be equivalent to a generalized graph coloring problem—the distance-2 graph coloring problem. A theorem is proved showing that the general distance-k graph coloring problem is NP-Complete for all fixedk≥2, and hence that the optimal non-overlapping direct cover problem is also NP-Complete. Some worst-case bounds on the performance of a simple coloring heuristic are given. An appendix proves a well-known folklore result, which gives lower bounds on the number of gradient evaluations needed in any possible approximation method.

Coloring with no $2$-Colored $P_4$'s

Article

Mar 2004
ELECTRON J COMB

A proper coloring of the vertices of a graph is called a star coloring if every two color classes induce a star forest. Star colorings are a strengthening of acyclic colorings, i.e., proper colorings in which every two color classes induce a forest. We show that every acyclic $k$-coloring can be refined to a star coloring with at most $(2k^2-k)$ colors. Similarly, we prove that planar graphs have star colorings with at most 20 colors and we exhibit a planar graph which requires 10 colors. We prove several other structural and topological results for star colorings, such as: cubic graphs are $7$-colorable, and planar graphs of girth at least $7$ are $9$-colorable. We provide a short proof of the result of Fertin, Raspaud, and Reed that graphs with tree-width $t$ can be star colored with ${t+2\choose2}$ colors, and we show that this is best possible.

Hardness of Approximations; in: Approximation Algorithms for NP-Hard Problems

Article

Jan 1996

Acyclic colorings of planar graphs

Article

Dec 1979
DISCRETE MATH

O.V. Borodin

The conjecture of B. Grünbaum on existing of admissible vertex coloring of every planar graph with 5 colors, in which every bichromatic subgraph is acyclic, is proved and some corollaries of this result are discussed in the present paper.

Introduction To Algorithms

Book

Jan 2001
J OPER RES SOC

Estimation of Sparse Jacobian Matrices and Graph Coloring Blems

Article

Feb 1983

Given a mapping with a sparse Jacobian matrix, the problem of minimizing the number of function evaluations needed to estimate the Jacobian matrix by differences is investigated. This problem can be attacked as a graph coloring problem and this approach leads to very efficient algorithms. The behavior of these algorithms is studied and, in particular, it is proved that two of the algorithms are optimal for band graphs. Numerical evidence is presented which indicates that these two algorithms are nearly optimal on practical problems.

On the Estimation of Sparse Hessian Matrices

Article

Dec 1979
SIAM J NUMER ANAL

This paper studies automatic procedures for estimating second derivatives of a real valued function of several variables. The estimates are obtained from differences in first derivative vectors, and it is supposed that the required matrix is sparse and that its sparsity structure is known. Our main purpose is to find ways of taking advantage of the sparsity structure and the symmetry of the second derivative matrix, in order to make small the number of first derivative vectors that have to be calculated. Two new algorithms are proposed, which seem to be very successful in practice and which do not require much computer arithmetic. One is a direct method and the other is a substitution method, these terms being explained in the paper. Some examples show, however, that the given methods may not minimize the number of first derivative vector calculations.

Estimation of sparse Jacobian matrices and graph coloring problems

Article

Jan 1984
SIAM J NUMER ANAL

The Cyclic Coloring Problem and Estimation of Sparse Hessian Matrices

Article

Apr 1986

Numerical optimization algorithms often require the (symmetric) matrix of second derivatives, $\nabla ^2 f( x )$. If the Hessian matrix is large and sparse, then estimation by finite differences can be quite attractive since several schemes allow for estimation in much fewer than n gradient evaluations. The purpose of this paper is to analyze, from a combinatorial point of view, a class of methods known as substitution methods. We present a concise characterization of such methods in graph-theoretic terms. Using this characterization, we develop a complexity analysis of the general problem and derive a roundoff error bound on the Hessian approximation. Moreover, the graph model immediately reveals procedures to effect the substitution process optimally (i.e. using fewest possible substitutions given the differencing directions) in space proportional to the number of nonzeros in the Hessian matrix.

CHARMM: A Program for Macromolecular Energy, Minimization, and Dynamics Calculations

Article

Sep 2004
J COMPUT CHEM

CHARMM (Chemistry at HARvard Macromolecular Mechanics) is a highly flexible computer program which uses empirical energy functions to model macromolecular systems. The program can read or model build structures, energy minimize them by first- or second-derivative techniques, perform a normal mode or molecular dynamics simulation, and analyze the structural, equilibrium, and dynamic properties determined in these calculations. The operations that CHARMM can perform are described, and some implementation details are given. A set of parameters for the empirical energy function and a sample run are included.

New Acyclic and Star Coloring Algorithms with Application to Computing Hessians

Abstract and Figures

Recommended publications

Ecien t Computation of Sparse Hessians using Coloring and Automatic Dieren tiation

Efficient Computation of Sparse Hessians Using Coloring and Automatic Differentiation

The Optimal Computation of Hessians in Optimization

A Parallel Distance-2 Graph Coloring Algorithm for Distributed Memory Computers