ArticlePDF Available

Effective nuclei segmentation with sparse shape prior and dynamic occlusion constraint for glioblastoma pathology images

Authors:

Abstract

We propose a segmentation method for nuclei in glioblastoma histopathologic images based on a sparse shape prior guided variational level set framework. By spectral clustering and sparse coding, a set of shape priors is exploited to accommodate complicated shape variations. We automate the object contour initialization by a seed detection algorithm and deform contours by minimizing an energy functional that incorporates a shape term in a sparse shape prior representation, an adaptive contour occlusion penalty term, and a boundary term encouraging contours to converge to strong edges. As a result, our approach is able to deal with mutual occlusions and detect contours of multiple intersected nuclei simultaneously. Our method is applied to several whole-slide histopathologic image datasets for nuclei segmentation. The proposed method is compared with other state-of-the-art methods and demonstrates good accuracy for nuclei detection and segmentation, suggesting its promise to support biomedical image-based investigations.
Effective nuclei segmentation with
sparse shape prior and dynamic
occlusion constraint for glioblastoma
pathology images
Pengyue Zhang
Fusheng Wang
George Teodoro
Yanhui Liang
Mousumi Roy
Daniel Brat
Jun Kong
Pengyue Zhang, Fusheng Wang, George Teodoro, Yanhui Liang, Mousumi Roy, Daniel Brat, Jun Kong,
Effective nuclei segmentation with sparse shape prior and dynamic occlusion constraint for
glioblastoma pathology images,J. Med. Imag. 6(1), 017502 (2019),
doi: 10.1117/1.JMI.6.1.017502.
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
Effective nuclei segmentation with sparse
shape prior and dynamic occlusion constraint
for glioblastoma pathology images
Pengyue Zhang,aFusheng Wang,bGeorge Teodoro,cYanhui Liang,dMousumi Roy,aDaniel Brat,eand
Jun Kongf,g,*
aStony Brook University, Department of Computer Science, Stony Brook, New York, United States
bStony Brook University, Department of Biomedical Informatics and Computer Science, Stony Brook, New York, United States
cUniversity of Brasìlia, Department of Computer Science, Brasìlia, Brazil
dGoogle Inc., Mountain View, California, United States
eNorthwestern University, Department of Pathology, Chicago, Illinois, United States
fEmory University, Department of Computer Science and Biomedical Informatics, Atlanta, Georgia, United States
gGeorgia State University, Department of Mathematics and Statistics, Atlanta, Georgia, United States
Abstract. We propose a segmentation method for nuclei in glioblastoma histopathologic images based on
a sparse shape prior guided variational level set framework. By spectral clustering and sparse coding, a set
of shape priors is exploited to accommodate complicated shape variations. We automate the object contour
initialization by a seed detection algorithm and deform contours by minimizing an energy functional that incor-
porates a shape term in a sparse shape prior representation, an adaptive contour occlusion penalty term, and
a boundary term encouraging contours to converge to strong edges. As a result, our approach is able to deal
with mutual occlusions and detect contours of multiple intersected nuclei simultaneously. Our method is applied
to several whole-slide histopathologic image datasets for nuclei segmentation. The proposed method is com-
pared with other state-of-the-art methods and demonstrates good accuracy for nuclei detection and segmenta-
tion, suggesting its promise to support biomedical image-based investigations. ©The Authors. Published by SPIE under a
Creative Commons Attribution 4.0 Unported License. Distribution or reproduction of this work in whole or in part requires full attribution of the original
publication, including its DOI. [DOI: 10.1117/1.JMI.6.1.017502]
Keywords: nuclei segmentation; level set; sparse representation; graph learning; spectral clustering.
Paper 18262R received Dec. 5, 2018; accepted for publication Feb. 19, 2019; published online Mar. 14, 2019.
1 Introduction
Image segmentation is a fundamental problem in computer
vision and image analysis. In the image segmentation commu-
nity, level set-based approaches are important tools, because
they are able to handle nuclei shapes and contours with complex
variations. Chan and Vese initiated a region-based active contour
model with a level set formulation based on Mumford-Shahs
functional.1,2This model does not depend on gradient informa-
tion and thus can detect nuclei contours with weak edges.
However, ChanVese model does not include prior shape
knowledge to restrain shapes in appearance and may, therefore,
detect meaningless shapes. This drawback becomes more seri-
ous when nuclei are partially occluded, corrupted, or repre-
sented in low contrast imaging data. In the literature, some
research work emerged to mitigate this problem by incorporat-
ing shape prior information into the level set formulation so
that the detected shape can be regulated by a selected reference
shape.35For example, Chan and Zhu6proposed a variational
model introducing a shape difference term with a shape prior
as the reference shape.1With prior information in place, shapes
similar to the shape prior can be successfully segmented. Mean-
while, nuclei presenting meaningless shapes are restrained.
Based on the shape prior segmentation model, Yan et al.7intro-
duced a geodesic length term to drive contours to nuclei edges.
Ali and Madabhushi8proposed an active contour model to
accommodate complex shapes by learning shape priors through
principal component analysis.
Nevertheless, shape prior-based level set segmentation meth-
ods are still challenged by multiple problems. (1) It is difficult to
segment nuclei from raw images without knowing the number
and position of nuclei. Thus, one important step in this class of
segmentation approaches is to appropriately identify number
and positions of nuclei of interest for initialization purpose.
(2) Thus far, shape prior-based segmentation approaches exploit
libraries consisting of a small number of shape priors as refer-
ence to nuclei in similar shapes. However, in most real-world
scenarios, such as nuclei segmentation, it could be very complex
to model shape variations explicitly. Therefore, it is practically
difficult to find a limited number of shape priors that could re-
present all shapes reasonably well. (3) Simultaneous segmenta-
tion of mutually occluded nuclei remains challenging. Recent
development in level set methods introduced a repulsion term
to prevent two nuclei from becoming identical through evolu-
tion. However, it is common to have more than two nuclei
involved in mutual occlusion. As a result, we conceive that
larger penalties should be given to regions shared by more
nuclei.
Nuclei initialization is an important step that guides the
following modules in segmentation. The geometric centers of
nuclei, so-called seeds, are natural nuclei indicators in practice,
as nuclei contours can be appropriately initialized with them.
Parvin et al.9proposed an iterative voting method for detect-
ing seeds of overlapping nuclei. Al-Kofahi et al.10 proposed a*Address all correspondence to Jun Kong, E-mail: jkong@gsu.edu
Journal of Medical Imaging 017502-1 JanMar 2019 Vol. 6(1)
Journal of Medical Imaging 6(1), 017502 (JanMar 2019)
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
multiscale Laplacian of Gaussian (LoG) filtering method for
automatic detection and segmentation of nuclei. By distance-
map-based adaptive scale selection, nuclear seed points were
detected to perform an initial segmentation. However, the multi-
scale LoG method is sensitive to minor peaks on the distance
map, resulting in oversegmentation and detection. Qi et al.11
improved the method by applying a shifted Gaussian kernel and
mean shift to a single-pass voting algorithm. The basic idea is to
search for the voting direction over a cone-shaped voting area.
The two-dimensional (2-D) Gaussian kernel was designed to
assign larger weights for area centers in the voting process.
Finally, mean shift was applied to determine the seed points.
To accommodate the variation in nuclei shapes, a large set of
shape priors can be used as a reference library. However, it is
challenging to model the relationship between the nuclei of
interest to be segmented and the shape priors. Representation
learning methods, such as graph learning and sparse coding,
were successfully applied in many areas of computer vision
research, such as face recognition,12 image restoration,13 graph
learning,14 and medical image analysis.15 Wright et al.12 pro-
posed a robust face recognition algorithm based on sparse
representation. The method can effectively handle occlusion and
corruption errors by enforcing sparse distribution with respect
to the pixel/sample basis. Zhang et al.15 proposed a sparse shape
composition model to incorporate shape prior information for
detection and segmentation tasks in radiologic images. Cheng
et al.14 introduced a process to build L1graph and multiple
algorithms for data clustering, subspace learning, and semi-
supervised learning based on L1graph. Inspired by theories
in graph learning, spectral clustering, and sparse representation,
an L1graph-based shape proximity was learned to cluster the
shape priors with which a compact dictionary was created for
sparse coding. Given a shape of interest, it can be approximated
by a sparse representation based on the compact dictionary
within the level set framework. The resulting reconstruction
error can be included as a shape penalty term in the variational
level set model.
To date, numerous investigators have devised methods to
segment overlapping nuclei. These methods extended the shape
prior segmentation approaches to segment multiple similar
nuclei under mutual occlusion. In Refs. 1618, various repulsive
force terms were proposed and included in the cost functional to
penalize two overlapping nuclei contours and prevent them from
becoming identical. In our approach, we extend this idea by
introducing an adaptive occlusion penalty term that penalizes
regions occupied by multiple nuclei based on the scope of occlu-
sion. Following this principle, we assign a larger penalty to a
region overlapped by more nuclei.
Recently, deep learning-based approaches greatly improved
the state-of-the-art in research fields, such as computer vision,
speech recognition, and natural language processing. Deep
learning methods require large-scale annotated data to preserve
generative power and prevent overfitting. However, collecting
both data and annotation for medical imaging tasks is time-
consuming and requires much domain expertise. Specifically,
a large number of bounding boxes or shape masks of nuclei
are required to train the network for deep learning-based nuclei
detection and segmentation methods. Compared to the formi-
dable data scale required by deep learning methods, the only
prior information used in our method is a set of representative
shapes in vector representations with only nuclei contour co-
ordinates.
In this paper, we propose a level set-based variational model
that allows simultaneous segmentation of multiple nuclei with
mutual occlusion. The main contributions of our work are
summarized as follows:
A seed detection algorithm is proposed to automate nuclei
identification and contour initialization. The proposed
seed detection algorithm utilizes joint information of
spatial connectivity, distance constraint, image edge map,
and a shape-based voting map derived from eigenvalue
analysis of Hessian matrix across multiple scales.
We use a list of contour co-ordinates to represent each
annotated nuclei contour. The resulting shape annotation
representations are then clustered with an L1graph-based
manifold learning method to establish a concise and rep-
resentative shape prior library.
A sparse representation-based shape term is introduced to
better guide nuclei contour convergence with shape prior
information from the shape library.
A dynamic nuclei occlusion term is proposed to dynami-
cally deal with occlusion events involving a variable num-
ber of nuclei.
We learn and address the nuclei detection and segmenta-
tion problem in an unsupervised manner with no explicit
training process in our approach. The only prior informa-
tion we used is the shape library consisting of shape priors
from other glioblastoma histopathology images.
Our approach is successfully applied to two datasets of
glioblastoma histopathology images for nuclei segmenta-
tion task. Our extensive experiments demonstrate that the
proposed method can appropriately handle nuclei mutual
occlusions and accurately differentiate nuclei from other
structures.
As shown in Fig. 1, our method framework consists of a
seed detection algorithm for nuclei contour initialization, and an
integrated contour deformable model that incorporates region,
shape, and boundary information. The final nuclei contours are
converged through an iterative contour evolution process.
This journal paper extends our earlier work19,20 through sub-
stantial method improvement on shape prior library generation,
more comprehensive experiments, more in-depth parameter sen-
sitivity analysis. In detail, these important extensions include:
We include an extensive set of annotated nuclei shapes
to include more shape variation information. We propose
a way to dynamically construct shape prior dictionary by
applying an L1graph-based manifold learning method.
With the approach, representative shape priors covering
most shape variations can be automatically produced in
a data-driven manner.
We provide an in-detail theoretical derivation for level set
functional optimization and offer a solution to optimize
the loss function.
The proposed method is evaluated on two datasets, namely
GBM40 dataset and TCGA dataset. The GBM40 dataset
consists of glioblastoma tissues from Emory University
Hospital archive and TCGA dataset is a publicly avail-
able dataset used for method robustness test and efficacy
Journal of Medical Imaging 017502-2 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
validation. Therefore, we significantly extend our dataset
scale for testing and validating the robustness and accuracy
of our proposed method in this journal work.
We present results from our method for seed detection
and nuclei segmentation in different image regions and
compare our method with other state-of-the-art methods.
In addition, we design multiple analyses with various
parameter settings to investigate behaviors of individual
terms included in our method. Demonstrated by these
extensive tests, the proposed method has manifested
promising robustness and accuracy for nuclei detection
and segmentation.
2 Shape Representation
Let us consider an image that contains Nnuclei of interest
O1;O2;:::;ON, where each nuclei Oihas a closed and
bounded shape contour Oiin image domain ΩR2. The basic
idea of the level set framework is to implicitly represent Oias
a zero level set of a Lipschitz function ϕiΩR.ϕiðxÞhas
positive and negative value when xis inside and outside Oi,
respectively. Note that due to memory limit, we use image
patches instead of whole slide images in our experiments.
2.1 Distance Map Function
A distance map function Γ½ϕiðxÞ represents the shortest dis-
tance dðx;OiÞfrom a current pixel xin the image domain
Ωto a given nuclei contour Oi¼fxjϕiðxÞ¼0;xΩg:
EQ-TARGET;temp:intralink-;e001;326;521Γ½ϕiðxÞ ¼ 8
<
:
0;x∈∂Oi
dðx;OiÞ;xOi
dðx;OiÞ;otherwise
:(1)
In this way, any nuclei in the image domain can be represented
as a distance map or vice versa. Given a distance map, the cor-
responding nuclei contour can be recovered by searching for the
zero-valued pixel locations. As 2-D images only capture 2D pro-
jections of 3D nuclei, it is not uncommon to observe overlapped
nuclei. Instead of partitioning an image into disjoint regions, we
allow a pixel to be associated with multiple nuclei in an image
with intersecting contours. Therefore, the corresponding level
set functions of all those intersecting nuclei have positive values
over the overlapped regions.
2.2 Shape Prior Modelling
Instead of a single shape prior by previous shape prior segmen-
tation methods, we use a large set of shape priors to deal with
the complex shape variation observed in most real-world appli-
cations. Training shape priors are manually extracted from
raw images and aligned with generalized Procrustes analysis,21
as illustrated in Fig. 2. The resulting shapes are represented
by a set of vectors of uniformly sampled local landmarks:
fγ1;γ2;:::;γKg, where Kis the size of the shape prior set.
For better computational efficiency, the shape priors are par-
titioned into Mclusters, where MK. We cluster the shape
priors with a learned L1graph learned by solving the minimi-
zation problem for each γi:14
Shape Dictionary
Integrated Cost
Functional
Whole-slide image
Shape Prior
Dataset
Dynamic
Occlusion
Term
Sparse Shape
Term
Stable Term
Smooth Term
Image Patch Initialization
Seed
Detection
Patch
Extraction
Sparse
Manifold
Learning
Contour
Deformation
Iteration 10
Contour
Deformation
Iteration 20
Contour
Deformation
Final Contour
Cost
Minimization
Cost
Minimization
=
Sparse Coding
Input
Dictionary
Coefficient
Fig. 1 A schema of our proposed detection and segmentation method for nuclei in histopathologic
images.
Fig. 2 Left: illustration of representative shape priors in the shape prior library. Right: Overlaid 100
shape priors. The shape priors are manually collected by pathologist and serve as shape reference.
Journal of Medical Imaging 017502-3 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
EQ-TARGET;temp:intralink-;e002;63;752^
zi¼kγiBzik2
2þλkzik1;i¼1;:::;K s:t:zii ¼0;(2)
where B¼ðγ1;γ2;:::;γK;IÞ,I; is an identity matrix, and λis
the parameter balancing the two terms. ziis the ith weighting
coefficient and zij is the jth entry of zi. Let Wbe a graph weight
matrix, where each entry Wij represents the similarity between
the ith and jth shape priors. We formulate graph matrix as
follows:
EQ-TARGET;temp:intralink-;e003;63;664Wij ¼ð^zij þ^zjiÞ2;i;j¼1;:::;K: (3)
With the graph matrix, the un-normalized spectral clustering
method22,23 is used to partition the shape priors into Mclusters.
The resulting average shape of each cluster is computed and
used as a representative shape for the cluster. For simplicity, we
denote the corresponding distance maps of the average shapes as
Ψ¼ðψ1;ψ2;:::;ψMÞ. These distance maps are normalized to
have unit Frobenius norm so that the impact of each shape prior
is balanced.
2.3 Shape Alignment
To represent a nuclei Oiwith shape priors in the training set Ψ,
we first align the associated distance map Γ½ϕiðxÞ to shape pri-
ors by rotation and translation. The transformation from a pixel
x¼½x1x2Tto the corresponding point ˜
xiin shape prior after
alignment is formulated as follows:
EQ-TARGET;temp:intralink-;e004;63;459
˜
xi¼sicosðθiÞsinðθiÞ
sinðθiÞcosðθiÞxþti;(4)
where θidenotes the rotation angle and ti¼½ti1ti2Trepresents
the translation displacement. As we observe that all shape priors
Ψhave nearly same scales, we simply set si¼1,ito achieve
better computational efficiency.
2.4 Sparse Shape Representation
Given the set of distance maps derived by mapping the ith
nuclei to shape prior set Ψð˜
xiÞ, we assume that the distance map
ΓðϕiðxÞÞ of Oican be approximately represented as a linear
composition of shape priors in Ψð˜
xiÞ. The distance maps and
shape priors are vectorized and denoted as Γ½ϕiðxÞ and Ψð˜
xiÞ¼
ðψ1;ψ2;:::;ψMÞ, respectively. By the linearity assumption,
Γ½ϕiðxÞ can be represented as follows:
EQ-TARGET;temp:intralink-;e005;63;260Γ½ϕiðxÞ ¼ Ψð˜
xiÞαiþei;(5)
where αiis a coefficient vector representing the weights of shape
priors in reconstruction and eiis the error vector.
To avoid the curse of dimensionality problem, we reduce Γ
dimensionality, leading to a less computational cost. The proc-
ess of dimension reduction can be modeled as left multiplication
by a nonzero projection matrix RRd×m,dm, where mis
the total number of pixels in an image. As proved in 12, the
choice of matrix Rdoes not critically affect the ability to recover
the sparse solution. For computation simplicity, we compose
a matrix Rwith entries randomly generated from standard
Gaussian distribution and subject to unit length for all rows.
The corresponding low dimension representations are denoted
as
ˇ
ΓðϕiÞ¼RΓðϕiÞ,
ˇ
Ψ¼RΨ, and ˇ
ei¼Rei. Thus, we have
EQ-TARGET;temp:intralink-;e006;63;85
ˇ
Γ½ϕiðxÞ ¼
ˇ
Ψð˜
xiÞαiþˇ
ei¼
ˇ
Ψð˜
xiÞαiþIˇ
ei;(6)
where IRd×dis an identity matrix and eiis the reconstruction
error. Note that for an overcomplete system ΨRd×M, the
linear system of Eq. (6) may not produce a unique solution.
Therefore, more constraints on αiand eiare needed. Similar
to face recognition applications,12 we observe that given a large
enough training set, a test shape should be sufficiently repre-
sented using a small number of similar shape priors. In addition,
the number of corrupted pixels in the original image and the
derived distance map is assumed to be small. This suggests
that the coefficient vectors αiand eican be sparse as most
entries are zero. In this way, most dissimilar prior shapes are
suppressed due to large penalty measured by the reconstruction
error, which enables us to differentiate Oifrom dissimilar shape
priors. To ensure the sparsity, we use L0norm regularization
in the formulation. The sparse representation problem is formu-
lated as follows:
EQ-TARGET;temp:intralink-;e007;326;576
^
αi;^
ei¼arg min
αi;ˇ
ei
kˇ
Γ½ϕiðxÞ ˇ
Ψð˜
xiÞαiIˇ
eik2
2
þλ1kαik0þλ2kˇ
eik0:(7)
However, the problem of finding the sparsest solution in an
underdetermined linear system such as Eq. (7) is proved to be
nondeterministic polynomial-time hard.24 Due to recent devel-
opments in sparse theory, it has been revealed that such a prob-
lem can be solved via L1relaxation:2527
EQ-TARGET;temp:intralink-;e008;326;463
^
αi;^
ei¼arg min
αi;ˇ
ei
kˇ
Γ½ϕiðxÞ ˇ
Ψð˜
xiÞαiIˇ
eik2
2
þλ1kαik1þλ2kˇ
eik1:(8)
The L1minimization problem can be solved via standard linear
programming methods. By this approach, we obtain the sparse
coefficients ^αiand ^eiand use the coefficients for sparse
reconstruction.
3 Seed Detection
In this section, we present an algorithm to recognize nuclei spa-
tial locations by nuclei seed detection. Nuclei seed detection is
essential for the follow up nuclei segmentation, as it decides the
number and initial contour locations. The proposed seed detec-
tion algorithm utilizes joint information of spatial connectivity,
distance constraint, image edge map, and a shape-based voting
map derived from eigenvalue analysis of Hessian matrix across
multiple scales.
3.1 Initialization and Preprocessing
The pathology images of tumor specimens for routine diagnos-
tics usually contain two primary chemical stains: hematoxylin
and eosin (H&E). Hematoxylin presents a dark blue or violet
color, which positively binds to nuclei. We decompose signal
intensity components of H&E stains in original images and
use only signals of hematoxylin stain for analysis. Based on
LambertBeers law, the relationship between the intensity of
light entering a specimen Liand that through a specimen Lo
can be characterized as Lo¼LieðAbÞ, where Aand bare the
amount of stain and the absorption factor, respectively. By this
law, we can compute the stain-specific strength values with a
predefined stain decomposition matrix Mand the observed
optical density (OD): Y¼logðLoLiÞ.28 The stain composi-
tion matrix Mconsists of three normalized column vectors
Journal of Medical Imaging 017502-4 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
representing OD values of red, green, and blue channels from
hematoxylin, eosin, and null stain. Therefore, stain specific opti-
cal signals can be computed by X¼M1Y. We retain the first
entry of the resulting stain specific signal vector at each pixel
and denote the decomposed hematoxylin channel signal as IH.
Note that IHis often coupled with noise produced during the
slide preparation process, including uneven tissue slide cut,
heterogeneous histology process, staining artifacts, and digital
scanning noise. We can normalize the background noise in
IHby morphological reconstruction with two morphological
components, namely mask image Iand marker image I.
Initially, we set Ias complementary image of IHand Ias
IHρr, where denotes the morphological dilation operator
and ρris a circular structural element with radius r. With marker
and mask image, an image RðI;I
Þcan be reconstructed by
iterative dilation and intersection until convergence.29 The dif-
ference image hðI;I
;ρrÞ¼IRðI;I
Þconsists of a near
zero-level background and a group of enhanced foreground
peaks, each representing an object of interest. In Fig. 4(a),we
present a typical background normalization result with morpho-
logical reconstruction.
3.2 Voting Map Construction
Assuming a typical nuclei shape is close to either a circle or
an ellipse, we proceed with a shape-based voting process by
analyzing nuclei structures within local image regions. Before
the voting process, we first enhance nuclei regions by eigenvalue
analysis with the Hessian matrix convolved with Gaussian filters
at multiple scales.30 With this approach, we search for circular
structures based on geometric structures characterized by the
neighboring pixel intensity profiles. Specifically, for a pixel at
ði0;j
0Þ, its local image intensity change can be represented by
Taylor expansion:
EQ-TARGET;temp:intralink-;e009;326;303
hσði0þδi; j0þδjÞ¼hσði0;j
0Þþðδi; δjÞDðhσði0;j
0ÞÞ
þ1
2ðδi; δjÞD2ðhσði0;j
0ÞÞðδi; δjÞT
þO½ðδ3Þ;(9)
where hσ¼hGσand Gσis a Gaussian filter with standard
deviation σ;D2ðhσði0;j
0ÞÞ is Hessian of hσði0;j
0Þthat is sym-
metric and thus diagonalizable with two resulting eigenvalues λi,
i¼1;2, where λ1λ2. To make our analysis invariant to nuclei
size, we adopt a family of Gaussian filters with distinctive scales
to convolve with hði; jÞ:Gσi,i¼1;2;···;S. As pixels close to
nuclei centers have larger intensity values than peripheral points,
signs of both eigenvalues for pixels within a nucleus are nega-
tive. Even if a nucleus is overlapped with other nuclei, such
negative eigenvalue pairs can still be identified at different scales
σiwithin an appropriately chosen scale range because signs of
such eigenvalues depend on relative pixel intensity change in
nuclei neighborhoods. Leveraging such a signature property
of neighboring pixel intensity change, the presence of nuclei,
even in clumps, can be indicated by the number of times
Fig. 3 Analysis results of typical examples of overlapped nuclei with distinct overall clump shapes are
presented at different steps. (a) Clumped nuclei instances with distinct overall shapes; (b) 3-D illustration
of complementary image of decomposed hematoxylin signal surface overlaid with original image;
(c) 3-D illustration of the difference image Ihsurface after background normalization; and (d) 3-D
illustration of voting map after eigenvalue sign analysis.
Journal of Medical Imaging 017502-5 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
Hessian matrix produces negative eigenvalue pairs across differ-
ent Gaussian filter scales. In Fig. 3, we present typical examples
of overlapped nuclei with distinct overall clump shapes after dif-
ferent analysis steps. By comparison, voting maps produced
from eigenvalue sign analysis encoding pixel intensity profile
in nuclei neighborhoods have superior contrast to complemen-
tary image of decomposed hematoxylin signal IHand the differ-
ence image Ih. It demonstrates that the voting map derived from
eigenvalue sign analysis is an effective way to characterize over-
lapped nuclei local intensity profile features regardless of overall
clump shapes. Next, we begin the voting process with a zero-
valued voting map for all pixels in the image domain. We only
increment the voting map value Vði; jÞwhen λ2ði; jÞ<0.If
λ2ði; jÞ>0, we consider pixel ði; jÞis not within a nuclei struc-
ture and hence set zero vote for such locations. This process is
repeated for all scales. A voting map overlaid with a typical his-
tological image region, where nuclei regions in dark exhibit high
votes, is presented in Fig. 4(b).
3.3 Dynamic Seeds Detection and Merging
Given the derived voting map, we dynamically adjust a seed list
based on candidate spatial connectivity, distance constraint, and
image edge map. The proposed method can produce robust and
accurate seed detection result, especially for overlapped nuclei.
As any pixel on the voting map is no larger than the number
of scales S, we consider the voting map as a surface in a three-
dimensional space, as shown in Fig. 4(b). Those strong voting
peaks, representing consistent negative eigenvalue pairs from
Hessian matrix at different smoothing scales due to radially out-
ward decreasing pixel intensity profiles in local regions, suggest
the presence of nuclei. The strong peaks on the voting surface
can be detected as we gradually slide down an imaginary hori-
zontal plane (e.g., from the blue to green plane) intersecting with
the voting surface. We can generate a binary image from the
original voting map with threshold vfor each intersection plane.
We begin with an empty seed list Land append to it the resulting
centroids of voting peaks satisfying all the following conditions:
(a) such voting peak centers do not exist in the peak list Lyet.
(b) They come from strong peaks that suggest consistent local
intensity change property, a key to a reliable nuclei detection.
Such strong peaks can be identified with their sizes no less
than area threshold AðvÞ¼ηþexp½ρfðvÞ, where ρand ηare
predefined scalars that determine the lower area threshold for
detected peaks in the voting map at each step and fðvÞ¼
vmaxv
vmaxvmin is a function normalizing the voting value v; specifi-
cally, ηþ1is the lowest peak cross-section size expectation
when the imaginary horizontal plane is the highest. ρdetermines
how peak areas are expected to grow as the imaginary horizontal
plane slides down. Their choices depend on the nuclei texture
homogeneity and nuclei size. We choose ηto be small enough
not to miss any strong peaks, and ρbig enough to represent the
rapid growth of strong voting peak cross-sections as the imagi-
nary horizontal plane slides down. (c) Such points are within the
foreground region in the binary mask detected by the adaptive
thresholding method.
With a list of seed candidates, we perform seed pruning to
eliminate false positive seeds. We compute the pairwise distan-
ces for all peaks in Land iteratively merge adjacent centroids
within distance threshold D1. This is followed by a second
round of distance-based merging with a relaxed threshold D2
(D1<D2). In the second round, two peaks are merged only
if the following conditions are true: (a) distance between these
two points is less than D2and (b) the path connecting a pair of
points does not intersect with any canny edge derived from the
original image. The edge map is used to prevent centroids of
closely clumped nuclei from being merged. With this seed merg-
ing process, we can retain seeds from true clumped nuclei in
histological images without tedious parameter tuning process.
4 Occluded Nuclei Segmentation
With nuclei detected in Sec. 3, we further develop an improved
level set based segmentation method to drive initialized nuclei
contours to true nuclei edges. Our development is driven by the
use case of automatic analysis of occluded nuclei in whole-slide
histopathologic images. In our method, each nuclei is described
Fig. 4 Voting map derived from a typical image region is demonstrated. (a) Top: the deconvolved hema-
toxylin channel, eosin channel, marker image; bottom: reconstructed, difference, and voting map surface
overlaid with original image. (b) Illustration of the voting peak detection by sliding down an imaginary
horizontal plane that intersects with the voting surface.
Journal of Medical Imaging 017502-6 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
by a level set function and we aim to simultaneously obtain all
Nlevel set functions by optimizing a variational model.
4.1 Prior Information Integration
As described in Sec. 2.4, given a large enough set of shape priors
Ψ¼fψjg, a shape of interest ϕican be encoded as sparse linear
superposition of shape priors. Thus, we define a shape term as
follows:
EQ-TARGET;temp:intralink-;e010;63;656E1¼X
N
i¼1ZΩΓ½ϕiðxÞ X
M
j¼1
ψjð˜
xiÞ^
αij2
dx; (10)
where ^
αij is the jth entry of ^
αi. This shape prior term describes
the difference between the target distance map and the best
sparsely reconstructed distance map derived from shape priors.
Through this optimization process, the target distance map
ΓðϕiÞevolves to fit itself to the feature space spanned by the
shape priors.
4.2 Adaptive Penalty on Mutual Occlusion
It is common to have mutually occluded nuclei in 2-D histopa-
thologic images due to the fact that these images represent 2-D
projected signals of tissue nuclei in 3-D space. It is a challenging
task to segment mutually occluded nuclei and identify hidden
boundaries across each other. This problem becomes exponen-
tially complicated when there are more than two nuclei involved
in occlusion. In the level set framework, intersecting nuclei may
all have positive function values after contour evolutions, mak-
ing it difficult to differentiate them from each other. To address
this problem, we introduce an adaptive occlusion penalty term to
dynamically suppress nuclei intersection events. The occlusion
penalty term is determined by the number of nuclei that are
overlapped. Meanwhile, this term prevents deformable contours
from becoming identical after iterations of evolution. Speci-
fically, we define the adaptive occlusion penalty term as follows:
EQ-TARGET;temp:intralink-;e011;63;344E2¼ZΩX
N
i¼1
H½ϕiðxÞ 1Y
N
k¼1
ð1HðϕkðxÞÞÞdx; (11)
where Hð·Þis the Heaviside function. The first term in Eq. (11)
is a superposition of the Heaviside functions of all nuclei at pixel
xand the second term is a binary function indicating if pixel x
is inside any nuclei. This penalty term describes the extent of
image regions occupied by multiple nuclei and the associated
number of intersecting nuclei in such regions. Equivalently, it
represents a dynamic occlusion penalty power on overlapped
nuclei.
4.3 Edge Guided Contour and Evolution Stability
To further encourage contour convergence and retain contour
smoothness, we define an edge-guided contour regularity term
as follows:
EQ-TARGET;temp:intralink-;e012;63;143E3¼X
N
i¼1ZΩ
QðxÞjH½ϕiðxÞjdx; (12)
where QðxÞ¼expðjIðxÞj2
2σ2Þis an exponential function mono-
tonously decreasing as the image magnitude of gradient at x
increases and jH½ϕiðxÞj is nonzero only on nuclei boundaries.
As a result, this term drives contours to strong edge locations
and helps to regulate smoothness of nuclei boundaries.
Additionally, we include an evolutionary stability term31 to
regulate the property of level set function as follows:
EQ-TARGET;temp:intralink-;e013;326;708E4¼X
N
i¼1ZΩ
RðjϕiðxÞjÞdx; (13)
where RðxÞis defined as a double-well potential function:
EQ-TARGET;temp:intralink-;e014;326;648RðxÞ¼(1
ð2πÞ2½1cosð2πxÞ;if x1
1
2ðx1Þ2;if x1:(14)
This term in Eq. (13) is used to retain signed distance property
for stable level set function computation.
4.4 Improved Variational Level Set Formulation
By combining terms in Eqs. (10)(13), we formulate our method
with a variational level set framework.1To evolve nuclei con-
tours to desired nuclei boundaries, we minimize the following
functional:
EQ-TARGET;temp:intralink-;e015;326;500
EðC; Φ;Θ;TÞ¼ZΩ
Lðx; Φ;C;Θ;TÞdx¼Ecv þX
4
i¼1
Ei
¼λoX
N
i¼1ZΩ
ðIciÞ2H½ϕiðxÞdx
þλbZΩ
ðIcbÞ2Y
N
i¼1
½1HðϕiðxÞÞdx
þνX
N
i¼1ZΩΓðϕiðxÞÞ X
M
j¼1
ψjðxiÞ^
αij2
dx
þωZΩX
N
i¼1
H½ϕiðxÞ 1Y
N
k¼1
ð1HðϕkðxÞÞÞdx
þμX
N
i¼1ZΩ
QðxÞjH½ϕiðxÞjdx
þξX
N
i¼1ZΩ
RðjϕiðxÞjÞdx; (15)
where fcigand cbare scalars that define the average pixel inten-
sities in regions of nuclei and background, respectively. We
denote C¼fci;c
bg,Φ¼fϕig,Θ¼fθig, and T¼fti1;t
i2g,
i¼1;2;:::;N as sets of variables of the energy functional.
Parameters fλo;λb;μ;ξ;ω;νgare weights of different terms
in the functional.
5 Numerical Computations
In this section, we present numerical computations to minimize
the functional in Eq. (15). We optimize the functional iteratively
by updating functions Φand variables fC; Θ;Tgalternatively.
We begin with updating functions Φfirst. Parameterizing iter-
ation as an artificial time variable t>0, we minimize the func-
tional by solving EulerLagrange equation based on theory of
calculus of variations:
Journal of Medical Imaging 017502-7 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
EQ-TARGET;temp:intralink-;e016;63;752
ϕi
t¼E
ϕi
¼Lðx;ϕiÞ
ϕi
X
2
j¼1
xjLðx;ϕiÞ
ϕi
xj
¼λoðIciÞ2δðϕiÞ
þλbðIcbÞ2Y
N
k¼1;k¼i
ð1HðϕkðxÞÞÞδðϕiÞ
þμδðϕiÞQðxÞ·ϕi
jϕijþQðxÞdivϕi
jϕij
þξdivR0ðjϕi
jϕijϕi
ωδðϕiÞ1Y
N
k¼1;k¼i
ð1HðϕkðxÞÞÞ
2νΓðϕiðxÞÞ X
M
j¼1
ψjð˜
xiÞ^
αij;(16)
where δð·Þ¼H0ð·Þdenotes the Dirac delta function.
Next, we fix Φand update transformation parameters. We
derive updating equations for transformation parameters by
computing gradient descent of functional EðΘ;TÞ:
EQ-TARGET;temp:intralink-;e017;63;486
θi
t¼2νZΩΓðϕiðxÞÞ X
M
j¼1
ψjð˜
xiÞ^
αij
X
M
j¼1
ð˜
xiψj·θi
˜
xiÞ^
αijdx;(17)
where ˜
xiψjhψj
˜
xi1
ψj
˜
xi2iTand θi
˜
xi¼
siðx1sinðθiÞþx2cosðθiÞÞ
siðx1cosðθiÞx2sinðθiÞÞ
EQ-TARGET;temp:intralink-;e018;63;357
tik
t¼2νZΩΓðϕiðxÞÞ X
M
j¼1
ψjð˜
xiÞ^
αij
X
M
j¼1
ð˜
xiψj·tik
˜
xiÞ^
αijdx; (18)
where k¼1or 2:
EQ-TARGET;temp:intralink-;e019;63;260tik
˜
xi¼½10T;if k¼1
½01T;if k¼2:(19)
Finally, we fix Φ, and fΘ;Tg, and update ciand cbby setting
the partial derivative of Eðci;c
bÞwith respective to ciand cband
solving these equations, respectively. The optimal values turn
out to be the average image intensities in corresponding area:1
EQ-TARGET;temp:intralink-;e020;63;170
Eðci;c
bÞ
ci
¼0ci¼RΩIðxÞHðϕiÞdx
RΩHðϕiÞdx;(20)
EQ-TARGET;temp:intralink-;e021;63;113
Eðci;cbÞ
cb
¼0cb¼RΩIðxÞQN
i¼1½1HðϕiðxÞÞdx
RΩQN
i¼1½1HðϕiðxÞÞdx:(21)
In this way, this minimization problem is solved by iterative
computation of EulerLagrange equation and gradient descent
approach until the functional is converged. The zero level sets of
the converged functions Φindicate the final nuclei contours.
6 Experimental Results
6.1 Dataset and Parameter Setting
We present experimental results of our algorithm for analysis
of nuclei within histopathologic images of glioblastoma brain
tumor specimens. The effectiveness of our algorithm is verified
with two datasets of H&E stained GBM specimens captured at
40×magnification, namely GBM40 dataset and TCGA FFPE
dataset: http://cancergenome.nih.gov/. These images are ob-
tained after glioblastoma brain tissues are processed with a tis-
sue preparation protocol. Image patches with size 512 ×512 are
used for experiments due to memory limit. For GBM40 dataset,
there are 5396 and 1849 manually annotated nuclei for seed
detection and boundary delineation, respectively. For TCGA
dataset, the total number of annotated nuclei for boundary seg-
mentation is 4961. Shape profiles of 27,000 glioblastoma brain
nuclei are manually extracted from the dataset to form the set of
shape priors. All shape priors are aligned with generalized
Procrustes analysis.21 As discussed in Secs. 2.1 and 2.2, we re-
present shapes as distance maps, cluster shape priors with an L1
graph into Mgroups and use one representative shape from each
group to form a training shape dictionary Ψfor sparse represen-
tation. We apply the proposed method to datasets with the fol-
lowing parameter setup: r¼10,σi¼f3.3;3.6;:::;10g,D1¼
15,D2¼25,λo¼λb¼1,μ¼5000,ξ¼2,ω¼2000,ν¼
3000,M¼100,η¼10,ρ¼10. Note that our approach is
an image data driven process. Therefore, the scanner setting such
as magnification factor does not have significant impact on the
parameter setting. As seeds are detected in the voting map that is
produced by eigenvalue analysis of image local Hessian matrix
associated with a set of Gaussian filter scales, transforming from
the original image to voting map can partially help take care of
nuclei scale variations. In our study, this set of Gaussian filter
scales is chosen to cover varying nuclei sizes, with 3 maxiðσiÞ
approximately equal to the radius of a typical large nucleus.
For the weighting parameters for nuclei segmentation, we
assign parameter values so that all terms in Eq. (15) are appro-
priately balanced in numerical values. In our experiment, follow-
ing three parameters have similar value and scale: the coefficient
for edge-gradient weighted contour length μ, the coefficient for
the occlusion penalty term ω, and the coefficient for squared
fitting error of shape-derived distance map ν. The weight ξis
numerically set by referring to typical value of the double-well
potential function. We use the l1-ls MATLAB toolbox32 to solve
the L1-minimization problem.
6.2 Seed Detection
For quantitative analysis, we assess performance of seed detec-
tion method with reference to annotations by a pathologist. Note
that we evaluate our approach with nontouching and occluded
nuclei in each image separately. Five metrics are computed from
each image to show seed detection performance: nuclei number
error (C), miss detection (M), false recognition (F), over seg-
mentation (O), under segmentation (U), and count error (CE)%.
Nuclei number error is used to demonstrate the absolute differ-
ence between the number of nuclei detected by machine and
that reported by human expert. Miss and false detection re-
present the number of missing and false recognition events when
Journal of Medical Imaging 017502-8 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
machine-based seed detection method is used to detect individ-
ual nuclei with no occluded neighbors. Meanwhile, we use over-
and undersegmentation to record events where the number of
machined-identified nuclei from a nuclei clump is more and less
than the true number marked by human expert, respectively.
Finally, we use the count error% to represent the nuclei number
error in reference to the true nuclei number. The resulting out-
puts are shown in Tables 1and 2. Additionally, we compare our
method with a single-pass voting method based on mean shift11
and an oriented kernel-based iterative voting method.9By con-
trast to our method, both the single-pass voting method and the
iterative voting method tend to miss detecting nuclei. Note that
GBM40 dataset is produced by our local laboratory. Compared
to the public TCGA dataset, GBM40 dataset is more carefully
prepared with better staining and contrast, leading to higher
SNR. This results in better performances of all methods for com-
parison consistently. We also illustrate seed detection results
on several typical image patches in Fig. 5.
6.3 Nuclei Segmentation
We model the nuclei segmentation problem by the variational
level set framework mathematically described in Eq. (15) and
solve it by the numerical computations in Sec. 4. The level set
functions do not stop evolving until they either reach conver-
gence or exceed iteration number limit. We present the evolution
results of zero level sets at iteration 10, 20, and 30 in Fig. 6. The
detected nuclei shapes are well defined, as shown in Fig. 6.
Notably, those overlapping nuclei can also be correctly seg-
mented, with occluded boundaries reasonably recovered. It is
also observed that most zero level sets can rapidly converge
to true nuclei boundaries within 10 iterations. After that, zero
level sets fine tune themselves to better fit to nuclei contour
details, especially over those overlapped nuclei. With our
experiment data, we only observe minor improvements in nuclei
segmentation from 10 to 30 iterations. In general, the optimal
number of iterations depends on data properties. End users can
determine the accuracy-speed tradeoff and select the best num-
ber of iterations based on individual experiment scenarios.
In addition, we present experimental results of three images
presenting the best, median, and worst overall segmentation per-
formance in Fig. 7. The bar charts in Fig. 7present our method
efficacy measured by the Jaccard coefficient, precision, recall,
and F1score. The small variation in these metrics over the best,
median, and worst images suggests a good consistency and gen-
eralization ability of our method.
To quantitatively assess our methods performance, we use
human annotations from pathologist as ground truth. By com-
paring the experimental result Bwith ground truth annotation A,
we evaluate the performance of our method with multiple
metrics:36,37 Jaccard coefficient JðA; BÞ¼jABj
jABj, precision rate
PðA; BÞ¼jABj
B, recall rate RðA; BÞ¼jABj
A, F1 score
F1ðA; BÞ¼2PðA;BÞRðA;BÞ
PðA;BÞþRðA;BÞ, and Hausdorff distance HðA; BÞ¼
maxðhðA; BÞ;hðB; AÞÞ, where hðA;BÞ¼maxaAminbBkabk.
The testing results evaluated with these metrics are presented
in Table 3. Our proposed method achieves better performance
in most metrics than the other methods. A qualitative perfor-
mance comparison across several compared methods and our
proposed method is shown in Fig. 8. We also demonstrate
the results of the proposed nuclei segmentation method with
different seed detection methods in Fig. 10. It is notable that
our proposed method is able to better capture nuclei boundaries
than other methods for comparison. In particular, only our
method can recover boundaries of overlapped nuclei by compar-
isons. This property makes our method superior to the other
Table 1 Image-wise seed detection performance on GBM40 dataset.
Metric CMFOUCount error (%)
IVW33 15.85 8.42 2.84 2.23 5.76 3.05 12.85 5.67 0.07 0.27 11.14
MOW34 32.15 4.86 1.62 1.94 11.15 3.48 22.69 4.53 0.08 0.28 22.60
RACM35 34.46 10.60 5.00 2.38 7.08 3.88 32.08 7.42 0.00 0.00 24.23
CNN 32.17 6.61 1.67 1.03 6.16 2.48 27.67 6.15 0.00 0.00 22.65
Ours 2.15 1.51 2.85 1.99 1.30 1.18 0.33 0.66 0.50 0.72 1.51
Note: Bold numbers indicate the best performance.
Table 2 Image-wise seed detection performance on TCGA dataset.
Metric CMF OUCount error (%)
IVW33 36.85 10.99 0.77 0.83 32.92 9.28 6.31 2.46 1.62 1.33 29.71
MOW34 55 14.17 1.38 1.26 55.38 14.55 3.62 2.33 2.62 1.45 44.35
RACM35 54.23 13.28 1 1.47 42.15 12.42 13 5.42 0.08 0.28 43.73
CNN 48.20 7.59 0.00 0.00 20.81 9.34 27.42 9.81 0.00 0.00 38.87
Ours 11.92 6.45 2.46 1.20 15.69 6.18 1.38 1.56 2.69 1.65 9.61
Note: Bold numbers indicate the best performance.
Journal of Medical Imaging 017502-9 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
methods when analytics of occluded nuclei is crucial in
investigations.
6.4 Parameter Sensitivity Analysis for Segmentation
We further investigate the segmentation contribution from indi-
vidual terms in our variational model by testing with different
parameters and comparing their associated segmentation results.
The segmentation result with our default parameter setting is
shown in Fig. 9(a). When ν¼0, the shape prior fitting term
E1does not take effect in the contour deformation process.
The resulting segmentation outcome is presented in Fig. 9(b),
where finalized nuclei contours are less regulated. Similarly,
we can remove the occlusion penalty term E2from the varia-
tional model by setting ω¼0. The associated result is illus-
trated in Fig. 9(c). Under this setting, the detected nuclei
present a strong inclination to overlap with each other. When
the shape prior fitting term E1, the dynamic occlusion penalty
term E2, and the evolutionary stability term E4are all removed
(ν¼ω¼ξ¼0), the resulting nuclei contours become signifi-
cantly degraded, as shown in Fig. 9(d). Note that shapes appear
to be less regulated in Figs. 9(b)9(d) without one or more terms
Fig. 6 Nuclei contour deformation: initial zero level set and that after 10, 20, 30 evolving iterations,
respectively. Those overlapping nuclei can be well separated, as shown in close-up views on the right.
(a) The initial contour, (b) 10 iterations, (c) 20 iterations, (d) 30 iterations, and (e) zoomed-in views.
Fig. 7 Metrics of the samples with the best, median, and worst
Jaccard coefficients overall nuclei segmentation result of all testing
images.
Fig. 5 Comparisons of seed detection results: (a) ground truth; (b) IVW;33 (c) MOW;34 (d) RACM;35 and
(e) our proposed method. We use arrows to demonstrate representative tissue areas, where our
approach can correctly detect seeds while other methods fail.
Journal of Medical Imaging 017502-10 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
Table 3 The performance comparison of different nuclei segmentation methods measured by region- and distance-based metrics on GBM40 dataset and TCGA dataset.
Methods
GBM40 dataset TCGA dataset
JPRF
1HJPRF
1H
MS38 0.44 0.27 0.48 0.30 0.90 0.17 0.56 0.27 29.20 28.94 0.41 0.26 0.52 0.32 0.76 0.26 0.54 0.27 30.12 30.58
ISO39 0.59 0.28 0.62 0.30 0.93 0.15 0.69 0.28 18.91 29.11 0.55 0.26 0.61 0.28 0.87 0.21 0.67 0.25 16.34 14.52
SBGFRLS40 0.72 0.22 0.73 0.22 0.98 0.04 0.81 0.18 9.48 9.66 0.68 0.25 0.73 0.26 0.90 0.15 0.78 0.21 14.13 13.02
IVW33 0.68 0.18 0.74 0.20 0.94 0.14 0.80 0.15 6.53 5.55 0.70 0.19 0.86 0.16 0.82 0.21 0.81 0.16 6.94 6.55
MOW34 0.74 0.20 0.79 0.20 0.93 0.13 0.83 0.16 6.22 6.01 0.72 0.21 0.90 0.17 0.80 0.20 0.82 0.17 6.90 6.97
RACM35 0.63 0.19 0.82 0.21 0.77 0.19 0.75 0.16 7.82 6.09 0.63 0.18 0.89 0.18 0.71 0.19 0.76 0.16 8.08 6.65
mRLS41 0.53 0:17 0.83 0:17 0.64 0.23 0.67 0.15 10.02 6.65 0.51 0.20 0.90 0.12 0.56 0.24 0.65 0.18 10.93 7.33
CNN42 0.62 0.25 0.74 0.30 0.82 0.14 0.73 0.22 16.41 22.09 0.66 0.24 0.80 0.26 0.83 0.18 0.77 0.20 11.74 14.51
Ours 0.78 0.17 0.79 0.17 0.99 0.03 0.87 0.13 4.07 2.76 0.83 0.24 0.89 0.20 0.90 0.20 0.88 0.20 4.85 7.28
Note: Bold numbers indicate the best performance.
Journal of Medical Imaging 017502-11 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
Fig. 9 Segmented nuclei with distinct parameter settings: (a) μ¼5000,ξ¼2,ω¼2000,ν¼3000;
(b) same as (a) except ν¼0; (c) same as (a) except ω¼0; (d) same as (a) except ξ¼0,ω¼0,
ν¼0; (e) same as (a) except μ¼3900; (f) same as (a) except μ¼4500; (g) same as (a) except
ν¼5000; and (h) same as (a) except μ¼3900 and ν¼5000.
(b) (c) (d) (e) (a) (f) (g)
Fig. 8 Comparison of segmentation results: (a) original image; (b) ground truth; (c) MS;38 (d) ISO;39
(e) SBGFRLS,40 (f) vanilla ChanVese model,1and (g) our proposed method. We use arrows to dem-
onstrate typical tissue areas, where our approach can correctly segment nuclei while other methods fail.
Journal of Medical Imaging 017502-12 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
in the variational model. In addition to the final results with only
a subset of terms in Figs. 9(b)9(d), we also investigate the sen-
sitivity of parameters to final results. Our investigations with
parameter deviation from our proposed value set suggest that
results remain similar even when we change μby 22%
(μ¼3900), 10% (μ¼4500), and νby 67% (ν¼5000), as pre-
sented in Figs. 9(e)9(g). Figure 9(h) presents results when μ
and νare simultaneously changed by 22% and 67%. Overall,
a larger μleads to a better contour convergence to true nuclei
boundary by energy term E3; a larger νforces contours to look
more from the reconstructed sparse shape priors by E1; a larger
ωtends to prevent nuclei more from overlapping with each other
by E2.
To illustrate the effect of seed detection, we demonstrate the
segmented nuclei contours in Fig. 10 by replacing the seed
detection algorithm in our pipeline with other seed detection
methods. Since the number of detected nuclei depends on the
seed-based initialization, miss detection of seeds may result
in undersegmentation of nuclei, as shown in Figs. 10(b) and
10(c). We also test the impact of the shape prior library size
on segmentation by changing the number of shape prior clusters
M. The segmented nuclei contours in Fig. 11 show that the pro-
posed method does not degrade significantly with less shape
priors. In practice, we carefully choose the number of clusters
M¼100 so that Mis large enough to cover shape variations,
but small enough to avoid high computation burden.
6.5 Limitation and Future Work
Although the proposed method can achieve good quantitative
and visual results, it is limited by the following factors:
(a) Nuclei contour evolution depends on accurate detection of
seeds, therefore, the segmentation performance may degrade
when seeds are not correctly detected. In cases where seeds are
missing, our proposed method, like other level set based meth-
ods in literature, does not produce correct segmentation result as
no level set function is correctly initialized for deformation.
However, we have shown in our experimental tests that our pro-
posed seed detection has a good performance with a very low
missing rate, as suggested in Tables 1and 2. When nuclei seeds
are slightly shifted within nuclei regions, our approach is able
to produce similar and correct segmented nuclei contours for
overlapping nuclei. (b) Application of our method to whole slide
images can be time-consuming compared to other state-of-the-
art methods. To address the problem, we will further develop a
MapReduce based high performance image analysis framework
to make this process scalable and cost effective.43,44
7 Conclusion
In this paper, we propose a nuclei segmentation method based
on the level set framework, aiming to simultaneously identify
contours of multiple nuclei with mutual occlusion. We present
our method with its application to nuclei segmentation using
histopathologic images of glioblastoma brain tumors. First, a
seed detection method is developed to automatically initialize
nuclei contours. For better nuclei contour deformation, we
incorporate into the model a set of typical nuclei shapes as
prior information through spectral clustering on L1graph.
Meanwhile, an adaptive nuclei occlusion penalty term is de-
signed to dynamically penalize nuclei contour occlusion based
on the number of overlapped nuclei. It also prevents recognized
contours of overlapped nuclei from being identical. A numerical
Fig. 10 Comparison of segmentation results with nuclei initialization by different seed detection methods:
(a) original image; (b) ground truth; (c) single-pass voting with mean shift11 using ImageJ plugin; (d) iter-
ative voting;9and (e) our proposed method.
Journal of Medical Imaging 017502-13 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
optimization algorithm is used to iteratively search for the
desired level set functions. Experiments on glioblastoma brain
histopathologic images produce better results as compared with
other state-of-the-art experiments, suggesting the effectiveness
of our detection and segmentation approach for nuclei analysis
with glioblastoma histopathologic images.
Disclosures
The authors state no conflicts of interest and have nothing to
disclose.
Acknowledgments
This research is supported in part by grants from National
Institute of Health under Grant No. K25CA181503, National
Science Foundation under Grant Nos. ACI 1443054 and IIS
1350885, and CNPq.
References
1. T. F. Chan and L. Vese, Active contours without edges,IEEE Trans.
Image Process. 10(2), 266277 (2001).
2. D. Mumford and J. Shah, Optimal approximations by piecewise
smooth functions and associated variational problems,Commun.
Pure Appl. Math. 42(5), 577685 (1989).
3. A. Tareef et al., Automatic segmentation of overlapping cervical smear
cells based on local distinctive features and guided shape deformation,
Neurocomputing 221,94107 (2017).
4. F. Xing, Y. Xie, and L. Yang, An automatic learning-based framework
for robust nucleus segmentation,IEEE Trans. Med. Imaging 35(2),
550566 (2016).
5. Z. Lu, G. Carneiro, and A. P. Bradley, An improved joint optimization
of multiple level set functions for the segmentation of overlapping
cervical cells,IEEE Trans. Image Process. 24(4), 12611272 (2015).
6. T. Chan and W. Zhu, Level set based shape prior segmentation,in
IEEE Comput. Soc. Conf. Comput. Vision and Pattern Recognit.,
Vol. 2, pp. 11641170 (2005).
7. P. Yan et al., Automatic segmentation of high-throughput RNAi
fluorescent cellular images,IEEE Trans. Inf. Theory. Biomed. 12(1),
109117 (2008).
8. S. Ali and A. Madabhushi, An integrated region-, boundary-, shape-
based active contour for multiple object overlap resolution in histologi-
cal imagery,IEEE Trans. Med. Imaging 31(7), 14481460 (2012).
9. B. Parvin et al., Iterative voting for inference of structural saliency and
characterization of subcellular events,IEEE Trans. Image Process.
16(3), 615623 (2007).
10. Y. Al-Kofahi et al., Improved automatic detection and segmentation of
cell nuclei in histopathology images,IEEE Trans. Biomed. Eng. 57(4),
841852 (2010).
11. X. Qi et al., Robust segmentation of overlapping cells in histopathol-
ogy specimens using parallel seed detection and repulsive level set,
IEEE Trans. Biomed. Eng. 59(3), 754765 (2012).
12. J. Wright et al., Robust face recognition via sparse representation,
IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210227 (2009).
13. J. Mairal, M. Elad, and G. Sapiro, Sparse representation for color
image restoration,IEEE Trans. Image Process. 17(1), 5369 (2008).
14. B. Cheng et al., Learning with L1-graph for image analysis,IEEE
Trans. Image Process. 19(4), 858866 (2010).
15. S. Zhang et al., Sparse shape composition: a new framework for
shape prior modeling,in IEEE Comput. Soc. Conf. Comput. Vision and
Pattern Recognit., pp. 10251032 (2011).
Fig. 11 Segmentation with different number of shape clusters: (a) 10; (b) 20; (c) 50; (d) 100; (e) 200;
and (f) 300.
Journal of Medical Imaging 017502-14 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
16. Q. Zhang and R. Pless, Segmenting multiple familiar objects under
mutual occlusion,in Proc. Int. Conf. Image Process., 197200 (2006).
17. C. Le Guyader and L. A. Vese, Self-repelling snakes for topology-
preserving segmentation models,IEEE Trans. Image Process. 17(5),
767779 (2008).
18. A. Dufour et al., Segmenting and tracking fluorescent cells in dynamic
3-d microscopy with coupled active surfaces,IEEE Trans. Image
Process. 14(9), 13961410 (2005).
19. J. Kong et al., Robust cell segmentation for histological images of glio-
blastoma,in Proc. IEEE 13th Int. Symp. Biomed. Imaging, IEEE,
pp. 10411045 (2016).
20. P. Zhang et al., Automated level set segmentation of histopathologic
cells with sparse shape prior support and dynamic occlusion constraint,
in Proc. IEEE 14th Int. Symp. Biomed. Imaging, IEEE, pp. 718722
(2017).
21. C. Goodall, Procrustes methods in the statistical analysis of shape,
J. R. Stat. Soc. 53(1), 285339 (1991).
22. B. Mohar et al., The Laplacian spectrum of graphs,in Graph Theory,
Combinatorics and Applications, Vol. 2, Springer, Berlin, p. 12 (1991).
23. B. Mohar, Some applications of Laplace eigenvalues of graphs,
Graph Symmetry, Springer, Dordrecht (1997).
24. D. L. Donoho and M. Elad, Optimally sparse representation in general
(nonorthogonal) dictionaries via l1 minimization,Proc. Natl. Acad.
Sci. U. S. A. 100(5), 21972202 (2003).
25. E. J. Candes, J. K. Romberg, and T. Tao, Stable signal recovery from
incomplete and inaccurate measurements,Commun. Pure Appl. Math.
59(8), 12071223 (2006).
26. E. J. Candes and T. Tao, Near-optimal signal recovery from random
projections: universal encoding strategies?IEEE Trans. Inf. Theory
52(12), 54065425 (2006).
27. D. L. Donoho, For most large underdetermined systems of linear
equations the minimal l1-norm solution is also the sparsest solution,
Commun. Pure Appl. Math. 59(6), 797829 (2006).
28. A. Ruifrok and D. A. Johnston, Quantification of histochemical stain-
ing by color deconvolution,Anal. Quant. Cytol. Histol. 23(4), 291299
(2001).
29. L. Vincent, Morphological grayscale reconstruction in image analysis:
applications and efficient algorithms,IEEE Trans. Image Process.
2(2), 176201 (1993).
30. A. F. Frangi et al., Multiscale vessel enhancement filtering,Lect.
Notes Comput. Sci. 1496, 130137 (1998).
31. C. Li et al., Distance regularized level set evolution and its applica-
tion to image segmentation,IEEE Trans. Image Process. 19(12),
32433254 (2010).
32. S. Kim et al., A method for large-scale l1-regularized least squares
problems with applications in signal processing and statistics,IEEE
J. Sel. Top. Signal. Process. 1(4), 606617 (2007).
33. H. Xu, C. Lu, and M. Mandal, An efficient technique for nuclei seg-
mentation based on ellipse descriptor analysis and improved seed detec-
tion algorithm,IEEE J. Bio. Health Info. 18(5), 17291741 (2014).
34. C. Jung and C. Kim, Segmenting clustered nuclei using h-minima
transform-based marker extraction and contour parameterization,
IEEE Trans. Biomed. Eng. 57(10), 26002604 (2010).
35. F. Xing et al., Automatic ki-67 counting using robust cell detection and
online dictionary learning,IEEE Trans. Biomed. Eng. 61(3), 859870
(2014).
36. T. Fawcett, An introduction to roc analysis,Pattern. Recognit. Lett.
27(8), 861874 (2006).
37. D. P. Huttenlocher et al., Comparing images using the Hausdorff dis-
tance,IEEE Trans. Pattern Anal. Mach. Intell. 15(9), 850863 (1993).
38. D. Comaniciu and M. Peter, Mean shift: a robust approach toward fea-
ture space analysis,IEEE Trans. Pattern Anal. Mach. Intell. 24(5),
603619 (2002).
39. L. Grady and E. Schwartz, Isoperimetric graph partitioning for image
segmentation,IEEE Trans. Pattern Anal. Mach. Intell. 28(3), 469475
(2006).
40. K. Zhang et al., Active contours with selective local or global segmen-
tation: a new formulation and level set method,Image. Vision Comput.
28(4), 668676 (2010).
41. H. Xu et al., Automatic nuclear segmentation using multi-scale radial
line scanning with dynamic programming,IEEE Trans. Biomed. Eng.
PP(99), 11 (2017).
42. O. Ronneberger, P. Fischer, and T. Brox, U-net: convolutional net-
works for biomedical image segmentation,Lect. Notes Comput. Sci.
9351, 234241 (2015).
43. H. Vo et al., Cloud-based whole slide image analysis using
MapReduce,in Proc. VLDB Workshop Data Manange. Anal. Med.
Healthcare, Springer, pp. 6277 (2016).
44. G. Teodoro et al., Algorithm sensitivity analysis and parameter
tuning for tissue image segmentation pipelines,Bioinformatics 33(7),
10641072 (2017).
Pengyue Zhang is a PhD candidate at the Department of Computer
Science at Stony Brook University. His research interests include
medical image analysis, computer vision, and machine learning.
Fusheng Wang is an associate professor at the Department of
Biomedical Informatics and Department of Computer Science at
Stony Brook University. He received his PhD in computer science
from the University of California, Los Angeles, in 2004. Prior to joining
Stony Brook University, he was an assistant professor at Emory
University. His research interest crosscuts data management and
biomedical informatics.
George Teodoro received his PhD in computer science from the
Universidade Federal de Minas Gerais (UFMG), Brazil, in 2010.
Currently, he is an assistant professor in the Computer Science
Department at the University of Brasilia (UnB), Brazil. His primary
areas of expertise include high performance runtime systems for
efficient execution of biomedical and data-mining applications on
distributed heterogeneous environments.
Yanhui Liang is works at Google Brain as a software engineer.
She received her PhD in biomedical informatics from Stony Brook
University. Her research interests include 2-D/3-D medical imaging,
computer vision, machine learning, and large-scale spatial data
analytics.
Mousumi Roy is a PhD student at Stony Brook University with
research focus on computer vision, data analysis, and machine
learning modeling for biomedical research.
Daniel Brat received his MD and PhD degrees from Mayo Medical
and Graduate Schools. He completed residency and a fellowship
at Johns Hopkins Hospital. He is Magerstadt professor and chair of
pathology at the Northwestern University Feinberg School of Medicine
and pathologist-in-chief of Northwestern Memorial Hospital. He
directs a basic and translational research lab that investigates mech-
anisms of glioma progression, including the contributions of hypoxia,
genetics, tumor microenvironment, and stem cells.
Jun Kong received a PhD in electrical and computer engineering
from the Ohio State University. Currently, he is an associate professor
in the Department of Mathematics and Statistics at Georgia State
University. He is an affiliated faculty and a member of the Winship
Cancer Institute in Emory university. His primary areas of expertise
include biomedical image analysis algorithms, computer-aided diag-
nosis systems, and large-scale integrative analysis for quantitative
biomedical research.
Journal of Medical Imaging 017502-15 JanMar 2019 Vol. 6(1)
Zhang et al.: Effective nuclei segmentation with sparse shape prior and dynamic. . .
Downloaded From: https://www.spiedigitallibrary.org/journals/Journal-of-Medical-Imaging on 09 Apr 2019
Terms of Use: https://www.spiedigitallibrary.org/terms-of-use
... Shape-based methods without requiring prior object detection have also been proposed. These methods use variational level sets and shape priors (e.g., [39], [40], [41], [42]). Statistical shape priors have been used to segment overlapping cells in histological images, and were initialized by the watershed transform [39] or a variety of features including second-order image statistics [41], [42]. ...
... These methods use variational level sets and shape priors (e.g., [39], [40], [41], [42]). Statistical shape priors have been used to segment overlapping cells in histological images, and were initialized by the watershed transform [39] or a variety of features including second-order image statistics [41], [42]. In [40], [42], the sparsity of shape representations in level set models was exploited for faster energy minimization. ...
... Statistical shape priors have been used to segment overlapping cells in histological images, and were initialized by the watershed transform [39] or a variety of features including second-order image statistics [41], [42]. In [40], [42], the sparsity of shape representations in level set models was exploited for faster energy minimization. However, these methods do not yield a globally optimal solution. ...
Article
Cell nuclei segmentation is challenging due to shape variation and closely clustered or partially overlapping objects. Most previous methods are not globally optimal, limited to elliptical models, or are computationally expensive. In this work, we introduce a globally optimal approach based on deformable shape models and global energy minimization for cell nuclei segmentation and cluster splitting. We propose an implicit parameterization of deformable shape models and show that it leads to a convex energy. Convex energy minimization yields the global solution independently of the initialization, is fast, and robust. To jointly perform cell nuclei segmentation and cluster splitting, we developed a novel iterative global energy minimization method, which leverages the inherent property of superadditivity of the convex energy. This property exploits the lower bound of the energy of the union of the models and improves the computational efficiency. Our method provably determines a solution close to global optimality. In addition, we derive a closed/form solution of the proposed global minimization based on the superadditivity property for non/clustered cell nuclei. We evaluated our method using fluorescence microscopy images of five different cell types comprising various challenges, and performed a quantitative comparison with previous methods. Our method achieved state/of/the/art or improved performance.
... Thus, their performances highly depend on the marker initialization and can be significantly deteriorated by oversegmentation (Kumar et al., 2017). Active contour model (ACM)-based algorithms are also widely used for pathology nuclei segmentation (Zhang et al., 2019). Although promising segmentation performance can be achieved, it is time-consuming for such algorithms as RACM (Xing et al., 2014) to complete segmentation. ...
... As it is difficult to collect sufficient high-quality image data and precise ground truth from experts in most cases, the development of effective solutions that can assure and boost the training success with limited annotated data becomes an important but challenging research subject. Second, it is noted that most conventional energy-based methods, such as watershed and ACM algorithms, can explicitly use such morphology information as shape prior for enhanced nuclei contour identification (Zhang et al., 2019). By contrast, CNN models cannot explicitly leverage the prior knowledge on nuclei shapes. ...
... Although it is easy to acquire numerous nuclei samples, it is not straightforward to identify representative shapes to be included in the shape library. Taking the computational efficiency into account, we cluster N nuclei samples from a large number of histology images into K groups where K ( N. Following the method in prior research work, K is set to 100 in this work, and it can be changed depending on the specific task and dataset (Zhang et al., 2019). All training nuclei samples are aligned by the generalized Procrustes analysis (Goodall, 1991). ...
Article
Motivation: In most tissue-based biomedical research, the lack of sufficient pathology training images with well-annotated ground truth inevitably limits the performance of deep learning systems. In this study, we propose a convolutional neural network with foveal blur enriching datasets with multiple local nuclei regions of interest derived from original pathology images. We further propose a human-knowledge boosted deep learning system by inclusion to the convolutional neural network new loss function terms capturing shape prior knowledge and imposing smoothness constraints on the predicted probability maps. Results: Our proposed system outperforms all state-of-the-art deep learning and non-deep learning methods by Jaccard coefficient, Dice coefficient, Accuracy, and Panoptic Quality in three independent datasets. The high segmentation accuracy and execution speed suggest its promising potential for automating histopathology nuclei segmentation in biomedical research and clinical settings. Availability: The codes, the documentation, and example data are available on an open source at: https://github.com/HongyiDuanmu26/FovealBoosted. Supplementary information: Supplementary data are available at Bioinformatics online.
... Motivated by a significant increase in corresponding publications (Fig. 3), this review focuses on the diagnostic tasks of subtyping, grading, molecular marker prediction, and survival prediction. Studies primarily addressing image segmentation [11][12][13][14][15] , image retrieval 16 , or tumor heterogeneity [17][18][19][20][21][22] have been considered out of scope. The reviewed studies are examined with regard to diagnostic tasks and methodological aspects of WSI processing, as well as discussed by addressing limitations and future directions. ...
Article
Full-text available
In recent years, the diagnosis of gliomas has become increasingly complex. Analysis of glioma histopathology images using artificial intelligence (AI) offers new opportunities to support diagnosis and outcome prediction. To give an overview of the current state of research, this review examines 83 publicly available research studies that have proposed AI-based methods for whole-slide histopathology images of human gliomas, covering the diagnostic tasks of subtyping (23/83), grading (27/83), molecular marker prediction (20/83), and survival prediction (29/83). All studies were reviewed with regard to methodological aspects as well as clinical applicability. It was found that the focus of current research is the assessment of hematoxylin and eosin-stained tissue sections of adult-type diffuse gliomas. The majority of studies (52/83) are based on the publicly available glioblastoma and low-grade glioma datasets from The Cancer Genome Atlas (TCGA) and only a few studies employed other datasets in isolation (16/83) or in addition to the TCGA datasets (15/83). Current approaches mostly rely on convolutional neural networks (63/83) for analyzing tissue at 20x magnification (35/83). A new field of research is the integration of clinical data, omics data, or magnetic resonance imaging (29/83). So far, AI-based methods have achieved promising results, but are not yet used in real clinical settings. Future work should focus on the independent validation of methods on larger, multi-site datasets with high-quality and up-to-date clinical and molecular pathology annotations to demonstrate routine applicability.
Conference Paper
Full-text available
In this paper, we propose a novel segmentation method for cells in histopathologic images based on a sparse shape prior guided variational level set framework. We automate the cell contour initialization by detecting seeds and deform contours by minimizing a new energy functional that incorporates a shape term involving sparse shape priors, an adaptive contour occlusion penalty term, and a boundary term encouraging contours to converge to strong edges. As a result, our approach is able to accommodate mutual occlusions and detect contours of multiple intersected cells. We apply our algorithm to a set of whole-slide histopathologic images of brain tumor sections. The proposed method is compared with other popular methods, and demonstrates good accuracy for cell segmentation by quantitative measures, suggesting its promise to support biomedical image-based investigations.
Conference Paper
Full-text available
Systematic analysis of high resolution whole slide images enables more effective diagnosis, prognosis and prediction of cancer and other important diseases. Due to the enormous sizes and dimensions of whole slide images, the analysis requires extensive computing resources which are not commonly available. Images have to be divided into smaller regions for processing due to computer memory limitations, which will lead to inaccurate results due to the ignorance of boundary crossing objects. In this paper, we propose a highly scalable and cost effective MapReduce based image analysis framework for whole slide image processing, and provide a cloud based implementation. The framework takes a grid-based overlapping partitioning scheme, and provides parallelization of image segmentation based on MapReduce. It provides graceful handling of boundary objects with a highly efficient spatial indexing based matching method, thus avoiding loss of accuracy due to partitioning. We demonstrate that the system achieves high scalability and is cost-effective – our experiments demonstrate that it costs less than fifteen cents to analyze one image on average using Amazon Elastic MapReduce.
Article
Full-text available
Motivation: Sensitivity analysis and parameter tuning are important processes in large-scale image analysis. They are very costly because the image analysis workflows are required to be executed several times to systematically correlate output variations with parameter changes or to tune parameters. An integrated solution with minimum user interaction that uses effective methodologies and high performance computing is required to scale these studies to large imaging datasets and expensive analysis workflows. Results: The experiments with two segmentation workflows show that the proposed approach can (i) quickly identify and prune parameters that are non-influential; (ii) search a small fraction (about 100 points) of the parameter search space with billions to trillions of points and improve the quality of segmentation results (Dice and Jaccard metrics) by as much as 1.42× compared to the results from the default parameters; (iii) attain good scalability on a high performance cluster with several effective optimizations. Conclusions: Our work demonstrates the feasibility of performing sensitivity analyses, parameter studies and auto-tuning with large datasets. The proposed framework can enable the quantification of error estimations and output variations in image segmentation pipelines. Availability and implementation: Source code: https://github.com/SBU-BMI/region-templates/ CONTACT: teodoro@unb.brSupplementary information: Supplementary data are available at Bioinformatics online.
Article
Full-text available
Automated segmentation of cells from cervical smears poses great challenge to biomedical image analysis because of the noisy and complex background, poor cytoplasmic contrast and the presence of fuzzy and overlapping cells. In this paper, we propose an automated segmentation method for the nucleus and cytoplasm in a cluster of cervical cells based on distinctive local features and guided sparse shape deformation. Our proposed approach is performed in two stages: segmentation of nuclei and cellular clusters, and segmentation of overlapping cytoplasm. In the first stage, a set of local discriminative shape and appearance cues of image superpixels is incorporated and classified by the Support Vector Machine (SVM) to segment the image into nuclei, cellular clusters, and background. In the second stage, a robust shape deformation framework is proposed, based on Sparse Coding (SC) theory and guided by representative shape features, to construct the cytoplasmic shape of each overlapping cell. Then, the obtained shape is refined by the Distance Regularized Level Set Evolution (DRLSE) model. We evaluated our approach using the ISBI 2014 challenge dataset, which has 135 synthetic cell images for a total of 810 cells. Our results show that our approach outperformed existing approaches in segmenting overlapping cells and obtaining accurate nuclear boundaries.