Conference PaperPDF Available

Sparse representation-based archetypal graphs for spectral clustering

Authors:

Abstract and Figures

We propose sparse representation-based archetypal graphs as input to spectral clustering for anomaly and change detection. The graph consists of vertices defined by data samples and edges which weights are determines by sparse representation. Besides relationships between all data samples, the graph also encodes the relationship to extremal points, so-called archetypes, which leads to an easily interpretable clustering result. We compare our approach to k-means clustering performed on the original feature representation and to k-means clustering performed on the sparse representation activations. Experiments show that our approach is able to deliver accurate and interpretable results for anomaly and change detection .
Content may be subject to copyright.
SPARSE REPRESENTATION-BASED ARCHETYPAL GRAPHS FOR SPECTRAL
CLUSTERING
Ribana Roscher, Lukas Drees, Susanne Wenzel
University of Bonn, Institute of Geodesy and Geoinformation
ABSTRACT
We propose sparse representation-based archetypal graphs as
input to spectral clustering for anomaly and change detec-
tion. The graph consists of vertices defined by data samples
and edges which weights are determines by sparse represen-
tation. Besides relationships between all data samples, the
graph also encodes the relationship to extremal points, so-
called archetypes, which leads to an easily interpretable clus-
tering result. We compare our approach to k-means cluster-
ing performed on the original feature representation and to k-
means clustering performed on the sparse representation acti-
vations. Experiments show that our approach is able to deliver
accurate and interpretable results for anomaly and change de-
tection.
Index TermsSparse representation, spectral clustering,
sparse graphs, anomaly detection, change detection
1. INTRODUCTION
The task of finding clusters in data sets has been an active
research area for a long time. Especially if the cluster distri-
bution is complex and different clusters cannot be separated
linearly in input space, there is a demand for powerful clus-
tering techniques. Therefore, spectral clustering has become
a popular clustering approach (e.g., [1, 2]), since it often out-
performs conventional cluster algorithms such as k-means or
EM-algorithm. For spectral clustering, singular value decom-
position of a matrix, representing relationships between data
samples, is used to derive the clustering result. One of the key
factors, influencing the performance of spectral clustering,
is the construction of the relationship-encoding graph form
which the matrix is derived. Commonly used approaches
build dense graphs comprising pairwise distances between all
data samples. More efficient sparse graphs comprise pairwise
distances between each sample and its neighbors in a small
neighborhood (e.g., [3]). The authors of [4] propose an alter-
native based on sparse representation, which fulfill the desired
graph characteristics of high discriminating power, sparsity
and an adaptive neighborhood. They show that so-called L1-
sparse graphs achieve a higher accuracy for clustering and for
semi-supervised classification, and its success is also under-
lined by [5].
Our contribution is the introduction of a sparse represen-
tation-based graph as input to spectral clustering, which in-
cludes information about relationships between all data sam-
ples, but also information about the relation to archetypes
(i.e., extremal data samples). Archetypes has been proved
to be suitable representatives of data samples in e.g., [6] and
[7], which can be efficiently extracted using simplex volume
maximization (SiVM, [8]1). Due to the relation of all samples
to archetypes, the approach is able to find a suitable number
of clusters and provides a spectral information which can be
easily interpreted as anomaly in single images or change in
time-series data. In our experiments, we show the suitability
of sparse representation-based graphs for spectral clustering
by means of plant disease symptom detection in a hyperspec-
tral image and by means of a change detection task. These
tasks have already been addressed by approaches using sparse
representation, e.g., [9], [10]), but none of them uses sparse
representation-based graphs.
2. METHODS
2.1. Spectral Clustering
Spectral clustering is the task of finding clusters using a spec-
tral decomposition of a matrix, interpretable as graph, de-
rived from the data. In order to represent relationships be-
tween the samples, a weight matrix Wis defined by weights
wnm >0being connections between two samples xnand
xmand wnm = 0, otherwise. We further define the normal-
ized graph Laplacian Lsym =D
1
2LD
1
2with L=DW.
The degree matrix is given by D= diag (Pmwm), where
wmis the m-th row of W. Since spectral decomposition of
the Graph Laplacian is intractable for large matrices, we use
Nystrm approximation[11] to derive a smaller matrix for de-
composition. We use the singular vectors to the smallest sin-
gular values as input to k-means clustering to obtain the final
clustering result.
2.2. Graph Construction with Sparse Representation
In our approach, we follow the work of [4] by designing
a sparse graph in which the estimated sparse coefficients,
computed by a nonnegative least squares optimization, are
1www-ai.cs.uni- dortmund.de/weblab/code.html
used to characterize relationships between data samples
X= [x1,...,xn,...,xN]with Nbeing the number of
samples. We define a directed graph G={X,W}with the
samples Xbeing the vertices and the matrix Wbeing the
edge weights. The graph is constructed in an unsupervised
way using sparse representation, where the activations are
interpreted as weights in the graph.
In terms of sparse coding a sample xnis represented
by a weighted linear combination of a few elements taken
from a (M×(N1))-dimensional dictionary T, such that
xn=Tαn+γwith kγk2being the reconstruction error.
The dictionary T= [x1,...,xn1,xn+1 . . . , xN]is embod-
ied by all other samples except xn. The coefficient vector,
comprising the activations, is given by αn, from which we
can derive wm= [α1,...,αn1,0,αn+1,...,αN], being
the rows of W. The optimization problem for the determina-
tion of optimal ˆαnis given by
ˆαn= argmin kTαnxnk2,subject to αn0(1)
where the first term is the reconstruction error and the second
term is the non-negativity constraint. Since non-negativity
alone leads to a sufficient sparsity, we do not introduce a fur-
ther sparsity enforcing prior.
Let us call the weight matrix comprising the relationships
between all samples WX. For spectral clustering, we con-
struct the weight matrix W=WXWAby extending WX
by an additional weight matrix WAbased on sparse repre-
sentation on archetypal dictionaries [6]. The weight matrix
WAincludes the independently computed result of (1) of all
samples with archetypes serving as dictionary. Therefore the
relationship between all samples is encoded, but also the rela-
tionship to archetypes leading to an easily interpretable clus-
tering result. More specifically, the cluster assignments of
the archetypes can be related to all samples in the same clus-
ter and thus, each cluster can be interpreted by an expert by
means of its assigned archetype.
3. EXPERIMENTAL SETUP AND RESULTS
3.1. Datasets
3.1.1. Hyperspectral Sugar Beet Plant
We use a hyperspectral image of a plant, cv. Pauletta (KWS,
Einbeck, Germany), which was cultivated for 8 weeks in
a controlled environment in a greenhouse. The plant was
inoculated with Cercospora beticola, the causal agent of
Cercospora leaf spot. We use the hyperspectral pushbroom
sensor unit VISNIR-camera ImSpector V10E (Specim, Oulu,
Finland) with 1600 pixel, observing a spectral signature with
211 spectral bands in the range of 400 1000 nm. For eval-
uation purposes, we manually annotated disease symptoms.
Due to the error-prone labeling of the exact area of the symp-
toms, we decided to exclude this effect from the analysis by
relying only on the symptom centers which are labeled more
robust. This dataset is used to evaluate our proposed approach
for anomaly detection, where disease symptoms are defined
as anomalies.
3.1.2. The Bastrop Complex Wildfire
This dataset comprises 4 satellite images of size 1534 ×
808 pixel of different sensors acquired over the Bastrop
County, Texas (USA). In September 2011, most of the area
has been destroyed by wildfire. One image acquired by
Landsat 5 TM sensor depicts the area before the event and
three images acquired by Landsat 5 TM, Advanced Land
Imager (ALI) from the EO-1 mission, and Landsat 8 Oper-
ational Land Imager show the area after the event. The data
was collected by NASA Land Processes Distributed Active
Archive Center Program and ground truth is available indi-
cating changed areas [12]. We use this dataset to evaluate our
proposed approach for change detection. Here, it is common
to take the differences of images as input features for cluster-
ing. Instead, for our experiments, we stack all images and use
this as input to our clustering approach, because we found that
spectral clustering with sparse representation-based graphs is
more stable for high dimensional feature spaces. However,
for indicating change only Landsat 5 TM images can be
directly compared.
3.2. Experimental Setup
For our experiments, we construct an archetypal sparse graph
as explained in Section 2. In order to be efficient in deriv-
ing WX, we restrict the dictionary for sparse representation
to the 3000 spatially nearest neighbors. The extraction of
archetypes follows the idea of [13] by using multiple ini-
tial points in order to ensure a complete final set. The fi-
nal number of clusters is fixed to the number of extracted
archetypes for all tested algorithms. We compare our ap-
proach, using spectral clustering on a sparse representation-
based archetypal graph (SCASR), with k-means in original
data representation (k-meansO), and k-means on coeffi-
cients obtained from sparse representation with archetypal
dictionaries (k-meansASR).
For our approach, we define the archetypes as indicator
for change. Please note, that an archetype represents a pixel
in the stacked image at four points in time. An archetype is
supposed to indicate change, if its RMS difference between
the two Landsat 5 TM images, before and after the event, is
higher than a threshold. The threshold is given as the 0.9
quantile value obtained from all archetypes’ Landsat 5 TM
RMS differences. For k-meansO we use the obtained cluster
centers in the same way, as indicators for change.
3.3. Results
Fig. 2 shows the obtained results for the hyperspectral plant
dataset. By visual inspection, it can be seen that our approach
SCASR is able to find different clusters for healthy plant parts,
specular reflections, leaf veins and disease symptoms. More
precisely, it is able to cluster two types of disease symptoms
(blue and red areas in Fig. 2 (d) and (h)), which are assigned
to two disease archetypes, illustrated in Fig. 1. However, also
the leaf borders are assigned to disease symptom, resulting
from erroneous measurements similar to the spectra of dis-
ease symptoms. In contrast to this, k-meansO is not able
Fig. 1. Left: Spectrum of healthy plant; Middle and right:
Two spectra of disease symptoms, which were identified by
archetypal analysis.
to cluster disease symptoms. The approach k-meansASR
detects disease symptoms, however it assigns only one clus-
ter to all of them. Moreover, it gives less smooth results and
noisy detections for some disease symptoms in comparison
to SCASR. After a visual inspection of the archetypes and a
manual assignment to specific archetypes to disease symp-
toms, a comparison with our annotation yield 163 out of 173
(94.2%) detected disease symptoms for SCASR and 137 out
of 173 (79.2%) for k-meansASR.
Fig. 3 shows the result for the Bastrop fire dataset. By
visual inspection, all three approaches are able to distin-
guish most of the areas affected and not affected by fire by
assigning different cluster sets to both areas. That means,
that the cluster centers detected by k-means as well as the
obtained archetypes are able to describe clusters for change.
However, a comparison of change gives significant differ-
ences. Although k-meansO delivers multiple clusters which
could be assigned to change, some parts are missing and
are thus, assigned to clusters defined by no change. For
SCASR, the region affected by fire is clearly visible and less
effected by noise. A qualitative comparison to the ground
truth yielded a kappa coefficient of 0.61 for k-meansO,
0.49 for k-meansASR and 0.80 for SCASR, which under-
lines that our approach is suitable for change detection with
limited manual input. We tested various quantile values to
decide on indicators for change, but for k-meansO as well
as k-meansASR a decrease of the quantile value led to an
increase of falsely detected change.
4. CONCLUSION
We propose a novel design for a directed graph as input to
spectral clustering. The graph is designed using sparse repre-
sentation, which includes information about relationships be-
tween all data samples, but also information about the relation
to archetypes. Our experiments confirm that the clustering re-
sult yield discriminative and interpretable clusters for the task
of change detection and anomaly detection. Our approach can
also be used for multi-sensoral datasets, since the archetypes
are interpretable by experts.
Acknowledgements
The authors would like to thank Jan Behmann and Anne-
Katrin Mahlein for providing the hyperspectral image data
and helpful conversations. The Bastrop data is available from
the U.S. Geological Survey (http://earthexplorer.usgs.gov/).
REFERENCES
[1] A. Y. Ng, M. I. Jordan, and Y. Weiss, “On spectral clustering:
Analysis and an algorithm,” Adv. Neural. Inf. Process. Syst.,
vol. 2, pp. 849–856, 2002.
[2] F. Hu, G.S. Xia, X. Wang, Z.and Huang, L. Zhang, and H. Sun,
“Unsupervised feature learning via spectral clustering of multi-
dimensional patches for remotely sensed scene classification,”
IEEE J. Sel. Topics Appl. Earth Observ. in Remote Sens., vol.
8, no. 5, 2015.
[3] U. Von Luxburg, “A tutorial on spectral clustering, Stat. and
Comput., vol. 17, no. 4, pp. 395–416, 2007.
[4] J. Wright, Y. Ma, J. Mairal, G. Sapiro, T. S. Huang, and S. Yan,
“Sparse representation for computer vision and pattern recog-
nition,” Proc. of the IEEE, vol. 98, no. 6, pp. 1031–1044, 2010.
[5] S. Yan and H. Wang, “Semi-supervised learning by sparse rep-
resentation.,” in SDM. SIAM, 2009, pp. 792–801.
[6] R. Roscher, C. R¨
omer, B. Waske, and L. Pl¨
umer, “Landcover
classification with self-taught learning on archetypal dictionar-
ies,” in IEEE IGARSS, July 2015, pp. 2358–2361.
[7] C. R ¨
omer, M. Wahabzada, A. Ballvora, F. Pinto, M. Rossini,
C. Panigada, J. Behmann, J. L´
eon, C. Thurau, and C. Bauck-
hage, “Early drought stress detection in cereals: simplex vol-
ume maximisation for hyperspectral image analysis,” Funct.
Plant Biol., vol. 39, no. 11, pp. 878–890, 2012.
[8] C. Thurau, K. Kersting, and C. Bauckhage, “Yes we can:
simplex volume maximization for descriptive web-scale matrix
factorization,” in Proc. CIKM. ACM, 2010, pp. 1785–1788.
[9] R. Roscher, J. Behmann, A.-K. Mahlein, J. Dupuis,
H. Kuhlmann, and L. Pl¨
umer, “Detection of disease symp-
toms on hyperspectral 3D plant models,” in ISPRS Annals of
Photogrammetry, Remote Sensing and Spatial Information Sci-
ences, 2016, pp. 89–96.
[10] A. Adler, M. Elad, Y. Hel-Or, and E. Rivlin, “Sparse coding
with anomaly detection,” J. Signal. Process. Syst., vol. 79, no.
2, pp. 179–188, 2015.
[11] Anna Choromanska, Tony Jebara, Hyungtae Kim, Mahesh Mo-
han, and Claire Monteleoni, “Fast spectral clustering via the
nystr¨
om method,” in ALT. Springer, 2013, pp. 367–381.
[12] M. Volpi, G. Camps-Valls, and D. Tuia, “Spectral alignment
of multi-temporal cross-sensor images with automated kernel
canonical correlation analysis,” ISPRS J. Photogramm. Remote
Sens., vol. 107, pp. 50–63, 2015.
[13] R. Roscher, S. Wenzel, and B. Waske, “Discriminative archety-
pal self-taught learning for multispectral landcover classifica-
tion,” in Proc. PRRS, Workshop at ICPR, 2016.
(a) (b) (c) (d)
(e) (f) (g) (h)
Fig. 2. Results obtained for hyperspectral plant dataset. (a) Image data, (e) Image data, detail, (b) and (f) Clustering result with
k-means on original representation, (c) and (g) Clustering result with k-means on sparse representation coefficients, (d) and (h)
Spectral clustering result on a sparse representation-based archetypal graph. Colors are arbitrary and indicate assignment to
clusters. Therefore, they are not related between the images.
(a) (b) (c)
(d) (e) (f)
(g) (h) (i)
Fig. 3. Results obtained for the Bastrop fire dataset. (a) Image data pre-event, (b) Image data post-event, (c) Ground-truth
information for change, (d) Clustering result with k-means on original representation, (e) Clustering result with k-means on
sparse representation coefficients, (f) Spectral clustering result on a sparse representation-based archetypal graph, (g) Detected
change using k-means on original representation, (h) Detected change using k-means on sparse representation coefficients, (i)
Detected change using spectral clustering result.
... Numerous techniques [19]- [22], [27]- [28] have been developed to evaluate the similarities among data points. Among these, sparse representation stands out, especially noted for its superior performance in managing highdimensional or sparse datasets. ...
... Thus, to effectively utilize both local and global information, we suggest a method based on sparse representation [27], [28] to determine the weights of each data point. Drawing from [22], we define the objective function of our method as follows: ...
Article
Full-text available
The approach to calculating density significantly impacts the clustering efficacy of the Density Peak Clustering (DPC) method, with various density calculation methods tailored for different datasets. To address this, this study introduces a Self-Representation Weighted Density Peak Clustering (SR-DPC) method. Unlike traditional DPC, SR-DPC not only utilizes the local data point information but also amplifies the impact of different data points on the data center by implementing a weighting strategy, enhancing the precision with which data centers are identified. Moreover, SR-DPC adaptively reflects the influence of diverse data points on the data centers via feature representation and employs a weighted Gaussian kernel distance instead of the Euclidean distance to boost its clustering capabilities. Experimental evaluations on both synthetic and real datasets demonstrate the effectiveness and practicality of the SR-DPC method.
... So far, many methods [16][17][18][19][24][25] are proposed to measure the similarities of all the data points. In these methods, sparse representation is a typical resolution and has been confirmed its outperformed performance on the high-dimensional and/or highly sparse datasets. ...
... Therefore, in order to make full use of the local and the global information, we propose a sparse representation-based method [24,25] to calculate the weight of each data point, referring to [19], the objective function of the proposed method is defined as follows: ...
Preprint
Full-text available
The calculation method of density has an important influence on the clustering performance of density peak clustering method (DPC), the different density calculation methods are applied for the different datasets. To solve this problem, a self-representation weighted density peak clustering method (SR-DPC) is proposed in this study. Different from DPC, SR-DPC not only considers the local information of data points, but also enhances the influence of different data points on the data center by introducing the idea of weighting, so as to improve the accuracy of finding the data center.Furthermore, SR-DPC can adaptively reflect the influence of different data points on the data center points through the feature representation among data points, and the weighted Gaussian kernel distance is adopted to replace the Euclidean distance, so as to improve its clustering performance. Experimental results based on synthetic datasets and real datasets show that SR-DPC method is effective and practical.
... The same methodology, improved by Yang et al. in [54] by applying deep canonical correlation analysis, could not perform better than 0.947 (standard deviation 0.02) in OA and 0.71 (standard deviation 0.1) in KC. In [55], Roscher et al. achieved a KC of 0.80 on the same dataset but by using two Landsat 5 TM images, so performing homogeneous CD. Overall, our unsupervised method consistently outperforms the aforementioned methods, both in terms of OA and KC. ...
Preprint
Full-text available
Change detection in heterogeneous multitemporal satellite images is an emerging and challenging topic in remote sensing. In particular, one of the main challenges is to tackle the problem in an unsupervised manner. In this paper we propose an unsupervised framework for bitemporal heterogeneous change detection based on the comparison of affinity matrices and image regression. First, our method quantifies the similarity of affinity matrices computed from co-located image patches in the two images. This is done to automatically identify pixels that are likely to be unchanged. With the identified pixels as pseudo-training data, we learn a transformation to map the first image to the domain of the other image, and vice versa. Four regression methods are selected to carry out the transformation: Gaussian process regression, support vector regression, random forest regression, and a recently proposed kernel regression method called homogeneous pixel transformation. To evaluate the potentials and limitations of our framework, and also the benefits and disadvantages of each regression method, we perform experiments on two real data sets. The results indicate that the comparison of the affinity matrices can already be considered a change detection method by itself. However, image regression is shown to improve the results obtained by the previous step alone and produces accurate change detection maps despite of the heterogeneity of the multitemporal input data. Notably, the random forest regression approach excels by achieving similar accuracy as the other methods, but with a significantly lower computational cost and with fast and robust tuning of hyperparameters.
... The same methodology, improved by Yang et al. in [54] by applying deep canonical correlation analysis, could not perform better than 0.947 (standard deviation 0.02) in OA and 0.71 (standard deviation 0.1) in KC. In [55], Roscher et al. achieved a KC of 0.80 on the same dataset but by using two Landsat 5 TM images, so performing homogeneous CD. Overall, our unsupervised method consistently outperforms the aforementioned methods, both in terms of OA and KC. ...
Article
Full-text available
Change detection (CD) in heterogeneous multitemporal satellite images is an emerging and challenging topic in remote sensing. In particular, one of the main challenges is to tackle the problem in an unsupervised manner. In this paper, we propose an unsupervised framework for bitemporal heterogeneous CD based on the comparison of affinity matrices and image regression. First, our method quantifies the similarity of affinity matrices computed from colocated image patches in the two images. This is done to automatically identify pixels that are likely to be unchanged. With the identified pixels as pseudotraining data, we learn a transformation to map the first image to the domain of the other image and vice versa. Four regression methods are selected to carry out the transformation: Gaussian process regression, support vector regression, random forest regression (RFR), and a recently proposed kernel regression method called homogeneous pixel transformation. To evaluate the potentials and limitations of our framework and also the benefits and disadvantages of each regression method, we perform experiments on two real data sets. The results indicate that the comparison of the affinity matrices can already be considered a CD method by itself. However, image regression is shown to improve the results obtained by the previous step alone and produces accurate CD maps despite of the heterogeneity of the multitemporal input data. Notably, the RFR approach excels by achieving similar accuracy as the other methods, but with a significantly lower computational cost and with fast and robust tuning of hyperparameters.
Conference Paper
Full-text available
Self-taught learning (STL) has become a promising paradigm to exploit unlabeled data for classification. The most commonly used approach to self-taught learning is sparse representation, in which it is assumed that each sample can be represented by a weighted linear combination of elements of a unlabeled dictionary. This paper proposes discriminative archetypal self-taught learning for the application of landcover classification, in which unlabeled discriminative archetypal samples are selected to build a powerful dictionary. Our main contribution is to present an approach which utilizes reversible jump Markov chain Monte Carlo method to jointly determine the best set of archetypes and the number of elements to build the dictionary. Experiments are conducted using synthetic data, a multi-spectral Landsat 7 image of a study area in the Ukraine and the Zurich benchmark data set comprising 20 multispectral Quickbird images. Our results confirm that the proposed approach can learn discriminative features for classification and show better classification results compared to self-taught learning with the original feature representation and compared to randomly initialized archetypal dictionaries.
Article
Full-text available
We analyze the benefit of combining hyperspectral images information with 3D geometry information for the detection of Cercospora leaf spot disease symptoms on sugar beet plants. Besides commonly used one-class Support Vector Machines, we utilize an unsupervised sparse representation-based approach with group sparsity prior. Geometry information is incorporated by representing each sample of interest with an inclination-sorted dictionary, which can be seen as an 1D topographic dictionary. We compare this approach with a sparse representation based approach without geometry information and One-Class Support Vector Machines. One-Class Support Vector Machines are applied to hyperspectral data without geometry information as well as to hyperspectral images with additional pixelwise inclination information. Our results show a gain in accuracy when using geometry information beside spectral information regardless of the used approach. However, both methods have different demands on the data when applied to new test data sets. One-Class Support Vector Machines require full inclination information on test and training data whereas the topographic dictionary approach only need spectral information for reconstruction of test data once the dictionary is build by spectra with inclination.
Conference Paper
Full-text available
This paper introduces archetypal dictionaries for a self-taught learning framework for the application of landcover classification. Self-taught learning, an unsupervised representation learning method, is exploited to learn low-dimensional and discriminative higher-level features, which are used as input into a classification algorithm. Experiments are conducted using a multi-spectral Landsat 5 TM image of a study area in the north of Novo Progresso located in South America. Our results confirm that self-taught learning with archetypal dictionaries provide features, which can be used as input into a linear logistic regression classifier. The obtained classification accuracies are comparable to kernel-based classifier using the original features.
Article
Full-text available
In this paper we present an approach to perform relative spectral alignment between optical cross-sensor acquisitions. The proposed method aims at projecting the images from two different and possibly disjoint input spaces into a common latent space, in which standard change detection algorithms can be applied. The system relies on the regularized kernel canonical correlation analysis transformation (kCCA), which can accommodate nonlinear dependencies between pixels by means of kernel functions. To learn the projections, the method employs a subset of samples belonging to the unchanged areas or to uninteresting radiometric differences. Since the availability of ground truth information to perform model selection is limited, we propose a completely automatic strategy to select the hyperparameters of the system as well as the dimensionality of the transformed (latent) space. The proposed scheme is fully automatic and allows the use of any change detection algorithm in the transformed latent space. A synthetic problem built from real images and a case study involving a real cross-sensor change detection problem illustrate the capabilities of the proposed method. Results show that the proposed system outperforms the linear baseline and provides accuracies close the ones obtained with a fully supervised strategy. We provide a MATLAB implementation of the proposed method as well as the real cross-sensor data we prepared and employed at https://sites.google.com/site/michelevolpiresearch/codes/cross-sensor.
Article
Full-text available
Early water stress recognition is of great relevance in precision plant breeding and production. Hyperspectral imaging sensors can be a valuable tool for early stress detection with high spatio-temporal resolution. They gather large, high dimensional data cubes posing a significant challenge to data analysis. Classical supervised learning algorithms often fail in applied plant sciences due to their need of labelled datasets, which are difficult to obtain. Therefore, new approaches for unsupervised learning of relevant patterns are needed. We apply for the first time a recent matrix factorisation technique, simplex volume maximisation (SiVM), to hyperspectral data. It is an unsupervised classification approach, optimised for fast computation of massive datasets. It allows calculation of how similar each spectrum is to observed typical spectra. This provides the means to express how likely it is that one plant is suffering from stress. The method was tested for drought stress, applied to potted barley plants in a controlled rain-out shelter experiment and to agricultural corn plots subjected to a two factorial field setup altering water and nutrient availability. Both experiments were conducted on the canopy level. SiVM was significantly better than using a combination of established vegetation indices. In the corn plots, SiVM clearly separated the different treatments, even though the effects on leaf and canopy traits were subtle.
Article
Full-text available
Techniques from sparse signal representation are beginning to see significant impact in computer vision, often on nontraditional applications where the goal is not just to obtain a compact high-fidelity representation of the observed signal, but also to extract semantic information. The choice of dictionary plays a key role in bridging this gap: unconventional dictionaries consisting of, or learned from, the training samples themselves provide the key to obtaining state-of-the-art results and to attaching semantic meaning to sparse signal representations. Understanding the good performance of such unconventional dictionaries in turn demands new algorithmic and analytical techniques. This review paper highlights a few representative examples of how the interaction between sparse signal representation and computer vision can enrich both fields, and raises a number of open questions for further study.
Article
Scene classification plays an important role in the interpretation of remotely sensed high-resolution imagery. However, the performance of scene classification strongly relies on the discriminative power of feature representation, which is generally hand-engineered and requires a huge amount of domain-expert knowledge as well as time-consuming hand tuning. Recently, unsupervised feature learning (UFL) provides an alternative way to automatically learn discriminative feature representation from images. However, the performances achieved by conventional UFL methods are not comparable to the state-of-the-art, mainly due to the neglect of locally substantial image structures. This paper presents an improved UFL algorithm based on spectral clustering, named UFL-SC, which cannot only adaptively learn good local feature representations but also discover intrinsic structures of local image patches. In contrast to the standard UFL pipeline, UFL-SC first maps the original image patches into a low-dimensional and intrinsic feature space by linear manifold analysis techniques, and then learns a dictionary (e.g., using K-means clustering) on the patch manifold for feature encoding. To generate a feature representation for each local patch, an explicit parameterized feature encoding method, i.e., triangle encoding, is applied with the learned dictionary on the same patch manifold. The holistic feature representation of image scenes is finally obtained by building a bag-of-visual-words (BOW) model of the encoded local features. Experiments demonstrate that the proposed UFL-SC algorithm can extract efficient local features for image scenes and show comparable performance to the state-of-the-art approach on open scene classification benchmark.
Conference Paper
We propose and analyze a fast spectral clustering algorithm with computational complexity linear in the number of data points that is directly applicable to large-scale datasets. The algorithm combines two powerful techniques in machine learning: spectral clustering algorithms and Nyström methods commonly used to obtain good quality low rank approximations of large matrices. The proposed algorithm applies the Nyström approximation to the graph Laplacian to perform clustering. We provide theoretical analysis of the performance of the algorithm and show the error bound it achieves and we discuss the conditions under which the algorithm performance is comparable to spectral clustering with the original graph Laplacian. We also present empirical results.