Conference PaperPDF Available

A Retrieval System for Remotely Sensed Imagery

Authors:

Abstract and Figures

The retrieval of images in professional remote sensing databases is based on world-oriented information which is often not meaningful for users who have a limited knowledge about remote sensing but nevertheless have to work with satellite imagery. Moreover many applications require a different query formulation that cannot be performed with the provided standard mechanisms. Therefore this paper presents a content-based retrieval system that utilises the spectral information content of multispectral satellite imagery as well as the world-oriented features. The actual content-based query operates on the query-by-example approach – a powerful tool for both professional and non-professional remote sensing users. A further improved flexibility for posing queries is gained by not only supporting a-priori extracted features for a fast retrieval but also a dynamic approach which increases the quality of the retrieved images, i.e. the degree of image similarity.
Content may be subject to copyright.
A Retrieval System for Remotely Sensed Imagery
T. Bretschneider, O. Kao
Department of Computer Science, Technical University of Clausthal,
Julius-Albert-Strasse 4, 38678 Clausthal-Zellerfeld, Germany
Tel: +495323727140; Fax: +495323727149
{bretschneider,okao}@informatik.tu-clausthal.de
Abstract: The retrieval of images in professional re-
mote sensing databases is based on world-oriented
information which is often not meaningful for users
who have a limited knowledge about remote sensing
but nevertheless have to work with satellite imagery.
Moreover many applications require a different query
formulation that cannot be performed with the pro-
vided standard mechanisms. Therefore this paper pre-
sents a content-based retrieval system that utilises the
spectral information content of multispectral satellite
imagery as well as the world-oriented features. The
actual content-based query operates on the query-by-
example approach a powerful tool for both profes-
sional and non-professional remote sensing users. A
further improved flexibility for posing queries is
gained by not only supporting a-priori extracted fea-
tures for a fast retrieval but also a dynamic approach
which increases the quality of the retrieved images,
i.e. the degree of image similarity.
Keywords: Content-based retrieval, satellite im-
agery, multispectral feature extraction
1 Introduction
The development and application of orbital re-
mote sensing platforms result in the production of
huge amounts of image data – more than one
terabyte per satellite and day in the near future.
The obtained image data has to be systematically
collected, registered, organised, and classified in
a way that works completely unsupervised. Fur-
thermore, adequate search procedures and meth-
ods to formulate queries have to be provided to
access the information.
The retrieval of images in professional remote
sensing databases [1], [2], [3], [4], [5], [6] is
based on world-oriented information like the lo-
cation of the scene, the utilised scanner, and the
date of acquisition. However, these descriptions
are not meaningful for many users who have a
limited knowledge about remote sensing but nev-
ertheless have to work with satellite imagery. A
biologist who needs for a general assessment on
swampland in a large-scale region all corre-
sponding images depicts an example.
Therefore a content-based retrieval system
named (RS)2I (Retrieval System for Remotely
Sensed Imagery) was developed which uses an
example image as a starting point. Although the
idea is not new to the research community, none
of the published approaches [7], [8], [9], [10],
[11], [12] provides the required flexibility in gen-
eral. Either the systems are limited to a specific
satellite or need supervision. Both constraints
restrict the purpose of a general system beyond
the acceptable limit since the approaches are only
of use for an archive which is specialised on one
type of scanner or with enough man-power to
supervise the processing, respectively. This is
seldom true for users with a non-professional re-
mote sensing background. The proposed system
in this paper follows two different approaches
which can be combined, too. First all archived
satellite images are described by a selection of a-
priori extracted features, which are subsequently
stored in a relational database system. At query
time the user submits a sample image, which is
processed in the same way as the images in the
database. The features extracted from the sample
image can be compared with the corresponding
features of the archived images using a multidi-
mensional distance function. Based on the com-
puted distance a similarity ranking is created. The
second approach is based on dynamic feature ex-
traction. The main idea behind dynamic retrieval
is to extract the features after the query was for-
mulated and thus to adapt the extraction process
to the inherent requirements of the query image
and the preference of the user. Since the varia-
tions of the specific parameters are unlimited the
corresponding feature vectors cannot be stored
permanently. Finally, in either case a ranking is
presented to the user at the end. To get back to
the mentioned query on swampland, the most
sensitive approach is first to use the static world-
oriented features to restrict the further amount of
data to be processed. Then in the next step a-pri-
ori extracted features are used, e.g. contained
classes of ground cover which exhibit a high per-
centage of water. Since the features are most of
the time fairly abstract, the selected satellite im-
ages are dynamically post-processed to generate
the final query result. Note that each of the re-
trieval results can be used for a refinement of the
query parameters and feature values. The search
process is repeated iteratively, until the desired
image(s) occur(s) in the similarity ranking.
This paper is organised as follows: Section 2
describes the main system architecture of the
(RS)2I. Two of the required processing steps are
described in Section 3 in more detail. Finally the
conclusions and outlook complete the paper.
2 System Architecture
The architecture of the (RS)2I is similar to tradi-
tional image databases and consists of
Graphical user interface supporting query-by-
example,
Relational database system storing the meta-
information, i.e. the world-oriented features,
and the a-priori extracted features,
Index structures for accelerated access to all a-
priori known features, and
Parallel infrastructure for the efficient proc-
essing of the satellite imagery to enable dy-
namic feature extraction while guaranteeing a
reasonable response time.
The following sub-sections describe these com-
ponents in more detail.
2.1 Graphical User Interface
As already noted the graphical user interface al-
lows the user to initiate a similarity search using
a sample image. An illustration of the interface
layout is presented in Figure 1.
Figure 1: Graphical user interface of (RS)2I
The left hand side shows a vertical list of arbi-
trary user-defined query images while the right
hand side displays the ordered retrieval result
according to the chosen similarity measure. The
best match is displayed in the upper left corner of
the retrieval area and is certainly the query image
itself while the following sub-scenes are sections
of the same original satellite image with a quite
similar ground characteristic. On the later ranks
there are images of other scanners which exhibit
a comparable spectral information of related
ground cover.
The feature extraction and retrieval methods
were applied to a collection of multispectral
scenes of Landsat, SPOT, MOMS-02, and
IKONOS which offer spatial resolutions between
4m and 30m. The corresponding image archive
covers an area of more than 204,150 km2 and
contains a mixture of urban, agriculture, and for-
estry areas around the world. An in-depth analy-
sis of the retrieval results can be found in [13]
and is omitted here due to the limited space.
2.2 Database Features
The relational database stores the world-oriented
as well as the extracted content-based features.
The first set mainly consists of the type of scan-
ner, spectral characteristic of the system, acquisi-
tion time, scene location etc. which are supplied
by the respective satellite agency. The spectral
features are based on the outcome of a classifica-
tion process. i.e. the determined classes of ground
cover (Subsection 3.1). In total the feature vector
consists of 24 different elements whereby some
of them are related to the individual spectral band
of the scanner, thus extending the length of the
feature vector even further. The most dominant
features are the means and standard deviation of
the classes, spatial class neighbours, class com-
pactness in the image, size of classes etc. More-
over general features are stored which are not
related to a specific class but the whole classes
together, e.g. the mean over all class centres, av-
erage distance between classes. These enable the
database not just to search for individual classes
but also for groups of classes.
2.3 Index structures
The (RS)2I supports VP-trees (Vantage point) as
an index structure for the content-based retrieval
of the satellite imagery, which were introduced
by CHIUEH [14] and enable an efficient next
neighbour search. The advantage is obvious since
many queries in remote sensing are related to
geographical locations and the direct surround-
ing. The argumentation holds also true for fea-
tures extracted from spectral content since gener-
ally the focus is on the similarity of the spectral
characteristic.
Let M denote a set of n-dimensional vectors
with image features as components. In the inside
nodes of a VP-tree the vantage points are stored.
The leaf nodes contain the values of the extracted
features. The distribution of these values to the
vantage point can be easily demonstrated, if a
binary relation – as shown in Figure 2 – is given.
All values are sorted according to the distance to
the randomly chosen vantage point. Let
µ
be the
median of all distances, thus each sub tree, I> and
I contains half of all points, i.e. features related
to the actual images. Both-sub trees have to be
searched if the distance of the search parameter p
to the vantage point v is in the interval [
µ
-
σ
,
µ
+
σ
].
The next neighbour search starts in the root
node by calculating the distance of the search
parameter q to the vantage point v and
σ
is the
estimated maximum distance. The next
neighbour is element of a subset, if the condition
µ
i-
σ
< d(q, v) <
µ
i+1+σ holds.
Figure 2: Sub-division of a-priori extracted
features according to a randomly chosen
vantage point v
All subsets fulfilling this criterion are examined
until the next neighbour is found. An unsuccess-
ful search occurs if the value
σ
is too small, thus
the whole process has to be repeated with a modi-
fied
σ
. This adaptation can be performed by ad-
dition or multiplication of constant value. More-
over, the original algorithm is modified in the
way, that a ranking of the k>1 next neighbours
can be generated.
2.4 Parallel Infrastructure
Dynamic retrieval of satellite imagery requires
the analysis of all image sections in the remote
sensing database and produces an enormous
computational load since the requirements grow
exponentially by using arbitrary sections instead
of pre-defined tiles (Subsection 3.2). Therefore,
the utilisation of parallel architectures is neces-
sary for the solution of the performance problem.
An analysis of the main demands of the (RS)2I
shows that not the computing power but the huge
data volume and the communication are the lim-
iting factors [15], [16]. Thus cluster architectures
are the most suitable choice due to the shared
data transfer effort and the reasonable price per
node.
The proposed architecture for the (RS)2I is
based on a Beowulf cluster which consist of a
master for receiving queries, broadcasting the
work schedule, and unifying the intermediate re-
sults. The actual processing is done by the com-
puting nodes which access a disjoint subset of the
images in the database. Those are content in-
dependently partitioned because every content re-
lated partitioning restricts the general flexibility.
Note that only the dynamic feature extraction
requires parallel computing resources to keep the
response times short. All query operations based
on a-priori information are executed on the mas-
ter node since the implemented VP-trees enables
efficient processing.
Usually a-priori extracted features and dy-
namic approaches are used together in a query,
e.g. in the mentioned query on swampland. In the
first processing step the world-oriented and static
features lead to a selection of images which are
arbitrary distributed over the entire cluster – in
the worst case on a single node. Only a redistri-
bution of the data that has to be processed in the
dynamic phase enables the full use of the avail-
able computing resources. A simple but efficient
strategy for this workload balancing for the
(RS)2I was described in [17].
3 Data Processing
The key factors for the system are the processing
of incoming data to extract the feature vectors
and the processing for the dynamic retrieval.
From the perspective of signal processing both
parts are considered as being identically and there
will be no distinction henceforth.
3.1 Feature Extraction
In the following a brief overview of the feature
extraction technique will be given which results
in the described feature vector mentioned in Sub-
section 2.2. The emphasis is on a stable technique
for a variety of different multispectral satellite
images and not on a highly precise classification
map since the focus for the retrieval is on meas-
uring the similarity of images rather than on the
exact description of the ground cover.
The algorithm consists of three stages: First
the involved number of classes is estimated
whereby a larger number guarantees that a good
classification result even for scenes with a mani-
fold ground cover can be obtained. Scenes with
less various ground cover will be post-processed
to avoid the break-up of otherwise consistent
classes. Subsequently the actual classification
process is performed based on a modified version
of the algorithm by LOONEY [18]. The advantage
of the technique using a fuzzy approach in com-
bination with a standard k-means classifier is its
stability and reliability. However, the algorithm
was not developed for remotely sensed imagery
and therefore leads to an inadequate number of
resulting classes. Thus, in a second iterative proc-
essing stage classes are combined which do not
fulfil certain criteria, e.g. their appearance is not
of significance for the feature extraction. Note
that this constraint is not necessarily related to the
size of a class but also to its distance towards the
next closest class. A third step in the processing
chain post-processes the data and satisfy physical
constraints. For a detailed description of the pro-
posed method refer to [13].
3.2 Sub-division of Imagery
The unsupervised classification of remotely
sensed imagery, i.e. the feature extraction, is a
computational burdensome process, particularly
due to the size of the data which can easily be
larger than 3000×3000 pixel per image. The re-
quired processing time depends highly on the
image size and grows, like shown in Figure 3(a),
exponentially for a pre-defined number of con-
tained classes. A smaller, but still significant,
influence has the number of classes, in which the
satellite image has to be classified. Figure 3(b)
depicts an overview for a fixed image size of
256×256 pixel. In summary it is reasonable to
separate the data in independent sections, which
can be processed by the cluster simultaneously.
Moreover a sub-division of the images in the
database is believed to be advisable since the
huge coverage in a scene results in a large num-
ber of arising classes. In most cases this number
is considerably larger than the one in the query
image since usually the user’s region of interest
contains less variety in ground cover due to its
smaller extent. Thus a similarity measure be-
tween two different scenes might be influenced in
an undesired way.
A well-known solution for the sub-division of
images is the quad-tree structure. The algorithm
works recursively and divides a given image in
four sub-images if the image fulfils a given crite-
rion, e.g. its variance is greater than a pre-defined
threshold. Figure 4(a) shows an example where
the solid lines indicate the first stage of the algo-
rithm, the dashed lines the second stage, and the
dotted lines the smallest desired division into
sub-images.
0
100
200
300
400
0 100 200 300 400 500
Image size
Processing time
(a)
0
2
4
6
8
10
12
14
3 4 5 6 7 8 9 10
Number of Classes
Processing time
(b)
Figure 3: Required processing time for classi-
fication, (a) variable image size and constant
number of classes, (b) variable number of
classes and constant image size
The approach has the advantages to enable pow-
erful indexing techniques in the final database
and to reduce the number of feature vectors
which have to be stored. However, the idea to
find homogenous areas is not always beneficial,
e.g. if a medium sized urban area along the lake-
shore is of interest. The quad-tree would separate
the water from the building area. A way out is not
just to classify the resulting tiles at the bottom
level of the tree structure but the intermediate
sub-images, too. This is not feasible in practice
due to the significant increase of the burdensome
computation for the larger sub-images, like indi-
cated in Figure 3(a). Therefore this paper follows
the approach to arrange the remotely sensed
image in tiles of the same size. A second set of
tiles, which overlaps the first set, is used to partly
compensate for the discontinuities along the
edges of the tiles. The process is shown in Figure
4(b) where the solid lines indicate the first set and
the dashed lines the second overlapping set.
(a)
(b)
Figure 4: Sub-division of an image: (a) quad-
tree approach, (b) overlapping tiles
Although the approach increases the storage re-
quirements for the feature vectors, it is compu-
tational more economically than the quad-tree in
this particular case. However, there are a few
drawbacks to this solution. The main disadvan-
tage is that the size of the tiles has to be deter-
mined in advanced and does not necessarily
match an appropriate choice with respect to the
size of the query image. One way out of this
problem is the application of the dynamic ap-
proach which increases the needed processing
time dramatically. Instead a modified quad-tree
approach is proposed that is not based on the
pixel values itself but on the next higher level of
abstraction – namely the extracted features. Thus
a new feature vector is created without process-
ing the image data. Although the results are dif-
ferent from the actual extracted features the de-
gree of similarity is sufficiently large due to the
immense reduction of the dimensionality from
the image to the describing vector which sup-
presses not required details. The fusion process is
straightforward and is based on statistics since
the classes themselves are expressed by statistical
measures.
4 Conclusion
A system for the content-based retrieval of mul-
tispectral satellite imagery was presented. The
approach combines the idea of using world-ori-
ented features as well as a-priori and dynamically
extracted feature to enable queries which were
not possible with standard image databases for
remotely sensed optical data. The results have
shown that the utilisation of the provided mecha-
nisms makes complex retrieval operation even for
persons without in-depth remote sensing back-
ground knowledge possibly.
The two main foci of the paper are on the
image database system architecture with its un-
derlying functionality and specific parts of the
data processing.
Future work includes the incorporation of ad-
ditional feature extraction mechanisms beyond
the usage of the spectral information. Particularly
the exploitation of spatial features is of interest
which is not only characterised by shape but also
by texture, i.e. the spatial distribution of spectral
information.
References
[1] J.T. Anderson, M. Stonebraker, “Sequoia 2000
metadata schema for satellite images”,
SIGMOD Record, vol. 23, no. 4, pp. 42–48,
1994.
[2] C.-S. Li, J.J. Turek, and E.Feig, “Progressive
template matching for content-based retrieval
of earth observing satellite image database”,
Proceedings of the SPIE, vol. 2606, pp. 134–
144, 1995.
[3] T. Stephenson, H. Voorhees, “IMACT: An
interactive, multiterabyte image archive”, Pro-
ceedings of the IEEE Symposium on Mass
Storage Systems, pp. 146–161, 1995.
[4] D. Chang, B. Moon, A. Acharya, and C. Shock,
“Titan: A high-performance remote sensing
database”, Proceedings of the Conference on
Data Engineering, pp. 375–383, 1997.
[5] C.T. Shock, C. Chang, B. Moon, A. Acharya,
L. Davis, J. Saltz, and A. Sussman, “The de-
sign and evaluation of a high-performance
earth science database”. Parallel Computing,
vol. 24, pp. 65–89, 1998.
[6] G.B. Marchisio, W.-H. Li, M. Sannella, and
J.R. Goldschneider, “GeoBrowser: An inte-
grated environment for satellite image retrieval
and mining”, Proceedings of the IEEE Interna-
tional Geoscience and Remote Sensing Sympo-
sium, vol. 2, pp. 669–673, 1998.
[7] A. Vellaikal, C. Kuo, and S. Dao, “Content-
based retrieval of remote sensed images using
vector quantization”, Proceedings of SPIE, vol.
2488, pp. 178–189, 1995.
[8] J. Barros, J. French, and W. Martin, “System
for indexing multi-spectral satellite images for
efficient content-based retrieval”, Proceedings
of the SPIE, vol. 2420, pp. 228–237, 1995.
[9] K. Seidel, M. Schröder, H. Rehrauer, G.
Schwarz, and M. Datcu, “Query by image
content from remote sensing archives”, Pro-
ceedings of the IEEE International Conference
on Geoscience and Remote Sensing, vol. 1, pp.
393–396, 1998.
[10] D.G. Healey, A. Jain, “Retrieving multispectral
satellite images using physics-based invariant
representations“, IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 18, no.
8, pp. 842–848, 1996.
[11] G.B. Marchisio, J. Cornelison, “Content-based
search and clustering of remotely sensing im-
agery”, Proceedings of the IEEE International
Conference on Geoscience and Remote Sens-
ing, vol. 1, pp. 290–292, 1999.
[12] L. M. del Val Cura, N.J. Leite, and C.B.
Medeiros, “An architecture for content-based
retrieval of remote sensing images”. Proceed-
ings of the IEEE International Conference on
Multimedia and Expo, vol. 1, pp. 303–306,
2000.
[13] T. Bretschneider, R. Cavet, and O.Kao, “Re-
trieval of Remotely Sensed Imagery Using
Spectral Information Content”, to appear in:
Proceedings of the International Geoscience
and Remote Sensing Symposium 2002, Toronto,
Canada.
[14] T. Chiueh, “Content-based image indexing”,
Proceedings of the 20th Conference on Very
Large Databases, 582--593, 1994.
[15] T. Bretschneider, S. Geisler, and O. Kao,
“Simulation-based assessment of parallel ar-
chitectures for image databases”, to be pub-
lished in: Proceedings of the Conference on
Parallel Computing, 2001.
[16] S. Geisler, O. Kao, and T. Bretschneider,
“Analysis of cluster topologies for workload
balancing strategies in image databases”, Pro-
ceedings of the International Conference on
Parallel and Distributed Processing Tech-
niques and Applications, pp. 874–880, 2001.
[17] T. Bretschneider, O. Kao, “Retrieval of Multi-
spectral Satellite Imagery on Cluster Architec-
tures”, submitted to EuroPar 2002, Paderborn,
Germany.
[18] C. Looney, “A fuzzy clustering and fuzzy
merging algorithm”, Technical Report, CS-
UNR-101-1999, 1999.
... Again, these are defined through the particular application and can vary from low-dimensional vectors with scalar values to high-dimensional data with a complex structure. This section briefly introduces the features' characteristics that were derived for a CBIR database specialised on multispectral satellite imagery [8], [9]. ...
... In the VP de-clustering strategy, a vantage point is selected by the same algorithm that was used for the actual tree [12]. For example, if the points in Fig. 1 are de-clustered and distributed to three computing nodes, then according to the PI algorithm the combinations (2,7,9), (1,4,6) and (2,5,8) are assigned to different computing nodes. The VP algorithm distributes the information over the computing nodes as (1,4,7), (2,5,8) and (3,6,9). ...
... For example, if the points in Fig. 1 are de-clustered and distributed to three computing nodes, then according to the PI algorithm the combinations (2,7,9), (1,4,6) and (2,5,8) are assigned to different computing nodes. The VP algorithm distributes the information over the computing nodes as (1,4,7), (2,5,8) and (3,6,9). This is just one example and Section 5 provides more detailed experiments. ...
Conference Paper
Full-text available
Most modern image database systems employ content-based image re trieval techniques and various multi-dimensional indexing structures to speed up the query performance. While the first aspect ensures an intuitive re trieval for the user, the latter guarantees an efficient han dling of huge data amounts. How ever, beyond a system inherent threshold only the simultaneous paral lelisa tion of the indexing structure can improve the system’s performance. In such an ap proach one of the key factors is the de-clustering of the data. To tackle the high lighted issues, this pa per proposes an effective multi-dimensional in dex strat egy with de-clustering based on the vantage point tree with suitable simi lar ity measure for content-based re trieval. The conducted experiments show the effec tive and efficient behaviour for an actual image database.
... Content-based image retrieval takes images as queries, rather than keywords, whose performance therefore is extremely dependent on the visual features [22,23]. For promoting the accuracy of RS image retrieval, early studies mainly focused on seeking various feature representation methods, hoping to find more discriminating image features [24][25][26][27][28][29][30][31][32][33][34][35][36][37][38][39][40][41][42] or feature combinations [43][44][45][46][47][48][49][50][51][52][53]. Nevertheless, due to the drastically increasing volume and complexity of RS data, visual features may become subjective and ambiguous in some situations [4]. ...
Preprint
Remote sensing (RS) image retrieval is of great significant for geological information mining. Over the past two decades, a large amount of research on this task has been carried out, which mainly focuses on the following three core issues: feature extraction, similarity metric and relevance feedback. Due to the complexity and multiformity of ground objects in high-resolution remote sensing (HRRS) images, there is still room for improvement in the current retrieval approaches. In this paper, we analyze the three core issues of RS image retrieval and provide a comprehensive review on existing methods. Furthermore, for the goal to advance the state-of-the-art in HRRS image retrieval, we focus on the feature extraction issue and delve how to use powerful deep representations to address this task. We conduct systematic investigation on evaluating correlative factors that may affect the performance of deep features. By optimizing each factor, we acquire remarkable retrieval results on publicly available HRRS datasets. Finally, we explain the experimental phenomenon in detail and draw conclusions according to our analysis. Our work can serve as a guiding role for the research of content-based RS image retrieval.
... Although the spectral feature is one of the simplest feature representations, it encodes the reflectance of the corresponding areas of the Earth surface and depicts the most prominent information of RS imagery [60]. In literature, the spectral feature has been adopted in many RS image retrieval methods [123][124][125][126]. However, the spectral feature based RS image retrieval methods often present serious sensitivity to noise and illumination change [64]. ...
Article
The blooming proliferation of aeronautics and astronautics platforms, together with the ever-increasing remote sensing imaging sensors on these platforms, has led to the formation of rapidly-growing earth observation data with the characteristics of large volume, large variety, large velocity, large veracity and large value, which raises awareness about the importance of large-scale image processing, fusion and mining. Unconsciously, we have entered an era of big earth data, also called remote sensing (RS) big data. Although RS big data provides great opportunities for a broad range of applications such as disaster rescue, global security, and so forth, it inevitably poses many additional processing challenges. As one of the most fundamental and important tasks in RS big data mining, image retrieval (i.e., image information mining) from RS big data has attracted continuous research interests in the last several decades. This paper mainly works for systematically reviewing the emerging achievements for image retrieval from RS big data. And then this paper further discusses the RS image retrieval based applications including fusion-oriented RS image processing, geo-localization and disaster rescue. To facilitate the quantitative evaluation of the RS image retrieval technique, this paper gives a list of publicly open datasets and evaluation metrics, and briefly recalls the mainstream methods on two representative benchmarks of RS image retrieval. Considering the latest advances from multiple domains including computer vision, machine learning and knowledge engineering, this paper points out some promising research directions towards RS big data mining. From this survey, engineers from industry may find skills to improve their RS image retrieval systems and researchers from academia may find ideas to conduct some innovative work.
... Content-based image retrieval indexes with query images, rather than keywords, so its performance is extremely dependent on the visual features of the images. For promoting the accuracy of RS image retrieval, early studies mainly focused on seeking various feature representation methods, hoping to find the best image feature [21,[26][27][28][29][30][31][32][33][34][35][36][37][38][39][40][41][42][43] or feature combination [44][45][46][47][48][49][50][51][52][53][54]. ...
Article
Full-text available
Remote sensing (RS) image retrieval based on visual content is of great significance for geological information mining. Over the past two decades, a large amount of research on this task has been carried out, which mainly focuses on the following three core issues of image retrieval: visual feature, similarity metric and relevance feedback. Along with the advance of these issues, the technology of RS image retrieval has been developed comparatively mature. However, due to the complexity and multiformity of high-resolution remote sensing (HRRS) images, there is still room for improvement in the current methods on HRRS data retrieval. In this paper, we analyze the three key aspects of retrieval and provide a comprehensive review on content-based RS image retrieval methods. Furthermore, for the goal to advance the state-of-the-art in HRRS image retrieval, we focus on the visual feature aspect and delve how to use powerful deep representations in this task. We conduct systematic investigation on evaluating factors that may affect the performance of deep features. By optimizing each factor, we acquire remarkable retrieval results on publicly available HRRS datasets. Finally, we explain the experimental phenomenon in detail and draw instructive conclusions according to our analysis. Our work can serve as a guiding role for the research of content-based RS image retrieval.
... Similarity based image retrieval has been proposed as an automated method for managing and interacting with the the growing collections of remote sensed imagery. As in other domains, a variety of descriptors have been investigated including spectral[3,4], shape[14], texture[11,7,10,15,20], and combinations such as multi-spectral texture[19]. While the most effective descriptor is problem dependent, texture features have enjoyed success since, unlike spectral features, they incorporate spatial information which is clearly important for remote sensed imagery but avoid the difficult pre-processing step of segmentation needed to extract shape features. ...
Conference Paper
Full-text available
We investigate the application of a new category of low-level image descriptors termed interest points to remote sensed image analysis. In particular, we compare how scale and rotation invariant descriptors extracted from salient image locations perform compared to proven global texture fea- tures for similarity retrieval. Qualitative results using a geo- graphic image retrieval application and quantitative results using an extensive ground truth dataset show that inter- est point descriptors support effective similarity retrieval in large collections of remote sensed imagery.
... This allows a user to specify that the region of interest should be cloud free. Users are allowed to retrieve Landsat images using generic features that are dynamic or determined a-priori such as urban, agricultural, or forest areas in [17]. A toolbox using Gibbs Random Fields, a texture based approach, is described in [18]. ...
Conference Paper
Full-text available
This paper presents a shape-based retrieval system and its application to infrared satellite images. A complete system is presented, from region extraction of a full hemisphere scan to the actual retrieval mechanism. After region extraction, polygonal approximation is applied to the region shape, and local features of the polygons are hashed to provide an association space. This space becomes the indexing structure through which retrieval takes place. Although the indexing stage, containing region extraction and polygonal approximation, is slow, the actual retrieval is very fast. On average, retrieval of a query shape from a database of 1965 shapes takes 0.7 seconds for the more reduced representation, and 2.8 seconds for the less reduced representation consisting of 1914 shapes. The overall design is good for a moderately sized database, and extensions could be made to apply the method to a massive database. The results show that the approach performs well, and that there is a substantial speed benefit for using the local association hashing method.
Chapter
Content based satellite image retrieval is among the most vital technologies for military application. Huge volumes of satellite imagery get generated day by day. Management of huge archives of satellite image requires intelligent image retrieval system. In this paper, we have carried out study using SOTA Vision Transformer and other deep CNN based features in conjunction with various matching and indexing methods. We conducted exhaustive experiments by employing various deep features and different variants of Vision Transformer with variety of feature vector matching techniques. Experimental results conducted on various benchmark satellite image retrieval dataset demonstrate the promising capability of the Vision Transformer in conjunction with Ball tree. ResNet & DenseNet have outperformed on two benchmark datasets namely RS19, & PatterNet. ViT has outperformed on RSSC7N dataset & has performed on par with ResNet & DenseNet on UCM dataset.KeywordsResNetDenseNetViTBall treeBERT
Article
This paper presents a novel content-based remote sensing (RS) image retrieval system that consists of the following. First, an image description method that characterizes both spatial and spectral information content of RS images. Second, a supervised retrieval method that efficiently models and exploits the sparsity of RS image descriptors. The proposed image description method characterizes the spectral content by three different novel spectral descriptors that are: raw pixel values, simple bag of spectral values and the extended bag of spectral values descriptors. To model the spatial content of RS images, we consider the well-known scale invariant feature transform-based bag of visual words approach. With the conjunction of the spatial and the spectral descriptors, RS image retrieval is achieved by a novel sparse reconstruction-based RS image retrieval method. The proposed method considers a novel measure of label likelihood in the framework of sparse reconstruction-based classifiers and generalizes the original sparse classifier to the case both single-label and multi-label RS image retrieval problems. Finally, to enhance retrieval performance, we introduce a strategy to exploit the sensitivity of the sparse reconstruction-based method to different dictionary words. Experimental results obtained on two benchmark archives show the effectiveness of the proposed system.
Article
Conventional relevance feedback (RF) schemes improve the performance of content-based image retrieval (CBIR) requiring the user to annotate a large number of images. To reduce the labeling effort of the user, this paper presents a novel active learning (AL) method to drive RF for retrieving remote sensing images from large archives in the framework of the support vector machine classifier. The proposed AL method is specifically designed for CBIR and defines an effective and as small as possible set of relevant and irrelevant images with regard to a general query image by jointly evaluating three criteria: uncertainty; diversity; and density of images in the archive. The uncertainty and diversity criteria aim at selecting the most informative images in the archive, whereas the density criterion goal is to choose the images that are representative of the underlying distribution of data in the archive. The proposed AL method assesses jointly the three criteria based on two successive steps. In the first step, the most uncertain (i.e., ambiguous) images are selected from the archive on the basis of the margin sampling strategy. In the second step, the images that are both diverse (i.e., distant) to each other and associated to the high-density regions of the image feature space in the archive are chosen from the most uncertain images. This step is achieved by a novel clustering-based strategy. The proposed AL method for driving the RF contributes to mitigate problems of unbalanced and biased set of relevant and irrelevant images. Experimental results show the effectiveness of the proposed AL method.
Article
This paper investigates local invariant features for geographic (overhead) image retrieval. Local features are particularly well suited for the newer generations of aerial and satellite imagery whose increased spatial resolution, often just tens of centimeters per pixel, allows a greater range of objects and spatial patterns to be recognized than ever before. Local invariant features have been successfully applied to a broad range of computer vision problems and, as such, are receiving increased attention from the remote sensing community particularly for challenging tasks such as detection and classification. We perform an extensive evaluation of local invariant features for image retrieval of land-use/land-cover (LULC) classes in high-resolution aerial imagery. We report on the effects of a number of design parameters on a bag-of-visual-words (BOVW) representation including saliency- versus grid-based local feature extraction, the size of the visual codebook, the clustering algorithm used to create the codebook, and the dissimilarity measure used to compare the BOVW representations. We also perform comparisons with standard features such as color and texture. The performance is quantitatively evaluated using a first-of-its-kind LULC ground truth data set which will be made publicly available to other researchers. In addition to reporting on the effects of the core design parameters, we also describe interesting findings such as the performance-efficiency tradeoffs that are possible through the appropriate pairings of different-sized codebooks and dissimilarity measures. While the focus is on image retrieval, we expect our insights to be informative for other applications such as detection and classification.
Article
Full-text available
In this paper we present a progressive template matching algorithm that can be used when performing content-based retrieval on images or videos that are stored using DCT based block transforms such as those used in the JPEG compression standard. In the proposed method, the template matching is initially performed on a low-resolution version of the image consisting of the low frequency coefficients from the DCT-transformed image. The template is then matched against the neighborhood of the resulting hit(s) by incorporating additional DCT coefficients into the analysis. We have conducted preliminary experiments on a database consisting of large satellite images; our results show that the progressive template matching methodology yields significant computational speedup over the conventional approach.
Conference Paper
Full-text available
The retrieval of images in professional remote sensing databases is based on world-oriented information like for example the location of the scene, the utilised scanner, and the date of acquisition. However, these descriptions are not meaningful for many users who have a limited knowledge about remote sensing but nevertheless have to work with satellite imagery. Therefore the retrieval of an appropriate scene is often almost impossible since adequate techniques are not provided by the database systems. This paper proposes the content-based retrieval for satellite imagery using the query-by-example approach. The main contribution is the reliable and scanner independent technique for the feature extraction and representation. The entire approach is unsupervised to stay abreast of the daily increasing huge data volume.
Conference Paper
Full-text available
Intelligent query and retrieval techniques from remote sensing archives become more and more important with the increasing number of satellites in orbit acquiring more and more data. At the same time the increasing resolution of the sensors produces images of higher complexity. To allow easy access to this information, thus to enhance usage of the data, content-based query techniques are mandatory. The authors present the structure of a new intelligent remote sensing image archive providing query by image content. First they capture the information in the image using a family of robust signal models which are not selected according to a certain application but instead are able to describe the information in a condensed way. In a next level of information processing the images in the archive are clustered using conjectural models. Finally, the application-oriented user query is answered on a semantic level. In this way, users of various fields of remote sensing applications are able to see their data in the archive
Conference Paper
The retrieval of images in remote sensing databases is based on world-oriented information like the location of the scene, the utilised scanner, and the date of acquisition. However, these descriptions are not meaningful for many users who have a limited knowledge about remote sensing but nevertheless have to work with satellite imagery. Therefore a content-based dynamic retrieval technique using a cluster architecture to fulfil the resulting computational requirements is proposed. Initially the satellite images are distributed evenly over the available computing nodes and the retrieval operations are performed simultaneously. The dynamic strategy creates the need for a workload balancing before the sub-results are joined in a final ranking.
Conference Paper
This paper presents an architecture of a system for content-based retrieval of remote sensing images (RSI). This architecture relies on combining results from image processing, pattern recognition and multimedia databases. RSI retrieval from a database is based on the notion of image similarity. A novel aspect in the paper is that the degree of similarity among images is refined during a query session, and is computed dynamically based on user-provided texture and color image patterns, as well as information on how images were pre-processed before being stored in the database
Conference Paper
Current feature-based image databases can typically perform efficient and effective searches on scalar feature information. However, many important features, such as graphs, histograms, and probability density functions, have more complex structure. Mechanisms to manipulate complex feature data are not currently well understood and must be further developed. The work we discuss in this paper explores techniques for the exploitation of spectral distribution information in a feature-based image database. A six band image was segmented into regions and spectral information for each region was maintained. A similarity measure for the spectral information is proposed and experiments are conducted to test its effectiveness. The objective of our current work is to determine if these techniques are effective and efficient at managing this type of image feature data.
Article
Sequoia 2000 schema development is based on emerging geospatial standards to accelerate development and facilitate data exchange. This paper focuses on the metadata schema for digital satellite images. We examine how satellite metadata are defined, used, and maintained. We discuss the geospatial standards we are using, and describe a SQL prototype that is based on the Spatial Archive and Interchange Format (SAIF) standard and implemented in the Illustra object-relational database.
Conference Paper
The increasing amount of imagery to be collected by the Earth Observing System Data Information System (EOSDIS) emphasizes the need for intelligent retrieval infrastructures which enable model fitting and hypothesis testing on a very large scale, rather than on a small subset of the available data. The authors' work addresses two strategic challenges: data streaming and organization. The first involves the reduction of raw multispectral image data into attribute terms, which quantify a number of parameters of scientific interest and their temporal evolution at different spatial scales. To this end, they are developing algorithms for feature extraction which hold great promise for the automatic categorization of large collections of images. The second challenge is to provide the technology which can organize the extracted features and turn them into information. The solution requires the customization and embedding of sophisticated statistical algorithms in a database management system. Examples are agglomerative or divisive clustering of database attributes, multivariate techniques for inspecting the dependence of any attribute on two or more other attributes, and classification and regression trees for discovering hidden relationships among attributes. The emerging field of knowledge discovery in databases (KDD) has spawned a renaissance in statistical computing and has much to offer to the remote sensing community, if they are to make the best use of the volume of the new measurements