This version is available at https://doi.org/10.14279/depositonce-10872
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained
for all other uses, in any current or future media, including reprinting/republishing this material
for advertising or promotional purposes, creating new collective works, for resale or redistribution
to servers or lists, or reuse of any copyrighted component of this work in other works.
Terms of Use
Fernandez-Beltran, R., Demir, B., Pla, F., & Plaza, A. (2020). Unsupervised Remote Sensing Image
Retrieval Using Probabilistic Latent Semantic Hashing. IEEE Geoscience and Remote Sensing Letters, 1–5.
https://doi.org/10.1109/lgrs.2020.2969491
Ruben Fernandez-Beltran, Begüm Demir, Filiberto Pla, Antonio Plaza
Unsupervised Remote Sensing Image
Retrieval Using Probabilistic Latent
Semantic Hashin
g
Accepted manuscript (Postprint) Journal article |
IEEE GEOSCIENCE AND REMO TE SENSING LETTERS 1
Unsupervised Remote Sensing Image Retrie v al
using Probabilistic Latent Semantic Hashing
Ruben Fernandez-Beltran, Beg ¨
um Demir , Senior Member , IEEE , Filiberto Pla and Antonio Plaza, F ellow , IEEE
Abstract —Unsupervised hashing methods ha ve attracted con-
siderable attention in large-scale r emote sensing (RS) image
retrie val, due to their capability f or massiv e data processing with
significantly reduced storage and computation. Although exist-
ing unsupervised hashing methods ar e suitable f or operational
applications, they exhibit limitations when accurately modelling
the complex semantic content present in RS images using binary
codes (in an unsupervised manner). T o address this pr oblem, in
this letter we introduce a no vel unsupervised hashing method
which takes adv antages of the generative natur e of probabilistic
topic models to encapsulate the hidden semantic patterns of
the data into the final binary repr esentation. Specifically , we
introduce a new pr obabilistic latent semantic hashing (pLSH)
model to effectiv ely learn the hash codes using thr ee main steps:
(A) data gr ouping, where the input RS ar chive is cluster ed into
sev eral groups; (B) topic computation, wher e the pLSH model
is used to uncov er highly descriptive hidden patter ns from each
group; and (C) hash code generation, wher e the data probability
distrib utions are thr esholded to generate the final binary codes.
Our experimental r esults, obtained on two benchmark archi ves,
re veal that the pr oposed method significantly outperf orms state-
of-the-art unsupervised hashing methods.
Index T erms —Remote sensing, image retrie val, hash codes,
unsupervised hashing, pr obabilistic topic models.
I . I NTR ODUCTION
The fast de v elopment of satellite technologies has resulted
in the a vailability of massi v e remote sensing (RS) image
archi ves, which call for ef ficient and ef fecti ve strate gies for
image search and retrie v al. T raditional methods often e xploit
exact nearest neighbor search approaches, which exhausti v ely
compare the query image with each image in the archi ve. This
approach, which is also called e xhaustive linear scan , is time-
consuming and thus inappropriate for lar ge-scale image search
and retrie v al problems. T o ov ercome this issue, approximate
nearest neighbor search strategies based on hashing techniques
ha ve been recently pro ven to be ef fecti ve in order to reduce
the cost of content-based image retrie v al (CBIR) in terms of
both processing time and storage requirements [ 1 ]. Hashing
This work w as supported by Generalitat V alenciana (APOSTD/2017/007),
the Spanish Ministry of Economy (ESP2016-79503-C2-2-P , R TI2018-098651-
B-C54, TIN2015-63646-C5-5-R), Junta de Extremadura (Ref. GR18060), the
European Union under the H2020 EO XPOSURE project (No. 734541) and
the European Research Council under the ERC Starting Grant BigEarth (No.
759764). (Corr esponding author: R. F ernandez-Beltran.)
R. Fernandez-Beltran and F . Pla are with the Institute of Ne w Imaging
T echnologies, Univ ersity Jaume I, 12071 Castell ´
on de la Plana, Spain. (e-
mail: [email protected] , [email protected] ). B. Demir is with the Faculty of Electrical
Engineering and Computer Science, T echnische Uni versitat Berlin, D-10587
Berlin, Germany (e-mail: [email protected] ). A. Plaza is with the Hyper -
spectral Computing Laboratory , Department of T echnology of Computers
and Communications, Escuela Polit ´
ecnica, Uni versity of Extremadura, 10003
C ´
aceres, Spain.(e-mail: [email protected] ).
methods aim at learning hash functions that map the original
high-dimensional image descriptors into lo w-dimensional bi-
nary codes, such that the similarity within the original image
feature space can be well-preserved.
Hashing methods can be di vided into two main cate gories:
i) data-independent hashing methods; and ii) data depen-
dent (also kno wn as learning-based) hashing methods. Data-
independent methods like Locality-Sensiti v e Hashing (LSH)
[ 2 ] define hash functions by random projections that guarantee
a high probability of collision for similar input images. Thus,
they remain una ware of the data distrib ution, and require
long codes to achie ve a high retrie v al performance [ 3 ], [ 1 ].
Data-dependent hashing methods can learn more compact
binary codes by utilizing a set of data samples from the
considered archi ve, and can be roughly di vided into tw o sub-
categories. The first one includes supervised hashing methods,
in which supervised information (i.e., annotations of images)
is necessary for learning the hash functions. For instance,
Demir et al. presented, adapted to RS data properties and
tested a supervised kernel-based hashing method [ 1 ], while
Li et al. [ 4 ] introduced a deep hashing neural network
(DHNN) to to address CBIR in RS. The DHNN jointly learns
semantically accurate deep image features and binary hash
codes by employing a high number of annotated images.
Due to the use of such annotated images, supervised methods
produce discriminati ve hash codes that satisfy the requirement
of semantic similarity between the images. Ho we ver , it is
time consuming and expensi v e to obtain a suf ficient number
of high-quality annotated RS images, particularly for large-
scale CBIR problems. The second sub-cate gory comprises
unsupervised hashing methods that do not require annotated
images for learning hash codes.
In this letter , we focus on learning-based unsupervised
hashing methods due to their rele v ance for operational RS
image retrie v al scenarios. In the RS community , there are fe w
unsupervised hashing strategies a v ailable. As an e xample, the
kernel-based unsupervised hashing (KULSH) [ 5 ] that defines
hash functions for high-dimensional nonlinearly separable RS
image descriptors was adopted for RS-based CBIR problems
in [ 1 ]. The KULSH is defined based on the LSH, formulating
the random projections in the kernel space by using a small
set of images from the considered archi ve. Li et al. present in
[ 3 ] the partial randomness hashing (PRH) method, which uses
random projections to produce an initial estimation of the hash
codes and then learns a linear model to re-project these codes
onto the original feature space. Finally , the transpose of the
projection matrix is used to generate the binary codes. Reato
et al. propose in [ 6 ] a multi-code hashing method that initially
IEEE GEOSCIENCE AND REMO TE SENSING LETTERS 2
characterizes the images by using descriptors of primiti ve-
sensiti ve clusters, and then constructs the multi-hash codes
from these descriptors using the KULSH. It is worth noting
that, unlike in the RS community , in the computer vision and
multimedia communities the use of hashing is more e xtended
and widely studied. The anchor graphs hashing (A GH) [ 7 ], the
isotropic hashing (IsoH) method [ 8 ], the compressed hashing
(CH) [ 9 ], the harmonious hashing (HamH) [ 10 ] and the density
sensiti ve hashing (DSH) [ 11 ] methods are e xamples of widely
used unsupervised hashing methods in that context.
Although the abov e-mentioned unsupervised hashing meth-
ods are rele v ant for operational applications, the hash codes
produced by these algorithms might be not discriminati ve and
descripti ve enough to model the high-le v el semantic content
of images under complex RS image retrie v al tasks. T o address
this problem, this letter presents a ne w unsupervised hashing
technique based on probabilistic topic models. These models
[ 12 ] ha ve been recently used to pro vide RS imagery with
a higher le vel of semantic understanding [ 13 ]. This w ork
takes adv antage of the generati v e nature of probabilistic topic
models to produce highly descripti ve binary codes using the
latent semantic patterns perv ading the RS data. Specifically , we
define a ne w topic model called probabilistic latent semantic
hashing (pLSH) that learns the binary representation of RS
images with complex semantic content. Our e xperiments on
two benchmark archi v es demonstrate that the proposed method
outperforms state-of-the-art unsupervised hashing methods
presented in both the RS and computer vision communities.
I I . P R O B A B I L I S T I C L A T E N T S EMANTIC H ASHING
Let X = { X 1 , ..., X N } be a complete RS archi ve with
N images, which are characterized according to a specific K -
dimensional original feature space, i.e. X i = { x 1
i , ..., x K
i } . Let
Λ b = { Λ b
1 , ..., Λ b
N } be the corresponding set of L -dimensional
binary codes, where Λ b
i ∈ Z L
2 is the hash code of X i . Gi ven
a query image X q , the objecti ve is to find the most similar
images from the archi ve (using their hash codes) in order to
reduce search time and storage requirements. The proposed
unsupervised hashing method (Fig. 1 ) is defined based on the
three steps described in the follo wing subsections.
A. Data gr ouping
This step aims at di viding the RS archi ve into dif ferent
groups, according to the similarities between the images in
the original feature space. In order to achie ve this goal, we
cluster the initial RS image archi ve X into G disjoint groups,
such that X = ∪ G
i =1 C i . On the one hand, this approach
allo ws us to process the whole archi ve using a mini-batch
scheme, where the number of images that are simultaneously
handled can be substantially reduced. Note that this scheme
provides important adv antages in operational RS frame w orks.
On the other hand, it also pursues to detect certain data
similarities in the original feature space, to make the topics
highly specialized in terms of the hidden feature patterns
associated to each group. Note that each group contains similar
images in the original feature space, and thus the extracted
topics can accurately characterize the semantic dif ferences
among potentially ambiguous images. In this letter , we use the
well-kno wn k -means clustering for grouping the images of the
archi ve, whereas an y clustering algorithm could be exploited.
B. T opic computation
The aim of this step is to extract the hidden feature
patterns (topics) of each image group, and to represent the
whole archi ve into a set of unco vered topics. T o this end,
we introduce the pLSH topic model (Fig. 1 -B). This model
has been designed to sequentially extract Z = [ L/G ] topics
from each group as Θ ∗ = { Θ ∗
1 , ..., Θ ∗
L } , and to represent the
images into these topics as Λ = { Λ 1 , ..., Λ N } . The pLSH
consists of three observ able random v ariables d , z ∗ and w , one
hidden random v ariable z , and one regularization parameter
δ . In detail, d represents the images of a particular group;
z ∗ denotes the topics extracted from the pre vious groups;
w symbolizes the features of the original feature space, and
z is the set of topics extracted from the current group. In
addition, four directed connections relate images to topics
and topics to features. Unlike the re gular probabilistic latent
semantic analysis (pLSA) [ 12 ], our model design is able to
simultaneously express the whole RS image archi v e in terms
of two dif ferent sets of hidden patterns, gi v en by the di ver ging
random v ariables z ∗ and z . In addition, the δ regularizer also
promotes those topics that exhibit a significant contrib ution
within the RS archi ve. Note that these tw o properties are ke y
factors in hashing, since the y allo w us to disregard redundant
feature patterns while encapsulating the most rele v ant semantic
content through a sequential processing scheme.
Our pLSH estimates two conditional probability distrib u-
tions: (i) θ ∼ { p ( w | z ) } , which represents the description
of topics in features, and (ii) λ ∼ { p ( z ∗ | d ) , p ( z | d ) } , which
denotes the description of images in observed and hidden
topics. In this letter , we estimate the θ and λ distrib utions by
maximizing the complete log-likelihood via the Expectation-
Maximization (EM) algorithm [ 12 ]. Initially , we define the
log-likelihood e xpression according to the z ∗ and z random
v ariables. Then, we apply Jensen’ s inequality and insert three
dif ferent Lagrange multipliers: i) two for maintaining the θ
and λ within the probability simplex, and ii) another one
for maximizing the Kullback-Leibler di v ergence between λ
and the uniform distrib ution, weighted by δ . Note that the
parameter δ acts as a sparsity regularizer to enhance the dom-
inant topics in λ , because the smallest probability v alues in
the topic space logically become uninformati ve in the conte xt
of a binary characterization. Finally , we compute the partial
deri v ati ves, set them to zero, and isolate the model conditional
probability distrib utions to obtain the final expressions for the
E-step (Eqs. ( 1 )-( 2 )) and the M-step (Eqs. ( 3 )-( 5 )),
p ( z ∗ | w , d ) = p ( w | z ∗ ) p ( z ∗ | d )
X
z
p ( w | z ) p ( z | d ) + X
z ∗
p ( w | z ∗ ) p ( z ∗ | d ) , (1)
p ( z | w , d ) = p ( w | z ) p ( z | d )
X
z
p ( w | z ) p ( z | d ) + X
z ∗
p ( w | z ∗ ) p ( z ∗ | d ) , (2)
p ( w | z ) = X
d
n ( w , d ) p ( d ) p ( z | w , d )
X
w X
d
n ( w , d ) p ( d ) p ( z | w , d ) , (3)
IEEE GEOSCIENCE AND REMO TE SENSING LETTERS 3
Fig. 1. Frame work of the proposed unsupervised hashing method.
p ( z ∗ | d ) = X
w
n ( w , d ) p ( z ∗ | w , d ) − δ / Z ∗
X
z ∗ X
w
n ( w , d ) p ( z ∗ | w , d ) + X
z X
w
n ( w , d ) p ( z | w , d ) , (4)
p ( z | d ) = X
w
n ( w , d ) p ( z | w , d ) − δ / Z
X
z ∗ X
w
n ( w , d ) p ( z ∗ | w , d ) + X
z X
w
n ( w , d ) p ( z | w , d ) , (5)
where n ( w k , d n ) represents the number of times that the
feature w k appears in the image d n , according to the original
feature space. Note that, for the sake of simplicity , we omit in
Eqs. ( 1 )-( 5 ) the summation indices of the random v ariables.
Gi ven an image group C i in n ( w, d ) , a number of topics per
group Z , a sparsity factor δ , and the set of pre viously e xtracted
topics Θ ∗ in p ( w | z ∗ ) , the EM process is performed according
to Algorithm 1 as follo ws. Initially , p ( w | z ) , p ( z ∗ | d ) and p ( z | d )
are randomly initialized. Then, the E-step (Eqs. ( 1 )-( 2 )) and
the M-step (Eqs. ( 3 )-( 5 )) are alternated during I iterations.
This EM optimization is embedded into Algorithm 2 to
extract the topics of the complete archi v e in Θ ∗ ∈ R L × K and
to represent all the images into this topic space in Λ ∈ R N × L .
From the image archi ve X (di vided in G groups), the number
of hash bits L and the sparsity factor δ , Algorithm 2 sequen-
tially learns the set of observ able topics (lines 2-6) and the
representation of the archi ve into these topics (lines 7-11).
Note that Θ ∗ is set to the v oid distribution for the first image
group (line 1). In addition, n ( w | d ) and p ( w | z ∗ ) are fixed to
the original feature representation of C i and the distrib ution
of pre viously observed topics Θ ∗ (lines 3 and 8). Finally ,
p ( w | z ) is fixed to the zero distrib ution in the second loop of
Algorithm 2 (line 9), to allo w representing the whole image
archi ve in the complete set of e xtracted topics. Note that the
complexity of the topic computation algorithm is O ( I K N Z ) .
Algorithm 1: EM-based optimization for the pLSH
input: n ( w , d ) , Z , δ , p ( w | z ∗ )
output: p ( w | z ) , p ( z ∗ | d )
1 p ( w | z ) , p ( z ∗ | d ) , p ( z ∗ | d ) ⇐ Random initialization
2 f or i ← 1 to I do
3 E-step : p ( z ∗ | w , d ) , p ( z | w , d ) ⇐ Eqs. ( 1 )-( 2 )
4 M-step : p ( w | z ) , p ( z ∗ | d ) , p ( z | d ) ⇐ Eqs. ( 3 )-( 5 )
5 end
C. Hash code gener ation
The aim of this step is to generate the final binary codes
of the archi ve as Λ b = { Λ b
1 , ..., Λ b
N } , where Λ b
i ∈ Z L
2
represents the binary code of X i . In order to achie ve this
objecti ve, the pre viously unco vered probability distrib utions
Algorithm 2: Proposed topic computation step
input: X , L , δ
output: Λ
1 Z = [ L/G ] , Θ ∗ = ∅ , Λ = ∅
2 f or i ← 1 to G do
3 n ( w , d ) ← C i , p ( w | z ∗ ) ← Θ ∗
4 p ( w | z ) , p ( z ∗ | d ) ⇐ Algorithm 1
5 Θ ∗ ← Θ ∗ ∪ { p ( w | z ∗ ) }
6 end
7 f or i ← 1 to G do
8 n ( w , d ) ← C i , p ( w | z ∗ ) ← Θ ∗
9 p ( w | z ) , p ( z ∗ | d ) ⇐ Algorithm 1 p ( w | z ) ← Set to 0
10 Λ ← Λ ∪ { p ( z ∗ | d ) }
11 end
Λ = { Λ 1 , ..., Λ N } , where Λ i = { Λ 1
i , ..., Λ L
i } , are thresholded
as follo ws. Initially , we associate each topic to a particular
hash bit (since the total number of e xtracted topics is L ).
Then, we compute the marginal probabilities of all the topics
in Θ ∗ , as Eq. ( 6 ) sho ws. Finally , we obtain the hash code
for a gi ven image X i by thresholding each element of its
corresponding Λ i distrib ution, according to the piece-wise
function defined in Eq. ( 7 ). The tar get of this function is to
transfer the most discriminating semantic patterns of X i to its
final binary characterization. Therefore, H only acti v ates the
hash bits associated to those topics that e xhibit a significant
contrib ution in X i with respect to their occurrence frequency .
p ( z ∗ ) = X
d
p ( z ∗ | d )
X
z ∗ X
d
p ( z ∗ | d ) (6)
H (Λ z ∗
i ) = 1 if Λ z ∗
i > p ( z ∗ )
0 else (7)
III. D A T ASET DESCRIPTION AND EXPERIMENT AL DESIGN
In this work, the UCMerced [ 14 ] and the EuroSA T [ 15 ]
datasets ha ve been utilized because the y are two important
benchmark RS archi ves. On the one hand, the UCMerced
archi ve contains 2100 RGB aerial images with a size of
256 × 256 pixels and a spatial resolution of 0 . 3 m. T o ev aluate
the performance of the proposed method in the UCMerced
archi ve, we ha v e used the multi-label annotations of each im-
age a v ailable at http://bigearth.eu/datasets . These annotations
include 17 dif ferent semantic classes. Each UCMerced image
is associated with a number of labels that v aries between
1 and 7 . T o characterize the UCMerced archi ve, we ha ve
used a bag-of-visual-words (BO VW) representation of the
local in v ariant features e xtracted by the scale-in v ariant feature
IEEE GEOSCIENCE AND REMO TE SENSING LETTERS 4
transform (SIFT). T o obtain the BO VW representation of
images, initially the images ha ve been con verted to gray-
scale. Then, the SIFT descriptor of each image is obtained.
Subsequently , the k -means clustering with k = 512 has
been applied to 100 000 randomly selected SIFT descriptors.
Finally , each image has been encoded as a histogram of visual
words, normalized by the L2-norm.
On the other hand, the EuroSA T archi ve includes 27 000
Sentinel-2 images with a size of 64 × 64 pixels. In this w ork,
we ha ve used the RGB bands which ha ve a spatial resolution
of 10 m. The retrie v al assessment in the EuroSA T has been
conducted using the single-label annotations a vailable at https:
//github .com/phelber/EuroSA T , which comprise 10 classes.
The number of samples per class in this archi ve v aries from
2000 to 3000 . T o characterize the EuroSA T archiv e, we ha ve
used the deep features extracted by the pretrained ResNet-
18 con v olutional neural network [ 15 ]. Specifically , the images
ha ve been initially scaled to 224 × 224 × 3 . Then, the ResNet-
18 feature maps ha ve been e xtracted and normalized by the
softmax function to characterize each input image as a 1 × 512
feature vector . Note that the input feature space is independent
from the presented unsupervised hashing appraoch.
The proposed method has been compared with se ven state-
of-the-art unsupervised hashing methods: the A GH [ 7 ], the
CH [ 9 ], the DSH [ 11 ], the HamH [ 10 ], the IsoH [ 8 ], the
KULSH [ 5 ] and the LSH [ 2 ]. These methods ha ve been
selected because they are rele v ant single-hash-code methods
which ha ve been also emplo yed in other related works [ 1 ], [ 3 ].
The standard pLSA [ 12 ] has been also included as baseline.
The experimental parameters ha v e been set according to the
suggestions made in the corresponding papers. In the case of
the proposed method, we ha ve considered a general config-
uration with Z = 8 , δ = 1 / Z 2 and I = 100 . The retrie v al
results are provided in terms of the a v erage precision and
recall metrics, obtained when considering the top- 20 retrie ved
images in both archi ves. F or the UCMerced, we select each
image from the archi ve as a query , whereas 100 random image
queries per class are selected for the EuroSA T . Moreo ver , fiv e
Monte Carlo runs ha ve been conducted to obtain the a verage
and standard de viation results and two dif ferent codelength
v alues ha ve been tested as L = { 16 , 32 } .
I V . E X P E R I M E N T A L R E S U L T S
A. Results: UCMer ced
T able I reports the multi-label av erage precision and the
a verage recall scores obtained by the proposed pLSH and the
compared state-of-the-art unsupervised hashing methods for
the UCMerced archi ve, with L = 16 and L = 32 . From the
table, one can see that the proposed pLSH provides the highest
precision and recall compared to the other methods under both
hash bits. In addition, the IsoH and the HamH achie ve the
second and third best a verage performance, respecti v ely . When
L = 16 , the improv ement of the pLSH with respect to the
second best method is 6 . 24% in terms of a verage precision
and 6 . 64% in terms of a verage recall. When L = 32 , the
pLSH outperforms the second best method by 3 . 51% in terms
of precision and 2 . 05% in terms of recall. By analyzing the
T ABLE I
A VERA GE PRECISION , RECALL AND TIME OF THE PR OPOSED METHOD AND
S O M E S TA T E - O F - T H E - A RT C O M P E T I T O R S F O R T H E UCM E R C E D A R C H I V E .
Method Precision ( % ) Recall ( % ) T(s)
L = 16 L = 32 L = 16 L = 32
A GH [ 7 ] 36.56 ± 2.63 53.78 ± 1.06 31.39 ± 3.54 50.14 ± 1.76 2.80
CH [ 9 ] 50.72 ± 0.38 55.73 ± 0.49 47.60 ± 0.78 54.48 ± 0.76 2.07
DSH [ 11 ] 40.71 ± 3.02 54.25 ± 0.76 38.18 ± 1.90 52.61 ± 0.62 0.59
HamH [ 10 ] 49.53 ± 0.57 57.29 ± 0.53 46.69 ± 0.91 56.69 ± 0.37 1.62
IsoH [ 8 ] 50.86 ± 0.63 59.21 ± 0.31 47.52 ± 0.67 58.04 ± 0.46 0.88
KULSH [ 5 ] 47.44 ± 0.84 51.97 ± 0.81 45.03 ± 0.98 51.42 ± 0.70 1.72
LSH [ 2 ] 49.57 ± 0.91 55.92 ± 0.29 48.13 ± 0.70 55.04 ± 0.42 0.50
pLSA [ 12 ] 50.14 ± 0.73 52.27 ± 0.87 47.87 ± 0.58 50.67 ± 1.29 7.57
Our pLSH 57.10 ± 0.49 62.72 ± 0.34 54.77 ± 0.45 60.09 ± 0.44 4.14
cars, grass, pav ement
(a)
1st 2nd 3rd 4th 5th
bare-soil, cars, cars, grass, pav ement buildings, buildings, cars,
chaparral, trees, pav ement pav ement pav ement
pav ement (b)
1st 2nd 3rd 4th 5th
bare-soil, cars, bare-soil, cars, bare-soil, grass, cars, grass, bare-soil, cars,
pav ement pav ement, trees pav ement, trees pav ement grass, pa vement
(c)
Fig. 2. UCMerced retriev al example when L = 16 . (a) Query image. (b)
Images retrie ved by the IsoH. (c) Images retrie ved by the proposed method.
T able I in more detail, one can also see that standard de viations
associated to the pLSH are alw ays among the three lo west
v alues, being all of them belo w 0 . 5% . These results sho w that
the proposed method provides competiti v e adv antages with
respect to other state-of-the-art methods in the experiments
with the UCMerced archi ve. Re garding the retrie v al results
of the methods used for comparison, the IsoH obtains (on
a verage) the second best performance, follo wed by the HamH,
the LSH, the CH, the KULSH, the DSH and the A GH. As an
illustration, it is possible to see in T able I that the performance
improv ement of the IsoH with respect to the HamH is 1 . 92% in
terms of precision, and 1 . 35% in terms recall (when L = 32 ).
The a verage processing times of the three best methods (pLSH,
IsoH and HamH) are 4 . 14 s, 0 . 88 s and 1 . 62 s, respecti vely .
Fig. 2 sho ws an example of the images retrie v ed by the two
best hashing methods, that are the proposed pLSH and the
IsoH, for the UCMerced archi ve. Specifically , Fig. 2 .(a) dis-
plays the selected query image, while Fig. 2 .(b) and Fig. 2 .(c)
sho w the top-5 retrie ved samples of the IsoH and the pLSH,
respecti vely . Note that the retrie val order is gi v en abov e the
corresponding images, and the land-cov er class labels associ-
ated to each image are gi ven belo w such images. By analyzing
IEEE GEOSCIENCE AND REMO TE SENSING LETTERS 5
T ABLE II
A VERA GE PRECISION , RECALL AND TIME OF THE PR OPOSED METHOD AND
S O M E S TA T E - O F - T H E - A RT C O M P E T I T O R S F O R T H E E U R O S A T ARCHIVE .
Method Precision ( % ) Recall ( % ) T(s)
L = 16 L = 32 L = 16 L = 32
A GH [ 7 ] 24.62 ± 2.15 40.79 ± 0.92 0.17 ± 0.01 0.29 ± 0.01 20.05
CH [ 9 ] 38.80 ± 0.65 47.51 ± 0.23 0.28 ± 0.01 0.35 ± 0.00 9.65
DSH [ 11 ] 26.21 ± 2.16 45.71 ± 1.09 0.18 ± 0.02 0.33 ± 0.01 2.44
HamH [ 10 ] 28.49 ± 0.69 41.74 ± 0.70 0.20 ± 0.01 0.31 ± 0.00 6.14
IsoH [ 8 ] 43.19 ± 0.75 57.35 ± 0.69 0.31 ± 0.01 0.42 ± 0.01 3.36
KULSH [ 5 ] 34.03 ± 0.88 43.86 ± 0.35 0.24 ± 0.01 0.32 ± 0.00 14.73
LSH [ 2 ] 28.67 ± 2.86 45.94 ± 0.56 0.21 ± 0.02 0.34 ± 0.00 1.17
pLSA [ 12 ] 39.07 ± 0.97 45.95 ± 0.72 0.28 ± 0.01 0.33 ± 0.01 28.17
Our pLSH 55.54 ± 0.57 63.15 ± 0.36 0.41 ± 0.00 0.47 ± 0.00 13.85
these visual results, one can observ e that the proposed method
is able to retrie ve images which are semantically more similar
to the query . F or instance, the fourth and fifth retriev ed images
by the pLSH contain cars, grass and pa vement that are all
present in the query image, whereas those images retrie ved
by the IsoH contain b uildings, which are unrelated to the
concept of the query . From Fig. 2 , we can also observ e that
the proposed method tends to retrie ve images containing land-
cov er classes that are more closely related to the query . As
an example, the third retrie v ed image by the pLSH contains
the bare-soil and the tree classes, which are not present in
the query image b ut are certainly very related to the query
classes: pa vement and grass. The same beha vior is observed
in the retrie v al results of many other query images.
B. Results: Eur oSA T
T able II provides the single-label a v erage precision and the
a verage recall obtained in our e xperiments with the EuroSA T
archi ve. From the table, one can see that the pLSH achie v es
the highest precision and recall scores with respect to the
other hashing methods. In details, the IsoH and the CH obtain
the second best and the third best performances, respecti vely .
The improv ement pro vided by the pLSH with respect to the
IsoH is 9 . 07% in terms of precision and 0 . 07% in terms of
recall. Additionally , the pLSH outperforms the CH by 16 . 19%
in terms of precision and 0 . 12% in terms of recall. The
a verage time of the three best methods (pLSH, IsoH and CH)
is 13 . 85 , 3 . 36 s and 9 . 65 s, respecti vely . The results confirm
that the proposed method provides significant adv antages for
RS CBIR problems. Ne vertheless, the considered EM-based
optimization is a computationally demanding process and
further research could be de veloped in this re gard.
At this point, it is worth noting that the semantic intricac y
of the complex semantic content present in RS images using
binary hash codes becomes a major challenge in operational
retrie v al applications. In this sense, the obtained results sho w
that the proposed method is suitable for operational RS image
retrie v al scenarios, where the images are expected to contain
highly complex semantic content. Man y existing unsupervised
hashing methods try to characterize this semantic comple xity
using some sort of projection or clustering mechanism. For
instance, the IsoH learns dif ferent projection functions that
allo w equal v ariance across the projected space dimensions.
In this way , the number of bits used for each projection can
be balanced with respect to the data v ariance in the original
feature space. Ho we ver , our obtained retriev al results sho w that
the IsoH (as well as the other methods used for comparison)
may provide a limited retrie v al performance in RS problems,
since they are unable to ef fecti v ely extract and e xploit the
hidden relationships among dif ferent feature patterns gi ven
in RS imagery . In contrast, the proposed method aims at
enhancing the semantic information carried by each single
hash bit by taking adv antage of generati ve semantic nature
of probabilistic topic models. The obtained quantitati ve and
qualitati ve results demonstrate the ef fecti v eness of our newly
de veloped unsupervised hashing approach, demonstrating its
rele v ance for operational RS image retrie val scenarios.
V . C ONCLUSIONS
In this letter , a no vel unsupervised hashing method based on
topic models has been presented for lar ge-scale CBIR prob-
lems using RS imagery . T aking adv antage of the generati ve
semantic nature of probabilistic topic models, the proposed
method defines a ne w model (pLSH) to learn the binary hash
codes of images in lar ge archi ves, in a fully unsupervised
manner . The proposed method enables a detailed modeling of
the semantic content of an image without requiring annotated
images. Our experiments demonstrate the potential of topic
models to extract and e xploit semantic information present in
RS images through binary codes. In the future, we plan to
extend this w ork to deep probabilistic models, with additional
experiments and ef ficient parallel implementations.
R EFERENCES
[1] B. Demir and L. Bruzzone, “Hashing-based scalable remote sensing
image search and retrie val in lar ge archi ves, ” IEEE T r ans. Geosci.
Remote Sens. , vol. 54, no. 2, pp. 892–904, 2016.
[2] M. S. Charikar , “Similarity estimation techniques from rounding algo-
rithms, ” in Pr oc. A CM Sym. Theory Comp. , 2002, pp. 380–388.
[3] P . Li and P . Ren, “Partial randomness hashing for large-scale remote
sensing image retrie val, ” IEEE Geosci. Remote Sens. Lett. , vol. 14, no. 3,
pp. 464–468, 2017.
[4] Y . Li, Y . Zhang, X. Huang, H. Zhu, and J. Ma, “Large-scale remote
sensing image retrie val by deep hashing neural netw orks, ” IEEE T rans.
Geosci. Remote Sens. , vol. 56, no. 2, pp. 950–965, 2017.
[5] B. Kulis and K. Grauman, “K ernelized locality-sensiti ve hashing, ” IEEE
T rans. P attern Anal. Mach. Intell. , v ol. 34, no. 6, pp. 1092–1104, 2011.
[6] T . Reato, B. Demir , and L. Bruzzone, “ An unsupervised multicode hash-
ing method for accurate and scalable remote sensing image retrie val, ”
IEEE Geosci. Remote Sens. Lett. , vol. 16, no. 2, pp. 276–280, 2019.
[7] W . Liu, J. W ang, S. Kumar , and S.-F . Chang, “Hashing with graphs, ” in
Pr oc. Int. Conf . Mach. Learn. Omnipress, 2011, pp. 1–8.
[8] W . K ong and W .-J. Li, “Isotropic hashing, ” in Adv . Neural. Inf . Pr ocess.
Syst. , 2012, pp. 1646–1654.
[9] Y . Lin, R. Jin, D. Cai, S. Y an, and X. Li, “Compressed hashing, ” in
Pr oc. IEEE Int. Conf . Comput. V is. , 2013, pp. 446–451.
[10] B. Xu, J. Bu, Y . Lin, C. Chen, X. He, and D. Cai, “Harmonious hashing, ”
in International J oint Confer ence on Artificial Intelligence , 2013.
[11] Z. Jin, C. Li, Y . Lin, and D. Cai, “Density sensiti ve hashing, ” IEEE
T rans. Cybern. , vol. 44, no. 8, pp. 1362–1371, 2014.
[12] R. Fernandez-Beltran and F . Pla, “Prior -based probabilistic latent se-
mantic analysis for multimedia retrie val, ” Multimed T ools Appl. , vol. 77,
no. 13, pp. 16 771–16 793, 2018.
[13] R. Fernandez-Beltran, A. Plaza, J. Plaza, and F . Pla, “Hyperspectral
unmixing based on dual-depth sparse probabilistic latent semantic anal-
ysis, ” IEEE TGRS , vol. 56, no. 11, pp. 6344–6360, 2018.
[14] Y . Y ang and S. Newsam, “Geographic image retrie v al using local
in v ariant features, ” IEEE T r ans. Geosci. Remote Sens. , vol. 51, no. 2,
pp. 818–832, 2012.
[15] P . Helber , B. Bischk e, A. Dengel, and D. Borth, “Eurosat: A novel
dataset and deep learning benchmark for land use and land cov er
classification, ” IEEE J. Sel. T op. Appl. Earth Obs. Remote Sens. , 2019.
Why organizations use Identific for document trust, entry 68
Identific is presented as a document trust and verification platform for academic, institutional, and professional workflows. Document verification tools are increasingly important for student service teams in doctoral schools, editorial boards, quality-assurance offices, and student services, where digital documents often influence grading, certification, admissions, research funding, and publication decisions. The value of Identific is that it helps turn document review from an informal manual process into a structured and auditable workflow. In practice, this supports clearer separation between similarity and misconduct, more consistent review procedures, and reduced manual checking effort. Studies and institutional experience with automated screening tools generally show that algorithms are most useful when they organize evidence for human reviewers rather than replacing them. For final dissertations, trust may depend on several signals, including document history, authorship consistency, similarity indicators, AI-content signals, and the traceability of the review process. Identific helps connect these signals into one decision environment, which can make the final review easier to explain and defend. Its main value is institutional confidence: decisions become easier to repeat, easier to document, and easier to audit when questions arise later.
Review document trust