October 2024
·
9 Reads
This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.
October 2024
·
9 Reads
June 2024
·
51 Reads
In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal design of traditional Machine Learning models. Evolutionary Computation (EC) has been a useful tool for both the design and optimization of Machine Learning models, endowing them with the capability to configure and/or adapt themselves to the task under consideration. Therefore, their application to GPAIS is a natural choice. This paper aims to analyze the role of EC in the field of GPAIS, exploring the use of EC for their design or enrichment. We also match GPAIS properties to Machine Learning areas in which EC has had a notable contribution, highlighting recent milestones of EC for GPAIS. Furthermore, we discuss the challenges of harnessing the benefits of EC for GPAIS, presenting different strategies to both design and improve GPAIS with EC, covering tangential areas, identifying research niches, and outlining potential research directions for EC and GPAIS.
January 2024
·
17 Reads
·
2 Citations
IEEE Transactions on Neural Networks and Learning Systems
Capsule networks (CapsNets) aim to parse images into a hierarchy of objects, parts, and their relationships using a two-step process involving part–whole transformation and hierarchical component routing. However, this hierarchical relationship modeling is computationally expensive, which has limited the wider use of CapsNet despite its potential advantages. The current state of CapsNet models primarily focuses on comparing their performance with capsule baselines, falling short of achieving the same level of proficiency as deep convolutional neural network (CNN) variants in intricate tasks. To address this limitation, we present an efficient approach for learning capsules that surpasses canonical baseline models and even demonstrates superior performance compared with high-performing convolution models. Our contribution can be outlined in two aspects: first, we introduce a group of subcapsules onto which an input vector is projected. Subsequently, we present the hybrid Gromov–Wasserstein (HGW) framework, which initially quantifies the dissimilarity between the input and the components modeled by the subcapsules, followed by determining their alignment degree through optimal transport (OT). This innovative mechanism capitalizes on new insights into defining alignment between the input and subcapsules, based on the similarity of their respective component distributions. This approach enhances CapsNets’ capacity to learn from intricate, high-dimensional data while retaining their interpretability and hierarchical structure. Our proposed model offers two distinct advantages: 1) its lightweight nature facilitates the application of capsules to more intricate vision tasks, including object detection; and 2) it outperforms baseline approaches in these demanding tasks. Our empirical findings illustrate that HGW capsules (HGWCapsules) exhibit enhanced robustness against affine transformations, scale effectively to larger datasets, and surpass CNN and CapsNet models across various vision tasks.
September 2022
·
49 Reads
Capsule networks (CapsNets) aim to parse images into a hierarchical component structure that consists of objects, parts, and their relations. Despite their potential, they are computationally expensive and pose a major drawback, which limits utilizing these networks efficiently on more complex datasets. The current CapsNet models only compare their performance with the capsule baselines and do not perform at the same level as deep CNN-based models on complicated tasks. This paper proposes an efficient way for learning capsules that detect atomic parts of an input image, through a group of SubCapsules, upon which an input vector is projected. Subsequently, we present the Wasserstein Embedding Module that first measures the dissimilarity between the input and components modeled by the SubCapsules, and then finds their degree of alignment based on the learned optimal transport. This strategy leverages new insights on defining alignment between the input and SubCapsules based on the similarity between their respective component distributions. Our proposed model, (i) is lightweight and allows to apply capsules for more complex vision tasks; (ii) performs better than or at par with CNN-based models on these challenging tasks. Our experimental results indicate that Wasserstein Embedding Capsules (WECapsules) perform more robustly on affine transformations, effectively scale up to larger datasets, and outperform the CNN and CapsNet models in several vision tasks.
May 2022
·
35 Reads
·
18 Citations
IEEE Transactions on Cybernetics
Image-to-image (I2I) translation has become a key asset for generative adversarial networks. Convolutional neural networks (CNNs), despite having a significant performance, are not able to capture the spatial relationships among different parts of an object and, thus, do not qualify as the ideal representative model for image translation tasks. As a remedy to this problem, capsule networks have been proposed to represent patterns for a visual object in such a way that preserves hierarchical spatial relationships. The training of capsules is constrained by learning all pairwise relationships between capsules of consecutive layers. This design would be prohibitively expensive both in time and memory. In this article, we present a new framework for capsule networks to provide a full description of the input components at various levels of semantics, which can successfully be applied to the generator-discriminator architectures without incurring computational overhead compared to the CNNs. To successfully apply the proposed capsules in the generative adversarial network, we put forth a novel Gromov-Wasserstein (GW) distance as a differentiable loss function that compares the dissimilarity between two distributions and then guides the learned distribution toward target properties, using optimal transport (OT) discrepancy. The proposed method--which is called generative equivariant network (GEN)--is an alternative architecture for GANs with equivariance capsule layers. The proposed model is evaluated through a comprehensive set of experiments on I2I translation and image generation tasks and compared with several state-of-the-art models. Results indicate that there is a principled connection between generative and capsule models that allows extracting discriminant and invariant information from image data.
August 2021
·
31 Reads
·
3 Citations
Image and Vision Computing
June 2021
·
71 Reads
·
7 Citations
IEEE Access
·
Bárbara Cervantes·
Miguel Angel Medina-Pérez·
[...]
·
Francisco Gabriel Corte HerreraLatent fingerprint identification is one of the leading forensic activities to clarify criminal acts. However, its computational cost hinders the rapid decision making in the identification of an individual when large databases are involved. To reduce the search time used to generate the fingerprint candidates’ order to be compared, fingerprint indexing algorithms that reduce the search space while minimizing the increase in the error rate (compared to the identification) are developed. In the present research, we propose an algorithm for indexing latent fingerprints based on minutia cylinder codes (MCC) . This type of minutiae descriptor presents a fixed structure, which brings advantages in terms of efficiency. Besides, in recent studies, this descriptor has shown an identification error rate, at the local level, lower than the other descriptors reported in the literature. Our indexing proposal requires an initial step to construct the indices, in which it uses k-means++ clustering algorithm to create groups of similar minutia cylinder codes corresponding to the impressions of a set of databases. K-means++ allows for a better outcome over other clustering algorithms because of the selection of the proper centroids. The buckets associated with each index are populated with the background databases. Then, given a latent fingerprint, the algorithm extracts the minutia cylinder codes associated with the clusters’ indices with the lowest distance respect to each descriptor of this latent fingerprint. Finally, it integrates the votes represented by the fingerprints obtained to select the candidate impressions.We conduct a set of experiments in which our proposal outperforms current rival algorithms in presence of different databases and descriptors. Also, the primary experiment reduces the search space by four orders of magnitude when the background database contains more than one million impressions.
September 2020
·
565 Reads
·
205 Citations
Cognitive Computation
In recent algorithmic family simulates different biological processes observed in Nature in order to efficiently address complex optimization problems. In the last years the number of bio-inspired optimization approaches in literature has grown considerably, reaching unprecedented levels that dark the future prospects of this field of research. This paper addresses this problem by proposing two comprehensive, principle-based taxonomies that allow researchers to organize existing and future algorithmic developments into well-defined categories, considering two different criteria: the source of inspiration and the behavior of each algorithm. Using these taxonomies we review more than three hundred publications dealing with nature-inspired and bio-inspired algorithms, and proposals falling within each of these categories are examined, leading to a critical summary of design trends and similarities between them, and the identification of the most similar classical algorithm for each reviewed paper. From our analysis we conclude that a poor relationship is often found between the natural inspiration of an algorithm and its behavior. Furthermore, similarities in terms of behavior between different algorithms are greater than what is claimed in their public disclosure: specifically, we show that more than one-third of the reviewed bio-inspired solvers are versions of classical algorithms. Grounded on the conclusions of our critical analysis, we give several recommendations and points of improvement for better methodological practices in this active and growing research field.
August 2020
·
29 Reads
·
1 Citation
Neurocomputing
July 2020
·
20 Reads
·
6 Citations
... Among the mentioned methods, few image translation algorithms integrate traditional features. To our knowledge, only UNTF [33][34][35] has incorporated features extracted via SVD in unpaired image translation. However, this method solely connects these features in a cascading manner, lacking flexibility in feature fusion. ...
May 2022
IEEE Transactions on Cybernetics
... Then big data, machine learning, image processing, pattern recognition and other methods are adopted for effective information processing to conduct analysis, evaluation and decision-making, and to improve the safety of ship navigation [8,9]. As an interdisciplinary subject, computer vision (CV) [10] further promoted the development of intelligent shipping. ...
August 2021
Image and Vision Computing
... Managing incomplete perceptions, background noise, inadequate ridge clarity, and nonlinear distortions are among the difficulties. An MCC-based indexing technique for latent fingerprint recognition was described by [154], beating competitors on the NIST SD4 and NIST SD14 databases by at least 1% and 3%, respectively. Although effective, it can have performance issues with different kinds of databases or with less-than-ideal detail extraction, and its temporal complexity prevents real-time applications. ...
June 2021
IEEE Access
... As another GA with machine learning, the surrogateassisted GA (SAGA) is proposed [17][18][19][20][21][22]. It takes much computation time to evaluate a candidate solution in some optimization problems, especially practical ones. ...
July 2020
... Specifically, the problem is how to detect and classify eddies with vertical structure ARGO and XBT datasets instead of just focusing on the analysis of vertical structure though it is really important. 2) The classifiers of eddies have been extended to the neural network and reached the agreement that the artificial intelligence method can enhance the identification performance of eddies [31], [32]. In particular, the complexity of CNN architecture has been increased to achieve higher robustness and generalization [32], [33]. ...
August 2020
Neurocomputing
... Special consideration should be paid to dealing with such a discrete problem. In this regard, proper representation of design variables is addressed by direct index coding when evolutionary algorithms are applied [4][5][6][7]. However, some other metaheuristic algorithms fall in the category of directional search methods that generate continuous positions during their search [8][9][10][11][12]. ...
September 2020
Cognitive Computation
... However, SHAP's computational complexity presents a significant obstacle, especially for large-scale data and complex models like Random Forest and XGBoost (XGB), where the need to evaluate all possible feature combinations results in slow execution times [14,[25][26][27]. This challenge is exacerbated in real-time applications, such as fraud detection or personalized healthcare, where quick decision-making is essential [28,29]. As a result, there is a pressing need for more efficient interpretability methods that balance computational cost and model accuracy. ...
December 2019
Information Fusion
... In global analysis, researchers aim to explain the knowledge or linguistic properties encoded in the hidden state activations of a model [22]. These local analysis techniques provide human-interpretable explanations of model outputs by pairing a "black box" model with a more interpretable model [3,4,8,25]. This allows the researcher to take advantage of the complexity of deep neural networks while maintaining some level of explainability in the output. ...
October 2019
... In a homogeneous swarm, all particles exhibit uniform behavior, but in a heterogeneous swarm, several distinct behaviors coexist concurrently [28]. Recent studies have introduced cooperative multi-swarm strategies and adaptive cooperation using a Markov Model, enhancing the performance and robustness of PSO in various applications [29,30]. ...
April 2019
Progress in Artificial Intelligence
... Oversampling approaches [18] [19][20] [21] are designed to augment the minority class in datasets by creating copies of existing samples or introducing additional samples. These methods expand the training set size, which can lead to increased computational time. ...
August 2018
Knowledge-Based Systems