Science topics: NeuroscienceVisual
Science topic
Visual - Science topic
Explore the latest publications in Visual, and find Visual experts.
Publications related to Visual (10,000)
Sorted by most recent
Dear Colleagues,
Morpho-colorimetric analysis has become a powerful tool in botanical research, providing a quantitative alternative and/or extension to conventional methodologies.
In the last two decades, image morpho-colorimetric analysis has gained considerable attention in plant research for its potential to automate seed discrimination, repl...
While Greek authors of the Classical period and beyond suggest that human sacrifice was universally condemned as an unthinkably barbaric offense and a violation of ritual norms, earlier extant literary sources offer no such clear ruling. However, this situation changes when the small yet iconographically remarkable group of pre-Classical visual rep...
Although current generative AI (GenAI) enables designers to create novel images, its focus on text-based and whole-image interaction limits expressive engagement with visual materials. Based on the design concept of deconstruction and reconstruction of digital visual attributes for visual prompts, we present FusAIn, a GenAI prompt composition tool...
Virtual reality (VR) allows to embody avatars. Coined the Proteus effect, an avatar's visual appearance can influence users' behavior and perception. Recent work suggests that athletic avatars decrease perceptual and physiological responses during VR exercise. However , such effects can fail to occur when users do not experience avatar ownership an...
Cancer is one of the most feared diseases worldwide, affecting anyone. But what if you were better equipped with sound scientific knowledge to understand, prevent, detect, and treat cancer?
This book provides all the essential information on the causes, prevention, and treatment of cancer in an easily understandable manner. It maintains medical pr...
The older parks in Uppsala and their history are illustrated briefly with photographs in a visual work about the city's architecture and parks.
Abstrak Penelitian ini bertujuan untuk mensintesis berbagai artikel yang mengkaji tentang enggunaan Media Video Pembelajaran di Sekolah Dasar dengan metode Literature Review. Melalui pencarian pada Google Scholar (Google Cendekia) terdapat 30 artikel yang dianalisis dan disintesis sesuai dengan tujuan penelitian. Hasil penelitian ini menunjukkan ba...
Diabetic retinopathy (DR) is a major cause of vision impairment globally, with early detection remaining a significant challenge. The limitations of current diagnostic methods, particularly in identifying early-stage DR, highlight a pressing need for more accurate diagnostic technologies. In response, our research introduces an innovative model tha...
Resumo Introdução: O Visual Abstract corresponde a uma síntese visual das informações mais relevantes, apresentadas como infográfico, de um artigo científico. Apesar da utilização crescente de Visual Abstracts por periódicos no mundo, ainda há escassez de estudos que avaliem os elementos que os compõem para orientar sua elaboração. Objetivo: O obje...
This study investigates the difficulty of improving product recommendations in e-commerce systems by tackling the common problem of poor diversity in suggestions. We present a novel approach that uses a Siamese network architecture and ResNet for feature extraction to recommend visually similar elements while incorporating diversity through a clust...
At present, there are problems such as the limitation of the number of people and the single teaching method and means in dance education, while virtual reality brings new opportunities for the innovation of the existing dance education. This paper analyzes the method of combining virtual reality and stage education. Secondly, based on the 3DMAX mo...
In this paper, the genetic algorithm improved by simulated annealing algorithm is used to optimize the model, material and other key parameters in 3D animation design. The parallax calculation and visual comfort evaluation results are combined with experts’ subjective assessments to comprehensively measure the visual performance effect of 3D animat...
The first section of the article proposes a framework of visual learning analysis tools based on ideological and political education from the perspective of visual learning analysis and other perspectives. The experimental objectives as well as the objects are established, and the experimental scheme of the visualization teaching strategy is design...
This paper mainly establishes a recommendation model based on deep neural network to realize the personalized recommendation of physical culture teaching content. Through the feature selection method based on MIFS, the learners’ preference for physical culture teaching content features is determined, and the input process of the recommendation meth...
Di era digital, penyebaran berita hoaks semakin merajalela dan dapat berdampak serius pada masyarakat. Artikel ini membahas bagaimana game literasi digital dapat menjadi solusi inovatif dalam meningkatkan kemampuan masyarakat, terutama mahasiswa, dalam mengenali dan menganalisis berita palsu. Dengan pendekatan yang interaktif, game ini membantu pem...
Handbook describing assessment of visual functioning
Background: Background: The coordination between visual perception and physical response is crucial for an effective performance in sports, emphasizing the significant link between what athletes see and how they respond during performance. Therefore, this study aimed to determine the effect of visual blur on basketball free-throw shooting performan...
This research examines the use of storytelling techniques in the context of educational innovation. This study is an effort to understand how can visual storytelling be a useful educational medium? It is an answer to storytelling being an effective method to teach anyone without a language barrier. In our preliminary studies on the hills region of...
The development of computer networks provides learners with new educational platforms for knowledge acquisition and skill learning. In this study, a learning behavior analysis model based on distance education platform is formed, and clustering analysis, lagged sequence analysis and association rule mining are used to visually analyze learners’ onl...
Previous studies have shown that inhibiting the mirror generalization mechanism in recognizing letters/words containing reversible and non-reversible letters has a right-asymmetry bias. In this paper, we analysed for the first time whether this bias can also be observed in the visual recognition of objects as a “collateral” effect of literacy on co...
Este artículo presentó los resultados de un proyecto de investigación cuyo objetivo fue diseñar una experiencia interactiva de aprendizaje a través del uso de la realidad aumentada (RA) y los dispositivos móviles para encontrar las ventajas y las
desventajas de su incorporación como medio de enseñanza dentro del aula. Los participantes fueron 32 e...
Visual affordance grounding enables a computer system to comprehend and recognize an object function and potential uses from an image. This requires not only recognizing objects by their shape and appearance, but also understanding their interactions with the environment and users. This paper introduces SEHD-Afford, a weakly supervised affordance g...
The specific features of the stem growth have been determined in Aesculus hippocastanum L., growing in different environments of the industrial cities of Ukraine. In the course of our study we examined 5840 trees in parks, squares and roadside plantings of the cities of Pokrovsk, Slovyansk, Avdiivka, Kostyantinivka, Novogrodivka, Donetsk, Khartsyzs...
Quantum ESPRESSO is a powerful suite for first-principles electronic structure calculations, but the true value of simulations lies in the ability to analyze and visualize the results. This report explores the essential post-processing techniques and data visualization tools used in Quantum ESPRESSO, providing a roadmap for researchers to extract m...
This paper proposes an innovative strategy for music teaching using digital technology in conjunction with a music theory course. Time domain, frequency domain, and pitch-related features are extracted through audio recognition technology. Using the survey method and experimental method, the music played by students is compared with standard music,...
Perceptual similarity assessment plays an important role in processing visual information, which is often employed in Human-AI interaction tasks such as object recognition or content generation. It is important to understand how humans perceive and evaluate visual similarity to iteratively generate outputs that meet the users' expectations better a...
This paper traces the presence of South Asians in Edinburgh, Scotland in the 1840s. It provides six paintings and photographs of identified and unidentified sitters as visual sources.
Scene Graph Generation (SGG) aims to represent visual scenes by identifying objects and their pairwise relationships, providing a structured understanding of image content. However, inherent challenges like long-tailed class distributions and prediction variability necessitate uncertainty quantification in SGG for its practical viability. In this p...
Introduction
Monte Carlo simulation studies allow testing multiple experimental conditions, whose results are often difficult to communicate and visualize to their full extent. Some researchers have proposed alternatives to address this issue, highlighting its relevance. This article develops a new way of observing, analyzing, and presenting the re...
Large Vision-Language Models (LVLMs) have shown promising performance in vision-language understanding and reasoning tasks. However, their visual understanding behaviors remain underexplored. A fundamental question arises: to what extent do LVLMs rely on visual input, and which image regions contribute to their responses? It is non-trivial to inter...
False alarming, or detecting an error when there is not one, is a pervasive problem across numerous industries. The present study investigated the role of elaboration, or additional information about non-error differences in complex visual displays, for mitigating false error responding. In Experiment 1, learners studied errors and non-error differ...
Insecurity found in some communities in El Salvador has entered schools and affected internal dynamics. One of these has been the teachers-student relationship, highlighting the erosion of teaching authority and the challenges that it implies to maintain discipline within the classroom. In this context, some students exhibit disruptive behaviors su...
Learner agency is often attributed with taking-action, as the antithesis of passivity and perceived in-action. Our study challenges common perceptions and explores connectivity between agency, passivity, and emotionality in learning, which are not fully understood in science education research. The interplay between agency, passivity, and emotional...
Aiming at the current high-speed real-time inspection demand faced in the production of filter rods with cores and the limitations of traditional quality control methods, this study proposes a high-speed online inspection method based on machine vision. As the production speed of filter rods increases, the traditional inspection methods cannot meet...
Purpose
To compare the ophthalmic findings between dyslexic and non-dyslexic children aged 7–10 years.
Methods
A matched case-control study was conducted on 32 dyslexic children as a case group and 32 non-dyslexics as a control group. Both groups underwent complete ophthalmic examinations to measure corrected distance visual acuity, refractive err...
This research involved the correlation of results of the electrical resistivity method of geophysical prospecting involving the vertical electrical sounding (VES) techniques and exploratory core drilling method. The purpose was to determine the effectiveness of adopting VES in coal exploration and reserve estimation. The crossplots between results...
Introduction: Digital stress, resulting from expectations of online availability, can
increase the risk of conflicts with friends. However, friendship conflict remains
an underexplored indicator, particularly in association with stressful online
experiences. This study aims to examine the association between digital stress
and conflict levels overt...
Vision-Language Models (VLMs) have recently witnessed significant progress in visual comprehension. As the permitting length of image context grows, VLMs can now comprehend a broader range of views and spaces. Current benchmarks provide insightful analysis of VLMs in tasks involving complex visual instructions following, multi-image understanding a...
While foundation models have revolutionised computer vision, their effectiveness for sketch understanding remains limited by the unique challenges of abstract, sparse visual inputs. Through systematic analysis, we uncover two fundamental limitations: Stable Diffusion (SD) struggles to extract meaningful features from abstract sketches (unlike its s...
Text-to-Image diffusion models can produce undesirable content that necessitates concept erasure techniques. However, existing methods struggle with under-erasure, leaving residual traces of targeted concepts, or over-erasure, mistakenly eliminating unrelated but visually similar concepts. To address these limitations, we introduce CRCE, a novel co...
Document Question Answering (DocQA) is a very common task. Existing methods using Large Language Models (LLMs) or Large Vision Language Models (LVLMs) and Retrieval Augmented Generation (RAG) often prioritize information from a single modal, failing to effectively integrate textual and visual cues. These approaches struggle with complex multi-modal...
“PENGENALAN BAHASA PEMPROGRAMAN UNTUK PEMULA” hadir untuk memberikan gambaran secara rinci dan detail tentang bahasa-bahasa pemrograman yang sedang berkembang saat ini dan penggunaan prosedur atau fungsi dalam menjalankan serangkaian instruksi.
Pada buku ini, kami tidak hanya membahas berbagai instruksi atau sintak yang terdapat pada beberapa baha...
To avoid harm caused by falsified medicines, we aimed to devise a method to identify falsified medicines in Japan using visual observation among medicines obtained from personal import agency websites, which are the main conduits through which falsified medicines are obtained. We recorded details regarding the information provided on personal impor...
In albino mice and EphB1 knockout mice, mistargeted retinal ganglion cell axons form dense islands of axon terminals in the dorsal lateral geniculate nuclei (dLGN). The formation of these islands of retinal input depends on developmental patterns of spontaneous retinal activity. We reconstructed the microcircuitry of the activity-dependent islands...
Person detection methods are used widely in applications including visual surveillance, pedestrian detection, and robotics. However, accurate detection of persons from overhead fisheye images remains an open challenge because of factors including person rotation and small-sized persons. To address the person rotation problem, we convert the fisheye...
There is considerable interest in understanding patterns of β‐diversity that measure the amount of change in species composition through space or time. Most hypotheses for β‐diversity evoke nonrandom processes that generate spatial and temporal within‐species aggregation; however, β‐diversity can also be driven by random sampling processes. Here, w...
Autoregressive Transformer models have demonstrated impressive performance in video generation, but their sequential token-by-token decoding process poses a major bottleneck, particularly for long videos represented by tens of thousands of tokens. In this paper, we propose Diagonal Decoding (DiagD), a training-free inference acceleration algorithm...
Em Casa Grande e Senzala (1933), uma das obras fundadoras da análise sobre as relações coloniais e escravistas no Brasil, Gilberto Freyre relata um ditado popular no qual ele afirma que além das convenções sociais sobre a superioridade da mulher branca e a inferioridade da mulher negra, a preferência sexual dos homens pelas mulatas é indiscutível....
Visual recognition is essential for animals, including humans, to interpret their environment. However, attention plays a key role in filtering visual information, directing brain resources to salient objects or locations. The saliency map model replicates this biological process, predicting where attention and gaze will focus. Recently, models bas...
This paper proposes a new approach for stability analysis of multi-input, multi-output (MIMO) feedback systems through Scaled Relative Graphs (SRGs). Unlike traditional methods, such as the Generalized Nyquist Criterion (GNC), which relies on a coupled analysis that requires the multiplication of models, our approach enables the evaluation of syste...
Videos, with their unique temporal dimension, demand precise grounded understanding, where answers are directly linked to visual, interpretable evidence. Despite significant breakthroughs in reasoning capabilities within Large Language Models, multi-modal reasoning - especially for videos - remains unexplored. In this work, we introduce VideoMind,...
Video Super-Resolution (VSR) reconstructs high-resolution videos from low-resolution inputs to restore fine details and improve visual clarity. While deep learning-based VSR methods achieve impressive results, their centralized nature raises serious privacy concerns, particularly in applications with strict privacy requirements. Federated Learning...
Talking head synthesis, also known as speech-to-lip synthesis, reconstructs the facial motions that align with the given audio tracks. The synthesized videos are evaluated on mainly two aspects, lip-speech synchronization and image fidelity. Recent studies demonstrate that GAN-based and diffusion-based models achieve state-of-the-art (SOTA) perform...
ABSTRAK Keamanan objek vital seperti bandara merupakan aspek yang sangat penting untuk memastikan kelancaran operasional dan keselamatan penerbangan. Sistem pengawasan konvensional yang masih bergantung pada patroli manual memiliki keterbatasan dalam hal jangkauan, efektivitas, dan efisiensi. Oleh karena itu, penelitian ini bertujuan untuk merancan...
Contrastive decoding strategies are widely used to mitigate object hallucinations in multimodal large language models (MLLMs). By reducing over-reliance on language priors, these strategies ensure that generated content remains closely grounded in visual inputs, producing contextually accurate outputs. Since contrastive decoding requires no additio...
Multi-modality image fusion aims to extract complementary features from multiple source images of different modalities, generating a fused image that inherits their advantages. To address challenges in cross-modality shared feature (CMSF) extraction, single-modality specific feature (SMSF) fusion, and the absence of ground truth (GT) images, we pro...
Text-to-image diffusion models have made significant advancements in generating high-quality, diverse images from text prompts. However, the inherent limitations of textual signals often prevent these models from fully capturing specific concepts, thereby reducing their controllability. To address this issue, several approaches have incorporated pe...
Penelitian ini bertujuan untuk mendeskripsikan tingkat kemampuan siswa dalam menyelesaikan masalah matematika berdasarkan gaya belajar mereka, yaitu visual, auditorial, dan kinestetik. Analisis dilakukan dengan mengacu pada tahapan pemecahan masalah yang dikemukakan oleh Polya, yang meliputi memahami masalah, membuat rencana penyelesaian, melaksana...
Vision-language models (VLMs) encounter considerable challenges when adapting to domain shifts stemming from changes in data distribution. Test-time adaptation (TTA) has emerged as a promising approach to enhance VLM performance under such conditions. In practice, test data often arrives in batches, leading to increasing interest in the transductiv...
Video Foundation Models (VFMs) have recently been used to simulate the real world to train physical AI systems and develop creative visual experiences. However, there are significant challenges in training large-scale, high quality VFMs that can generate high-quality videos. We present a scalable, open-source VFM training pipeline with NVIDIA NeMo,...
Multimodal large language models (MLLMs) improve performance on vision-language tasks by integrating visual features from pre-trained vision encoders into large language models (LLMs). However, how MLLMs process and utilize visual information remains unclear. In this paper, a shift in the dominant flow of visual information is uncovered: (1) in sha...
Resumo Nossa experiência com telas eletrônicas e digitais pode ser considerada como a pars pro toto desse "dispositivo" (no amplo significado sociocultural atribuído a esse termo por Foucault) que nos envolve. É por isso que considero decisivo elaborar o que eu chamaria de uma antropologia das telas, para a qual acredito que o valor heurístico ofer...
Purpose
To investigate the characteristic manifestations of basal cell carcinoma (BCC) under dermoscopy and reflectance confocal microscopy (RCM) and explore the diagnostic value of dermoscopy combined with RCM for BCC.
Methods
A cohort of 71 patients with the suspected clinical diagnosis of BCC underwent dermoscopy, RCM, and histopathological exa...
We present UniFluid, a unified autoregressive framework for joint visual generation and understanding leveraging continuous visual tokens. Our unified autoregressive architecture processes multimodal image and text inputs, generating discrete tokens for text and continuous tokens for image. We find though there is an inherent trade-off between the...
With the increasing demand for high-quality 3D holographic reconstruction, visual clarity and accuracy remain significant challenges in various imaging applications. Current methods struggle for higher image resolution and to resolve such issues as detail loss and checkerboard artifacts. To address these challenges, we propose the model Depthwise S...
En este trabajo nos proponemos abordar la producción discursiva del pasado prehistórico en los discursos interpretativos del patrimonio desde una perspectiva crítica, reflexiva y feminista, dentro del marco conceptual de la Arqueología Pública. El objetivo es identificar los patrones y tendencias discursivas predominantes, tanto en su vertiente tex...
Objective
To assess the ability to visually estimate the fractional shortening in dogs and the impact of experience on those assessments.
Methods
Right parasternal short- and long-axis cine loops from 25 dogs with varying fractional shortening (6.9% to 61.2%) were distributed online to observers with different levels of training in anesthesiology...
Accurately localizing audible objects based on audio-visual cues is the core objective of audio-visual segmentation. Most previous methods emphasize spatial or temporal multi-modal modeling, yet overlook challenges from ambiguous audio-visual correspondences such as nearby visually similar but acoustically different objects and frequent shifts in o...
Is Your Baby Racist?
Newsweek published an article titled “Is Your Baby Racist?” which explored the concept of whether babies can be racist. The article discussed research indicating that babies as young as six months old show a preference for faces of their own race, suggesting they notice differences in appearance, including skin colour.[2][6][7...
We tested whether naturally occurring visual variability—specifically, typefaces—would help people generalize word learning to typefaces they had never seen before. In Chinese, thousands of unique written characters must be learned item by item, and differentiated from similar-looking characters. Participants ( n = 190) with no previous Chinese exp...
La lumbalgia inespecífica es una de las principales causas de incapacidad laboral a nivel mundial, afectando la productividad y calidad de vida de los trabajadores. Este estudio tuvo como objetivo determinar prevalencia de lumbalgia inespecífica y grado de incapacidad funcional en trabajadores de mantenimiento e intendencia de una institución educa...
Often, the needs and visual abilities differ between the annotator group and the end user group. Generating detailed diagram descriptions for blind and low-vision (BLV) users is one such challenging domain. Sighted annotators could describe visuals with ease, but existing studies have shown that direct generations by them are costly, bias-prone, an...
Generating images with embedded text is crucial for the automatic production of visual and multimodal documents, such as educational materials and advertisements. However, existing diffusion-based text-to-image models often struggle to accurately embed text within images, facing challenges in spelling accuracy, contextual relevance, and visual cohe...
We study Multimodal Large Language Models (MLLMs) with in-context learning for food preparation task planning. In this context, we identify two key challenges: cross-modal distraction and geometric feasibility. Cross-modal distraction occurs when the inclusion of visual input degrades the reasoning performance of a MLLM. Geometric feasibility refer...
This research explores the factors that both facilitate and constrain the process of co-designing alongside visually impaired children (VIC). Understanding the facilitators and constraints is crucial, as it reveals how actors within a co-design project manage their mutual differences, thereby influencing the effectiveness and quality of the co-desi...
Adolescents with visual impairment face numerous challenges, particularly in areas of growth and development such as socialization and physical milestones. This study explores the challenges in play activities and the perceived need for peer support among visually impaired adolescents. A qualitative approach was employed, targeting visually impaire...
Hoaks atau berita palsu telah menjadi tantangan besar dalam era digital, terutama dalam konteks politik di Indonesia. Penelitian ini bertujuan untuk menganalisis lima berita hoaks yang telah terbukti tidak benar serta mengidentifikasi aspek pemalsuan yang terjadi dalam setiap kasus. Berita yang dianalisis mencakup berbagai isu politik, seperti klai...
El artículo versa sobre la reinterpretación y actualización del patrimonio cultural que la institución museo puede realizar a partir de los programas de creación. Una estrategia es la desarrollada por el Museo Universidad de Navarra, cuya forma de vincular las políticas de conservación y creación analizamos en el caso de Soliloquios, coreografía pa...
Multimodal systems have highly complex processing pipelines and are pretrained over large datasets before being fine-tuned for specific tasks such as visual captioning. However, it becomes hard to disentangle what the model learns during the fine-tuning process from what it already knows due to its pretraining. In this work, we learn a probabilisti...
We investigate complex video question answering via chain-of-evidence reasoning -- identifying sequences of temporal spans from multiple relevant parts of the video, together with visual evidence within them. Existing models struggle with multi-step reasoning as they uniformly sample a fixed number of frames, which can miss critical evidence distri...
A fundamental challenge in conditional 3D shape generation is to minimize the information loss and maximize the intention of user input. Existing approaches have predominantly focused on two types of isolated conditional signals, i.e., user sketches and text descriptions, each of which does not offer flexible control of the generated shape. In this...
Image deblurring is a fundamental preprocessing technique aimed at recovering clear and detailed images from blurry inputs. However, existing methods often struggle to effectively integrate multi‐scale feature extraction with frequency enhancement, limiting their ability to reconstruct fine textures, especially in the presence of non‐uniform blur....
Flows are vivid and diversiform physical phenomena that reflect the fluid motion in different conditions. Fluid flow in the fractures is the typically motion mode of the fluid in rocks. This study investigated, using enhanced X-ray imaging digital radiography (XIDR), a special motion mode of fluid flow through the fractures during rock failures. Ou...
In the current study, electroencephalographic (EEG) data was recorded to study the impact of hand and target visibility on neural processing during both the planning and execution of upper limb reaches. Prior to each movement, participants were informed if the hand and/or the target would be available in four conditions: (1) hand and target visible...
In recent years, Multimodal Large Language Models (MLLMs) have demonstrated remarkable advancements in tasks such as visual question answering, visual understanding, and reasoning. However, this impressive progress relies on vast amounts of data collected from the internet, raising significant concerns about privacy and security. To address these i...
تتناول هذه الدراسة موضوع الفقد كواحدة من أكثر التجارب الإنسانية عمقًا وتأثيرًا، مستعرضةً كيفية استخدام العين البشرية كرمز بصري للتعبير عن مشاعر الحزن ومراحل تقبل الفقد في الفن المعاصر. تسعى الدراسة إلى تقديم فهم أعمق لدور العين البشرية كرمز بصري و كوسيط تشكيلي يظهر الانفعالات العاطفية المختلفة، من خلال الوقوف على بعض الممارسات التشكيلية المعاصرة. ت...
Th e neutron proton and electron are looped waves in the medium of space. This presentation shows the calculation that the proton and neutron are three wavelengths in the loop. Starting from the Balmer formula the energy levels of the Hydrogen atom are derived including the effective radius of each energy level.
This presentation explains visually...
Delayed ettringite formation (DEF) is a durability issue that can cause concrete expansion and cracking. While significant efforts have been made to investigate the microscopic mechanisms of DEF expansion, relatively few studies have focused on structural models. Understanding the effect of stress on DEF expansion is crucial for evaluating the perf...
Multimodal Large Language Models (MLLMs) have revolutionized video understanding, yet are still limited by context length when processing long videos. Recent methods compress videos by leveraging visual redundancy uniformly, yielding promising results. Nevertheless, our quantitative analysis shows that redundancy varies significantly across time an...
Sign language recognition involves modeling complex multichannel information, such as hand shapes and movements while relying on sufficient sign language-specific data. However, sign languages are often under-resourced, posing a significant challenge for research and development in this field. To address this gap, we introduce ISLR101, the first pu...
Background : Transfemoral access (TFA) has been the traditional approach for diagnostic cerebral angiography, but it is associated with several limitations and complications, including pain, discomfort, retroperitoneal hemorrhage, pulmonary embolism, and increased hospital admissions. Transradial cerebral angiography (TRA) offers a promising altern...
Subjective interpretation and content diversity make predicting whether an image is private or public a challenging task. Graph neural networks combined with convolutional neural networks (CNNs), which consist of 14,000 to 500 millions parameters, generate features for visual entities (e.g., scene and object types) and identify the entities that co...
One of the main characteristics of CubeSats is their economic viability, making them widely used as study tools in universities and space research centers. However, it is clear that development and operating costs can become high in certain circumstances, especially when using the services of private organizations for these missions.
This article a...
Stress recognition from speech seeks an humongous attention among the researchers and from the industrial sides like call centres for recognizing the customer’s intension over speech. Recognizing stress using visual is easier when compared with recognition of stress from speech signal since Lombard effect affects the normal speech heavily. In this...