Science topics: NeuroscienceVisual
Science topic

Visual - Science topic

Explore the latest publications in Visual, and find Visual experts.
Filters
All publications are displayed by default. Use this filter to view only publications with full-texts.
Publications related to Visual (10,000)
Sorted by most recent
Research Proposal
Full-text available
Dear Colleagues, Morpho-colorimetric analysis has become a powerful tool in botanical research, providing a quantitative alternative and/or extension to conventional methodologies. In the last two decades, image morpho-colorimetric analysis has gained considerable attention in plant research for its potential to automate seed discrimination, repl...
Chapter
Full-text available
While Greek authors of the Classical period and beyond suggest that human sacrifice was universally condemned as an unthinkably barbaric offense and a violation of ritual norms, earlier extant literary sources offer no such clear ruling. However, this situation changes when the small yet iconographically remarkable group of pre-Classical visual rep...
Conference Paper
Full-text available
Although current generative AI (GenAI) enables designers to create novel images, its focus on text-based and whole-image interaction limits expressive engagement with visual materials. Based on the design concept of deconstruction and reconstruction of digital visual attributes for visual prompts, we present FusAIn, a GenAI prompt composition tool...
Conference Paper
Full-text available
Virtual reality (VR) allows to embody avatars. Coined the Proteus effect, an avatar's visual appearance can influence users' behavior and perception. Recent work suggests that athletic avatars decrease perceptual and physiological responses during VR exercise. However , such effects can fail to occur when users do not experience avatar ownership an...
Book
Full-text available
Cancer is one of the most feared diseases worldwide, affecting anyone. But what if you were better equipped with sound scientific knowledge to understand, prevent, detect, and treat cancer? This book provides all the essential information on the causes, prevention, and treatment of cancer in an easily understandable manner. It maintains medical pr...
Chapter
Full-text available
The older parks in Uppsala and their history are illustrated briefly with photographs in a visual work about the city's architecture and parks.
Article
Full-text available
Abstrak Penelitian ini bertujuan untuk mensintesis berbagai artikel yang mengkaji tentang enggunaan Media Video Pembelajaran di Sekolah Dasar dengan metode Literature Review. Melalui pencarian pada Google Scholar (Google Cendekia) terdapat 30 artikel yang dianalisis dan disintesis sesuai dengan tujuan penelitian. Hasil penelitian ini menunjukkan ba...
Article
Full-text available
Diabetic retinopathy (DR) is a major cause of vision impairment globally, with early detection remaining a significant challenge. The limitations of current diagnostic methods, particularly in identifying early-stage DR, highlight a pressing need for more accurate diagnostic technologies. In response, our research introduces an innovative model tha...
Article
Full-text available
Resumo Introdução: O Visual Abstract corresponde a uma síntese visual das informações mais relevantes, apresentadas como infográfico, de um artigo científico. Apesar da utilização crescente de Visual Abstracts por periódicos no mundo, ainda há escassez de estudos que avaliem os elementos que os compõem para orientar sua elaboração. Objetivo: O obje...
Article
Full-text available
This study investigates the difficulty of improving product recommendations in e-commerce systems by tackling the common problem of poor diversity in suggestions. We present a novel approach that uses a Siamese network architecture and ResNet for feature extraction to recommend visually similar elements while incorporating diversity through a clust...
Article
Full-text available
At present, there are problems such as the limitation of the number of people and the single teaching method and means in dance education, while virtual reality brings new opportunities for the innovation of the existing dance education. This paper analyzes the method of combining virtual reality and stage education. Secondly, based on the 3DMAX mo...
Article
Full-text available
In this paper, the genetic algorithm improved by simulated annealing algorithm is used to optimize the model, material and other key parameters in 3D animation design. The parallax calculation and visual comfort evaluation results are combined with experts’ subjective assessments to comprehensively measure the visual performance effect of 3D animat...
Article
Full-text available
The first section of the article proposes a framework of visual learning analysis tools based on ideological and political education from the perspective of visual learning analysis and other perspectives. The experimental objectives as well as the objects are established, and the experimental scheme of the visualization teaching strategy is design...
Article
Full-text available
This paper mainly establishes a recommendation model based on deep neural network to realize the personalized recommendation of physical culture teaching content. Through the feature selection method based on MIFS, the learners’ preference for physical culture teaching content features is determined, and the input process of the recommendation meth...
Article
Full-text available
Di era digital, penyebaran berita hoaks semakin merajalela dan dapat berdampak serius pada masyarakat. Artikel ini membahas bagaimana game literasi digital dapat menjadi solusi inovatif dalam meningkatkan kemampuan masyarakat, terutama mahasiswa, dalam mengenali dan menganalisis berita palsu. Dengan pendekatan yang interaktif, game ini membantu pem...
Book
Full-text available
Handbook describing assessment of visual functioning
Article
Full-text available
Background: Background: The coordination between visual perception and physical response is crucial for an effective performance in sports, emphasizing the significant link between what athletes see and how they respond during performance. Therefore, this study aimed to determine the effect of visual blur on basketball free-throw shooting performan...
Article
Full-text available
This research examines the use of storytelling techniques in the context of educational innovation. This study is an effort to understand how can visual storytelling be a useful educational medium? It is an answer to storytelling being an effective method to teach anyone without a language barrier. In our preliminary studies on the hills region of...
Article
Full-text available
The development of computer networks provides learners with new educational platforms for knowledge acquisition and skill learning. In this study, a learning behavior analysis model based on distance education platform is formed, and clustering analysis, lagged sequence analysis and association rule mining are used to visually analyze learners’ onl...
Article
Full-text available
Previous studies have shown that inhibiting the mirror generalization mechanism in recognizing letters/words containing reversible and non-reversible letters has a right-asymmetry bias. In this paper, we analysed for the first time whether this bias can also be observed in the visual recognition of objects as a “collateral” effect of literacy on co...
Article
Full-text available
Este artículo presentó los resultados de un proyecto de investigación cuyo objetivo fue diseñar una experiencia interactiva de aprendizaje a través del uso de la realidad aumentada (RA) y los dispositivos móviles para encontrar las ventajas y las desventajas de su incorporación como medio de enseñanza dentro del aula. Los participantes fueron 32 e...
Article
Full-text available
Visual affordance grounding enables a computer system to comprehend and recognize an object function and potential uses from an image. This requires not only recognizing objects by their shape and appearance, but also understanding their interactions with the environment and users. This paper introduces SEHD-Afford, a weakly supervised affordance g...
Article
Full-text available
The specific features of the stem growth have been determined in Aesculus hippocastanum L., growing in different environments of the industrial cities of Ukraine. In the course of our study we examined 5840 trees in parks, squares and roadside plantings of the cities of Pokrovsk, Slovyansk, Avdiivka, Kostyantinivka, Novogrodivka, Donetsk, Khartsyzs...
Method
Full-text available
Quantum ESPRESSO is a powerful suite for first-principles electronic structure calculations, but the true value of simulations lies in the ability to analyze and visualize the results. This report explores the essential post-processing techniques and data visualization tools used in Quantum ESPRESSO, providing a roadmap for researchers to extract m...
Article
Full-text available
This paper proposes an innovative strategy for music teaching using digital technology in conjunction with a music theory course. Time domain, frequency domain, and pitch-related features are extracted through audio recognition technology. Using the survey method and experimental method, the music played by students is compared with standard music,...
Conference Paper
Full-text available
Perceptual similarity assessment plays an important role in processing visual information, which is often employed in Human-AI interaction tasks such as object recognition or content generation. It is important to understand how humans perceive and evaluate visual similarity to iteratively generate outputs that meet the users' expectations better a...
Article
Full-text available
This paper traces the presence of South Asians in Edinburgh, Scotland in the 1840s. It provides six paintings and photographs of identified and unidentified sitters as visual sources.
Preprint
Full-text available
Scene Graph Generation (SGG) aims to represent visual scenes by identifying objects and their pairwise relationships, providing a structured understanding of image content. However, inherent challenges like long-tailed class distributions and prediction variability necessitate uncertainty quantification in SGG for its practical viability. In this p...
Article
Full-text available
Introduction Monte Carlo simulation studies allow testing multiple experimental conditions, whose results are often difficult to communicate and visualize to their full extent. Some researchers have proposed alternatives to address this issue, highlighting its relevance. This article develops a new way of observing, analyzing, and presenting the re...
Preprint
Full-text available
Large Vision-Language Models (LVLMs) have shown promising performance in vision-language understanding and reasoning tasks. However, their visual understanding behaviors remain underexplored. A fundamental question arises: to what extent do LVLMs rely on visual input, and which image regions contribute to their responses? It is non-trivial to inter...
Article
Full-text available
False alarming, or detecting an error when there is not one, is a pervasive problem across numerous industries. The present study investigated the role of elaboration, or additional information about non-error differences in complex visual displays, for mitigating false error responding. In Experiment 1, learners studied errors and non-error differ...
Article
Full-text available
Insecurity found in some communities in El Salvador has entered schools and affected internal dynamics. One of these has been the teachers-student relationship, highlighting the erosion of teaching authority and the challenges that it implies to maintain discipline within the classroom. In this context, some students exhibit disruptive behaviors su...
Article
Full-text available
Learner agency is often attributed with taking-action, as the antithesis of passivity and perceived in-action. Our study challenges common perceptions and explores connectivity between agency, passivity, and emotionality in learning, which are not fully understood in science education research. The interplay between agency, passivity, and emotional...
Article
Full-text available
Aiming at the current high-speed real-time inspection demand faced in the production of filter rods with cores and the limitations of traditional quality control methods, this study proposes a high-speed online inspection method based on machine vision. As the production speed of filter rods increases, the traditional inspection methods cannot meet...
Article
Full-text available
Purpose To compare the ophthalmic findings between dyslexic and non-dyslexic children aged 7–10 years. Methods A matched case-control study was conducted on 32 dyslexic children as a case group and 32 non-dyslexics as a control group. Both groups underwent complete ophthalmic examinations to measure corrected distance visual acuity, refractive err...
Article
Full-text available
This research involved the correlation of results of the electrical resistivity method of geophysical prospecting involving the vertical electrical sounding (VES) techniques and exploratory core drilling method. The purpose was to determine the effectiveness of adopting VES in coal exploration and reserve estimation. The crossplots between results...
Article
Full-text available
Introduction: Digital stress, resulting from expectations of online availability, can increase the risk of conflicts with friends. However, friendship conflict remains an underexplored indicator, particularly in association with stressful online experiences. This study aims to examine the association between digital stress and conflict levels overt...
Preprint
Full-text available
Vision-Language Models (VLMs) have recently witnessed significant progress in visual comprehension. As the permitting length of image context grows, VLMs can now comprehend a broader range of views and spaces. Current benchmarks provide insightful analysis of VLMs in tasks involving complex visual instructions following, multi-image understanding a...
Preprint
Full-text available
While foundation models have revolutionised computer vision, their effectiveness for sketch understanding remains limited by the unique challenges of abstract, sparse visual inputs. Through systematic analysis, we uncover two fundamental limitations: Stable Diffusion (SD) struggles to extract meaningful features from abstract sketches (unlike its s...
Preprint
Full-text available
Text-to-Image diffusion models can produce undesirable content that necessitates concept erasure techniques. However, existing methods struggle with under-erasure, leaving residual traces of targeted concepts, or over-erasure, mistakenly eliminating unrelated but visually similar concepts. To address these limitations, we introduce CRCE, a novel co...
Preprint
Full-text available
Document Question Answering (DocQA) is a very common task. Existing methods using Large Language Models (LLMs) or Large Vision Language Models (LVLMs) and Retrieval Augmented Generation (RAG) often prioritize information from a single modal, failing to effectively integrate textual and visual cues. These approaches struggle with complex multi-modal...
Book
Full-text available
“PENGENALAN BAHASA PEMPROGRAMAN UNTUK PEMULA” hadir untuk memberikan gambaran secara rinci dan detail tentang bahasa-bahasa pemrograman yang sedang berkembang saat ini dan penggunaan prosedur atau fungsi dalam menjalankan serangkaian instruksi. Pada buku ini, kami tidak hanya membahas berbagai instruksi atau sintak yang terdapat pada beberapa baha...
Article
Full-text available
To avoid harm caused by falsified medicines, we aimed to devise a method to identify falsified medicines in Japan using visual observation among medicines obtained from personal import agency websites, which are the main conduits through which falsified medicines are obtained. We recorded details regarding the information provided on personal impor...
Article
Full-text available
In albino mice and EphB1 knockout mice, mistargeted retinal ganglion cell axons form dense islands of axon terminals in the dorsal lateral geniculate nuclei (dLGN). The formation of these islands of retinal input depends on developmental patterns of spontaneous retinal activity. We reconstructed the microcircuitry of the activity-dependent islands...
Preprint
Full-text available
Person detection methods are used widely in applications including visual surveillance, pedestrian detection, and robotics. However, accurate detection of persons from overhead fisheye images remains an open challenge because of factors including person rotation and small-sized persons. To address the person rotation problem, we convert the fisheye...
Article
Full-text available
There is considerable interest in understanding patterns of β‐diversity that measure the amount of change in species composition through space or time. Most hypotheses for β‐diversity evoke nonrandom processes that generate spatial and temporal within‐species aggregation; however, β‐diversity can also be driven by random sampling processes. Here, w...
Preprint
Full-text available
Autoregressive Transformer models have demonstrated impressive performance in video generation, but their sequential token-by-token decoding process poses a major bottleneck, particularly for long videos represented by tens of thousands of tokens. In this paper, we propose Diagonal Decoding (DiagD), a training-free inference acceleration algorithm...
Article
Full-text available
Em Casa Grande e Senzala (1933), uma das obras fundadoras da análise sobre as relações coloniais e escravistas no Brasil, Gilberto Freyre relata um ditado popular no qual ele afirma que além das convenções sociais sobre a superioridade da mulher branca e a inferioridade da mulher negra, a preferência sexual dos homens pelas mulatas é indiscutível....
Conference Paper
Full-text available
Visual recognition is essential for animals, including humans, to interpret their environment. However, attention plays a key role in filtering visual information, directing brain resources to salient objects or locations. The saliency map model replicates this biological process, predicting where attention and gaze will focus. Recently, models bas...
Preprint
Full-text available
This paper proposes a new approach for stability analysis of multi-input, multi-output (MIMO) feedback systems through Scaled Relative Graphs (SRGs). Unlike traditional methods, such as the Generalized Nyquist Criterion (GNC), which relies on a coupled analysis that requires the multiplication of models, our approach enables the evaluation of syste...
Preprint
Full-text available
Videos, with their unique temporal dimension, demand precise grounded understanding, where answers are directly linked to visual, interpretable evidence. Despite significant breakthroughs in reasoning capabilities within Large Language Models, multi-modal reasoning - especially for videos - remains unexplored. In this work, we introduce VideoMind,...
Preprint
Full-text available
Video Super-Resolution (VSR) reconstructs high-resolution videos from low-resolution inputs to restore fine details and improve visual clarity. While deep learning-based VSR methods achieve impressive results, their centralized nature raises serious privacy concerns, particularly in applications with strict privacy requirements. Federated Learning...
Preprint
Full-text available
Talking head synthesis, also known as speech-to-lip synthesis, reconstructs the facial motions that align with the given audio tracks. The synthesized videos are evaluated on mainly two aspects, lip-speech synchronization and image fidelity. Recent studies demonstrate that GAN-based and diffusion-based models achieve state-of-the-art (SOTA) perform...
Thesis
Full-text available
ABSTRAK Keamanan objek vital seperti bandara merupakan aspek yang sangat penting untuk memastikan kelancaran operasional dan keselamatan penerbangan. Sistem pengawasan konvensional yang masih bergantung pada patroli manual memiliki keterbatasan dalam hal jangkauan, efektivitas, dan efisiensi. Oleh karena itu, penelitian ini bertujuan untuk merancan...
Preprint
Full-text available
Contrastive decoding strategies are widely used to mitigate object hallucinations in multimodal large language models (MLLMs). By reducing over-reliance on language priors, these strategies ensure that generated content remains closely grounded in visual inputs, producing contextually accurate outputs. Since contrastive decoding requires no additio...
Article
Full-text available
Multi-modality image fusion aims to extract complementary features from multiple source images of different modalities, generating a fused image that inherits their advantages. To address challenges in cross-modality shared feature (CMSF) extraction, single-modality specific feature (SMSF) fusion, and the absence of ground truth (GT) images, we pro...
Preprint
Full-text available
Text-to-image diffusion models have made significant advancements in generating high-quality, diverse images from text prompts. However, the inherent limitations of textual signals often prevent these models from fully capturing specific concepts, thereby reducing their controllability. To address this issue, several approaches have incorporated pe...
Article
Full-text available
Penelitian ini bertujuan untuk mendeskripsikan tingkat kemampuan siswa dalam menyelesaikan masalah matematika berdasarkan gaya belajar mereka, yaitu visual, auditorial, dan kinestetik. Analisis dilakukan dengan mengacu pada tahapan pemecahan masalah yang dikemukakan oleh Polya, yang meliputi memahami masalah, membuat rencana penyelesaian, melaksana...
Preprint
Full-text available
Vision-language models (VLMs) encounter considerable challenges when adapting to domain shifts stemming from changes in data distribution. Test-time adaptation (TTA) has emerged as a promising approach to enhance VLM performance under such conditions. In practice, test data often arrives in batches, leading to increasing interest in the transductiv...
Preprint
Full-text available
Video Foundation Models (VFMs) have recently been used to simulate the real world to train physical AI systems and develop creative visual experiences. However, there are significant challenges in training large-scale, high quality VFMs that can generate high-quality videos. We present a scalable, open-source VFM training pipeline with NVIDIA NeMo,...
Preprint
Full-text available
Multimodal large language models (MLLMs) improve performance on vision-language tasks by integrating visual features from pre-trained vision encoders into large language models (LLMs). However, how MLLMs process and utilize visual information remains unclear. In this paper, a shift in the dominant flow of visual information is uncovered: (1) in sha...
Article
Full-text available
Resumo Nossa experiência com telas eletrônicas e digitais pode ser considerada como a pars pro toto desse "dispositivo" (no amplo significado sociocultural atribuído a esse termo por Foucault) que nos envolve. É por isso que considero decisivo elaborar o que eu chamaria de uma antropologia das telas, para a qual acredito que o valor heurístico ofer...
Article
Full-text available
Purpose To investigate the characteristic manifestations of basal cell carcinoma (BCC) under dermoscopy and reflectance confocal microscopy (RCM) and explore the diagnostic value of dermoscopy combined with RCM for BCC. Methods A cohort of 71 patients with the suspected clinical diagnosis of BCC underwent dermoscopy, RCM, and histopathological exa...
Preprint
Full-text available
We present UniFluid, a unified autoregressive framework for joint visual generation and understanding leveraging continuous visual tokens. Our unified autoregressive architecture processes multimodal image and text inputs, generating discrete tokens for text and continuous tokens for image. We find though there is an inherent trade-off between the...
Article
Full-text available
With the increasing demand for high-quality 3D holographic reconstruction, visual clarity and accuracy remain significant challenges in various imaging applications. Current methods struggle for higher image resolution and to resolve such issues as detail loss and checkerboard artifacts. To address these challenges, we propose the model Depthwise S...
Article
Full-text available
En este trabajo nos proponemos abordar la producción discursiva del pasado prehistórico en los discursos interpretativos del patrimonio desde una perspectiva crítica, reflexiva y feminista, dentro del marco conceptual de la Arqueología Pública. El objetivo es identificar los patrones y tendencias discursivas predominantes, tanto en su vertiente tex...
Article
Full-text available
Objective To assess the ability to visually estimate the fractional shortening in dogs and the impact of experience on those assessments. Methods Right parasternal short- and long-axis cine loops from 25 dogs with varying fractional shortening (6.9% to 61.2%) were distributed online to observers with different levels of training in anesthesiology...
Preprint
Full-text available
Accurately localizing audible objects based on audio-visual cues is the core objective of audio-visual segmentation. Most previous methods emphasize spatial or temporal multi-modal modeling, yet overlook challenges from ambiguous audio-visual correspondences such as nearby visually similar but acoustically different objects and frequent shifts in o...
Article
Full-text available
Is Your Baby Racist? Newsweek published an article titled “Is Your Baby Racist?” which explored the concept of whether babies can be racist. The article discussed research indicating that babies as young as six months old show a preference for faces of their own race, suggesting they notice differences in appearance, including skin colour.[2][6][7...
Article
Full-text available
We tested whether naturally occurring visual variability—specifically, typefaces—would help people generalize word learning to typefaces they had never seen before. In Chinese, thousands of unique written characters must be learned item by item, and differentiated from similar-looking characters. Participants ( n = 190) with no previous Chinese exp...
Article
Full-text available
La lumbalgia inespecífica es una de las principales causas de incapacidad laboral a nivel mundial, afectando la productividad y calidad de vida de los trabajadores. Este estudio tuvo como objetivo determinar prevalencia de lumbalgia inespecífica y grado de incapacidad funcional en trabajadores de mantenimiento e intendencia de una institución educa...
Preprint
Full-text available
Often, the needs and visual abilities differ between the annotator group and the end user group. Generating detailed diagram descriptions for blind and low-vision (BLV) users is one such challenging domain. Sighted annotators could describe visuals with ease, but existing studies have shown that direct generations by them are costly, bias-prone, an...
Preprint
Full-text available
Generating images with embedded text is crucial for the automatic production of visual and multimodal documents, such as educational materials and advertisements. However, existing diffusion-based text-to-image models often struggle to accurately embed text within images, facing challenges in spelling accuracy, contextual relevance, and visual cohe...
Preprint
Full-text available
We study Multimodal Large Language Models (MLLMs) with in-context learning for food preparation task planning. In this context, we identify two key challenges: cross-modal distraction and geometric feasibility. Cross-modal distraction occurs when the inclusion of visual input degrades the reasoning performance of a MLLM. Geometric feasibility refer...
Article
Full-text available
This research explores the factors that both facilitate and constrain the process of co-designing alongside visually impaired children (VIC). Understanding the facilitators and constraints is crucial, as it reveals how actors within a co-design project manage their mutual differences, thereby influencing the effectiveness and quality of the co-desi...
Article
Full-text available
Adolescents with visual impairment face numerous challenges, particularly in areas of growth and development such as socialization and physical milestones. This study explores the challenges in play activities and the perceived need for peer support among visually impaired adolescents. A qualitative approach was employed, targeting visually impaire...
Article
Full-text available
Hoaks atau berita palsu telah menjadi tantangan besar dalam era digital, terutama dalam konteks politik di Indonesia. Penelitian ini bertujuan untuk menganalisis lima berita hoaks yang telah terbukti tidak benar serta mengidentifikasi aspek pemalsuan yang terjadi dalam setiap kasus. Berita yang dianalisis mencakup berbagai isu politik, seperti klai...
Article
Full-text available
El artículo versa sobre la reinterpretación y actualización del patrimonio cultural que la institución museo puede realizar a partir de los programas de creación. Una estrategia es la desarrollada por el Museo Universidad de Navarra, cuya forma de vincular las políticas de conservación y creación analizamos en el caso de Soliloquios, coreografía pa...
Preprint
Full-text available
Multimodal systems have highly complex processing pipelines and are pretrained over large datasets before being fine-tuned for specific tasks such as visual captioning. However, it becomes hard to disentangle what the model learns during the fine-tuning process from what it already knows due to its pretraining. In this work, we learn a probabilisti...
Preprint
Full-text available
We investigate complex video question answering via chain-of-evidence reasoning -- identifying sequences of temporal spans from multiple relevant parts of the video, together with visual evidence within them. Existing models struggle with multi-step reasoning as they uniformly sample a fixed number of frames, which can miss critical evidence distri...
Preprint
Full-text available
A fundamental challenge in conditional 3D shape generation is to minimize the information loss and maximize the intention of user input. Existing approaches have predominantly focused on two types of isolated conditional signals, i.e., user sketches and text descriptions, each of which does not offer flexible control of the generated shape. In this...
Article
Full-text available
Image deblurring is a fundamental preprocessing technique aimed at recovering clear and detailed images from blurry inputs. However, existing methods often struggle to effectively integrate multi‐scale feature extraction with frequency enhancement, limiting their ability to reconstruct fine textures, especially in the presence of non‐uniform blur....
Article
Full-text available
Flows are vivid and diversiform physical phenomena that reflect the fluid motion in different conditions. Fluid flow in the fractures is the typically motion mode of the fluid in rocks. This study investigated, using enhanced X-ray imaging digital radiography (XIDR), a special motion mode of fluid flow through the fractures during rock failures. Ou...
Article
Full-text available
In the current study, electroencephalographic (EEG) data was recorded to study the impact of hand and target visibility on neural processing during both the planning and execution of upper limb reaches. Prior to each movement, participants were informed if the hand and/or the target would be available in four conditions: (1) hand and target visible...
Preprint
Full-text available
In recent years, Multimodal Large Language Models (MLLMs) have demonstrated remarkable advancements in tasks such as visual question answering, visual understanding, and reasoning. However, this impressive progress relies on vast amounts of data collected from the internet, raising significant concerns about privacy and security. To address these i...
Article
Full-text available
تتناول هذه الدراسة موضوع الفقد كواحدة من أكثر التجارب الإنسانية عمقًا وتأثيرًا، مستعرضةً كيفية استخدام العين البشرية كرمز بصري للتعبير عن مشاعر الحزن ومراحل تقبل الفقد في الفن المعاصر. تسعى الدراسة إلى تقديم فهم أعمق لدور العين البشرية كرمز بصري و كوسيط تشكيلي يظهر الانفعالات العاطفية المختلفة، من خلال الوقوف على بعض الممارسات التشكيلية المعاصرة. ت...
Presentation
Full-text available
Th e neutron proton and electron are looped waves in the medium of space. This presentation shows the calculation that the proton and neutron are three wavelengths in the loop. Starting from the Balmer formula the energy levels of the Hydrogen atom are derived including the effective radius of each energy level. This presentation explains visually...
Article
Full-text available
Delayed ettringite formation (DEF) is a durability issue that can cause concrete expansion and cracking. While significant efforts have been made to investigate the microscopic mechanisms of DEF expansion, relatively few studies have focused on structural models. Understanding the effect of stress on DEF expansion is crucial for evaluating the perf...
Preprint
Full-text available
Multimodal Large Language Models (MLLMs) have revolutionized video understanding, yet are still limited by context length when processing long videos. Recent methods compress videos by leveraging visual redundancy uniformly, yielding promising results. Nevertheless, our quantitative analysis shows that redundancy varies significantly across time an...
Preprint
Full-text available
Sign language recognition involves modeling complex multichannel information, such as hand shapes and movements while relying on sufficient sign language-specific data. However, sign languages are often under-resourced, posing a significant challenge for research and development in this field. To address this gap, we introduce ISLR101, the first pu...
Preprint
Full-text available
Background : Transfemoral access (TFA) has been the traditional approach for diagnostic cerebral angiography, but it is associated with several limitations and complications, including pain, discomfort, retroperitoneal hemorrhage, pulmonary embolism, and increased hospital admissions. Transradial cerebral angiography (TRA) offers a promising altern...
Preprint
Full-text available
Subjective interpretation and content diversity make predicting whether an image is private or public a challenging task. Graph neural networks combined with convolutional neural networks (CNNs), which consist of 14,000 to 500 millions parameters, generate features for visual entities (e.g., scene and object types) and identify the entities that co...
Article
Full-text available
One of the main characteristics of CubeSats is their economic viability, making them widely used as study tools in universities and space research centers. However, it is clear that development and operating costs can become high in certain circumstances, especially when using the services of private organizations for these missions. This article a...
Article
Full-text available
Stress recognition from speech seeks an humongous attention among the researchers and from the industrial sides like call centres for recognizing the customer’s intension over speech. Recognizing stress using visual is easier when compared with recognition of stress from speech signal since Lombard effect affects the normal speech heavily. In this...