Giovanni Aloisio

Giovanni Aloisio
University of Salento | Unisalento · Dept. Of Innovation Engineering, University Of Salento & CMCC/SCO (Scientific Computing and Operation Division)

About

301
Publications
44,888
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
3,137
Citations
Additional affiliations
August 2004 - present
Centro Euro-Mediterraneo sui Cambiamenti Climatici
Position
  • Managing Director

Publications

Publications (301)
Article
Full-text available
Tropical Cyclones (TCs) are counted among the most destructive phenomena that can be found in nature. Every year, globally an average of 90 TCs occur over tropical waters, and global warming is making them stronger and more destructive. The accurate localization and tracking of such phenomena have become a relevant and interesting area of research...
Preprint
Full-text available
Accurate and precise climate projections are required for climate adaptation and mitigation, but Earth system models still exhibit great uncertainties. Several approaches have been developed to reduce the spread of climate projections and feedbacks, yet those methods cannot capture the non-linear complexity inherent in the climate system. Using a T...
Preprint
Full-text available
Tropical Cyclones (TCs) are counted among the most destructive phenomena that can be found in nature. Every year, globally an average of 90 TCs occur over tropical waters, and global warming is making them stronger, larger and more destructive. The accurate detection and tracking of such phenomena have become a relevant and interesting area of rese...
Article
The increasing volume of data in many scientific fields demands a transformative approach to data management and analysis. The data space concept, i.e., a digital ecosystem promoting sustainable and FAIR data use, has emerged to address key challenges. This paper introduces the ENES Data Space, a domain-specific implementation for climate scientist...
Article
Full-text available
The evolution of High-Performance Computing (HPC) platforms enables the design and execution of progressively larger and more complex workflow applications in these systems. The complexity comes not only from the number of elements that compose the workflows but also from the type of computations they perform. While traditional HPC workflows target...
Article
Full-text available
This paper presents the message passing interface (MPI)-based parallelization of the three-dimensional hydrodynamic model SHYFEM (System of HydrodYnamic Finite Element Modules). The original sequential version of the code was parallelized in order to reduce the execution time of high-resolution configurations using state-of-the-art high-performance...
Preprint
Full-text available
The evolution of High-Performance Computing (HPC) platforms enables the design and execution of progressively larger and more complex workflow applications in these systems. The complexity comes not only from the number of elements that compose the workflows but also from the type of computations they perform. While traditional HPC workflows target...
Article
Full-text available
The evolution of High-Performance Computing (HPC) platforms enables the design and execution of progressively larger and more complex workflow applications in these systems. The complexity comes not only from the number of elements that compose the workflows but also from the type of computations they perform. While traditional HPC workflows target...
Article
Full-text available
Since December 2019, the novel coronavirus disease (COVID-19) has had a considerable impact on the health and socioeconomic fabric of Italy. The effective reproduction number Rt is one of the most representative indicators of the contagion status as it reports the number of new infections caused by an infected subject in a partially immunized popul...
Chapter
Full-text available
O período entre 2018 e 2022 mostrou-nos que o problema dos incêndios à escala global não está a diminuir, antes pelo contrário. Parece que as consequências das alterações climáticas já estão a afectar a ocorrência de incêndios florestais em várias partes do Mundo, de uma forma que só esperaríamos que acontecesse vários anos mais tarde. Em muitos pa...
Article
Full-text available
One of the most important open challenges in climate science is downscaling. It is a procedure that allows making predictions at local scales, starting from climatic field information available at large scale. Recent advances in deep learning provide new insights and modeling solutions to tackle downscaling-related tasks by automatically learning t...
Preprint
Full-text available
This paper presents the MPI-based parallelization of the three-dimensional hydrodynamic model SHYFEM (System of HydrodYnamic Finite Element Modules). The original sequential version of the code was parallelized in order to reduce the execution time of high-resolution configurations using state-of-the-art HPC systems. A distributed memory approach w...
Article
Back in December 2019, the novel coronavirus disease 2019 (Covid-19) started rapidly spreading worldwide, especially in Italy that was among the most affected countries. The geographical distribution of air pollution and Covid-19 mortality in Italy suggested atmospheric pollution as a worsening factor of severe Covid-19 health outcomes. The present...
Article
Full-text available
Compartmental models have long been used in epidemiological studies for predicting disease spread. However, a major issue when using compartmental mathematical models concerns the time-invariant formulation of hyper-parameters that prevent the model from following the evolution over time of the epidemiological phenomenon under investigation. In ord...
Article
Full-text available
Over the last two decades, scientific discovery has increasingly been driven by the large availability of data from a multitude of sources, including high-resolution simulations, observations and instruments, as well as an enormous network of sensors and edge components. In such a dynamic and growing landscape where data continue to expand, advance...
Article
The intensification of extreme events, storm surges and coastal flooding in a climate change scenario increasingly influences human processes, especially in coastal areas where sea-based activities are concentrated. Predicting sea level near the coasts, with a high accuracy and in a reasonable amount of time, becomes a strategic task. Despite the d...
Article
The spread of SARS-CoV-2, the beta coronavirus responsible for the current pneumonia pandemic outbreak, has been speculated to be linked to short-term and long-term atmospheric pollutants exposure. The present work has been aimed at analyzing the atmospheric pollutants concentrations (PM 10 , PM 2.5 , NO 2) and spatio-temporal distribution of cases...
Chapter
The continuous increase in the data produced by simulations, experiments and edge components in the last few years has forced a shift in the scientific research process, leading to the definition of a fourth paradigm in Science, concerning data-intensive computing. This data deluge, in fact, introduces various challenges related to big data volumes...
Article
Full-text available
Smart urban transportation management can be considered as a multifaceted big data challenge. It strongly relies on the information collected into multiple, widespread, and heterogeneous data sources as well as on the ability to extract actionable insights from them. Besides data, full stack (from platform to services and applications) Information...
Article
Full-text available
Analysis of public transportation data in large cities is a challenging problem. Managing data ingestion, data storage, data quality enhancement, modelling and analysis requires intensive computing and a non-trivial amount of resources. In EUBra-BIGSEA (Europe–Brazil Collaboration of Big Data Scientific Research Through Cloud-Centric Applications)...
Chapter
Full-text available
We present a message-passing based parallel algorithm for mining Correlated Heavy Hitters from a two-dimensional data stream. To the best of our knowledge, this is the first parallel algorithm solving the problem. We show, through experimental results, that our algorithm provides very good scalability, whilst retaining the accuracy of its sequentia...
Presentation
Full-text available
Numerical Ocean Models rely on the solution of linear systems. This happens when the models use implicit schemes to solve the equations (e.g. the free surface equation). Krylov Subspace Methods are most commonly used to solve the linear systems, and in the distributed memory case this introduces a communication overhead that becomes the more oner...
Article
Climate and biodiversity systems are closely linked across a wide range of scales. To better understand the mutual interaction between climate change and biodiversity there is a strong need for multidisciplinary skills, scientific tools, and access to a large variety of heterogeneous, often distributed, data sources. Related to that, the EUBrazilCl...
Article
Full-text available
This paper describes the achievements of the H2020 project INDIGO-DATACLOUD. The project has provided e-infrastructures with tools, applications and cloud framework enhancements to manage the demanding requirements of scientific communities, either locally or through enhanced interfaces enabling those infrastructures to become part of a federation...
Article
Full-text available
We deal with the problem of detecting frequent items in a stream under the constraint that items are weighted, and recent items must be weighted more than older ones. This kind of problem naturally arises in a wide class of applications in which recent data is considered more useful and valuable with regard to older, stale data. The weight assigned...
Conference Paper
In the context of the EU H2020 INDIGO-DataCloud project several use case on large scale scientific data analysis regarding different research communities have been implemented. All of them require the availability of large amount of data related to either output of simulations or observed data from sensors and need scientific (big) data solutions t...
Article
Full-text available
Reliable and timely information on the environmental conditions at sea is key to the safety of professional and recreational users as well as to the optimal execution of their activities. The possibility of users obtaining environmental information in due time and with adequate accuracy in the marine and coastal environment is defined as sea situat...
Preprint
Full-text available
Climate and biodiversity systems are closely interlaced across a wide range of scales. To better understand the mutual interaction between climate change and biodiversity there is a strong need for multidisciplinary skills, tools and a large variety of heterogeneous, distributed data sources. In this regard, the EUBrazilCloudConnect project provide...
Preprint
Full-text available
Climate and biodiversity systems are closely interlaced across a wide range of scales. To better understand the mutual interaction between climate change and biodiversity there is a strong need for multidisciplinary skills, tools and a large variety of heterogeneous, distributed data sources. In this regard, the EUBrazilCloudConnect project provide...
Article
Full-text available
An efficient, secure and interoperable data platform solution has been developed in the TESSA project to provide fast navigation and access to the data stored in the data archive, as well as a standard-based metadata management support. The platform mainly targets scientific users and the situational sea awareness high-level services such as the de...
Article
This work describes the introduction of a second level of parallelism based on the OpenMP shared memory paradigm to NEMO, one of the most widely used ocean models in the European climate community. Although the existing parallelisation scheme in NEMO, based on the MPI paradigm, has served it well for many years, it is becoming unsuited to current h...
Article
Full-text available
A climate model represents a multitude of processes on a variety of timescales and space scales: a canonical example of multi-physics multi-scale modeling. The underlying climate system is physically characterized by sensitive dependence on initial conditions, and natural stochastic variability, so very long integrations are needed to extract signa...
Article
In this paper we present the approach proposed by EU H2020 INDIGO-DataCloud project to orchestrate dynamic workflows over a cloud environment. The main focus of the project is on the development of open source Platform as a Service solutions targeted at scientific communities, deployable on multiple hardware platforms, and provisioned over hybrid e...
Conference Paper
A case study on climate models intercomparison data analysis addressing several classes of multi-model experiments is being implemented in the context of the EU H2020 INDIGO-DataCloud project. Such experiments require the availability of large amount of data (multi-terabyte order) related to the output of several climate models simulations as well...
Preprint
Full-text available
A climate model represents a multitude of processes on a variety of time and space scales; a canonical example of multi-physics multi-scale modeling. The underlying climate system is physically characterized by sensitive dependence on initial conditions, and natural stochastic variability, so very long integrations are needed to extract signals of...
Article
Full-text available
An efficient, secure, and interoperable data platform solution has been developed in the TESSA project to provide fast navigation and access to the data stored in the data archive, as well as a standard-based metadata management support. The platform mainly targets scientific users and the Situational Sea Awareness high-level services such as the D...
Article
Full-text available
The provision of reliable and timely information on the environmental conditions at sea to professional and recreational users is of strategic importance for their safety and for the optimal execution of their duties and activities. The capacity of the users of having the environmental information in due time and with the adequate accuracy in the m...
Article
Given an array of n elements and a value 2≤k≤n, a frequent item or k-majority element is an element occurring in more than n/k times. The k-majority problem requires finding all of the k-majority elements. In this paper, we deal with parallel shared-memory algorithms for frequent items; we present a shared-memory version of the Space Saving algorit...
Preprint
Given an array $\mathcal{A}$ of $n$ elements and a value $2 \leq k \leq n$, a frequent item or $k$-majority element is an element occurring in $\mathcal{A}$ more than $n/k$ times. The $k$-majority problem requires finding all of the $k$-majority elements. In this paper we deal with parallel shared-memory algorithms for frequent items; we present a...
Article
Full-text available
The present work aims at evaluating the scalability performance of a high-resolution global ocean biogeochemistry model (PELAGOS025) on massive parallel architectures and the benefits in terms of the time-to-solution reduction. PELAGOS025 is an on-line coupling between the Nucleus for the European Modelling of the Ocean (NEMO) physical ocean model...
Article
Full-text available
In this paper we describe the architecture of a Platform as a Service (PaaS) oriented to computing and data analysis. In order to clarify the choices we made, we explain the features using practical examples, applied to several known usage patterns in the area of HEP computing. The proposed architecture is devised to provide researchers with a unif...
Data
Full-text available
In this paper the use of augmented reality and cloud computing technology to enrich the scenes of sites with a relevant cultural interest is proposed. The main goal is to develop a mobile application to improve the user’s cultural experience during the sightseeing of a city of art through the integration of digital contents related to specific site...
Data
Full-text available
Phytoplankton is a quality element for determining the ecological status of transitional water ecosystems. In routine analysis, bio-volume and surface area of phytoplankton are the most studied morphometric descriptors. Bio-volume can be estimated by comparing the algae with similar three-dimensional geometric forms and determining their volume, by...
Article
We present FDCMSS, a new sketch based algorithm for mining frequent items in data streams. The algorithm cleverly combines key ideas borrowed from forward decay, the Count-Min and the Space Saving algorithms. It works in the time fading model, mining data streams according to the cash register model. We formally prove its correctness and show, thro...
Code
Ophidia is a CMCC Foundation research project addressing big data challenges for eScience. It provides support for data-intensive analysis exploiting advanced parallel computing techniques and smart data distribution methods. The Ophidia analytics framework can be exploited in different scientific domains (e.g. Climate Change, Earth Sciences, Life...
Chapter
Many e-science initiatives are currently investigating the use of cloud computing to support all kinds of scientific activities. The objective of this chapter is to describe the architecture and the deployment of the EUBrazilCC federated e-infrastructure, a Research & Development project that aims at providing a user-centric test bench enabling Eur...
Article
Full-text available
The present work aims at evaluating the scalability performance of a high-resolution global ocean biogeochemistry model (PELAGOS025) on massive parallel architectures and the benefits in terms of the time-to-solution reduction. PELAGOS025 is an on-line coupling between the physical ocean model NEMO and the BFM biogeochemical model. Both the models...
Conference Paper
In this paper the use of augmented reality and cloud computing technology to enrich the scenes of sites with a relevant cultural interest is proposed. The main goal is to develop a mobile application to improve the user’s cultural experience during the sightseeing of a city of art through the integration of digital contents related to specific site...
Conference Paper
The analysis of large volumes of data is key for knowledge discovery in several scientific domains such as climate, astrophysics, life sciences among others. It requires a large set of computational and storage resources, as well as flexible and efficient software solutions able to dynamically exploit the available infrastructure and address issues...
Conference Paper
Full-text available
Phytoplankton is a quality element for determining the ecological status of transitional water ecosystems. In routine analysis, bio-volume and surface area of phytoplankton are the most studied morphometric descriptors. Bio-volume can be estimated by comparing the algae with similar three-dimensional geometric forms and determining their volume, by...
Conference Paper
The present work describes the analysis and optimisation of the PELAGOS025 configuration based on the coupling of the NEMO physic component of the ocean dynamics and the BFM (Biogeochemical Flux Model), a sophisticated biogeochemical model that can simulate both pelagic and benthic processes. The methodology here followed is characterised by the pe...
Conference Paper
The Ophidia project is a research effort addressing big data analytics requirements, issues, and challenges for eScience. We present here the Ophidia analytics framework, which is responsible for atomically processing, transforming and manipulating array-based data. This framework provides a common way to run on large clusters analytics tasks appli...