Jan Martinovic

Jan Martinovic
VŠB-Technical University of Ostrava · IT4Innovations

Ph.D.

About

169
Publications
16,014
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
798
Citations
Introduction
Head of Advanced Data Analysis and Simulation Lab at IT4Innovations. His research activities are focused on information retrieval, data processing, design and development of information systems and disaster management. His activities also cover a development HPC as a Service Middleware which allows using HPC infrastructure remotely by specific API. Jan is the coordinator of the H2020 LEXIS project. He had previous experience with coordination of the different contracted research activities with international and national companies such as Microsoft Corporation USA or ArcelorMittal Frýdek-Místek a.s. Czech Republic and had responsibility for the
Additional affiliations
January 2006 - present
VŠB-Technical University of Ostrava

Publications

Publications (169)
Chapter
The LEXIS (Large-scale EXecution for Industry & Society) H2020 project is building an advanced engineering platform taking advantage of HPC, Cloud solutions and Big Data, leveraging existing HPC infrastructures. In the framework of the LEXIS project, CIMA Research Foundation is running a three nested domain WRF Model with European coverage and rada...
Chapter
Traditional usage models of Supercomputing centres have been extended by High-Throughput Computing (HTC), High-Performance Data Analytics (HPDA) and Cloud Computing. The complexity of current compute platforms calls for solutions to simplify usage and conveniently orchestrate computing tasks. These enable also non-expert users to efficiently execut...
Article
Full-text available
Remote-sensing-driven urban change detection has been studied in many ways for decades for a wide field of applications, such as understanding socio-economic impacts, identifying new settlements, or analyzing trends of urban sprawl. Such kinds of analyses are usually carried out manually by selecting high-quality samples that binds them to small-sc...
Preprint
Full-text available
High-Performance Big Data Analytics (HPDA) applications are characterized by huge volumes of distributed and heterogeneous data that require efficient computation for knowledge extraction and decision making. Designers are moving towards a tight integration of computing systems combining HPC, Cloud, and IoT solutions with artificial intelligence (A...
Poster
Full-text available
LEXIS (Large-scale EXecution for Industry and Society) H2020 project is currently developing an advanced system for Big Data analysis that takes advantage of interacting large-scale geographically-distributed HPC infrastructure and cloud services. More specifically, LEXIS Weather and Climate Large-Scale Pilot workflows ingest data coming from diffe...
Chapter
The LEXIS Weather and Climate Large-scale Pilot will deliver a system for prediction of water-food-energy nexus phenomena and their associated socio-economic impacts. The system will be based on multiple model layers chained together, namely global weather and climate models, high-resolution regional weather models, domain-specific application mode...
Chapter
In the LEXIS project, a tsunami and earthquake large scale pilot is being deployed on top of an innovative HPC/Cloud orchestration layer, combining real-time deadlines, high performance simulations and cloud processing. To handle this, we have relied on a suitable model of computation oriented towards performance and real-time requirements, and use...
Chapter
This paper is concerned with finding near optimal parameters for the inventory optimization model on large dataset. It is shown that our proposed method allows for very good model parameter estimation with great reduction in computation time. Model developed in cooperation with K2 atmitec s.r.o. company has four input parameters which must be set b...
Book
This book presents the latest findings in the areas of data management and smart computing, big data management, artificial intelligence and data analytics, along with advances in network technologies. Gathering peer-reviewed research papers presented at the Fourth International Conference on Data Management, Analytics and Innovation (ICDMAI 2020),...
Book
This book presents the latest findings in the areas of data management and smart computing, big data management, artificial intelligence and data analytics, along with advances in network technologies. Gathering peer-reviewed research papers presented at the Fourth International Conference on Data Management, Analytics and Innovation (ICDMAI 2020),...
Article
Full-text available
The Sentinel-1 satellite system continuously observes European countries at a relatively high revisit frequency of six days per orbital track. Given the Sentinel-1 configuration, most areas in Czechia are observed every 1–2 days by different tracks in a moderate resolution. This is attractive for various types of analyses by various research groups...
Preprint
Sentinel-1 satellite system continuously observes European countries in a relatively high revisit frequency of 6 days per orbital track. Given the Sentinel-1 configuration, most areas in Czechia are observed every 1–2 days by different tracks in a moderate resolution. This is attractive for various types of analyses by various research groups. The...
Article
Developing and optimizing software applications for high performance and energy efficiency is a very challenging task, even when considering a single target machine. For instance, optimizing for multicore-based computing systems requires in-depth knowledge about programming languages, application programming interfaces, compilers, performance tunin...
Article
Full-text available
Abstract Artificial intelligence (AI) is undergoing a revolution thanks to the breakthroughs of machine learning algorithms in computer vision, speech recognition, natural language processing and generative modelling. Recent works on publicly available pharmaceutical data showed that AI methods are highly promising for Drug Target prediction. Howev...
Chapter
Effective navigation and distribution of traffic flow in large cities has become a hot topic in recent years. The authors have developed an advanced server side routing system which, together with client side navigation systems, is able not only to navigate cars according to their routing requests, but also to distribute traffic flow within a city....
Chapter
The HPC-as-a-Service concept is to provide users with simple and intuitive access to a supercomputing infrastructure without the need to buy and manage their own physical servers or data centers. This article presents the commonly used services and implementations of this concept and introduces our own in-house application framework called High-End...
Chapter
Full-text available
High Performance Computing (HPC) infrastructures (also referred to as supercomputing infrastructures) are at the basis of modern scientific discoveries, and allow engineers to greatly optimize their designs. The large amount of data (Big-Data) to be treated during simulations is pushing HPC managers to introduce more heterogeneity in their architec...
Chapter
Full-text available
Accurate and rapid earthquake loss assessments and tsunami early warnings are critical in modern society to allow for appropriate and timely emergency response decisions. In the LEXIS project, we seek to enhance the workflow of rapid loss assessments and emergency decision support systems by leveraging an orchestrated heterogeneous environment comb...
Chapter
Floating car data (FCD) are one of the most important sources of traffic data. However, their processing requires several steps that may seem trivial but have far-reaching consequences. One such step is map-matching, i.e. assignment of the FCD measurement to the correct road segment. While it can be done very simply by assigning the point of measur...
Chapter
Analysis of the relationship between a large number of sequences is a significant problem in many different applications such as business processes, sport, voting, weblogs, etc. Generally, studying relationship is based on clustering the sequences and creating a network of relationships. Interpretation and validation of such results require a domai...
Article
Incorporating speed probability distribution to the computation of the route planning in car navigation systems guarantees more accurate and precise responses. In this paper, we propose a novel approach for dynamically selecting the number of samples used for the Monte Carlo simulation to solve the Probabilistic Time-Dependent Routing (PTDR) proble...
Article
The ANTAREX project relies on a Domain Specific Language (DSL) based on Aspect Oriented Programming (AOP) concepts to allow applications to enforce extra functional properties such as energy-efficiency and performance and to optimize Quality of Service (QoS) in an adaptive way. The DSL approach allows the definition of energy-efficiency, performanc...
Preprint
Full-text available
The ANTAREX project relies on a Domain Specific Language (DSL) based on Aspect Oriented Programming (AOP) concepts to allow applications to enforce extra functional properties such as energy-efficiency and performance and to optimize Quality of Service (QoS) in an adaptive way. The DSL approach allows the definition of energy-efficiency, performanc...
Preprint
Incorporating speed probability distribution to the computation of the route planning in car navigation systems guarantees more accurate and precise responses. In this paper, we propose a novel approach for dynamically selecting the number of samples used for the Monte Carlo simulation to solve the Probabilistic Time-Dependent Routing (PTDR) proble...
Chapter
The robot soccer game introduces a variable and dynamic environment for cooperating agents. Coverage of areas such as multi-agent systems, robot control, optimal path planning, real-time image processing and machine learning makes this domain very attractive. This article presents our approach to strategy description of the robot soccer game and a...
Article
Full-text available
The large number of real-world applications have shown that the use of computational method for distribution process planning produces substantial savings. Many of these applications lead to problem generally known as Vehicle Routing Problem. The real-world applications are highly computationally demanding for larger instances. This article aims to...
Article
Full-text available
The quality of an opt imal solut ion of the Vehicle Rout ing Problem is st rongly depended on the sett ing of the configurat ion parameters of the algorithm. The paper is focused on the int roduct ion of hyperparameter search for solving the Vehicle Rout ing Problem using a HyperLoom plat form for defining and execut ing scient ific pipelines in a...
Article
Full-text available
We introduce KaraMIR, a musical project dedicated to karaoke song analysis. Within KaraMIR, we define Kara1k, a dataset composed of 1000 cover songs provided by Recisio Karafun application, and the corresponding 1000 songs by the original artists. Kara1k is mainly dedicated toward cover song identification and singing voice analysis. For both tasks...
Conference Paper
Full-text available
Our automatized interferometric monitoring system, IT4S1, contains a database of Sentinel-1 satellite image bursts that have been preprocessed to the state of a consistent well-coregistered dataset. The coregistration solution introduces a new type of data, an SLC-C (corrected single look complex data). These are SLC images ready for interferometri...
Conference Paper
The growing demand for high-performance capabilities in data centers (DCs) leads to adopt heterogeneous solutions. The advantage of specialised hardware is a better support for different types of workloads, and a reduction of the power consumption. Among the others, FPGAs offer the unique capability to provide hardware specialisation and low power...
Conference Paper
We have developed HyperLoom - a platform for defining and executing scientific workflows in large-scale HPC systems. The computational tasks in such workflows often have non-trivial dependency patterns, unknown execution time and unknown sizes of generated outputs. HyperLoom enables to efficiently execute the workflows respecting task requirements...
Conference Paper
Designing and optimizing applications for energy-efficient High Performance Computing systems up to the Exascale era is an extremely challenging problem. This paper presents the toolbox developed in the ANTAREX European project for autotuning and adaptivity in energy efficient HPC systems. In particular, the modules of the ANTAREX toolbox are descr...
Conference Paper
The main goal of this article is to describe the overview of Floreon⁺ system, an online flood monitoring and prediction system, which was primarily developed for the Moravian-Silesian region in the Czech Republic. Moreover, the article specifies the basic processes, which are implemented for running automatic and on-demand simulations that utilize...
Conference Paper
Real-world scientific applications often encompass end-to-end data processing pipelines composed of a large number of interconnected computational tasks of various granularity. We introduce HyperLoom, an open source platform for defining and executing such pipelines in distributed environments and providing a Python interface for defining tasks. Hy...
Data
Poster from IEEE ISM 2017: Kara1k: a karaoke dataset for cover song identification and singing voice analysis. Explaining the new Kara1k dataset for cover song identification and singing voice analysis. Download below.
Conference Paper
Full-text available
We introduce Kara1k, a new musical dataset com- posed of 2,000 analyzed songs thanks to a partnership with a karaoke company. The dataset is divided into 1,000 cover songs provided by Recisio Karafun application http://www.karafun.com , and the corresponding 1,000 songs by the original artists. Kara1k is mainly dedicated toward cover song identific...
Article
Full-text available
The probabilistic time-dependent vehicle routing problem is presented in this paper. It is a novel variant of the vehicle routing problem. The variant is a problem of finding optimal routes for a fleet of vehicles visiting customers in order to proceed delivery or pick-up. All customers must be visited in designated times with given probabilities a...
Conference Paper
Full-text available
Cover song identification has been a popular task within music information retrieval in the 20th century. The task is to identify a different version or performance of a previously recorded song. Unlike audio search for an exact matching song, this task has not yet been popularized among users, due to an ambiguous definition of a cover song and the...
Conference Paper
Hierarchical Data Format (HDF5) is a popular binary storage solution in high performance computing (HPC) and other scientific fields. It has bindings for many popular programming languages, including C++, which is widely used in the HPC field. Its C++ API requires mapping of the native C++ data types to types native to the HDF5 API. This task can b...
Chapter
Data clustering is a basic data mining discipline that has been in center of interest of many research groups. This paper describes the formulation of the basic NP-hard optimization problem in data clustering which is approximated by many heuristic methods. The famous k-means clustering algorithm and its initialization is of a particular interest i...
Chapter
While most modern well known performance benchmarks for high performance computers focused mainly on the speed of arithmetical operations, the increasing amount of nowadays problems depend also on the speed of memory access. This aspect is becoming crucial for all data driven computations. In this paper, two benchmarks focusing on the speed of memo...
Conference Paper
Computational performance of route planning algorithms has become increasingly important in recent real navigation applications with many simultaneous route requests. Navigation applications should recommend routes as quickly as possible and preferably with some added value. This paper presents a performance evaluation of the main part of probabili...
Conference Paper
We present automated application of inventory optimization based on sales forecast. Inventory stock optimization is very required issue by companies recent years, however inventory models are based on the sales expectation. Therefore, the problem of optimizing inventory stock is divided into two parts, sales forecast and setting optimal inventory....
Conference Paper
Dynamic Time Warping algorithm (DTW) is an effective tool for comparing two sequences which are subject to some kind of distortion. Unlike the standard methods for comparison, it is able to deal with a different length of compared sequences or with reasonable amount of inaccuracy. For this reason, DTW has become very popular and it is widely used i...
Conference Paper
This paper presents an experimental evaluation of probabilistic time-dependent travel time computation. Monte Carlo simulation is used for the computation of travel times and their probabilities. The simulation is utilizing traffic data regarding incidents on roads to compute the probability distribution of travel time on a selected path. Traffic d...
Conference Paper
The ANTAREX project aims at expressing the application self-adaptivity through a Domain Specific Language (DSL) and to runtime manage and autotune applications for green and heterogeneous High Performance Computing (HPC) systems up to Exascale. The DSL approach allows the definition of energy-efficiency, performance, and adaptivity strategies as we...
Chapter
Visualisation of relations between the users is an important part of business process analysis. The authors focused on behavioral graphs to represent relations between the users based on their behavior in the system. The behavior is determined by sequences of activities the users have performed. The proposed method deals with the problem of the beh...
Chapter
One of the main tasks of Intelligent Transportation Systems is to predict state of the traffic from short to medium horizon. This prediction can be used to manage the traffic both to prevent the traffic congestions and to minimize their impact. This information is also useful for route planning. This prediction is not an easy task given that the tr...
Conference Paper
This paper presents an algorithm for dynamic travel time computation along Czech Republic highways. The dynamism is represented by speed profiles used for computation of travel times at specified time. These speed profiles have not only the information about an optimal speed, but also a probability of this optimal speed and the probability of the s...
Conference Paper
Intelligent Transportation Systems are highly dependent on the quality and quantity of road traffic data. The complexity of input data is often crucial for effectiveness and sufficient reliability of such systems. Recent days, the fusion of various data sources is the topic which attracts attention of several researchers. The algorithms for data fu...
Conference Paper
Analysis of event logs is very important discipline used for the evaluation of performance and control-flow issues within the systems. This type of analysis is typically used in process mining sphere, where information systems, for example workflow management systems, enterprise resource planning systems, customer relationship management, supply ch...
Article
Full-text available
This article describes statistical evaluation of the computational model for precipitation forecast and proposes a method for uncertainty modelling of rainfall-runoff models in the Floreon(+) system based on this evaluation. The Monte-Carlo simulation method is used for estimating possible river discharge and provides several confidence intervals t...
Article
Full-text available
Robot Soccer is a very attractive platform in terms of research. It contains a number of challenges in the areas of robot control, artificial intelligence and image analysis. This article presents a look at the overall architecture of the game and describes some results of our experiments in analysis and optimization of strategies using sequence ex...
Conference Paper
Co-author network is a typical example of dynamic complex network, which evolves and changes over time. One of the ways how to capture and describe the dynamics of the network is determination of Stationarity for detected communities in the network. In the paper, we have proposed the modified Stationarity, which is focused only on co-authors of a g...
Conference Paper
Robot Soccer is a very attractive platform in terms of research. It contains a number of challenges in the areas of robot control, artificial intelligence and image analysis. This article presents a method to improve the description of the strategy by creating substrategies in strategy and thus ensuring smoother implementation of actions defined by...