
Hiroya MatsubaHitachi, Ltd. · Central Research Laboratory
Hiroya Matsuba
Ph.D (Information Science and Technology)
About
16
Publications
624
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
59
Citations
Citations since 2017
Introduction
Skills and Expertise
Additional affiliations
August 2006 - March 2010
April 2003 - August 2006
Publications
Publications (16)
This paper proposes a system-wide communication library to couple multiple MPI programs for heterogeneous coupling computing called h3-Open-SYS/WaitIO-Socket (WaitIO-Socket for short). WaitIO-Socket provides an inter-program communication environment among MPI programs and supports different MPI libraries with various interconnects and processor ty...
Accurate and up-to-date knowledge of how a cloud tenant's VMs utilize the underlying cloud infrastructure is essential for many cloud management tasks including tenant onboarding, optimized VM placement, performance optimization, and debugging. Unfortunately, existing solutions such as instrumentation at the hypervisors or standard networking proto...
An on-demand file staging system, Catwalk, is proposed. Catwalk is designed so that it can run on any Linux clusters without any special or additional hardware. By having hook functions on the system calls of file operations, a file staging system can be transparent from the view of users, and users can be free from having wrong file staging script...
This paper describes the design and implementation of a seamless MPI runtime environment, called MPI-Adapter, that realizes
MPI program binary portability in different MPI runtime environments. MPI-Adapter enables an MPI binary program to run on
different MPI implementations. It is implemented as a dynamic loadable module so that the module dynamic...
In a cluster of multiple processors or cpu-cores, many processes may run on each compute node. Each process tends to issue contiguous I/O requests for snapshot, checkpointing or so, however, if large num- ber of processes enter the I/O phase at the same time, the requests from the same process may be interrupted by the requests of other processes....
At a cluster of clusters used for parallel computing, it is important to fully utilize the inter-cluster network. Existing MPI implementations for cluster of clusters have two issues: 1) Single point-to-point communication cannot utilize the bandwidth of the high-bandwidth inter-cluster network because a Gigabit Ethernet interface is used at each n...
Multiple processors or multi-core CPUs are now in common, and the number of processes running concurrently is increasing in a cluster. Each process issues contiguous I/O requests individually, but they can be interrupted by the requests of other processes if all the processes enter the I/O phase together. Then, I/O nodes handle these requests as no...
A broadcast-based single IP cluster aims at being both scalable and available. However, existing systems can only employ static traffic assignment based on incoming packets. In this paper we propose FTCS, a new TCP connection dispatching mechanism that enables a single IP cluster to use more flexible load balancing algorithms. In this mechanism, on...
We propose a fault detection system activated by an application when the application recognizes the occurrence of a failure, in order to realize self managing systems that automatically find the source of a failure. In existing detection systems, there are three issues for constructing self managing applications: i) the detection results are not se...
As a method for running multiple operating systems on one machine, we propose a new resource partitioning method we have named "single hardware with independent multiple operating systems" (SHIMOS). In SHIMOS, CPU and memory resources are partitioned by multiple native kernels without any architectural virtualization supports. There is nearly no sl...
A new communication model, called the PlogPT model, is proposed to predict communication performance in a commodity cluster where computing nodes communicate using TCP/IP. This model extends the PlogP model in order to consider the variation of bandwidth brought about by bottleneck links in network switches and the delay of packet retransmission by...
Operating a cluster on a single IP address is required when the cluster is used to provide some Internet services. This paper proposes SAPS, a new method to assign a single IP address to a cluster. The TCP/IP protocol is handled at a single node called the I/O server. The other nodes, called application nodes, provide the socket interface to applic...
FDSM, a software distributed shared memory system, has been designed and implemented to run OpenMP pro- grams on distributed memory computers. FDSM inspects the access pattern of an application at the first iteration of a loop and figures out the communication pattern. FDSM is implemented on both IA-32 and IA-64 architectures and evaluated using th...
This paper describes an OpenMP ready distributed shared memory system called FDSM. FDSM analyzes the access pattern to the shared memory at the flrst itera- tion of a loop and obtain the communication set. By using this information, it reduces the overhead of the coherence maintenance. OpenMP on FDSM is evaluated by using the benchmark application...