Article

A method for efficient execution of bioinformatics workflows.

Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka 565-0871, Japan.
Genome informatics. International Conference on Genome Informatics 10/2009; 23(1):139-48. pp.139-48
Source: PubMed

ABSTRACT Efficient execution of data-intensive workflows has been playing an important role in bioinformatics as the amount of data has been rapidly increasing. The execution of such workflows must take into account the volume and pattern of communication. When orchestrating data-centric workflows, a centralized workflow engine can become a bottleneck to performance. To cope with the bottleneck, a hybrid approach with choreography for data management of workflows is proposed. However, when a workflow includes many repetitive operations, the approach might not gain good performance because of the overheads of its additional mechanism. This paper presents and evaluates an improvement of the hybrid approach for managing a large amount of data. The performance of the proposed method is demonstrated by measuring execution times of example workflows.

0 0
 · 
0 Bookmarks
 · 
48 Views

Full-text

View
1 Download
Available from

Keywords

additional mechanism
 
bioinformatics
 
centralized workflow engine
 
data management
 
data-intensive workflows
 
Efficient execution
 
example workflows
 
execution times
 
hybrid approach
 
large amount
 
orchestrating data-centric workflows
 
paper presents
 
proposed method
 
repetitive operations
 
workflows