About
15
Publications
659
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
35
Citations
Introduction
Skills and Expertise
Current institution
Education
March 2006 - February 2011
Publications
Publications (15)
The Layout-Aware Data Scheduling (LADS) data movement framework optimizes congestion for end-toend data transfers. During data transfer, LADS can avoid congested storage elements by exploiting the underlying storage layout at each endpoint. This improves the I/O bandwidth and hence the data transfer rate across high-speed networks. However, the abs...
Background
A cross-correlation (XCorr) score function is one of the most popular score functions utilized to search peptide identifications in databases, and many computer programs, such as SEQUEST, Comet, and Tide, currently use this score function. Recently, the HiXCorr algorithm was developed to speed up this score function for high-resolution s...
Layout-Aware Data Scheduler (LADS) data transfer tool, identifies and addresses the issues that lead to congestion on the path of an end-to-end data transfer in the terabit network environments. It exploits the underlying storage layout at each endpoint to maximize throughput without negatively impacting the performance of shared storage resources...
As the amount and the type of data for business decision making are rapidly increasing, the importance of big data analytics is gradually critical for making effective business strategy. However, big data analytics based decision making systems basically requires distributed parallel computing capability in order to make timely business strategy re...
Sensor data is structured and generally lacks of meaning by itself, but life-logging data (time, location, etc.) out of sensor data can be utilized to create lots of meaningful information combined with social data from social networks like Facebook and Twitter. There have been many platforms to produce meaningful information and support human beha...
Recently, there is an increasing interest in effectively using big data. It is also thought that the machine learning methods are crucial to effectively extract knowledge from big text data when they are coupled with big data technologies such as MapReduce and Hadoop. For tasks such as the knowledge extraction from huge amount of texts and the reas...
Numerous linguistic resources are readily available in area of expertise due to the development of wireless devices such as smart-phones and the internet. To select useful information from the massive amount of the data, many systems using semantic web technologies have been developed. In order to build those systems, data collection and natural la...
There are a lot of research results in large scale graph analysis on Hadoop. The performance of the graph analysis based on Hadoop is impacted by data partitioning. The effectiveness of data partitioning depends on how the data partitioning maintains data locality in each node of cluster, and this would be different from the problems faced with. On...