Craig A. LeeFederation Partners
Craig A. Lee
Doctor of Philosophy
Federation
About
84
Publications
19,401
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,830
Citations
Publications
Publications (84)
The Internet of Things needs for computing power and storage are expected to remain on the rise in the next decade. Consequently, the amount of data generated by devices at the edge of the network will also grow. While cloud computing has been an established and effective way of acquiring computation and storage as a service to many applications, i...
The Internet of Things needs for computing power and storage are expected to remain on the rise in the next decade. Consequently, the amount of data generated by devices at the edge of the network will also grow. While cloud computing has been an established and effective way of acquiring computation and storage as a service to many applications, i...
There are many different options available to implement information technology (IT) today: enterprise data center legacy approaches or private cloud, public cloud, hybrid cloud, fog, device, mobile edge, and multicloud. Although definitions vary, for a variety of reasons, many companies are using multicloud approaches in a variety of patterns, at v...
With the maturation of the Cloud Computing, the eyes of the scientific community and specialized commercial institutions have turned to research related to the use of multiple clouds. The main reason for this interest is the limitations that many cloud providers individually face to meet all the inherent characteristics of this paradigm. Therefore,...
We present a design for a general federation agent based on the OpenStack Keystone v3 service. We examine and choose an initial approach based on peer-to-peer methods that support the use of arbitrary application-level services in Virtual Organizations. We review general federation management requirements, deployment models, and current work, inclu...
In cloud computing, 'hybrid cloud' and 'community cloud' are established jargon. These concepts, however, actually entail federation, that is, the ability to securely share compute resources. Cloud federation has been investigated at all levels of the system stack - infrastructure as a service (IaaS), platform as a service (PaaS), and software as a...
As distributed, on-line communities are increasingly supported by the global, interconnected computing infrastructure, methods must be developed to securely manage their interactions. The virtual organization (VO) concept provides a security and discovery context whereby collaboration across multiple administrative domains can be enabled while enfo...
As computing devices become increasingly connected, and at a global scale, methods must be developed to flexibly, yet securely, manage their interactions. In the most general sense, this requires federation management. The virtual organization (VO) concept provides a federated, security and discovery context whereby collaboration across multiple ad...
This paper describes approaches for supporting virtual organizations (VOs) in OpenStack. A VO provides a security and discovery context whereby collaboration across multiple sites can be enabled while enforcing joint security policies. VOs were developed in the grid computing arena to manage international scientific collaborations. However, the VO...
This paper presents ongoing research to define the basic models and architecture patterns for federated access control in heterogeneous (multi-provider) multi-cloud and inter-cloud environment. The proposed research contributes to the further definition of Intercloud Federation Framework (ICFF) which is a part of the general Intercloud Architecture...
This paper develops a research and development plan for dynamic service level agreements (SLAs) in Open Stack to support cloud applications that may have changing resources requirements. The fundamental SLA functions are reviewed: Admission Control, Monitoring, SLA Evaluation, and SLA Enforcement - a classic autonomic control cycle. We then identif...
OpenStack is an open source cloud computing project that is enjoying wide. While many cloud deployments may be stand-alone, it is clear that secure federated community clouds, i.e., inter-clouds, are needed. Hence, there must be methods for federated identity management (FIM) that enable authentication and authorisation to be flexibly enforced acro...
This paper investigates the use of proxies to incorporate on-orbit sensors into netcentric environments. Proxies can provide a natural system interface that observes all of the tenets of netcentricity. Proxies can provide support for security, policy enforcement, reliability, mediation, power, performance, and operational management. Proxies can al...
Remote sensing data have become very widespread in recent years, and the exploitation of this technology has gone from developments mainly conducted by government intelligence agencies to those carried out by general users and companies. There is a great deal more to remote sensing data than meets the eye, and extracting that information turns out...
Cloud computing has the potential for tremendous benefits, but wide scale adoption has a range of challenges that must be met. We review these challenges and how they relate to scientific computing. To achieve the portability, interoperability, and economies of scale that clouds offer, it is clear that common design principles must be widely adopte...
Linked to the constantly changing computer infrastructures and the analysis of spatial datasets, there currently exist networks which are capable of processing all types of information online across administrative domains and regulations. The target of such networks (also known as Grids or e-Infrastructures) is to combine complex dataseis, which ma...
The Open Geospatial Consortium and the Open Grid Forum are collaborating to develop open standards that address the distributed computing needs of geospatial applications while accommodating the inevitability of diverse formats, schemas, and processing algorithms. These standards will provide the necessary infrastructure for developing tools, softw...
The advent of affordable high performance computing (HPC) resources, such as computing clusters and grids, has transformed the computational capabilities of many organizations. However, the lack of convenient interfaces to such valuable resources limits their effectiveness, restricting their user base to a small set of highly skilled programmers an...
A vibroacoustic analysis toolkit for launch vehicle telemetry was refactored as a service architecture using Globus Toolkit 4.0. We grid-enabled two tools in the analysis flow, managing their remote execution workflow from a desktop client. We examine the performance of the tools and the processing time for the entire tool chain. The results indica...
This paper explores the use of optimistic computation to improve application performance in wide-area distributed environments. We do so by dening a parametric model of optimistic computation and then running sets of param- eterized experiments to show where, and to what degree, optimistic computation can produce speed-ups. The model is instantiate...
This workshop at ICCBSS07 addresses issues relating to developing interoperable ground systems starting with components that may not originally have been intended to work together. The workshop explores these issues in more depth, shares lessons learned, identifies practical approaches for acquisition and implementation, and highlights research dir...
Since late 2000 we have developed and maintained a general purpose technical and scientific computing cluster running the FreeBSD operating system. In that time we have grown from a cluster of 8 dual In- tel Pentium III systems to our current mix of 64 dual Intel Xeon and 289 dual AMD Opteron systems. This paper looks back on the system architectur...
10.1 Introduction In the last 40 years, remote sensing of the Earth has seen a continuous growth in the capabilities of the instrumentation (satellites, airborne, and ground-based sensors that monitor and measure the environment) that provides the fundamental data sets and an increase in the complexity of the data analyses and modeling that these d...
Over the past five years we have built and operated a computing cluster for company-wide unclassified technical and scientific computing. In this paper we discuss our cluster architecture, some of the architectural decisions we faced in the process, a sampling of applications run by our users, and lessons learned from building and operating this cl...
System management aspects are described for the process of grid-enabling a vibroacoustic analysis application using the Globus Toolkit 3.2.1. This is the first step in a project intended to grid-enable a suite of tools being developed as a service-oriented enterprise architecture for spacecraft telemetry analysis. Many of the applications in the su...
Grid technology has matured considerably over the past few years. Progress in both implementation and standardization is reaching a level of robustness that enables production quality deployments of grid services in the academic research community with heightened interest and early adoption in the industrial community. Despite this progress, grid a...
This paper describes the process of grid-enabling a vibroacoustic analysis application using the Globus Toolkit 3.2.1. This is the first step in a project intended to grid-enable a suite of tools being developed as a service-oriented architecture for spacecraft telemetry analysis. Many of the applications in the suite are compute intensive and woul...
Grid computing represents the common vision of truly general distributed computing across a ubiquitous, open-ended infrastructure supporting a wide range of different application areas. Realizing this vision will require a long-term collaboration of fundamental and applied computer science, industry, commercial infrastructure providers, and many, m...
This chapter examines the integration of content-based event notication systems with worko w management. This is motivated by the need for dynamic, data- driven application systems which can dynamically discover, ingest data from, and interact with other application systems, including physical systems with online sensors and actuators. This require...
After years of development by the high performance computing (HPC) community, grid computing has hit the mainstream as one of the hottest buzzwords in computing technology today. This paper exam-ines the issues involved in integrating FreeBSD with the Globus Toolkit, the de facto standard for grid computing. Particular attention is paid to interac-...
This paper describes the concept of optimistic grid computing. This allows applications to synchronize more loosely and better tolerate the dynamic and heterogeneous bandwidths and latencies that are seen in grid environ-ments. Based on the observed performance of a world-wide grid testbed, we estimate target operating regions for grid applications...
This position paper examines the use of topology-aware communication services to support message-passing in wide-area, distributed
environments, i.e., grids. Grid computing promises great benefits in the flexible sharing of resources but poses equally great
challenges for high-performance computing, that is to say, how to execute large-scale comput...
Summary and Review: 1) Component "Web Service" Architectures with well-known namespace conventions; 2) Topology-Aware Communication Services will become essential; 3) Peer-to-Peer Systems will manage much of this; 4) Program meta-models with grid-aware "back-ends:" 5) Mobile, Ad Hoc, Embedded grids are coming.
In this paper we discuss the design and implementation of Fellowship, a 300+ CPU, general use computing cluster based on FreeBSD. We address the design features including configuration management, network booting of nodes, and scheduling which make this cluster unique and how FreeBSD helped (and hindered) our e#orts to make this design a reality.
This paper investigates the use of advanced communication services in grid environments. Such services can include augmented communication semantics (e.g., filtering), collective operations, content-based and policy-based routing, and managing communication scope to manage feasibility. These services could be implemented and deployed in a variety o...
IntroductionGrid Programming IssuesA Brief Survey of Grid Programming ToolsAdvanced Programming SupportConclusion
References
Contents include the following: Background and motivation. Grid computing concepts. Advanced data grid (ADG) prototype development. ADG requirements and operations concept. ADG architecture. ADG implementation. ADG test plan. ADG schedule. Summary and status.
This paper describes our progress in creating a case study on optimistic computing for the Grid using parallel mesh generation. For the implementation of both methods we will be using a Portable Runtime Environment for Mobile Applications (PREMA) which is extended to provide sup- port for optimistic control using grid performance moni- toring and p...
This paper discusses preliminary work on standardizing and implementing a remote procedure call (RPC) mechanism for grid computing.
The GridRPC API is designed to address the lack of a standardized, portable, and simple programming interface. Our initial
work on GridRPC shows that client access to existing grid computing systems such as NetSolve an...
This paper discusses preliminary work on standardizing and implementing a remote procedure call (RPC) mechanism for grid computing. The GridRPC API is designed to address one of the factors that has hindered widespread acceptance of grid computing -- the lack of a standardized, portable, and simple programming interface. In this paper, we examine t...
This paper investigates time management for distributed simulations in active networks. Time management is essentially the computation of the Lower Bound Time Stamp (LBTS) across federates in a distributed simulation, including in-transit (in-flight) messages. We show that the LBTS computation is an instance of the Distributed Termination Detection...
The advent of a widely interconnected computing infrastructure has raised the question of the kind of advanced communications that can be implemented "in the network". The active networks concept explores advanced capabilities such as content-based routing and multi-mode multicast. A larger issues exists, however, as to how widely distributed appli...
Blue Gene/L (BG/L) is a 65, 536-node massively parallel computer being developed at the IBM Thomas J. Watson Research Center that promises to revolutionize large-scale scientific computing. However, its size alone will make programming BG/L a major challenge, ...
Investigates time management for distributed simulations in active networks. Time management is essentially the computation of the lower-bound time stamp (LBTS) across federates in a distributed simulation, including in-transit (in-flight) messages. We show that LBTS computation is an instance of the distributed termination detection (DTD) problem...
For a grid network performance data set, we estimate propagation distances to get a lower bound on propagation delays. With a model for primary performance factors and assumptions about expected performance trends, we extrapolate to estimate the communication performance of a global grid in ten years time. Communication pipes are getting fatter, bu...
This paper examines the issues surrounding efficient execution in heterogeneous grid environments. The performance of a Linux cluster and a parallel supercomputer is initially compared using both benchmarks and an application. With an understanding of how benchmark and application performance is affected by processor and interconnect speed, a compa...
Examines the issues surrounding efficient execution in heterogeneous grid environments. The performances of a Linux cluster and a parallel supercomputer are initially compared using both benchmarks and an application. With an understanding of how benchmark and application performance is affected by processor and interconnect speed, a comparison is...
This paper examines the integration of grid computing and active networks. Resource management is a fundamental aspect of grid computing and should allow for the integration of new capabilities. Active networks will enable a range of higher-level abstractions that will be captured in middleware. This paper systematically examines the components of...
In grid computing environments, network bandwidth discovery and allocation is a serious issue. Before their applications are running, grid users will need to choose hosts based on available bandwidth. Running applications may need to adapt to a changing set of hosts. Hence, a tool is needed for monitoring network performance that is integral to the...
The potential for faults in distributed computing systems is a significant complicating factor for application developers.
While a variety of techniques exist for detecting and correcting faults, the implementation of these techniques in a particular
context can be difficult. Hence, we propose a fault detection service designed to be incorporated,...
The realization of end-to-end quality of service (QoS) guarantees
in emerging network-based applications requires mechanisms that support
first dynamic discovery and then advance or immediate reservation of
resources that will often be heterogeneous in type and implementation
and independently controlled and administered. We propose the Globus
Arch...
The potential for faults in distributed computing systems is a significant complicating factor for application developers. While a variety of techniques exist for detecting and correcting faults, the implementation of these techniques in a particular context can be difficult. Hence, we propose a fault detection service designed to be incorporated,...
Introduction General computing over a widely distributed set of heterogeneous machines typically called metacomputitg offers definite advantages. In addition to allowing a single application to bring together different types of resources, such as specialized data sources, data bases, and visualization systems, it allows an application to acquire an...
General computing over a widely distributed set of heterogeneous
machines-typically called metacomputing-offers definite advantages. The
notion of quality of service (QoS) for metacomputing is very important.
This paper presents Qualis, the QoS component for the Globus
metacomputing system. We present the Qualis architecture, how it is
integrated i...
Metacomputing combines heterogeneous system elements in a seamless computing service. In this case study, we introduce the elements of metacomputing and describe an application for cloud detection and visualization of infrared and visible-light satellite images. The application processes the satellite images by using Compositional C++ (CC++)-a simp...
Global Virtual Time (GVT) is the fundamental synchronization concept in optimistic simulations. It is defined as the earliest time tag within the set of unprocessed pending events in distributed simulation. A number of techniques for determining GVT have been proposed in recent years, each having their own intrinsic properties. However, most of the...
Global Virtual Time (GVT) is the fundamental synchronization concept in optimistic simulations. It is defined as the earliest time tag within the set of unprocessed pending events in distributed simulation. A number of techniques for determining GVT have been proposed in recent years, each having their own intrinsic properties. However, most of the...
The author introduces the concept of semi-private memory as an
architectural means of load balancing. In distributed, message-passing
systems, each processing element is typically associated with a private
input queue and a private local memory. The semi-private memory concept,
however, allows each processing element to service a small, unique
subs...
The paper identifies two barrier synchronization techniques that
are appropriate for nonuniform memory architectures. The first of these
two, the dissemination barrier, is based on a progressive skip-ring of
flags. The second, called the reflected-tree barrier, is based on
software combining trees. The dissemination barrier is optimal in terms
of t...
The binary predicate execution model (BPEM) is a computational model that combines logic programming, semantic nets, and message-driven computation into a paradigm for the construction of highly parallel knowledge-base systems. Simulation results are presented that demonstrate the ability of BPM to exploit effectively the resources of a loosely cou...
Mapping a problem graph of communicating tasks onto a network
graph of processing elements so that the communication distance is
minimized and the problem graph is evenly distributed over the network,
is an NP-complete problem. The authors demonstrate that an approximation
technique called simulated annealing can be applied to find acceptable
solut...
There is a direct correspondence between semantic networks and a subset of logic programs, restricted only to binary predicates. The advantage of the latter is that it can describe not only the nodes and arcs comprising a semantic net, but also the data-retrieval operations applied to such nets. The main objective of this paper is to present a data...
Guinea pigs were exposed to a 4.5 kHz pure-tone at 104 dB for 10 min during artificial ventilation with either carbogen gas (95% O2/5% CO2) or normal air. Mean N1 response amplitudes to tone bursts at 32 test frequencies extending from 2.1 kHz through 30 kHz were measured at standardized intervals before and after the acoustic overstimulation. All...
A grid computing environment is inherently parallel, dis-tributed, heterogeneous and dynamic, both in terms of the resources involved and their performance. That is to say, grids will present hierarchies of machines, data and net-works that have associated hierarchies of cycles, band-widths and latencies that are beyond the scope of most es-tablish...
The realization of end-to-end quality of service (QoS) guarantees in emerging network- based applications requires mechanisms that support rst dynamic discovery and then ad- vance or immediate reservation of resources that will often be heterogeneous in type and implementation and independently controlled and administered. W ep r o p o s e t h eG l...
A grid computingenvironment is inherently parallel, dis- tributed, heterogeneous and dynamic, both in terms of the resources involved and their performance. That is to say, grids will present hierarchies of machines, data and net- works that have associated hierarchies of cycles, band- widths and latencies that are beyond the scope of most es- tabl...
Abstract GridRPC is an easy-to-use API for Grids based on the established remote procedure call model. In developing the API, we partitioned it into an End-User API and a Middleware API. The End-User API is smaller, simpler, and intended for end-user applications. The Middleware API augments the End-User API to allow more powerful GridRPC-based mid...
Global Virtual Time (GVT) is the fundamental synchronization concept in optimistic simulations. It is defined as the earliest time tag within the set of unprocessed pending events in a distributed simulation. A number of techniques for determining GVT have been proposed in recent years, each having their own intrinsic properties. However, most of t...