Keith Jeffery

Keith Jeffery
Keith G Jeffery Consultants

PhD

About

205
Publications
27,248
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,690
Citations
Introduction
Keith Jeffery currently works as Keith G Jeffery Consultants. Previously he was Director, IT at STFC Rutherford Appleton Laboratory, UK. Keith does research in Information Systems (Scientific and Business Informatics), Metadata, Data Structures and Databases, Cloud computing, Virtual Research Environments. He works currently on projects EPOS, ENVRI, and MELODIC. He reviews for various funding agencies and publishers. He holds 3 honorary/visiting professorships, is a Chartered Engineer, Chartered IT Professional and a Fellow of both the Geological Society of London and the British Computer Society. He is an honorary fellow of the Irish Computer Society.
Additional affiliations
April 2013 - present
Keith G Jeffery Consultants
Position
  • Managing Director
Description
  • Specialising in technical coordination of ICST projects, knowledge and technology transfer, research information systems
April 2006 - March 2013
Rutherford Appleton Laboratory
Position
  • Managing Director
July 1998 - March 2006
Rutherford Appleton Laboratory
Position
  • Managing Director
Education
June 1968 - June 1971
University of Exeter
Field of study
  • Geology (with computing)
October 1965 - June 1968
University of Exeter
Field of study
  • Geology

Publications

Publications (205)
Article
Full-text available
Research infrastructures play an increasingly essential role in scientific research. They provide rich data sources for scientists, such as services and software packages, via catalog and virtual research environments. However, such research infrastructures are typically domain-specific and often not connected. Accordingly, researchers and practiti...
Article
Full-text available
Research infrastructures play an increasingly essential role in scientific research. They provide rich data sources for scientists, such as services and software packages, via catalog and virtual research environments. However, such research infrastructures are typically domain-specific and often not connected. Accordingly, researchers and practiti...
Article
Full-text available
Much research is dependent on Information and Communication Technologies (ICT). Researchers in different research domains have set up their own ICT systems (data labs) to support their research, from data collection (observation, experiment, simulation) through analysis (analytics, visualisation) to publication. However, too frequently the Digital...
Article
Interpretation of the archaeological record is determined by the occurrence and distribution of finds and their (relative and absolute) distribution over space. Comparison and categorization of finds strongly depends on their characteristics and the significance assigned to them. By default this is a very erroneous process leading to debate about i...
Chapter
Full-text available
Environmental research infrastructures aim to provide scientists with facilities, resources and services to enable scientists to effectively perform advanced research. When addressing societal challenges such as climate change and pollution, scientists usually need data, models and methods from different domains to tackle the complexity of the comp...
Chapter
Full-text available
Virtual Research Environments (VREs) are playing an increasingly important role in data centric sciences. Also, the concept is known as Science Gateways in North America where generally the functionality is portal plus workflow deployment and Virtual Laboratories in Australia where the end-user can compose a complete system from the user interface...
Chapter
Full-text available
E-Infrastructures play an increasingly important part in the provision of digital services to environmental researchers and other users. The availability of reliable networks, storage facilities, high performance and high throughput computers and associated middleware and services to ease their utilisation all contribute to enabling research and it...
Chapter
Full-text available
Environmental research infrastructures (RIs) support their respective research communities by integrating large-scale sensor/observation networks with data curation and management services, analytical tools and common operational policies. These RIs are developed as service pillars for intra- and interdisciplinary research; however, comprehension o...
Chapter
Full-text available
Data is a valuable resource. In some scientific disciplines, experiments can be redone to reproduce the data. In environmental sciences, the observations and measurements of the earth and its surroundings commonly can be made only once: each time point records uniquely the state of the many earth processes. This demands that environmental data - st...
Chapter
Full-text available
After a brief reminder on general concepts used in data cataloguing activities, this chapter provides information concerning the architecture and design recommendations for the implementation of catalogue systems for the ENVRIplus community. The main objective of this catalogue is to offer a unified discovery service allowing cross-disciplinary sea...
Article
Full-text available
The FAIR principles have been widely cited, endorsed and adopted by a broad range of stakeholders since their publication in 2016. By intention, the 15 FAIR guiding principles do not dictate specific technological implementations, but provide guidance for improving Findability, Accessibility, Interoperability and Reusability of digital resources. T...
Article
Full-text available
Since 2009 initiatives that were selected for the roadmap of the European Strategy Forum on Research Infrastructures started working to build research infrastructures for a wide range of research disciplines. An important result of the strategic discussions was that distributed infrastructure scenarios were now seen as “complex research facilities”...
Article
Full-text available
Purpose The purpose of this paper is to boost multidisciplinary research by the building of an integrated catalogue or research assets metadata. Such an integrated catalogue should enable researchers to solve problems or analyse phenomena that require a view across several scientific domains. Design/methodology/approach There are two main approach...
Article
Science gateways, virtual laboratories and virtual research environments are all terms used to refer to community-developed digital environments that are designed to meet a set of needs for a research community. Specifically, they refer to integrated access to research community resources including software, data, collaboration tools, workflows, in...
Article
Virtual Research Environments (VREs), also known as science gateways or virtual laboratories, assist researchers in data science by integrating tools for data discovery, data retrieval, workflow management and researcher collaboration, often coupled with a specific computing infrastructure. Recently, the push for better open data science has led to...
Preprint
Full-text available
There is a growing acknowledgement in the scientific community of the importance of making experimental data machine findable, accessible, interoperable, and reusable (FAIR). Recognizing that high quality metadata are essential to make datasets FAIR, members of the GO FAIR Initiative and the Research Data Alliance (RDA) have initiated a series of w...
Article
Over the last years, computer applications have moved away from the single-core, local execution that they were originally intended for. Modern applications have to run distributed, in parallel, on heterogeneous environments, share distributed data etc. Not surprisingly the standard ways of programming do not hold anymore and modern software engine...
Conference Paper
Archaeological finds include remainders of collapsed buildings. Interpreting these finds requires a deeper understanding of the underlying physics. Though physics simulations can help with this process, they are by default irreversible and thus serve more demonstration than explanation purposes. In this article, we examine a method to statistically...
Conference Paper
Governments and publicly-funded research institutions increasingly open up data collected and created through research. One way to share and use data obtained through research is through Virtual Research Environments (VREs). Insights obtained through open data use in VREs can subsequently provide input for policy making. However, this process invol...
Article
Full-text available
Since around 1950, scientific challenges have been an important application domain for computing and thus a significant factor driving developments in hardware and software. Almost 20 years ago, this led to the idea of grid computing, a key progenitor of todays cloud computing. Grid initiatives primarily focused on linking up supercomputing resourc...
Article
The PASTEUR4OA project (Open Access Policy Alignment Strategies for European Union Research) aims to support the European Commission's Recommendation to Member States of July 2012 that they develop and implement policies to ensure Open Access (OA) to all outputs from publicly-funded research.
Article
Full-text available
EPOS is a Research Infrastructure plan that is undertaking the challenge of integrating data from different solid Earth disciplines and of providing a common knowledge-base for the Solid-Earth community in Europe, by implementing and managing a logically centralised catalog based on the CERIF model. The EPOS catalogue will contain the information a...
Conference Paper
The stratigraphic layout of an excavation determines how finds can be interpreted regarding their timeline and relationships with each other. Older excavation reports do not fully record this layout however and reconstructing the relationships is often subject to conjecture. In this paper we present a first approach for reasoning over and visualizi...
Conference Paper
Computer science frequently considers much of humanities and in particular archaeology “trivial”. The “hard challenges” were defined by sciences, such as physics. Yet these “soft” domains have been and are struggling with challenges that still exceed computational capabilities and that cannot be solved with current approaches. On the other hand, cl...
Article
The PaaSage Project (2012-2016) addressed the overall objective to develop a model-based cross-cloud development and deployment platform. The project slogan was "Define your application once, deploy across the full spectrum of CLOUDs";. PaaSage is an open source integrated platform to support both design and deployment of Cloud applications, basing...
Article
Full-text available
The HOLACloud initial 2016 roadmap has been generated by a process similar to that for 2015 using the input to the CLOUD Forward Conference 2016. The analysis of the position papers, and ideas from the scientific papers, provided the synopsis of future R&I (Research and Innovation) topics. These are: advanced systems development method(s) based on...
Article
Purpose – The purpose of this paper is to discuss added value in the context of current research information systems (CRISs) based on metadata enrichment. Design/methodology/approach – This discussion paper uses literature review as well as analysis of CRISs specifications to discuss added value possibilities. Findings – Added value of the CRISs...
Conference Paper
Full-text available
Access full-text at http://hdl.handle.net/11366/507 Proof of Concept at pasteur4oa.is4ri.com --- The PASTEUR4OA project (Open Access Policy Alignment Strategies for European Union Research) aims to support the European Commission’s Recommendation to Member States of July 2012 that they develop and implement policies to ensure Open Access (OA) to a...
Conference Paper
Governments and publicly-funded research organisations increasingly make research data available openly. Researchers can use this data in Virtual Research Environments (VREs) to conduct multidisciplinary data-driven research and to obtain new insights potentially for governmental policy-making. However, the requirements for such a VRE are not yet c...
Presentation
Full-text available
Early excavation reports of around 1950 already employed meticulous recording techniques, yet fail to respect a lot of aspects concerning information that would help in generating relational information about a site’s organization at different times, such as recurring occupation of sites during specific seasons. The interested archaeologists either...
Article
Full-text available
The cloud computing paradigm encompasses several key differentiating elements and technologies, tackling a number of inefficiencies, limitations and problems that have been identified in the distributed and virtualized computing domain. Nonetheless, and as it is the case for all emerging technologies, their adoption led to the presentation of new c...
Conference Paper
Environmental research infrastructures (RIs) support data-intensive research by integrating large-scale sensor/observer networks with dedicated data curation services and analytical tools. However the diversity of scientific disciplines coupled with the lack of an accepted methodology for constructing new RIs inevitably leads to incompatibilities b...
Conference Paper
Environmental research infrastructures (RIs) support their respective research communities by integrating large-scale sensor/observation networks with data curation services, analytical tools and common operational policies. These RIs are developed as pillars of intra-and interdisciplinary research, however comprehension of the complex, pathologica...
Conference Paper
EPOS is an e-Infrastructure for solid Earh science in Europe. It integrates many heterogeneous Research Infrastructures (RIs) using a novel approach based on the harmonization of existing service and component interfaces. EPOS is designed to provide an architectural framework for new Research Infrastructures in the domain, and to interface with inc...
Article
Full-text available
RDA is all about facilitating researchers to use data (including scholarly publications and grey literature used as data). This encompasses data collection, data validation, data management (including preservation/curation), data analysis, data simulation/modelling, data mining, data visualisation and interoperation of data. Metadata are the key to...
Article
The primary IT paradigm has changed from a compute centric to a data centric problem perspective. For decades our whole IT concepts were building around the foundation laid out by von Neumann. As memory wall, parallelism, dynamicity and multi-tenancy become the pre-dominating factors in modern application performance and capabilities, this foundati...
Article
Full-text available
OGD (Open Government Data) is provided from government departments for transparency and to stimulate a market in ICT services for industry and citizens. Research datasets from publicly funded research commonly are associated with the open scholarly publications movement. However, the former world commonly is derived from the latter with generalisat...
Article
Full-text available
The vast majority of research output is grey; white (peer reviewed scholarly publications) forms a minor proportion. Historically, grey material was generated and used within an organisation. However, in recent years some research- relevant grey material is being made openly available. Among grey outputs, research datasets are the largest proportio...
Article
Full-text available
One of the key aspects of the approaching data-intensive science era is integration of data through interoperability of systems providing data products or visualization and processing services. Far from being simple, interoperability requires robust and scalable e-infrastructures capable of supporting it. In this work we present the case of EPOS, a...
Conference Paper
One of the key aspects of the approaching data-intensive science era is integration of data through interoperability of systems providing data products or visualisation and processing services. Far from being simple, interoperability requires robust and scalable e-infrastructures capable of supporting it. In this work we present the case of EPOS, a...
Technical Report
Full-text available
This report summarises the findings of the EC Cloud Expert Group with respect to upcoming challenges in Software Engineering under the EC Framework Programme H2020
Article
Full-text available
The Research Data Alliance (RDA) Metadata Standards Directory Working Group (MSDWG) is building a directory of descriptive, discipline-specific metadata standards. The purpose of the directory is to promote the discovery, access and use of such standards, thereby improving the state of research data interoperability and reducing duplicative standar...
Article
Full-text available
In the context of the wide research environment, we introduce the CERIF Common European Research Information Format data model which a has a richer structure than the usual metadata standards used in research information; b separates base entities from link entities thus providing flexibility in expressing role-based temporal relationships; c defin...
Technical Report
Full-text available
http://ec.europa.eu/information_society/newsroom/cf/dae/document.cfm?action=display&doc_id=6775
Article
Though Cloud Computing has found considerable uptake and usage, the amount of expertise, methodologies and tools for efficient development of in particular distributed Cloud applications is still comparatively little. This is mostly due to the fact that all our methodologies and approaches focus on single users, even single processors, let alone ac...
Article
We define grey as information that is not peer reviewed scholarly publications. In the CRIS (Current Research Information System) domain Grey includes not only non-peer reviewed publications (typically technical papers) but also performance art, art artifacts, design documents, models (e.g. for engineering or architecture) and - increasingly import...
Conference Paper
Resource usage in Clouds can be improved by deploying applications with richer defined requirements. Such "richer requirements" involve wider application / user specific context capture expressed in interrelated models. The use of model based requirements is presented using input from test-beds monitoring resource use in terms of Trust, Risk, Eco-E...
Conference Paper
Full-text available
The concept of CLOUD Computing has been a hot talking point for some years. However, there are as yet relatively few business applications running using CLOUDs although the use of CLOUDs for emailing, sharing office documents and social media is well-established. Cutting through the hype it is possible to classify kinds of CLOUD offerings and exami...
Article
Purpose ‐ The need for a more structured methodology than currently exists for describing the impact of academic research is widely acknowledged. The most widely used research information standard, CERIF, does not currently allow the encoding of research impact in a structured way: this project devised and tested an extension to CERIF to address th...
Conference Paper
In this paper, we provide an overview over the PaaSage project's approach to helping the developer in exploiting cloud environments according to their specific needs and requirements. Classical software engineering methodologies no longer apply in multi-tenant, elastic environments, if the full capabilities for cost reduction and availability are t...
Article
The European Plate Observing System (EPOS) is integrating the diverse, but advanced Research Infrastructures in Europe for solid Earth Science, and will build on new e-science opportunities to monitor and understand the dynamic and complex solid-Earth System. This integration requires a significant coordination between, among others, disciplinary (...
Article
The European Plate Observing System (EPOS) is a European initiative which aims to promote and make possible innovative approaches for a better understanding of the physical processes laying behind natural events and geo-science phenomena (earthquakes, volcanic eruptions, unrest episodes and tsunamis etc.) by integrating existing national and trans-...
Article
The research lifecycle has multiple objectives materialised as outputs, outcomes and impacts. Typical outputs are research publications (including grey literature), patents and products such as research datasets and software, many kinds of art or prototype engineering artifacts. Outcomes include patent licence income, value of a company set up to e...
Article
Full-text available
Public and private organizations increasingly release their data to gain benefits such as transparency and economic growth. The use of these open data can be supported and stimulated by providing considerable metadata (data about the data), including discovery, contextual and detailed metadata. In this paper we argue that metadata are key enablers...
Conference Paper
Full-text available
The opening of data is considered to provide many benefits. However, opening up data by public bodies is a complex and ill-understood activity. Although many public bodies might be willing to open up their data, they lack any systematic guidance. In this paper, guidance is provided by investigating the publishing processes at the Dutch Research and...
Conference Paper
CLOUD Computing is surrounded by hype and commercial supplier claims. However, there is emerging experience of real benefits and alongside a set of challenges for CLOUD Computing to reach wide acceptance. Major problems concern performance, security, privacy and interoperation - all of which have strong quality aspects. CLOUD Computing faces not on...
Article
Full-text available
Research Information Systems (RIS) play a critical role in the sharing of scientific information and provide researchers, professionals and decision makers with the required data for their activities. Existing RIS standards have proposed data models to represent the main entities for storage and exchange. These account for the needs of multiple sta...
Article
Full-text available
CERIF (the Common European Research Information Format) is a formal conceptual model to support the management of Research Information, including the set up of and the interoperation between Research Information Systems. Research Information is information about research entities such as people, projects, organisations, publications, patents, produ...
Technical Report
Full-text available
http://cordis.europa.eu/fp7/ict/ssai/docs/future-cc-2may-finalreport-experts.pdf
Technical Report
Full-text available
http://cordis.europa.eu/fp7/ict/ssai/docs/cloud-expert-group/roadmap-dec2012-vfinal.pdf
Conference Paper
It is known, that research is a key driver of innovation. Not only are governments interested in ongoing activities and outcomes, but also managers, researchers, the media, and increasingly society as such. Research Information is naturally stored in information systems by means of an underlying conceptual model to describe the system entities by t...
Article
GL (Grey literature, interpreted here as grey objects) is very heterogeneous in content, form and quality. Most GL objects evolve through a workflow. Some of these phases involve some form of evaluation or peer review, commonly internal within the management structure of an organisation and possibly involving external advice, including from 'friend...