
Stasa MilojevicIndiana University Bloomington | IUB
Stasa Milojevic
About
74
Publications
75,397
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,639
Citations
Citations since 2017
Introduction
Skills and Expertise
Publications
Publications (74)
What science does, what science could do, and how to make science work? If we want to know the answers to these questions, we need to be able to uncover the mechanisms of science, going beyond metrics that are easily collectible and quantifiable. In this perspective piece, we link metrics to mechanisms by demonstrating how emerging metrics of scien...
What is scientific knowledge, and how is it created, accumulated, transformed, and used? If we want to know the answers to these questions, we need to be able to uncover the structures and mechanisms of science, in addition to the metrics that are easily collectable and quantifiable. In this review article, we link metrics to mechanisms, by demonst...
Significance
Prior work establishes wide-ranging gender inequities in science. Disparities at the level of earnings, support, and promotion indicate that women’s research is not recognized equally to men’s. Since an imbalance in visibility might have consequential downstream effects on citations and awards, the study of online success is critical t...
The number, size and complexity of ‘big science’ projects are growing — as are the size, complexity and value of the data sets and software services they produce. In this context, big data gives a new way to analyse, understand, manage and communicate the inner workings of collaborations that often involve thousands of experts, thousands of scholar...
BACKGROUND . The increasing availability of digital data on scholarly inputs and outputs – from research funding, productivity, and collaboration to paper citations and scientist mobility – offers unprecedented opportunities to explore the structure and evolution of science. The science of science (SciSci) offers a quantitative understanding of the...
Nature, Science, and PNAS are the three most prestigious general-science journals, and Nature and Science are among the most influential journals overall, based on the journal Impact Factor (IF). In this paper we perform automatic classification of ~50,000 articles in these journals (published in the period 2005-2015) into 14 broad areas, to explor...
We propose a new citation model which builds on the existing models that explicitly or implicitly include "direct" and "indirect" (learning about a cited paper's existence from references in another paper) citation mechanisms. Our model departs from the usual, unrealistic assumption of uniform probability of direct citation, in which initial differ...
The citations process for scientific papers has been studied extensively. But while the citations accrued by authors are the sum of the citations of their papers, translating the dynamics of citation accumulation from the paper to the author level is not trivial. Here we conduct a systematic study of the evolution of author citations, and in partic...
Nature, Science, and PNAS are the three most prestigious general-science journals, and Nature and Science are among the most influential journals overall, based on the journal Impact Factor (IF). In this paper we perform automatic classification of ~ 50,000 articles in these journals (published in the period 2005–2015) into 14 broad areas, to explo...
With funding from the National Science Foundation, the Center for Open Science (COS) and Indiana University will create a dynamic, distributed, and heterogeneous data source for the advancement of science of science research. This will be achieved by using, enhancing, and combining the capabilities of the Open Science Framework (OSF) and the Collab...
Classification of bibliographic items into subjects and disciplines in large databases is essential for many quantitative science studies. The Web of Science classification of journals into ~250 subject categories, which has served as a basis for many studies, is known to have some fundamental problems and several practical limitations that may aff...
Classification of bibliographic items into subjects and disciplines in large databases is essential for many quantitative science studies. The Web of Science classification of journals into approximately 250 subject categories, which has served as a basis for many studies, is known to have some fundamental problems and several practical limitations...
The citations process for scientific papers has been studied extensively. But while the citations accrued by authors are the sum of the citations of their papers, translating the dynamics of citation accumulation from the paper to the author level is not trivial. Here we conduct a systematic study of the evolution of author citations, and in partic...
Contemporary science has been characterized by an exponential growth in publications and a rise of team science. At the same time, there has been an increase in the number of awarded PhD degrees, which has not been accompanied by a similar expansion in the number of academic positions. In such a competitive environment, an important measure of acad...
The whys and wherefores of SciSci
The science of science (SciSci) is based on a transdisciplinary approach that uses large data sets to study the mechanisms underlying the doing of science—from the choice of a research problem to career trajectories and progress within a field. In a Review, Fortunato et al. explain that the underlying rationale is...
This panel will present views and critical reflections on peer review, bibliometrics and altmetrics against the background of the latest developments and critique of the scientific system (e.g., replication crisis, DORA, open science, data manipulation). Peer review is the oldest form of monitoring the scientific process and research outcomes and i...
In this paper, we examine the characteristics of titles (average length, proportion of titles with subtitles, proportion of interrogatory, and indicative titles) and how they changed over a substantial period of time (half a century). We consistently analyze core literature in five diverse fields, in order to probe the usage of titles as a discipli...
In this paper we present “citation success index”, a metric for comparing the citation capacity of pairs of journals. Citation success index is the probability that a random paper in one journal has more citations than a random paper in another journal (50% means the two journals do equally well). Unlike the journal impact factor (IF), the citation...
Many internal and environmental triggers of primary headaches have been proposed, but establishing firm evidence for any of them has proved elusive. Geomagnetic storms, the disturbances of Earth’s magnetic field following Solar eruptions, have been proposed as one such trigger. In this study, we utilized a vast amount of self-reported symptoms from...
Aging is considered to be an important factor in a scholar’s propensity to innovate, produce, and collaborate on high quality work. Yet, empirical studies in the area are rare and plagued with several limitations. As a result, we lack clear evidence on the relationship between aging and scholarly communication activities and impact. To this end, we...
Choosing the best scientific venue for the submission of a manuscript, with the aim of maximizing the impact of the future publication, is a frequent task faced by scholars. In this paper, we show that the Impact Factor (IF) of a journal allows to rationally achieve this goal. We take advantage of a comprehensive bibliographic and citation dataset...
The volume of the existing research literature is such it can make it difficult to find highly relevant information and to develop an understanding of how a scientific topic has evolved. Prior research on topic evolution has often leveraged refinements to Latent Dirichlet Allocation (LDA) to identify emerging topics. However, such methods do not an...
We argue that citation is a composed indicator: short-term citations can be considered as currency at the research front, whereas long-term citations can contribute to the codification of knowledge claims into concept symbols. Knowledge claims at the research front are more likely to be transitory and are therefore problematic as indicators of qual...
While the modern science is characterized by an exponential growth in
scientific literature, the increase in publication volume clearly does not
reflect the expansion of the cognitive boundaries of science. Nevertheless,
most of the metrics for assessing the vitality of science or for making funding
and policy decisions are based on productivity. S...
We provide an overview of the common origins of qualitative and quantitative forms of STS, offering a discursive account of this history. We then demonstrate how scientometric techniques can be used to address substantive research questions, and we provide examples relevant both to the origins of STS and its state of the art. Our purpose is not to...
This paper applied LDA and regression analysis to conduct a lead-lag analysis to identify
different topic evolution patterns between preprints and papers from arXiv and Web of Science
(WoS) in astrophysics over the last twenty years (1992-2011). Fifty topics in arXiv and WoS
were generated using an LDA algorithm and then regression models were u...
Somewhere around 1983, maybe late 1982, there was talk beginning about doing something more formal within IEEE that dealt with robotics and automation. Informally, activity was getting started through the Control Society,...also Systems, Man and Cybernetics, which obviously makes a lot of sense with the telerobotics things and a few others. But we...
We analyze the citation impacts (,cited') and citation practices (,citing') of sociological journals which publish mainly in German, and discuss major drawbacks of using the journal impact factor (IF) to assess the quality of these journals. First, sociological literature moves very slowly in terms of citations, whereas journal impact factors (IF)...
The panel explores the theoretical, practical, and policy aspects of self-presentation in academia given the rapidly changing world of knowledge creation, dissemination and consumption. It offers insights into both the potential and challenges of social media in academia and highlights future directions regarding scholarly communication. The goal o...
Described by his peers as the father of Australian robotics, Raymond (Ray) Jarvis is internationally renowned for his scientific contributions to computer vision, intelligent robotics, and path planning. His work has found broad application in the sciences as well as in industry and society. Along with pursuing his academic career, Jarvis spent a s...
Networks have for a long time been used both as a metaphor and as a method for studying science. With the advent of very large data sets and the increase in the computational power, network analysis became more prevalent in the studies of science in general and the studies of science indicators in particular. For the purposes of this chapter scienc...
The movement to increase gender diversity in computing and
computer science is a well-funded and well researched mission.
However, despite this, female representation within the field tends
to lag behind that of other disciplines within the academy, both
among students and faculty. This study sets out to correlate
bibliometric and network meas...
Genre is considered to be an important element in scholarly communication and
in the practice of scientific disciplines. However, scientometric studies have
typically focused on a single genre, the journal article. The goal of this
study is to understand the role that handbooks play in knowledge creation and
diffusion and their relationship with th...
Since its creation in 1991, arXiv has become central to the diffusion of
research in a number of fields. Combining data from the entirety of arXiv and
the Web of Science (WoS), this paper investigates (a) the proportion of papers
across all disciplines that are on arXiv and the proportion of arXiv papers
that are in the WoS, (b) elapsed time betwee...
We have developed a (freeware) routine for “Referenced Publication Years Spectroscopy” (RPYS) and apply this method to the historiography of “iMetrics,” that is, the junction of the journals Scientometrics, Informetrics, and the relevant subset of JASIST (approx. 20%) that shapes the intellectual space for the development of information metrics (bi...
The panel explores the theoretical, methodological, and design aspects of the temporal dimension in the study of knowledge creation and representation and offers insights in both the challenges and the promising future directions. The goal of the panel is to start a collaborative discussion around a number of important themes.
We have developed a (freeware) routine for "referenced publication years
spectroscopy" (RPYS) and apply this method to the historiography of "iMetrics,"
that is, the junction of the journals Scientometrics, Informetrics, and the
relevant subset of JASIST (approx. 20%) that shapes the intellectual space for
the development of information metrics (bi...
There are a number of solutions that perform unsupervised name disambiguation
based on the similarity of bibliographic records or common co-authorship
patterns. Whether the use of these advanced methods, which are often difficult
to implement, is warranted depends on whether the accuracy of the most basic
disambiguation methods, which only use the...
This paper proposes a new framework for Citation Content Analysis (CCA), for
syntactic and semantic analysis of citation content that can be used to better
analyze the rich sociocultural context of research behavior. The framework
could be considered the next generation of citation analysis. This paper
briefly reviews the history and features of co...
Purpose
The purpose of this paper is to introduce the conceptual foundations and motivation for creating a digital archive to display the developments in the field of robotics over the past 50 years.
Design/methodology/approach
The paper proposes that the archive should be based on the conceptualization of science as an “ecology of knowledge”, com...
Miomir Vukobratovic is renowned in the robotics community for his pioneering contributions to both theoretical and applied robotics, particularly in the areas of biped locomotion, anthropomorphic mechanisms, and dynamic robot control. He spent most of his career as the director of the Robotics Laboratory at the Mihajlo Pupin Institute (MPI) in Belg...
The birth and decline of disciplines are critical to science and society. How do scientific disciplines emerge? No quantitative model to date allows us to validate competing theories on the different roles of endogenous processes, such as social collaborations, and exogenous events, such as scientific discoveries. Here we propose an agent-based mod...
Publishing or perishing reflects the fierce competition in scholarly communication. Claiming authorship of innovation is essential in many fields, especially in physics or astrophysics. Over the last decade, Web 2.0 technologies have dramatically changed the style of our scholarship and scholarly communication. People can claim and share their init...
Since its creation in 1991, arXiv has become central to the diffusion of research in a number of fields. Combining data from the entire arXiv and the Web of Science (WoS), this paper investigates (a) the proportion of papers across all disciplines that are on arXiv, (b) Elapsed time between arXiv submission and journal publication, and (c) the agin...
The aging of scholars is considered an important factor in creativity, productivity, and collaborative behaviour. However, the literature lacks in both conceptualizations and operationalizations of aging, and empirical studies show wide variation across disciplines. This study focused on two approaches on aging (by career age and cohort) and examin...
John McCarthy is best known as one of the founding fathers of artificial intelligence (AI), a term he coined in 1955, and much has been written about his pioneering work in computer and cognitive science (Figure 1). Less attention has been given to McCarthy's efforts related to robotics, though his AI research instigated and influenced development...
References are an essential component of research articles and therefore of scientific communication. In this study we investigate referencing (citing) behavior in five diverse fields (astronomy, mathematics, robotics, ecology and economics) based on 213,756 core journal articles. At the macro level we find: (a) a steady increase in the number of r...
"Bibliometrics", "scientometrics", "informetrics", and "webometrics" can all
be considered as manifestations of a single research area with similar
objectives and methods, which we call "information metrics" or iMetrics. This
study explores the cognitive and social distinctness of iMetrics with respect
to the general information science (IS), focus...
George Charles Devol, Jr., opened his first manufacturing company when he was just 19 years old, in the depths of the Great Depression. It was the first step in a lifelong commitment to transforming factory efficiency and safety. With his revolutionary concept of universal automationUnimation he cofounded the first and largest robotics company in t...
The paper provides an overview of the field of scientometrics, that is: the
study of science, technology, and innovation from a quantitative perspective.
We cover major historical milestones in the development of this specialism from
the 1960s to today and discuss its relationship with the sociology of
scientific knowledge, the library and informat...
Editor's Summary Bibliometrics takes center stage for this Bulletin, with a review tracing its historical foundation in the mid-19th century through forecasts of its expanding uses in future research. The scope of bibliometrics has grown from generalized statistical bibliography, the quantitative study of patterns and references in written communic...
This article examines the cognitive evolution and disciplinary diversity of
nanotechnology as expressed through the terminology used in titles of nano
journal articles. The analysis is based on the NanoBank bibliographic database
of 287,106 nano articles published between 1981 and 2004. We perform
multifaceted analyses of title words, focusing on 1...
Research topics and research communities are not disconnected from each other: communities and topics are interwoven and co-evolving. Yet, scientometric evaluations of topics and communities have been conducted independently and synchronically, with researchers often relying on homogeneous unit of analysis, such as authors, journals, institutions,...
Models of science aim to capture the structure and/or dynamics of science itself. Data on the science system – scholars, papers, patents, grants, jobs, etc., and their complex interdependencies and dynamics – are used to validate them. This chapter provides a general introduction to the modeling of science together with a discussion of different mo...
The panel explores the theoretical, metric, algorithmic, and design aspects of state-of-the-art information visualization, and offers insights regarding the challenges and future directions in this field. The goal of the panel is to start a collaborative discussion around a number of important themes.
We propose using community-centered analyses and agent-based models of scientific gatherings such as conferences, symposia and workshops as a way to understand how scientific practices evolve and transition between local, community, and systems levels in science. We suggest using robotics as a case study of global, cross-cultural, interdisciplinary...
This study comprises a suite of analyses of words in article titles in order to reveal the cognitive structure of Library and Information Science (LIS). The use of title words to elucidate the cognitive structure of LIS has been relatively neglected. The present study addresses this gap by performing (a) co-word analysis and hierarchical clustering...
Sponsored by: ASIS&T SIG/MET
This panel aims to inform participants of, and to stimulate interest in, the diverse ways in which the measurement of information (informetrics) is used in real-world applications. Its timeliness is indicated by the recent increase in interest amongst ASIS&T members in informetrics that culminated in the endorsement of...
This chapter discusses the relation between Social Networks and Semantics – two areas that have recently gained a lot of attention from both academia and industry. The authors show how synergies between these two areas can be used to solve concrete problems, and they describe three approaches that demonstrate the potential for interconnecting these...
Data mediation and interoperation have already become one of the central topics of IT for decades. Since the Web appears, this problem has been exploded due to the increasing amount of data and Web users. This paper takes look at the social tagging behavior. It proposes upper tag ontology (UTO) to model tag data from three popular social tagging ne...
We suggest partial logarithmic binning as the method of choice for uncovering the nature of many distributions encountered in information science (IS). Logarithmic binning retrieves information and trends “not visible” in noisy power law tails. We also argue that obtaining the exponent from logarithmically binned data using a simple least square me...
In this paper, we investigate the difference between metadata generated by users and authors. Delicious tags and HTML keyword META tags associated with the same set of web pages on topics related to semantic web are collected, forming two datasets (i.e., Delicious dataset and HTML dataset). Comparisons of the two datasets in micro and macro vocabul...
This article investigates the dynamic features of social tagging vocabularies in Delicious, Flickr and YouTube from 2003 to 2008. It analyzes the evolution of the usage of the most popular tags in each of these three social networks. We find that for different tagging systems, the dynamic features reflect different cognitive processes. At the macro...
One of the main shortcomings of Semantic Web technologies is that there are few user-friendly ways for displaying, browsing
and querying semantic data. In fact, the lack of effective interfaces for end users significantly hinders further adoption
of the Semantic Web. In this paper, we propose the Semantic Web Portal (SWP) as a light-weight platform...
The goal of the study was to determine the underlying processes leading to the observed collaborator distribution in modern scientific fields, with special attention to nonpower-law behavior. Nanoscience is used as a case study of a modern interdisciplinary field and its coauthorship network for 2000–2004 period is constructed from the NanoBank dat...
Data integration and mediation have become central concerns of information technology over the past few decades. With the advent of the Web and the rapid increases in the amount of data and the number of Web documents and users, researchers have focused on enhancing the interoperability of data through the development of metadata schemes. Other res...
We suggest partial logarithmic binning as the method of choice for uncovering
the nature of many distributions encountered in information science (IS).
Logarithmic binning retrieves information and trends "not visible" in noisy
power-law tails. We also argue that obtaining the exponent from logarithmically
binned data using a simple least square me...
New disciplines require the presence of a social infrastructure for the research to become institutionalized. I study the change of structural arrangements to determine the social development of nanotechnology as an emerging discipline. In order to understand structural properties of social infrastructure, I have conducted the empirical study of th...
This study examines the development of nanoscience/nanotechnology over a
35 year period (1970-2004) by mapping its social and cognitive
structures using social network analysis, bibliometrics and document
analysis, and following their changes in time. Mapping is performed
based on 580,000 journal articles, 240,000 patents and 53,000 research
grants...
The seemingly simple task of reusing data for science education relies on the presence of scientific data, scientists willing to share, infrastructure to provide access, and mechanisms to share between the two disparate communities of scientists and science students. What makes sharing between scientists and science students a special case of data...