Frank SchweitzerETH Zurich | ETH Zürich · Department of Management, Technology, and Economics
Frank Schweitzer
Professor
About
432
Publications
69,168
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
12,887
Citations
Publications
Publications (432)
The functional interaction structure of a team captures the preferences with which members of different roles interact. This paper presents a data-driven approach to detect the functional interaction structure for software development teams from traces team members leave on development platforms during their daily work. Our approach considers diffe...
Networks to distribute goods, from raw materials to food and medicines, are the backbone of a functioning economy. They are shaped by several supply relations connecting manufacturers, distributors, and final buyers worldwide. We present a network-based model to describe the mechanisms underlying the emergence and growth of distribution networks. I...
In the field of pharmaceutical supply chains, there is a lack of comprehensive historical data, representing a significant barrier to advancing research. To address this gap, we introduce a high-resolution dataset comprising drug packages distributed to approximately 300,000 pharmacies, hospitals, and practitioners across the US. We reconstruct 375...
This paper introduces Data2Neo, an open-source Python library for converting relational data into knowledge graphs stored in Neo4j databases. With extensive customization options and support for continuous online data integration from various data sources, Data2Neo is designed to be user-friendly, efficient, and scalable to large datasets. The tool...
Legislative bodies generally keep records of their activities. While the digitization wave spurred the availability of archival documents, their processing remains a challenge. The Swiss parliamentary records are no exception.
In this paper we present a supervised pipeline for extracting and structuring of content of archival records. Our pipeline...
Supply chain disruptions may cause shortages of essential goods, affecting millions of individuals. We propose a perspective to address this problem via reroute flexibility. This is the ability to substitute and reroute products along existing pathways, hence without requiring the creation of new connections. To showcase the potential of this appro...
Community smells are negative patterns in software development teams’ interactions that impede their ability to successfully create software. Examples are team members working in isolation, lack of communication and collaboration across departments or sub-teams, or areas of the codebase where only a few team members can work on. Current approaches...
Collectives form nonequilibrium social structures characterized by volatile dynamics. Individuals join or leave. Social relations change quickly. Therefore, unlike engineered or ecological systems, a resilient reference state cannot be defined. We propose a novel resilience measure combining two dimensions: robustness and adaptivity. We demonstrate...
Apart from nodes and links, for many networked systems, we have access to data on paths, i.e., collections of temporally ordered variable-length node sequences that are constrained by the system’s topology. Understanding the patterns in such data is key to advancing our understanding of the structure and dynamics of complex systems. Moreover, the a...
Supply chain disruptions cause shortages of raw material and products. To increase resilience, i.e., the ability to cope with shocks, substituting goods in established supply chains can become an effective alternative to creating new distribution links. We demonstrate its impact on supply deficits through a detailed analysis of the US opioid distri...
The functional interaction structure of a team captures the preferences with which members of different roles interact. This paper presents a data-driven approach to detect the functional interaction structure for software development teams from traces team members leave on development platforms during their daily work. Our approach considers diffe...
Resilience denotes the capacity of a system to withstand shocks and its ability to recover from them. We develop a framework to quantify the resilience of highly volatile, non-equilibrium social organizations, such as collectives or collaborating teams. It consists of four steps: (i) delimitation, i.e. narrowing down the target systems, (ii) concep...
Collectives form non-equilibrium social structures characterised by a volatile dynamics. Individuals join or leave. Social relations change quickly. Therefore, differently from engineered or ecological systems, a resilient reference state cannot be defined. We propose a novel resilience measure combining two dimensions: robustness and adaptivity. W...
Positive and negative relations play an essential role in human behavior and shape the communities we live in. Despite their importance, data about signed relations is rare and commonly gathered through surveys. Interaction data is more abundant, for instance, in the form of proximity or communication data. So far, though, it could not be utilized...
We evaluate the robustness and adaptivity of social groups with heterogeneous agents that are characterized by their binary state, their ability to change this state, their status and their preferred relations to other agents. To define group structures, we operationalize the hexagrams of the I Ching. The relations and properties of agents are used...
In the U.S. Congress, legislators can use active and passive cosponsorship to support bills. We show that these two types of cosponsorship are driven by two different motivations: the backing of political colleagues and the backing of the bill's content. To this end, we develop an Encoder+RGCN based model that learns legislator representations from...
Firms' innovation potential depends on their position in the R&D network. But details on this relation remain unclear because measures to quantify network embeddedness have been controversially discussed. We propose and validate a new measure, coreness, obtained from the weighted k-core decomposition of the R&D network. Using data on R&D alliances,...
We study the effect of group interactions on the emergence of consensus in a spin system. Agents with discrete opinions {0,1} form groups. They can change their opinion based on their group's influence (voter dynamics), but groups can also split and merge (adaptation). In a hypergraph, these groups are represented by hyperedges of different sizes....
Communal roosting in Bechstein’s bat colonies is characterized by the formation of several groups that use different day roosts and that regularly dissolve and re-merge (fission–fusion dynamics). Analysing data from two colonies of different sizes over many years, we find that (i) the number of days that bats stay in the same roost before changing...
Do politicians carry on their interpersonal relations online? We examine how online political support compares to offline support. We overlay two data sets on political support among members of the Swiss National Council to estimate the explanatory power of online endorsements on offline cosponsorship signatures among members, using a gHypEG networ...
We use a data-driven agent-based model to study the core–periphery structure of two collaboration networks, R&D alliances between firms and co-authorship relations between scientists. To characterize the network embeddedness of agents, we introduce a coreness value obtained from a weighted [Formula: see text]-core decomposition. We study the change...
The spontaneous formation and subsequent growth, dissolution, merger, and competition of social groups bears similarities to physical phase transitions in metastable finite systems. We examine three different scenarios, percolation, spinodal decomposition, and nucleation, to describe the formation of social groups of varying size and density. In ou...
By means of a statistical model we study the adoption of Protestantism during the Reformation for 262 territories of the Holy Roman Empire. Our unit of analysis is a territory and the dependent variable indicates whether and when the territorial ruler adopted Protestantism. The independent variables are based on seven theoretical factors that histo...
We study the effect of group interactions on the emergence of consensus in a spin system. Agents with discrete opinions $\{0,1\}$ form groups. They can change their opinion based on their group's influence (voter dynamics), but groups can also split and merge (adaptation). In a hypergraph, these groups are represented by hyperedges of different siz...
Massive data from software repositories and collaboration tools are widely used to study social aspects in software development. One question that several recent works have addressed is how a software project's size and structure influence team productivity, a question famously considered in Brooks' law. Recent studies using massive repository data...
We study the lock-in effect in a network of task assignments. Agents have a heterogeneous fitness for solving tasks and can redistribute unfinished tasks to other agents. They learn over time to whom to reassign tasks and preferably choose agents with higher fitness. A lock-in occurs if reassignments can no longer adapt. Agents overwhelmed with tas...
We study the lock-in effect in a network of task assignments. Agents have a heterogeneous fitness for solving tasks and can redistribute unfinished tasks to other agents. They learn over time to whom to reassign tasks and preferably choose agents with higher fitness. A lock-in occurs if reassignments can no longer adapt. Agents overwhelmed with tas...
We analyze data about the micro-blogging site Twitter using sentiment extraction techniques. From an information perspective, Twitter users are involved mostly in two processes: information creation and subsequent distribution (tweeting), and pure information distribution (retweeting), with pronounced preference to the first. However a rather subst...
The spontaneous formation and subsequent growth, dissolution, merger and competition of social groups bears similarities to physical phase transitions in metastable finite systems. We examine three different scenarios, percolation, spinodal decomposition and nucleation, to describe the formation of social groups of varying size and density. In our...
Data from software repositories have become an important foundation for the empirical study of software engineering processes. A recurring theme in the repository mining literature is the inference of developer networks capturing e.g. collaboration, coordination, or communication from the commit history of projects. Many works in this area studied...
The reputation of firms is largely channeled through their ownership structure. We use this relation to determine reputation spillovers between transnational companies and their participated companies in an ownership network core of 1318 firms. We then apply concepts of network controllability to identify minimum sets of driver nodes (MDS) of 314 f...
We propose an agent-based model of collective opinion formation to study the wisdom of crowds under social influence. The opinion of an agent is a continuous positive value, denoting its subjective answer to a factual question. The wisdom of crowds states that the average of all opinions is close to the truth, i.e., the correct answer. But if agent...
Are offline political relations and endorsements established or maintained with online interactions?We examine the external validity of data on political behavior exhibited online and compare it to offline behavior. We overlay two data sets on political support among members of the Swiss parliament to estimate the explanatory power of online endors...
Systems design utilizes top-down and bottom-up approaches to influence social or economic systems such that a desired outcome is obtained. We characterize different approaches like network controllability, network interventions, nudging and mechanism design and discuss the problems involved. We argue that systems design cannot be reduced to solving...
High skill labour is an important factor underpinning the competitive advantage of modern economies [5, 6, 3]. Therefore, attracting and retaining scientists has become a major concern for migration policy [9]. In this work, we study the migration of scientists on a global scale, by combining two large data sets covering the publications of 3.5 Mio...
Theories of social roles neglect social clusters. However, accounting for clusters is essential because individuals in social networks (e.g., social media) cannot oversee the whole network and have to restrict their interactions to local substructures. Roles that do not account for this cluster formation may lead to misinterpretations of the networ...
The dynamics of collaboration networks of firms follow a life cycle of growth and decline. That does not imply they also become less resilient. Instead, declining collaboration networks may still have the ability to mitigate shocks from firms leaving and to recover from these losses by adapting to new partners. To demonstrate this, we analyze 21.50...
The social percolation model Solomon et al. (2000) considers a 2-dimensional regular lattice. Each site is occupied by an agent with a preference xi sampled from a uniform distribution U[0,1]. Agents transfer the information about the quality q of a movie to their neighbors only if xi≤q. Information percolates through the lattice if q=qc=0.593. – F...
Leading-following behavior as a way of transferring information about the location of resources is wide-spread in many animal societies. It represents active information transfer that allows a given social species to reach collective decisions in the presence of limited information. Although leading-following behavior has received much scientific i...
In our multi-agent model agents generate wealth from repeated interactions for which a prisoner's dilemma payoff matrix is assumed. Their gains are taxed by a government at a rate $\alpha$. The resulting budget is spent to cover administrative costs and to pay a bonus to cooperative agents, which can be identified correctly only with a probability...
The reputation of firms is largely channeled through their ownership structure. We use this relation to determine reputation spillovers between transnational companies and their participated companies in an ownership network core of 1318 firms. We then apply concepts of network controllability to identify minimum sets of driver nodes (MDS) of 314 f...
“Wisdom of crowds” refers to the phenomenon that the average opinion of a group of individuals on a given question can be very close to the true answer. It requires a large group diversity of opinions, but the collective error, the difference between the average opinion and the true value, has to be small. We consider a stochastic opinion dynamics...
The dynamics of collaboration networks of firms follow a life-cycle of growth and decline. That does not imply they also become less resilient. Instead, declining collaboration networks may still have the ability to mitigate shocks from firms leaving, and to recover from these losses by adapting to new partners. To demonstrate this, we analyze 21.5...
In our multi-agent model agents generate wealth from repeated interactions for which a prisoner's dilemma payoff matrix is assumed. Their gains are taxed by a government at a rate α. The resulting budget is spent to cover administrative costs and to pay a bonus to cooperative agents, which can be identified correctly only with a probability p. Agen...
To what extent is the citation rate of new papers influenced by the past social relations of their authors? To answer this question, we present a data-driven analysis of nine different physics journals. Our analysis is based on a two-layer network representation constructed from two large-scale data sets, INSPIREHEP and APS. The social layer contai...
The social percolation model \citep{solomon-et-00} considers a 2-dimensional regular lattice. Each site is occupied by an agent with a preference $x_{i}$ sampled from a uniform distribution $U[0,1]$. Agents transfer the information about the quality $q$ of a movie to their neighbors only if $x_{i}\leq q$. Information percolates through the lattice...
We analyze an agent-based model to estimate how the costs and benefits of users in an online social network (OSN) impact the robustness of the OSN. Benefits are measured in terms of relative reputation that users receive from their followers. They can be increased by direct and indirect reciprocity in following each other, which leads to a core-per...
We investigate a multi-agent model of firms in a Research & Development (R&D) network. Each firm is characterized by its knowledge stock xi(t), which follows a non-linear dynamics. xi(t) grows with the input from other firms, i.e., by knowledge transfer, and decays otherwise. However, maintaining the interactions that increase knowledge stock is co...
We provide a general framework to model the growth of networks consisting of different coupled layers. Our aim is to estimate the impact of one such layer on the dynamics of the others. As an application, we study a scientometric network, where one layer consists of publications as nodes and citations as links, whereas the second layer represents t...
It is known that individual opinions on different policy issues often align to a dominant ideological dimension (e.g., left vs right) and become increasingly polarized. We provide an agent-based model that reproduces alignment and polarization as emergent properties of opinion dynamics in a multi-dimensional space of continuous opinions. The mechan...
We propose an agent-based model of collective opinion formation to study the wisdom of crowds under social influence. The opinion of an agent is a continuous positive value, denoting its subjective answer to a factual question. The wisdom of crowds states that the average of all opinions is close to the truth, i.e. the correct answer. But if agents...
The law of proportionate growth simply states that the time-dependent change of a quantity x is proportional to x. Its applicability to a wide range of dynamic phenomena is based on various assumptions for the proportionality factor, which can be random or deterministic, constant or time dependent. Further, the dynamics can be combined with additio...
"Wisdom of crowds" refers to the phenomenon that the average opinion of a group of individuals on a given question can be very close to the true answer. It requires a large group diversity of opinions, but the collective error, the difference between the average opinion and the true value, has to be small. We consider a stochastic opinion dynamics...
We propose a novel sequence prediction method for sequential data capturing node traversals in graphs. Our method builds on a statistical modelling framework that combines multiple higher-order network models into a single multi-order model. We develop a technique to fit such multi-order models in empirical sequential data and to select the optimal...
Abstract The mobility of scientists between different universities and countries is important to foster knowledge exchange. At the same time, the potential mobility is restricted by geographic and institutional constraints, which leads to temporal correlations in the career trajectories of scientists. To quantify this effect, we extract 3.5 million...
We investigate a multi-agent model of firms in an R\&D network. Each firm is characterized by its knowledge stock $x_{i}(t)$, which follows a non-linear dynamics. It can grow with the input from other firms, i.e., by knowledge transfer, and decays otherwise. Maintaining interactions is costly. Firms can leave the network if their expected knowledge...
Online social networks (OSN) are prime examples of socio-technical systems in which individuals interact via a technical platform. OSN are very volatile because users enter and exit and frequently change their interactions. This makes the robustness of such systems difficult to measure and to control. To quantify robustness, we propose a coreness v...
We provide an agent-based model to explain the emergence of collective opinions not based on feedback between different opinions, but based on emotional interactions between agents. The driving variable is the emotional state of agents, characterized by their valence, quantifying the emotion from unpleasant to pleasant, and their arousal, quantifyi...
We provide an agent-based model to explain the emergence of collective opinions not based on feedback between different opinions, but based on emotional interactions between agents. The driving variable is the emotional state of agents, characterized by their valence, quantifying the emotion from unpleasant to pleasant, and their arousal, quantifyi...
It is known that individual opinions on different policy issues often align to a dominant ideological dimension (e.g. "left" vs. "right") and become increasingly polarized. We provide an agent-based model that reproduces these two stylized facts as emergent properties of an opinion dynamics in a multi-dimensional space of continuous opinions. The m...
We provide a general framework to model the growth of networks consisting of different coupled layers. Our aim is to estimate the impact of one such layer on the dynamics of the others. As an application, we study a scientometric network, where one layer consists of publications as nodes and citations as links, whereas the second layer represents t...
We propose a novel sequence prediction method for sequential data capturing node traversals in graphs. Our method builds on a statistical modelling framework that combines multiple higher-order network models into a single multi-order model. We develop a technique to fit such multi-order models in empirical sequential data and to select the optimal...
We propose an agent-based model of collective opinion formation to study the wisdom of crowds under social influence. The opinion of an agent is a continuous positive value, denoting its subjective answer to a factual question. The wisdom of crowds states that the average of all opinions is close to the truth, i.e. the correct answer. But if agents...
The dynamics of collaboration networks of rms follow a life-cycle of growth and decline. That does not imply they also become less resilient. Instead, declining collaboration networks may still have the ability to mitigate shocks from rms leaving, and to recover from these losses by adapting to new partners. To demonstrate this, we analyze 21.500 R...
We propose a new measure to quantify the impact of a node [Formula: see text] in controlling a directed network. This measure, called “control contribution” [Formula: see text], combines the probability for node [Formula: see text] to appear in a set of driver nodes and the probability for other nodes to be controlled by [Formula: see text]. To cal...
Communal roosting in Bechstein's bat colonies is characterized by the formation of several groups that use different day roosts and that regularly dissolve and re-merge (fission-fusion dynamics). Analyzing data from two colonies of different size over many years, we find that (i) the number of days bats stay in the same roost before changing follow...
Leading-following behaviour in Bechstein's bats transfers information about suitable roost sites from experienced to inexperienced individuals, and thus ensures communal roosting. We analyze 9 empirical data sets about individualized leading-following (L/F) events, to infer rules that likely determine the formation of L/F pairs. To test these rules...
Leading-following behavior as a way of transferring information about the location of resources is widespread in different animal societies. However, it cannot always be observed directly. Here, we develop a general method to infer leading-following events from observational data if only the discrete appearance of individuals is recorded. Our metho...
We consider a multi-layer network with two layers, $\mathcal{L}_{1}$, $\mathcal{L}_{2}$. Their intra-layer topology shows a scale-free degree distribution and a core-periphery structure. A nested structure describes the inter-layer topology, i.e., some nodes from $\mathcal{L}_{1}$, the generalists, have many links to nodes in $\mathcal{L}_{2}$, spe...
Analyzing available FAO data from 176 countries over 21 years, we observe an increase of complexity in the international trade of maize, rice, soy, and wheat. A larger number of countries play a role as producers or intermediaries, either for trade or food processing. In consequence, we find that the trade networks become more prone to failure casc...
Online social networks (OSN) are prime examples of socio-technical systems in which individuals interact via a technical platform. OSN are very volatile because users enter and exit and frequently change their interactions. This makes the robustness of such systems difficult to measure and to control. To quantify robustness, we propose a coreness v...
We propose a novel way to measure the reputation of firms by using information about their ownership structure. Supported by the signalling theory, we argue that ownership relations channel reputation spillovers between shareholders and their invested companies. We model such reputation spillovers by means of a simple dynamics that runs on the owne...
To what extent is the citation rate of new papers influenced by the past social relations of their authors? To answer this question, we present a data-driven analysis of nine different physics journals. Our analysis is based on a two-layer network representation constructed from two large-scale data sets, INSPIREHEP and APS. The social layer contai...
We quantify a social organization’s potentiality, that is, its ability to attain different configurations. The organization is represented as a network in which nodes correspond to individuals and (multi-)edges to their multiple interactions. Attainable configurations are treated as realizations from a network ensemble. To have the ability to encod...
We analyze an agent-based model to estimate how the costs and benefits of users in an online social network (OSN) impact the robustness of the OSN. Benefits are measured in terms of relative reputation that users receives from their followers. They can be increased by direct and indirect reciprocity in following each other, which leads to a core-pe...
The law of proportionate growth simply states that the time dependent change of a quantity $x$ is proportional to $x$. Its applicability to a wide range of dynamic phenomena is based on various assumptions for the proportionality factor, which can be random or deterministic, constant or time dependent. Further, the dynamics can be combined with add...
We provide an agent-based model to explain the emergence of collective opinions not based on feedback between different opinions, but based on emotional interactions between agents. The driving variable is the emotional state of agents, characterized by their valence and their arousal. Both determine their emotional expression, from which collectiv...
Freedoms and limitations. Systems design-the term immediately evokes associations with the freedoms of a divine creator. However, it is precisely this idea that already describes the problem: like any other design, systems design must respect certain boundaries. And that means, rst of all, knowing these limitations. A product designer's creativity...
Freiheiten und Einschränkungen. Systemgestaltung-das weckt sofort Assoziationen an die Freiheiten eines göttlichen Schöpfers. Genau diese Vorstellung bezeichnet aber bereits das Problem: Wie jedes andere Design muss auch das Systemdesign gewisse Randbedingungen re-spektieren. Und das bedeutet zunächst einmal, diese Beschränkungen zu kennen. Ein Pro...
We propose a new measure to quantify the impact of a node $i$ in controlling a directed network. This measure, called `control contribution' $\mathcal{C}_{i}$, combines the probability for node $i$ to appear in a set of driver nodes and the probability for other nodes to be controlled by $i$. To calculate $\mathcal{C}_{i}$, we propose an optimizati...
The unsupervised detection of anomalies in time series data has important applications, e.g., in user behavioural modelling, fraud detection, and cybersecurity. Anomaly detection has been extensively studied in categorical sequences, however we often have access to time series data that contain paths through networks. Examples include transaction s...