Filipi Nascimento SilvaIndiana University Bloomington | IUB · Indiana University Network Science Institute
Filipi Nascimento Silva
PhD
About
115
Publications
26,799
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,442
Citations
Publications
Publications (115)
Random walks find extensive applications across various complex network domains, including embedding generation and link prediction. Despite the widespread utilization of random walks, the precise impact of distinct biases on embedding generation from sequence data and their subsequent effects on link prediction remain elusive. We conduct a compara...
We propose a method to measure the similarity of papers and authors by simulating a literature search procedure on citation networks, which is an information retrieval inspired conceptualization of similarity. This transition probability (TP) based approach does not require a curated classification system, avoids clustering complications, and provi...
The impact of research papers, typically measured in terms of citation counts, depends on several factors, including the reputation of the authors, journals, and institutions, in addition to the quality of the scientific work. In this paper, we present an approach that combines natural language processing and machine learning to predict the impact...
Artistic pieces can be studied from several perspectives, one example being their reception among readers over time. In the present work, we approach this interesting topic from the standpoint of literary works, particularly assessing the task of predicting whether a book will become a best seller. Unlike previous approaches, we focused on the full...
Many real-world systems give rise to a time series of symbols. The elements in a sequence can be generated by agents walking over a networked space so that whenever a node is visited the corresponding symbol is generated. In many situations the underlying network is hidden, and one aims to recover its original structure and/or properties. For examp...
Random walks find extensive application across various complex network domains, including embedding generation and link prediction. Despite the widespread utilization of random walks, the precise impact of distinct biases on embedding generation from sequence data and their subsequent effects on link prediction remain elusive. In this study, we con...
Neuroscience research has expanded dramatically over the past 30 years by advancing standardization and tool development to support rigor and transparency. Consequently, the complexity of the data pipeline has also increased, hindering access to FAIR (Findable, Accessible, Interoperabile, and Reusable) data analysis to portions of the worldwide res...
Recent progress in natural language processing (NLP) enables mining the literature in various tasks akin to knowledge discovery. Obtaining an updated birds-eye view of key research topics and their evolution in a vast, dynamic field such as materials science is challenging even for experienced researchers. In this Perspective paper, we present a la...
Neuroscience research has expanded dramatically over the past 30 years by advancing standardization and tool development to support rigor and transparency. Consequently, the complexity of the data pipeline has also increased, hindering access to FAIR data analysis to portions of the worldwide research community. brainlife.io was developed to reduce...
Science has become more collaborative over the past years, as evidenced by the growing number of authors per publication and the emergence of interdisciplinary research endeavors involving specialists from different fields. In this context, it is not trivial to quantify the individual impact of researchers. To address this issue, we evaluate the ef...
Citation networks can reveal much important information regarding the development of science and the relationship between different areas of knowledge. Frequently, citation networks are created using articles acquired from a set of relevant keywords or queries. Here, we study the robustness of citation networks communities with regards to the keywo...
Many real-world systems give rise to a time series of symbols. The elements in a sequence can be generated by agents walking over a networked space so that whenever a node is visited the corresponding symbol is generated. In many situations the underlying network is hidden, and one aims to recover its original structure and/or properties. For examp...
Artistic pieces can be studied from several perspectives, one example being their reception among readers over time. In the present work, we approach this interesting topic from the standpoint of literary works, particularly assessing the task of predicting whether a book will become a best seller. Dissimilarly from previous approaches, we focused...
In this paper, we present the implementation of a Python library to calculate symmetry, accessibility, and generalized accessibility, measures that are used to characterize the topological properties of complex network nodes. The software is written as a Python C extension and provides both a fast implementation and user friendly API. In order to i...
Sequences arise in many real-world scenarios; thus, identifying the mechanisms behind symbol generation is essential to understanding many complex systems. This paper analyzes sequences generated by agents walking on a networked topology. Given that in many real scenarios, the underlying processes generating the sequence is hidden, we investigate w...
Several interesting situations involve two networks containing the same set of nodes, but which are interconnected in potentially distinct manners. These two networks can refer to: the same network, a network and a respectively modified version, or two related but inherently distinct networks such as a citation and a co-authorship networks that, th...
A basic question in network community detection is how modular a given network is. This is usually addressed by evaluating the quality of partitions detected in the network. The Girvan-Newman (GN) modularity function is the standard way to make this assessment, but it has a number of drawbacks. Most importantly, it is not clearly interpretable, giv...
Poetry and prose are written artistic expressions that help us appreciate the reality we live in. Each of these styles has its own set of subjective properties, such as rhyme and rhythm, which are easily caught by a human reader’s eye and ear. With the recent advances in artificial intelligence, the gap between humans and machines may have decrease...
Gene network analysis is an important tool for studying the changes in steady states that characterize cell functional properties, the genome-environment interplay, and the health-disease transitions. Moreover, gene co-expression and protein–protein interaction (PPI) data can be integrated with clinical, histopathological, and imaging information –...
Several complex systems are characterized by presenting intricate characteristics extending along many scales. These characterizations are used in various applications, including text classification, better understanding of diseases, and comparison between cities, among others. In particular, texts are also characterized by a hierarchical structure...
Science has become more collaborative over the past years, a phenomenon that is related to the increase in the number of authors per paper and the emergence of interdisciplinary works featuring specialists of different fields. In such a environment, it is not trivial to quantify the individual impact of researchers. Here we analyze how the most pro...
Understanding the dynamics of authors is relevant to predict and quantify performance in science. While the relationship between recent and future citation counts is well-known, many relationships between scholarly metrics at the author-level remain unknown. In this context, we performed an analysis of author-level metrics extracted from subsequent...
A basic question in network community detection is how modular a given network is. This is usually addressed by evaluating the quality of partitions detected in the network. The Girvan-Newman (GN) modularity function is the standard way to make this assessment, but it has a number of drawbacks. Most importantly, it is not clearly interpretable, giv...
Climate system teleconnections are crucial for improving climate predictability, but difficult to quantify. Standard approaches to identify teleconnections are often based on correlations between time series. Here we present a novel method leveraging Granger causality, which can infer/detect relationships between any two fields. We compare teleconn...
The number, size and complexity of ‘big science’ projects are growing — as are the size, complexity and value of the data sets and software services they produce. In this context, big data gives a new way to analyse, understand, manage and communicate the inner workings of collaborations that often involve thousands of experts, thousands of scholar...
Free Unified Rendering in pYthon (FURY), is a community-driven, open-source, and highperformance scientific visualization library that harnesses the graphics processing unit (GPU) for improved speed, precise interactivity, and visual clarity. FURY provides an integrated API in Python that allows UI elements and 3D graphics to be programmed together...
Poetry and prose are written artistic expressions that help us to appreciate the reality we live. Each of these styles has its own set of subjective properties, such as rhyme and rhythm, which are easily caught by a human reader's eye and ear. With the recent advances in artificial intelligence, the gap between humans and machines may have decrease...
Citation networks can reveal many important information regarding the development of science and the relationship between different areas of knowledge. Thus, many studies have analyzed the topological properties of such networks. Frequently, citation networks are created using articles acquired from a set of relevant keywords or queries. Here, we s...
With the expansion of electronic publishing, a new dynamics of scientific articles dissemination was initiated. Still substantially important, citations became a longer term effect. Nowadays, many works are widely disseminated even before publication, in the form of preprints. Another important new element concerns the views of published articles....
Principal component analysis (PCA) is often applied for analyzing data in the most diverse areas. This work reports, in an accessible and integrated manner, several theoretical and practical aspects of PCA. The basic principles underlying PCA, data standardization, possible visualizations of the PCA results, and outlier detection are subsequently a...
The relationship between the topology of a network and specific types of dynamics unfolding in networks constitutes a subject of substantial interest. One type of dynamics that has attracted increasing attention because of its several potential implications is opinion formation. A phenomenon of particular importance, known to take place in opinion...
The development of modern electronics was to a large extent related to the advent and popularization of bipolar junction technology. The present work applies science of science concepts and methodologies in order to develop a relatively systematic, quantitative study of the development of electronics from a bipolar-junction perspective. First, we s...
Discovery processes have been an important topic in the network science field. The exploration of nodes can be understood as the knowledge acquisition process taking place in the network, where nodes represent concepts and edges are the semantical relationships between concepts. While some studies have analyzed the performance of the knowledge acqu...
Understanding the dynamics of authors is relevant to predict and quantify performance in science. While the relationship between recent and future citation counts is well-known, many relationships between scholarly metrics at the author-level remain unknown. In this context, we performed an analysis of author-level metrics extracted from subsequent...
Climate system teleconnections, which are far-away climate responses to perturbations or oscillations, are difficult to quantify, yet understanding them is crucial for improving climate predictability. Here we leverage Granger causality in a novel method of identifying teleconnections. Because Granger causality is explicitly defined as a statistica...
The development of modern electronics was to a large extent related to the advent and popularization of bipolar junction technology. The present work applies science of science concepts and methodologies in order to develop a relatively systematic, quantitative study of the development of electronics from a bipolar-junction-centered perspective. Fi...
Discovery processes have been an important topic in the network science field. The exploration of nodes can be understood as the knowledge acquisition process taking place in the network, where nodes represent concepts and edges are the semantical relationships between concepts. While some studies have analyzed the performance of the knowledge acqu...
The citations process for scientific papers has been studied extensively. But while the citations accrued by authors are the sum of the citations of their papers, translating the dynamics of citation accumulation from the paper to the author level is not trivial. Here we conduct a systematic study of the evolution of author citations, and in partic...
With the expansion of electronic publishing, a new dynamics of scientific articles dissemination was initiated. Nowadays, many works become widely disseminated even before publication, in the form of preprints. Another important new element concerns the visualizations of published articles. Thanks to the availability of respective data by some jour...
In this paper, we introduce a network-based methodology to study how political entities evolve over time. We constructed networks of voting data from the Brazilian Chamber of Deputies, where deputies are nodes and edges are represented by voting similarity among deputies. The Brazilian Chamber of deputies is characterized by a multi-party political...
With funding from the National Science Foundation, the Center for Open Science (COS) and Indiana University will create a dynamic, distributed, and heterogeneous data source for the advancement of science of science research. This will be achieved by using, enhancing, and combining the capabilities of the Open Science Framework (OSF) and the Collab...
The citations process for scientific papers has been studied extensively. But while the citations accrued by authors are the sum of the citations of their papers, translating the dynamics of citation accumulation from the paper to the author level is not trivial. Here we conduct a systematic study of the evolution of author citations, and in partic...
The relationship between the topology of a network and specific types of dynamics unfolding on it has been extensively studied in network science. One type of dynamics that has attracted increasing attention because of its several implications is opinion formation. A phenomenon of particular importance that is known to take place in opinion formati...
In this paper, we introduce a network-based methodology to study how clusters represented by political entities evolve over time. We constructed networks of voting data from the Brazilian Chamber of Deputies, where deputies are nodes and edges are represented by voting similarity among deputies. The Brazilian Chamber of deputies is characterized by...
Understanding the way in which human opinion changes along time and space constitutes one of the great challenges in complex systems research. Among the several approaches that have been attempted at studying this problem, the Sznajd model provides some particularly interesting features, such as its simplicity and ability to represent some of the m...
Understanding the way in which human opinion changes along time and space constitutes one of the great challenges in complex systems research. Among the several approaches that have been attempted at studying this problem, the Sznajd model provides some particularly interesting features, such as its simplicity and ability to represent some of the m...
Over the past years, network science emerged as a new scientific field dedicated to represent and model complex systems by means of networks. Visualizing such structures has been proven to be both challenging and fruitful. For instance, by using a force-directed layout many topological characteristics, such as communities and spatiality, clearly su...
Media links for Visualization of Complex Networks (CDT-5)
Most complex networks are not static, but evolve along time. Given a specific configuration of one such changing network, it becomes a particularly interesting issue to quantify the diversity of possible unfoldings of its topology. In this work, we suggest the concept of malleability of a network, which is defined as the exponential of the entropy...
Most complex networks are not static, but evolve along time. Given a specific configuration of one such changing network, it becomes a particularly interesting issue to quantify the diversity of possible unfoldings of its topology. In this work, we suggest the concept of malleability of a network, which is defined as the exponential of the entropy...
A framework integrating information theory and network science is proposed. By incorporating and integrating concepts such as complexity, coding, topological projections and network dynamics, the proposed network-based framework paves the way not only to extending traditional information science, but also to modeling, characterizing and analyzing a...
Real-world dynamics running on networks can be characterized in terms of their respective diversity , or heterogeneity of state values. Spatial networks can be understood as networks exhibiting limited small world characteristics. In the present work we argue that network spatiality can enhance the diversity of respectively unfolding dynamics. This...
Real-world dynamics running on networks can be characterized in terms of their respective diversity , or heterogeneity of state values. Spatial networks can be understood as networks exhibiting limited small world characteristics. In the present work we argue that network spatiality can enhance the diversity of respectively unfolding dynamics. This...
Studies regarding knowledge organization and acquisition are of great importance to understand areas related to science and technology. A common way to model the relationship between different concepts is through complex networks. In such representations, networks’ nodes store knowledge and edges represent their relationships. Several studies that...
Negative feedback is a powerful approach capable of improving several aspects of a system. In linear electronics, it has been critical for allowing invariance to device properties. Negative feedback is also known to enhance linearity in amplification, which is one of the most important foundations of linear electronics. At the same time, thousands...
Principal component analysis (PCA) is often used for analysing data in the most diverse areas. In this work, we report an integrated approach to several theoretical and practical aspects of PCA. We start by providing, in an intuitive and accessible manner, the basic principles underlying PCA and its applications. Next, we present a systematic, thou...
Principal component analysis (PCA) is often used for analysing data in the most diverse areas. In this work, we report an integrated approach to several theoretical and practical aspects of PCA. We start by providing, in an intuitive and accessible manner, the basic principles underlying PCA and its applications. Next, we present a systematic, thou...
Bipolar junction transistors (BJTs) have been at the core of linear electronics from its beginnings. Here, we suggest an Early‐based inspired geometric model of BJT devices and its application to derive models of related electronic circuits (more specifically a common‐emitter configuration). The approach involves using a beam of isolines converging...
Studies regarding knowledge organization and acquisition are of great importance to understand areas related to science and technology. A common way to model the relationship between different concepts is through complex networks. In such representations, network's nodes store knowledge and edges represent their relationships. Several studies that...
Statistical techniques that analyse texts, referred to as text analytics, have departed from the use of simple word count statistics towards a new paradigm. Text mining now hinges on a more sophisticated set of methods, including the representations in terms of complex networks. While well-established word-adjacency (co-occurrence) methods successf...
Complex networks have been widely used to model biological systems. The concept of accessibility has been proposed recently as a means to organize the nodes of complex networks as belonging to its border or center. Such an approach paves the way to investigating how the functional and structural properties of nodes vary with their respective positi...
We study the topological organization of several world cities, according to respective representations by complex networks. We use, as a first step, a recently developed methodology that allows the most significant urban region of each city to be identified. Then, we estimate many topological measures and apply multivariate statistics and data anal...
The topological organization of several world cities are studied according to respective representations by complex networks. As a first step, the city maps are processed by a recently developed methodology that allows the most significant urban region of each city to be identified. Then, we estimate many topological measures on the obtained networ...
Complex networks have been found to provide a good representation of knowledge. In this context, the discovery process can be modeled in terms of a dynamic process such as agents moving in a knowledge space. Recent studies proposed more realistic dynamics which can be influenced by the visibility of the agents, or by their memory. However, rather t...
The properties of bipolar junction transistors (BJTs) are known to vary in terms of their parameters. In this work, an experimental approach, including pattern recognition concepts and methods such as principal component analysis (PCA) and linear discriminant analysis (LDA), was used to experimentally investigate the variation among BJTs belonging...
The properties of bipolar junction transistors (BJTs) are known to vary in terms of their parameters. In this work, an experimental approach, including pattern recognition concepts and methods such as principal component analysis (PCA) and linear discriminant analysis (LDA), was used to experimentally investigate the variation among BJTs belonging...
A framework integrating information theory and network science is proposed, giving rise to a potentially new area of network information science. By incorporating and integrating concepts such as complexity, coding, topological projections and network dynamics, the proposed network-based framework paves the way not only to extending traditional inf...
Complex networks have been found to provide a good representation of the structure of knowledge, as understood in terms of discoverable concepts and their relationships. In this context, the discovery process can be modeled as agents walking in a knowledge space. Recent studies proposed more realistic dynamics, including the possibility of agents b...
Bipolar junction transistors (BJTs) have been at the core of linear electronics from its beginnings. Although their properties can be well represented by the transistor equation, design and analysis approaches have, to a good extent, been limited to using current-equispaced horizontal and parallel isolines of the characteristic surface. Here, we re...
Negative feedback is a powerful approach capable of improving several aspects of a system. In linear electronics, it has been critical for allowing invariance to device properties. Negative feedback is also known to enhance linearity in amplification, which is one of the most important foundations of linear electronics. At the same time, thousands...
Science is becoming increasingly more interdisciplinary, giving rise to more diversity in the areas of expertise within research labs and groups. This also have brought changes to the role researchers in scientific works. As a consequence, multi-authored scientific papers have now became a norm for high quality research. Unfortunately, such a pheno...
Science is becoming increasingly more interdisciplinary, giving rise to more diversity in the areas of expertise within research labs and groups. This also have brought changes to the role researchers in scientific works. As a consequence, multi-authored scientific papers have now became a norm for high quality research. Unfortunately, such a pheno...
Linearity is an important and frequently sought property in electronics and instrumentation. Here, we report a method capable of, given a transfer function, identifying the respective most linear region of operation with a fixed width. This methodology, which is based on least squares regression and systematic consideration of all possible regions,...
Linearity is an important and frequently sought property in electronics and instrumentation. Here, we report a method capable of, given a transfer function, identifying the respective most linear region of operation with a fixed width. This methodology, which is based on least squares regression and systematic consideration of all possible regions,...
Statistical techniques that analyze texts, referred to as text analytics, have departed from the use of simple word count statistics towards a new paradigm. Text mining now hinges on a more sophisticated set of methods, including the representations in terms of complex networks. While well-established word-adjacency (co-occurrence) methods successf...
The increasing interest in complex networks research has been a consequence of several intrinsic features of this area, such as the generality of the approach to represent and model virtually any discrete system, and the incorporation of concepts and methods deriving from many areas, from statistical physics to sociology, which are often used in an...
The increasing interest in complex networks research has been a consequence of several intrinsic features of this area, such as the generality of the approach to represent and model virtually any discrete system, and the incorporation of concepts and methods deriving from many areas, from statistical physics to sociology, which are often used in an...
The access to an ever increasing amount of information in the modern world gave rise to the development of many quantitative indicators about urban regions in the globe. Therefore, there is a growing need for a precise definition of how to delimit urban regions, so as to allow proper respective characterization and modeling. Here we present a strai...