About
375
Publications
74,585
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
26,612
Citations
Publications
Publications (375)
Complex systems, such as economic, social, biological, and ecological systems, usually feature interactions not only between pairwise entities but also among three or more entities. These multi-entity interactions are known as higher-order interactions. Hypergraph, as a mathematical tool, can effectively characterize higher-order interactions, wher...
Link prediction is a paradigmatic and challenging problem in network science, which aims to predict missing links, future links and temporal links based on known topology. Along with the increasing number of link prediction algorithms, a critical yet previously ignored risk is that the evaluation metrics for algorithm performance are usually chosen...
Link prediction is one of the most productive branches in network science, aiming to predict links that would have existed but have not yet been observed, or links that will appear during the evolution of the network. Over nearly two decades, the field of link prediction has amassed a substantial body of research, encompassing a plethora of algorit...
Understanding how student peers influence learning outcomes is crucial for effective education management in complex social systems. The complexities of peer selection and evolving peer relationships, however, pose challenges for identifying peer effects using static observational data. Here we use both null-model and regression approaches to exami...
Link prediction aims to predict the potential existence of links between two unconnected nodes within a network based on the known topological characteristics. Evaluation metrics are used to assess the effectiveness of algorithms in link prediction. The discriminating ability of these evaluation metrics is vitally important for accurately evaluatin...
The optimization of urban traffic efficiency and reduction of pollution through minimizing the number of taxis has become a topic of increasing interest. However, the problem of determining the minimum fleet that considers both time and distance efficiency has received limited attention. Furthermore, little research has been done on how this proble...
Describing travel patterns and identifying significant locations is a crucial area of research in transportation geography and social dynamics. Our study aims to contribute to this field by analyzing taxi trip data from Chengdu and New York City. Specifically, we investigate the probability density distribution of trip distance in each city, which...
Unfolding different gender roles is preceding the efforts to reduce gender inequality. This paper analyzes COVID-19 family clusters outside Hubei Province in mainland China during the 2020 outbreak, revealing significant differences in spreading patterns across gender and family roles. Results show that men are more likely to be the imported cases...
Recommender systems have a wide range of applications in the age suffering information overload. A promising way to design better recommender systems in the presence of ubiquitous social media is to utilize social relationships in recommendation algorithms, named social recommendation. One critical challenge in social recommendation is how to mine...
The distribution of the lifetime of Chinese dynasties (as well as that of the British Isles and Japan) in a linear Zipf plot is found to consist of two straight lines intersecting at a transition point. This two-section piecewise-linear distribution is different from the power law or the stretched exponent distribution, and is called the Bilinear E...
As an essential mode of travel for city residents, taxis play a significant role in meeting travel demands in an urban city. Understanding the modal characteristics of taxis is vital to addressing many difficulties regarding urban sustainability. The movement trajectory of taxis reflects not only the operating features of taxis themselves but also...
Violations of laws and regulations about food safety, production safety, quality standard and environmental protection, or negative consequences from loan, guarantee and pledge contracts, may result in operating and credit risks of firms. The above illegal or trust-breaking activities are collectively called discreditable activities, and firms with...
Previous studies show that recommendation algorithms based on historical behaviors of users can provide satisfactory recommendation performance. Many of these algorithms pay attention to the interest of users, while ignore the influence of social relationships on user behaviors. Social relationships not only carry intrinsic information of similar c...
Equal pay is an essential component of gender equality, one of the Sustainable Development Goals of the United Nations. Using resume data of over ten million Chinese online job seekers in 2015, we study the current gender pay gap in China. The results show that on average women only earned 71.57\% of what men earned in China. The gender pay gap exi...
The traditional hydrodistillation (HD) and ultrasound-assisted pretreatment extraction (UAPE) methods were proposed to obtain essential oil (EO) from Tribute citrus (TC) peels. The Box-Behnken design was employed to optimize the HD and UAPE procedures. Moreover, gas chromatography-mass spectrometry (GC-MS) and electronic nose (E-nose) were applied...
Link prediction is a fundamental challenge in network science. Among various methods, similarity-based algorithms are popular for their simplicity, interpretability, high efficiency and good performance. In this paper, we show that the most elementary local similarity index Common Neighbor (CN) can be linearly decomposed by eigenvectors of the adja...
Prior research on work experience diversity yields inconsistent findings regarding its effects on employment outcomes: some conclude that experience diversity discounts (e.g., Ferguson & Hasan, 2013; Zuckerman, Kim, Ukanwa, & Rittmann, 2003), whereas some highlight its benefits (e.g., Lazear, 2004; Custodio, Ferreira, & Matos, 2013). Using resume d...
Highlights • We observe that the urban mobility community has different associated characteristics with travel distance and administrative area. • We demonstrate that as the travel distance increases, many adjacent communities gradually merge into large ones in the central area, while the communities remain similar in the suburban area. • We find s...
With the increase of urban population and the expansion of urban scale, understanding the urban structure could provide intellectual support for urban planning, traffic congestion, and even the spread of diseases. Little research has addressed the relationship between urban structure and human mobility. In this study, the community division method...
Link prediction is a fundamental challenge in network science. Among various methods, local similarity indices are widely used for their high cost-performance. However, the performance is less robust: for some networks local indices are highly competitive to state-of-the-art algorithms while for some other networks they are very poor. Inspired by t...
Industrial diversification depends on spillovers from related industries and nearby regions, yet their interaction remains largely unclear. We study economic diversification in China during the period 1990-2015 and present supportive evidence on both spillover channels. We add to the literature by showing that these two channels behave as substitut...
Link prediction is a fundamental challenge in network science. Among various methods, local similarity indices are widely used for their high cost-performance. However, the performance is less robust: for some networks local indices are highly competitive to state-of-the-art algorithms while for some other networks they are very poor. Inspired by t...
Control measures are necessary to contain the spread of serious infectious diseases such as COVID-19, especially in its early stage. We propose to use temporal reproduction number an extension of effective reproduction number, to evaluate the efficacy of control measures, and establish a Monte-Carlo method to estimate the temporal reproduction numb...
Link prediction is a significant and challenging task in network science. The majority of known methods are similarity-based, which assign similarity indices for node pairs and assume that two nodes of larger similarity have higher probability to be connected by a link. Due to their simplicity, interpretability and high efficiency, similarity-based...
City taxi service systems have been empirically studied by a number of data-driven methods. However, their underlying mechanisms are hard to understand because the present mathematical models neglect to explain a (whole) taxi service process that includes a pair of on-load phase and off-load phase. In this paper, by analyzing a large amount of taxi...
Clustering is a fundamental tool aiming at classifying data points into groups based on their pairwise distances or similarities. It has found successful applications in all natural and social sciences, including biology, physics, economics, chemistry, astronomy, psychology, and so on. Among various types of algorithms, hierarchical clustering is o...
The identification of vital nodes that maintain the network connectivity is a long-standing challenge in network science. In this paper, we propose a so-called reverse greedy method where the least important nodes are preferentially chosen to make the size of the largest component in the corresponding induced subgraph as small as possible. Accordin...
The improvements in data acquisition and processing capabilities, as well as artificial intelligence and statistical mechanics, have rapidly and significantly changed the methodology of social and economic research. The recent paradigm shifting of social science driven by big data and artificial intelligence provides promising and novel data-driven...
Intermediaries refer to agents that facilitate the resource allocation under certain constrain conditions. In a distribution market, buyers and sellers often interact via intermediaries and pay commissions to them. Due to the inadequacy of information spreading, a commodity can only be allocated to some agent who is close to the seller, which not o...
Different from the western education system, Chinese teachers and parents strongly encourage students to have a regular lifestyle. However, due to the lack of large-scale behavioral data, the relation between living patterns and academic performance remains poorly understood. In this chapter, we analyze large-scale behavioral records of 18,960 stud...
Link prediction is a significant and challenging task in network science. The majority of known methods are similarity-based, which assign similarity indices for node pairs and assume that two nodes of larger similarity have higher probability to be connected by a link. Due to their simplicity, interpretability and high efficiency, similarity-based...
Uncovering the structure of socioeconomic systems and timely estimation of socioeconomic status are significant for economic development. The understanding of socioeconomic processes provides foundations to quantify global economic development, to map regional industrial structure, and to infer individual socioeconomic status. In this review, we wi...
Clustering is a fundamental analysis tool aiming at classifying data points into groups based on their similarity or distance. It has found successful applications in all natural and social sciences, including biology, physics, economics, chemistry, astronomy, psychology, and so on. Among numerous existent algorithms, hierarchical clustering algori...
Uncovering the structure of socioeconomic systems and timely estimation of socioeconomic status are significant for economic development. The understanding of socioeconomic processes provides foundations to quantify global economic development, to map regional industrial structure, and to infer individual socioeconomic status. In this review, we wi...
Globalization significantly influences climate change. Ecological modernization theory and world polity theory suggest that globalization reduces carbon dioxide emissions worldwide by facilitating economic, political, social, and cultural homogenization, whereas ecological unequal exchange theory indicates that cumulative economic and political dis...
Detecting abnormal behaviors of students in time and providing personalized intervention and guidance at the early stage is important in educational management. Academic performance prediction is an important building block to enabling this pre-intervention and guidance. Most of the previous studies are based on questionnaire surveys and self-repor...
Novel data has been leveraged to estimate the socioeconomic status in a timely manner, however, direct comparison on the use of social relations and talent movements remains rare. In this letter, we estimate the regional economic status based on the structural features of two networks. One is the online information flow network built on the followi...
The enrichment of data resources and the innovation of analytic methods are gradually facilitating the transformation of socioeconomics into a data-driven and quantitative discipline. As a part of quantitative human resources, the investigation of salary has a significant role on social and economic development. However, previous studies are mainly...
Detecting abnormal behaviors of students in time and providing personalized intervention and guidance at the early stage is important in educational management. Academic performance prediction is an important building block to enabling this pre-intervention and guidance. Most of the previous studies are based on questionnaire surveys and self-repor...
Novel data has been leveraged to estimate socioeconomic status in a timely manner, however, direct comparison on the use of social relations and talent movements remains rare. In this letter, we estimate the regional economic status based on the structural features of the two networks. One is the online information flow network built on the followi...
Accurate perception of socioeconomic status and timely identification of emergencies are critical to smart social governance, however, traditional public sector data and statistical analysis methods cannot meet the accuracy and real-time requirements. Recently, large-scale data accumulated by the private sector, with many advantages including low a...
The heterogeneous nature of human behaviors contributes to the complexity of human-activated systems. Empirical observations and theoretical models reveal the temporal and spatial heterogeneity of many aspects of human behaviors, including social connections and geographic movements, while little is known whether and how human individual's behavior...
Quantitative understanding of relationships between students' behavioral patterns and academic performance is a significant step towards personalized education. In contrast to previous studies that mainly based on questionnaire surveys, in this paper, we collect behavioral records from 18,960 undergraduate students' smart cards and propose a novel...
In high dimensional data, many dimensions are irrelevant to each other and clusters are usually hidden under noise. As an important extension of the traditional clustering, subspace clustering can be utilized to simultaneously cluster the high dimensional data into several subspaces and associate the low-dimensional subspaces with the corresponding...
Recently, Antonioni and Cardillo proposed a coevolutionary model based on the intertwining of oscillator synchronization and evolutionary game theory [Phys. Rev. Lett. \textbf{118}, 238301 (2017)], in which each Kuramoto oscillator can decide whether to interact-or not-with its neighbors, and all oscillators can receive some benefits from the local...
Recently, Antonioni and Cardillo proposed a coevolutionary model based on the intertwining of oscillator synchronization and evolutionary game theory [Phys. Rev. Lett. 118, 238301 (2017)], in which each Kuramoto oscillator can decide whether to interact or not with its neighbors, and all oscillators can receive some benefits from the local synchron...
In an economic market, sellers, infomediaries and customers constitute an economic network. Each seller has her own customer group and the seller's private customers are unobservable to other sellers. Therefore, a seller can only sell commodities among her own customers unless other sellers or infomediaries share her sale information to their custo...
Repeated game has long been the touchstone model for agents' long-run relationships. Previous results suggest that it is particularly difficult for a repeated game player to exert an autocratic control on the payoffs since they are jointly determined by all participants. This work discovers that the scale of a player's capability to unilaterally in...
Repeated game has long been the touchstone model for agents’ long-run relationships. Previous results suggest that it is particularly difficult for a repeated game player to exert an autocratic control on the payoffs since they are jointly determined by all participants. This work discovers that the scale of a player’s capability to unilaterally in...
In an economic market, sellers, infomediaries and customers constitute an economic network. Each seller has her own customer group and the seller's private customers are unobservable to other sellers. Therefore, a seller can only sell commodities among her own customers unless other sellers or infomediaries share her sale information to their custo...
This paper studies an auction design problem for a seller to sell a commodity in a social network, where each individual (the seller or a buyer) can only communicate with her neighbors. The challenge to the seller is to design a mechanism to incentivize the buyers, who are aware of the auction, to further propagate the information to their neighbor...
Drug-target interaction (DTI) prediction plays a very important role in drug development and drug discovery. Biochemical experiments or \textit{in vitro} methods are very expensive, laborious and time-consuming. Therefore, \textit{in silico} approaches including docking simulation and machine learning have been proposed to solve this problem. In pa...
Link prediction is an elemental challenge in network science, which has already found applications in guiding laboratorial experiments, digging out drug targets, recommending friends in social networks, probing mechanisms in network evolution, and so on. With a simple assumption that the likelihood of the existence of a link between two nodes can b...
Link prediction is an elemental challenge in network science, which has already found applications in guiding laboratorial experiments, digging out drug targets, recommending friends in social networks, probing mechanisms in network evolution, and so on. With a simple assumption that the likelihood of the existence of a link between two nodes can b...
Location recommendation plays an essential role in helping people find attractive places. Though recent research has studied how to recommend locations with social and geographical information, few of them addressed the cold-start problem of new users. Because mobility records are often shared on social networks, semantic information can be leverag...
Human behaviors exhibit ubiquitous correlations in many aspects, such as individual and collective levels, temporal and spatial dimensions, content, social and geographical layers. With rich Internet data of online behaviors becoming available, it attracts academic interests to explore human mobility similarity from the perspective of social networ...
Coordination shall be deemed to the result of interindividual interaction among natural gregarious animal groups. However, revealing the underlying interaction rules and decision-making strategies governing highly coordinated motion in bird flocks is still a long-standing challenge. Based on analysis of high spatial-temporal resolution GPS data of...
Drug-target interaction (DTI) prediction plays a very important role in drug development. Biochemical experiments or in vitro methods to identify such interactions are very expensive, laborious and time-consuming. Therefore, in silico approaches including docking simulation and machine learning have been proposed to solve this problem. In particula...
This paper studies an auction design problem for a seller to sell a commodity in a social network, where each individual (the seller or a buyer) can only communicate with her neighbors. The challenge to the seller is to design a mechanism to incentivize the buyers, who are aware of the auction, to further propagate the information to their neighbor...
In the wake of large-scale retraction scandals, we urge scientific publishers to be more proactive in stamping out fake peer-reviewing practices. They should work with editors, authors and research institutes to implement an effective system of precautions and penalties.
Fraudulent peer review can arise when editors rely on authors' recommended r...
Many time series produced by complex systems are empirically found to follow power-law distributions with different exponents α. By permuting the independently drawn samples from a power-law distribution, we present nontrivial bounds on the memory strength (first-order autocorrelation) as a function of α, which are markedly different from the ordin...
In this paper, we study the evolution of cooperation in structured populations (individuals are located on either a regular lattice or a scale-free network) in the context of repeated games by involving three types of strategies, namely, unconditional cooperation, unconditional defection, and extortion. The strategy updating of the players is ruled...
Quantitative understanding of relationships between students' behavioral patterns and academic performances is a significant step towards personalized education. In contrast to previous studies that mainly based on questionnaire surveys, in this paper, we collect behavioral records from 18,960 undergraduate students' smart cards and propose a novel...
Inspired by practical importance of social networks, economic networks, biological networks and so on, studies on large and complex networks have attracted a surge of attentions in the recent years. Link prediction is a fundamental issue to understand the mechanisms by which new links are added to the networks. We introduce the method of robust pri...
This paper studies an auction design problem for a seller to sell a commodity in a social network, where each individual (the seller or a buyer) can only communicate with her neighbors. The challenge to the seller is to design a mechanism to incentivize the buyers, who are aware of the auction, to further propagate the information to their neighbor...
Recommender systems benefit us in tackling the problem of information overload by predicting our potential choices among diverse niche objects. So far, a variety of personalized recommendation algorithms have been proposed and most of them are based on similarities, such as collaborative filtering and mass diffusion. Here, we propose a novel vertex...
This paper studies an auction design problem for a seller to sell a commodity in a social network, where each individual (the seller or a buyer) can only communicate with her neighbors. The challenge to the seller is to design a mechanism to incentivize the buyers, who are aware of the auction, to further propagate the information to their neighbor...
Human behaviors exhibit ubiquitous correlations in many aspects, such as individual and collective levels, temporal and spatial dimensions, content, social and geographical layers. With rich Internet data of online behaviors becoming available, it attracts academic interests to explore human mobility similarity from the perspective of social networ...
To explore the fascinating inter-individual interaction mechanism governing the abundant biological grouping behaviors, more and more efforts have been devoted to collective motion investigation in recent years. Therein, bird flocking is one of the most intensively studied behaviors. A previous study (Nagy M. et al., Nature, 464 (2010) 890.) claims...
Real networks exhibit heterogeneous nature with nodes playing far different roles in structure and function. To identify vital nodes is thus very significant, allowing us to control the outbreak of epidemics, to conduct advertisements for e-commercial products, to predict popular scientific publications, and so on. The vital nodes identification at...
Zero-determinant strategies, which can unilaterally define a linear relationship between two individuals’ long-term payoff, have drawn much attention to comprehend the emergence of cooperation among individuals with repeated interactions. A subset of zero-determinant strategies, extortion strategy, can let an extortioner’s surplus exceed her oppone...
Real network data is often incomplete and noisy, where link prediction algorithms and spurious link identification algorithms can be applied. Thus far, it lacks a general method to transform network organizing mechanisms to link prediction algorithms. Here we use an algorithmic framework where a network’s probability is calculated according to a pr...