
Junming HuangPrinceton University | PU · Center on Contemporary China
Junming Huang
Bachelor of Engineering
About
37
Publications
14,646
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,200
Citations
Citations since 2017
Introduction
My research interests lie in causal inference, science of science, statistics analysis and social networks. I'm an Associate Research Scientist at the Paul and Marcia Wythes Center on Contemporary China, Princeton University. I was a Postdoctoral Research Associate at Center for Complex Network Research, Northeastern University during 2016 - 2018, and an assistant researcher at University of Electronic Science and Technology of China.
Additional affiliations
April 2015 - August 2018
September 2014 - September 2019
Publications
Publications (37)
The US leadership in science and technology has greatly benefitted from immigrants from other countries, most notably from China in the recent decades. However, feeling the pressure of potential federal investigation since the 2018 launch of the China Initiative under the Trump administration, Chinese-origin scientists in the US now face higher inc...
The declaration of COVID-19 as a pandemic has largely amplified the spread of related information on social platforms, such as Twitter, Facebook and WeChat. In this work, we investigate how the disease and information co-evolve in the population. We focus on COVID-19 and its information during the period when the disease was widely spread in China,...
Past research has studied social determinants of attitudes toward foreign countries. Confounded by potential endogeneity biases due to unobserved factors or reverse causality, the causal impact of these factors on public opinion is usually difficult to establish. Using social media data, we leverage the suddenness of the COVID-19 pandemic to examin...
Do mass media influence people’s opinions of other countries? Using BERT, a deep neural network-based natural language processing model, this study analyzes a large corpus of 267,907 China-related articles published by The New York Times since 1970. The output from The New York Times is then compared to a longitudinal data set constructed from 101...
The declaration of COVID-19 as a pandemic has largely amplified the spread of related information on social media, such as Twitter, Facebook, and WeChat.Unlike the previous studies which focused on how to detect the misinformation or fake news related toCOVID-19, we investigate how the disease and information co-evolve in the population. We focus o...
Millions of people are surveyed every year regarding their attitudes toward various topics. Together these surveys have produced a large corps of data that document how people think collectively toward various aspects of contemporary social life.The wealth of the attitude surveys has promoted scholars to move beyond the single-survey analysis. Howe...
Modern science is dominated by scientific productions from teams. A recent finding shows that teams of both large and small sizes are essential in research, prompting us to analyze the extent to which a country’s scientific work is carried out by big or small teams. Here, using over 26 million publications from Web of Science, we find that China’s...
Do mass media influence people's opinion of other countries? Using BERT, a deep neural network-based natural language processing model, we analyze a large corpus of 267,907 China-related articles published by The New York Times since 1970. We then compare our output from The New York Times to a longitudinal data set constructed from 101 cross-secti...
Modern science is dominated by scientific productions from teams. Large teams have demonstrated a clear advantage over small teams in applying for research funding, performing complicated research tasks and producing research works with high impact. Recent research, however, shows that both large and small teams have their own merits. Small teams t...
Significance
Empirical evidence suggests significant gender differences in the total productivity and impact of academic careers across science, technology, engineering, and mathematics (STEM) fields. Paradoxically, the increase in the number of women academics over the past 60 years has increased these gender differences. Yet, we find that men and...
There is extensive, yet fragmented, evidence of gender differences in academia suggesting that women are under-represented in most scientific disciplines, publish fewer articles throughout a career, and their work acquires fewer citations. Here, we offer a comprehensive picture of longitudinal gender discrepancies in performance through a bibliomet...
The heterogeneous nature of human behaviors contributes to the complexity of human-activated systems. Empirical observations and theoretical models reveal the temporal and spatial heterogeneity of many aspects of human behaviors, including social connections and geographic movements, while little is known whether and how human individual's behavior...
Reading remains the preferred leisure activity for most individuals, continuing to offer a unique path to knowledge and learning. As such, books remain an important cultural product, consumed widely. Yet, while over 3 million books are published each year, very few are read widely and less than 500 make it to the New York Times bestseller lists. An...
Micro-blogging systems have become one of the most important ways for information sharing. Network structure and users’ interactions such as forwarding behaviors have aroused considerable research attention, while mention, as a key feature in micro-blogging platforms which can improve the visibility of a message and direct it to a particular user b...
Response ratio r(k) versus the average degree of be-mentioned users 〈d(um)〉.
We classify all messages into two categories: 〈d(um)〉 ∈ [1, 100), and (〈d(um)〉 ∈ [10000, +∞). We observe a peak in response ratio at 2 mentions and then a slow drop in both categories. Moreover, we find that the response ratio of 〈d(um)〉 ∈ [1, 100) is higher than that of (...
The variation of response probability for different kinds of messages.
(A) response probability p(k) versus mention count k for messages with and without embedded Events. (B) response probability p(k) versus mention count k for messages with and without embedded URLs.
(EPS)
Human behaviors exhibit ubiquitous correlations in many aspects, such as individual and collective levels, temporal and spatial dimensions, content, social and geographical layers. With rich Internet data of online behaviors becoming available, it attracts academic interests to explore human mobility similarity from the perspective of social networ...
In recent years, tagging system has become a building block o summarize the content of items for further functions like retrieval or personalized recommendation in various web applications. One nontrivial requirement is to precisely deliver a list of suitable items when users interact with the systems via inputing a specific tag (i.e. a query term)...
The Ebola virus in West Africa has infected almost 30,000 and killed over 11,000 people. Recent models of Ebola Virus Disease (EVD) have often made assumptions about how the disease spreads, such as uniform transmissibility and homogeneous mixing within a population. In this paper, we test whether these assumptions are necessarily correct, and offe...
Previous works indicated that pairwise methods are state-of- the-art approaches to fit users’ taste from implicit feedback. In this paper, we argue that constructing item pairwise samples for a fixed user is insufficient, because taste differences between two users with respect to a same item can not be explicitly distinguished. Moreover, the rank...
Data sparsity is a long-standing challenge for recommender systems based on collaborative filtering. A promising solution for this problem is multi-context recommendation, i.e., leveraging users' explicit or implicit feedback from multiple contexts. In multi-context recommendation, various types of interactions between entities (users and items) ar...
Interactions between search and recommendation have recently attracted
significant attention, and several studies have shown that many potential
applications involve with a joint problem of producing recommendations to users
with respect to a given query, termed $Collaborative$ $Retrieval$ (CR).
Successful algorithms designed for CR should be poten...
Link prediction plays an important role in understanding intrinsic evolving
mechanisms of networks. With the belief that the likelihood of the existence of
a link between two nodes is strongly related with their similarity, many
methods have been proposed to calculate node similarity based on node
attributes and/or topological structures. Among a l...
Influence maximization, fundamental for word-of-mouth marketing and viral
marketing, aims to find a set of seed nodes maximizing influence spread on
social network. Early methods mainly fall into two paradigms with certain
benefits and drawbacks: (1)Greedy algorithms, selecting seed nodes one by one,
give a guaranteed accuracy relying on the accura...
For the study of information propagations, one fundamental problem is
uncovering universal laws governing the process of information propagations.
This problem, in microscopic perspective, is formulated as estimating the
propagation probability that one piece of information is propagated by an
individual to another. A propagation probability depend...
Food occupies a central position in every culture and it is therefore of great interest to understand the evolution of food culture. The advent of the World Wide Web and online recipe repositories have begun to provide unprecedented opportunities for data-driven, quantitative study of food culture. Here we harness an online database documenting rec...
Predicting the popularity of content is important for both the host and users
of social media sites. The challenge of this problem comes from the inequality
of the popularity of con- tent. Existing methods for popularity prediction are
mainly based on the quality of content, the interface of social media site to
highlight contents, and the collecti...
The rapid development of Internet technology enables human explore the web
and record the traces of online activities. From the analysis of these
large-scale data sets (i.e. traces), we can get insights about dynamic behavior
of human activity. In this letter, the scaling behavior and complexity of human
activity in the e-commerce, such as music, b...
Influence maximization, defined as a problem of finding a set of seed nodes to trigger a maximized spread of influence, is crucial to viral marketing on social networks. For practical viral marketing on large scale social networks, it is required that influence maximization algorithms should have both guaranteed accuracy and high scalability. Howev...
As an important tool for information filtering in the era of socialized web,
recommender systems have witnessed rapid development in the last decade. As
benefited from the better interpretability, neighborhood-based collaborative
filtering techniques, such as item-based collaborative filtering adopted by
Amazon, have gained a great success in many...
Word-of-mouth has proven an effective strategy for promot- ing products through social relations. Particularly, exist- ing studies have convincingly demonstrated that word-of- mouth recommendations can boost users' prior expectation and hence encourage them to adopt a certain innovation, such as buying a book or watching a movie. However, less atte...
Social recommendation, that an individual recommends an item to another, has gained popularity and success in web applications such as online sharing and shopping services. It is largely different from a traditional recommendation where an automatic system recommends an item to a user. In a social recommendation, the interpersonal influence plays a...
Projects
Project (1)