About
153
Publications
34,269
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,387
Citations
Introduction
I am a professor at ischool of NJUST. I received his PhD degree of Information Science from Nanjing University, China. My current research interests include scientific text mining, knowledge entity extraction and evaluation, social media mining. I serves as Editorial Board Member and Managing Guest Editor for 10 international journals and PC members of several international conferences in fields of natural language process and scientometrics. My website is : https://chengzhizhang.github.io/
Additional affiliations
March 2010 - March 2011
October 2013 - December 2013
July 2007 - present
Publications
Publications (153)
Since the 1990s, advancements in big data and information technology have increasingly driven data-centric research in the field of Library and Information Science (LIS). To assess the influence of this data-driven research paradigm on the LIS discipline, this study conducts a fine-grained analysis to uncover the evolutionary trends of research met...
Up to this point, keyword extraction task typically relies solely on textual data. Neglecting visual details and audio features from image and audio modalities leads to deficiencies in information richness and overlooks potential correlations, thereby constraining the model's ability to learn representations of the data and the accuracy of model pr...
Keywords facilitate rapid comprehension of academic papers for scholars, enhancing research efficiency. As some papers lack author‐assigned keywords, automated keyword extraction becomes crucial. Addressing the limited utilization of external paper information beyond titles and abstracts in prior studies, this research proposed leveraging the highl...
Objective
This paper aims to understand vaccine hesitancy in the post-epidemic era by analyzing texts related to vaccine reviews and public attitudes toward three prominent vaccine brands: Sinovac, AstraZeneca, and Pfizer, and exploring the relationship of vaccine hesitancy with the prevalence of epidemics in different regions.
Methods
We collecte...
Purpose
This study aims to analyze the distribution of novelty among scholarly papers in the field of library and information science (LIS) in China. Specifically, this study explores the distribution of novelty of papers in various journals, research topics and different periods. It is possible to understand the characteristics of LIS research in...
The Joint Workshop of the 5th Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE2024; https://eeke-workshop.github.io/) and the 4th AI + Informetrics (AII2024; https://ai-informetrics.github.io/) was held in Changchun, China and online, co-located with the iConference2024. The two workshop series are designed to activel...
Purpose
The composition of author teams is a significant factor affecting the novelty of academic papers. Existing research lacks studies focusing on institutional types and measures of novelty remained at a general level, making it difficult to analyse the types of novelty in papers and to provide a detailed explanation of novelty. This study aims...
Peer review is a critical process used in academia to assess the quality and validity of research articles. Top-tier conferences in the field of artificial intelligence (e.g. ICLR and ACL et al.) require reviewers to provide confidence scores to ensure the reliability of their review reports. However, existing studies on confidence scores have negl...
Introduction
Mental health issues bring a heavy burden to individuals and societies around the world. Recently, the large language model ChatGPT has demonstrated potential in depression intervention. The primary objective of this study was to ascertain the viability of ChatGPT as a tool for aiding counselors in their interactions with patients whil...
Billions of scientific papers lead to the need to identify essential parts from the massive text. Scientific research is an activity from putting forward problems to using methods. To learn the main idea from scientific papers, we focus on extracting problem and method sentences. Annotating sentences within scientific papers is labor-intensive, res...
Billions of scientific papers lead to the need to identify essential parts of the massive text. Scientific research is an activity from putting forward problems to using methods. To learn the main idea from scientific papers, we focus on extracting problem and method sentences. Annotating sentences in scientific papers is labor-intensive, resulting...
Purpose
In the era of artificial intelligence (AI), algorithms have gained unprecedented importance. Scientific studies have shown that algorithms are frequently mentioned in papers, making mention frequency a classical indicator of their popularity and influence. However, contemporary methods for evaluating influence tend to focus solely on indivi...
The present study analyzed over 26,000 research articles published between 1991 and 2021 in twenty-one major LIS (Library and Information Science) journals, using the machine learning (ML) approach to categorize the research methods used by LIS scholars. The findings of this study are significant. Firstly, there has been a shift in the research str...
The Joint Workshop of the 4th Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE2023; https://eeke-workshop.github.io/) and the 3rd AI + Informetrics (AII2023; https://ai-informetrics.github.io/) was held at Santa Fe, New Mexico, USA and online, co-located with the ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2...
Objective:
The goal of this study is to use summary generation and topic modeling to identify factors contributing to vaccine attitudes for three different vaccine brands, with the aim of generalizing these factors across different regions.
Methods:
A total of 5562 tweets about three vaccine brands (Sinovac, AstraZeneca, and Pfizer) were collect...
This paper explores the effect of publishing a data paper in the Open Access journal Data in Brief (DIB) on the citation counts of the related research paper. Using regression analysis, citation content analysis and a survey method, we investigate whether research papers with a related data paper have higher citation counts and the potential reason...
The global development of Library and Information Science (LIS) is influenced by various factors such as the economy, society, culture, discipline, tradition, and more. Consequently, the research methods of LIS vary greatly among countries. To better understand these differences, we conducted a study of 5281 research papers from 81 countries publis...
The increasingly mature artificial intelligence technologies, such as big data, deep learning, and natural language processing, provide technical support for research on automatic text understanding and bring development opportunities for innovative measurement of scientific communication. Innovation measurement in scientific communication is a cha...
Future work sentences (FWS) are the particular sentences in academic papers that contain the author's description of their proposed follow-up research direction. This paper presents methods to automatically extract FWS from academic papers and classify them according to the different future directions embodied in the paper's content. FWS recognitio...
As the unique academic culture in Chinese philosophy and social sciences, researches on method framework provide an opportunity for understanding the thinking model and value orientation of the ancient eastern civilization. The field of information science has achieved fruitful results in the method framework research closely related to the unique...
Peer review can evaluate the quality of academic articles involving the evaluation of some aspects, e.g., methodology, experiment, and aspects are usually the critical sections, substance and properties of the article concerned by reviewers. Previous research on content mining of peer review did not distinguish the round of reviews. Detecting diffe...
[Purpose] To understand the meaning of a sentence, humans can focus on important words in the sentence, which reflects our eyes staying on each word in different gaze time or times. Thus, some studies utilize eye-tracking values to optimize the attention mechanism in deep learning models. But these studies lack to explain the rationality of this ap...
In scientific research, the method is an indispensable means to solve scientific problems and a critical research object. With the advancement of sciences, many scientific methods are being proposed, modified, and used in academic literature. The authors describe details of the method in the abstract and body text, and key entities in academic lite...
[Purpose] To better understand the online reviews and help potential consumers, businessmen, and product manufacturers effectively obtain users' evaluation on product aspects, this paper explores the distribution regularities of user attention and sentiment toward product aspects from the temporal perspective of online reviews. [Design/methodology/...
Purpose The purpose of this paper is to explore which structures of academic articles referees would pay more attention to, what specific content referees focus on, and whether the distribution of PRC is related to the citations. Design/methodology/approach Firstly, utilizing the feature words of section title and hierarchical attention network mod...
[Purpose] The purpose of this paper is to explore which structures of academic articles referees would pay more attention to, what specific content referees focus on, and whether the distribution of PRC is related to the citations.
[Design/methodology/approach] Firstly, utilizing the feature words of section title and hierarchical attention networ...
COVID-19 has had a profound impact on the lives of all human beings. Emerging technologies have made significant contributions to the fight against the pandemic. An extensive review of the application of technology will help facilitate future research and technology development to provide better solutions for future pandemics. In contrast to the ex...
The 3rd Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE 2022) was held online at the ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2022. The goal of this workshop series (https://eekeworkshop.github.io/) is to engage the related communities in open problems in the extraction and evaluation of know...
Purpose
To understand the meaning of a sentence, humans can focus on important words in the sentence, which reflects our eyes staying on each word in different gaze time or times. Thus, some studies utilize eye-tracking values to optimize the attention mechanism in deep learning models. But these studies lack to explain the rationality of this appr...
In scientific research, the method is an indispensable means to solve scientific problems and a critical research object. With the advancement of sciences, many scientific methods are being proposed, modified, and used in academic literature. The authors describe details of the method in the abstract and body text, and key entities in academic lite...
With the development of Internet technology, the phenomenon of information overload is becoming more and more obvious. It takes a lot of time for users to obtain the information they need. However, keyphrases that summarize document information highly are helpful for users to quickly obtain and understand documents. For academic resources, most exi...
With the enrichment of literature resources, researchers are facing the growing problem of information explosion and knowledge overload. To help scholars retrieve literature and acquire knowledge successfully, clarifying the semantic structure of the content in academic literature has become the essential research question. In the research on ident...
COVID-19 has had a profound impact on the lives of all human beings. Emerging technologies have
made significant contributions to the fight against the pandemic. An extensive review of the application of tech-
nology will help facilitate future research and technology development to provide better solutions for future pan-
demics. In contrast to th...
With the enrichment of literature resources, researchers are facing the growing problem of information explosion and knowledge overload. To help scholars retrieve literature and acquire knowledge successfully, clarifying the semantic structure of the content in academic literature has become the essential research question. In the research on ident...
With the development of Internet technology, the phenomenon of information overload is becoming more and more obvious. It takes a lot of time for users to obtain the information they need. However, keyphrases that summarize document information highly are helpful for users to quickly obtain and understand documents. For academic resources, most exi...
The outbreak of coronavirus disease 2019 (COVID-19) has had a significant repercussion on the health, economy, politics and environment, making coronavirus-related issues more complicated and difficult to solve adequately by relying on a single field. Interdisciplinary research can provide an effective solution to complex issues in the related fiel...
Purpose
To better understand the online reviews and help potential consumers, businessmen and product manufacturers effectively obtain users’ evaluation on product aspects, this paper aims to explore the distribution regularities of users’ attention and sentiment on product aspects from the temporal perspective of online reviews.
Design/methodolog...
Research trend detection is an important topic for scientific researchers. Future work sentences (FWS), as direct descriptions of future research, aren't fully utilized in research trend detection. Therefore, this article uses FWS to investigate research trends of different tasks in a particular domain. Taking the conference papers in the natural l...
The surge in the number of books published makes the manual evaluation methods difficult to efficiently evaluate books. The use of books' citations and alternative evaluation metrics can assist manual evaluation and reduce the cost of evaluation. However, most existing evaluation research was based on a single evaluation source with coarse-grained...
Algorithms play an increasingly important role in scientific work, especially in data-driven research. Investigating the mention of algorithms in full-text paper helps us understand the use and development of algorithms in a specific domain. Current research on the mention of algorithms is limited to the academic papers in one language, which is ha...
Since the end of 2019, the ongoing of COVID-19 outbreak worldwide not only challenges the management capacity of governments on the public health emergency, but also tests the management capacity of governments on the public opinion and the governance capacity of dealing with social emergencies. To understand the impact on public emotion over COVID...
Multidisciplinary cooperation is now common in research since social issues inevitably involve multiple disciplines. In research articles, reference information, especially citation content, is an important representation of communication among different disciplines. Analyzing the distribution characteristics of references from different discipline...
The global spread of COVID-19 has caused pandemics to be widely discussed. This is evident in the large number of scientific articles and the amount of user-generated content on social media. This paper aims to compare academic communication and social communication about the pandemic from the perspective of communication preference differences. It...
Fine-grained sentiment analysis of social platforms like Twitter and Facebook nowadays becomes increasingly important, as it can reflect public opinions towards target entities such as politicians. Entity-Level Sentiment Analysis (ELSA) is an important fine-grained SA task, aiming to identify the sentiment over each entity mentioned in a sentence....
Citation recommendation is an important task to assist scholars in finding candidate literature to cite. Traditional studies focus on static models of recommending citations, which do not explicitly distinguish differences between papers that are caused by temporal variations. Although, some researchers have investigated chronological citation reco...
Citation recommendation is an important task to assist scholars in finding candidate literature to cite. Traditional studies focus on static models of recommending citations, which do not explicitly distinguish differences between papers that are caused by temporal variations. Although, some researchers have investigated chronological citation reco...
Multidisciplinary cooperation is now common in research since social issues inevitably involve multiple disciplines. In research articles, reference information, especially citation content, is an important representation of communication among different disciplines. Analyzing the distribution characteristics of references from different discipline...
Research on the construction of traditional information science methodology taxonomy is mostly conducted manually. From the limited corpus, researchers have attempted to summarize some of the research methodology entities into several abstract levels (generally three levels); however, they have been unable to provide a more granular hierarchy. More...
With the increasing abundance of literature resources, how to acquire knowledge elements efficiently and accurately is the key to achieving accurate literature retrieval and utilization of available literature resources. The identification of the structure function of academic documents is a fundamental work to meet the above requirements. In this...
Coronavirus disease 2019 (COVID-19) pandemic-related information are flooded on social media, and analyzing this information from an occupational perspective can help us to understand the social implications of this unprecedented disruption. In this study, using a COVID-19-related dataset collected with the Twitter IDs, we conduct topic and sentime...
During the outbreak of novel coronavirus pneumonia, the number of confirmed cases and deaths in Hubei province of China increased sharply, and the situation in Hubei was more severe than that in non-Hubei, so we do a research on psychological health status evaluation of the public in Hubei and non-Hubei areas. In this paper, we adopt textual analys...
Research on the construction of traditional information science methodology taxonomy is mostly conducted manually. From the limited corpus, researchers have attempted to summarize some of the research methodology entities into several abstract levels (generally three levels); however, they have been unable to provide a more granular hierarchy. More...
This paper mainly introduces our methods for Task 1A and Task 1B of CL-SciSumm 2020. Task 1A is to identify reference text in reference paper. Traditional machine learning models and MLP model are used. We evaluate the performances of these models and submit the final results from the optimal model. Compared with previous work, we optimize the rati...
In the era of big data, the advancement, improvement, and application of algorithms in academic research have played an important role in promoting the development of different disciplines. Academic papers in various disciplines, especially computer science, contain a large number of algorithms. Identifying the algorithms from the full-text content...
The premise of manual keyphrase annotation is to read the corresponding content of an annotated object. Intuitively, when we read, more important words will occupy a longer reading time. Hence, by leveraging human reading time, we can find the salient words in the corresponding content. However, previous studies on keyphrase extraction ignore human...
Purpose
With the growth in popularity of academic social networking sites, evaluating the quality of the academic information they contain has become increasingly important. Users' evaluations of this are based on predefined criteria, with external factors affecting how important these are seen to be. As few studies on these influences exist, this...
Citations are commonly used to measure academic impacts of scientific publications, including books. However, citation frequencies of books are single numerical evaluation metrics. It neglects details about books (e.g. contents), which may lead to the decline in comprehensiveness of evaluation results. Hence, fine-grained mining on books’ citation...
The goal of this workshop is to engage the related communities in open problems in the extraction and evaluation of knowledge entities from scientific documents. This workshop entitles this cutting-edge and cross-disciplinary direction Extraction and Evaluation of Knowledge Entity (EEKE), highlighting the development of intelligent methods for iden...
Sentences about future work (FWS) mentioned in the academic papers are very important, which contain valuable information and can provide researchers with new research topics or directions. At present, researchers' analysis of academic papers mainly fo-cuses on the content of citations, bibliographic information, etc., and little attention is paid...
As the basic work of knowledge mining and service based on full-text of articles, recognizing the categories of section in academic articles can help us to understand the function of content in different parts of the article. There is no existing a large-scale annotated corpus of section categories which can be used to classify the sections of the...
http://ceur-ws.org/Vol-2658/
EEKE 2020: 1st Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents
Compared with journal articles, books can provide broader, deeper and more comprehensive information, and often have higher expertise and academic depth. However, most researches on book assessment focus on measuring academic value of books (e.g. citations analysis) or identifying attitudes of readers (e.g. book review mining), depth and breadth re...