
Yaxin Fan- Soochow University
Yaxin Fan
- Soochow University
About
25
Publications
1,257
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
83
Citations
Introduction
Current institution
Publications
Publications (25)
Multiparty dialog discourse parsing (MDDP) aims to identify the links between pairs of utterances and recognize their discourse relations. Previous research has attempted to address data sparsity in discourse parsing through multitask learning, but these efforts often relied on manually annotated fine-grained information, limiting their practical a...
Although existing fashionable generation methods on Incomplete Utterance Rewriting (IUR) can generate coherent utterances, they often result in the inclusion of irrelevant and redundant tokens in rewritten utterances due to their inability to focus on critical tokens in dialogue context. Furthermore, the limited size of the training datasets also c...
Previous work on Incomplete Utterance Rewriting (IUR) has primarily focused on generating rewritten utterances based solely on dialogue context, ignoring the widespread phenomenon of coreference and ellipsis in dialogues. To address this issue, we propose a novel framework called TEO (\emph{Two-stage approach on Editing Operation}) for IUR, in whic...
Most previous studies on discourse parsing have utilized discriminative models to construct tree structures. However, these models tend to overlook the global perspective of the tree structure as a whole during the step-by-step top-down or bottom-up parsing process. To address this issue, we propose DP-GF, a macro Discourse Parser based on Generati...
Grammatical error correction aims to correct ungrammatical sentences automatically. Recently, some work has demonstrated the excellent capabilities of closed-source Large Language Models (LLMs, e.g., ChatGPT) in grammatical error correction. However, the potential of open-source LLMs remains unexplored. In this paper, we introduced GrammarGPT, an o...
The unparalleled performance of closed-sourced ChatGPT has sparked efforts towards its democratization, with notable strides made by leveraging real user and ChatGPT conversations, as evidenced by Vicuna. However, while current endeavors like Baize and UltraChat aim to auto-generate conversational data due to challenges in gathering human participa...
Discourse parsing aims to comprehend the structure and semantics of a document. Some previous studies have taken multiple levels of granularity methods to parse documents while disregarding the connection between granularity levels. Additionally, almost all the Chinese discourse parsing approaches concentrated on a single granularity due to lacking...
Dialogue topic shift detection is to detect whether an ongoing topic has shifted or should shift in a dialogue, which can be divided into two categories, i.e., response-known task and response-unknown task. Currently, only a few investigated the latter, because it is still a challenge to predict the topic shift without the response information. In...
Discourse functional pragmatics recognition focuses on identifying the functions of discourse paragraphs, which is a significant research direction in natural language processing. To obtain better paragraph representation and alleviate the issue of imbalanced data distribution, we propose a Chinese discourse Functional Pragmatics Recognition model...
The goal of dialogue topic shift detection is to identify whether the current topic in a conversation has changed or needs to change. Previous work focused on detecting topic shifts using pre-trained models to encode the utterance, failing to delve into the various levels of topic granularity in the dialogue and understand dialogue contents. To add...
Grammatical error correction aims to correct ungrammatical sentences automatically. Recently, some work has demonstrated the excellent capabilities of closed-source Large Language Models (LLMs, e.g., ChatGPT) in grammatical error correction. However, the potential of open-source LLMs remains unexplored. In this paper, we introduced GrammarGPT, an o...
The goal of dialogue topic shift detection is to identify whether the current topic in a conversation has changed or needs to change. Previous work focused on detecting topic shifts using pre-trained models to encode the utterance, failing to delve into the various levels of topic granularity in the dialogue and understand dialogue contents. To add...
Large Language Models (LLMs) like ChatGPT have proven a great shallow understanding of many traditional NLP tasks, such as translation, summarization, etc. However, its performance on high-level understanding, such as dialogue discourse analysis task that requires a higher level of understanding and reasoning, remains less explored. This study inve...
Dialogue topic shift detection is to detect whether an ongoing topic has shifted or should shift in a dialogue, which can be divided into two categories, i.e., response-known task and response-unknown task. Currently, only a few investigated the latter, because it is still a challenge to predict the topic shift without the response information. In...
Most existing studies construct a discourse structure tree following two popular methods: top-down or bottom-up strategy. However, they often suffered from cascading errors because they can not switch the strategy of building a structure tree to avoid mistakes caused by uncertain decision-making. Moreover, due to the different basis of top-down and...
The macro-level discourse parsing, as a fundamental task of macro discourse analysis, mainly focuses on converting a document into a hierarchical discourse tree at paragraph level. Most existing methods follow micro-level studies and suffer from the issues of semantic representation and the semantic interaction of the larger macro discourse units....
Hierarchically constructing micro (i.e., intra-sentence or inter-sentence) discourse structure trees using explicit boundaries (e.g., sentence and paragraph boundaries) has been proved to be an effective strategy. However, it is difficult to apply this strategy to document-level macro (i.e., inter-paragraph) discourse parsing, the more challenging...