Yueran Zu’s research while affiliated with Beihang University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (9)


Adaptive Beam Search Decoding for Discrete Keyphrase Generation
  • Article

May 2021

·

6 Reads

·

6 Citations

Proceedings of the AAAI Conference on Artificial Intelligence

Xiaoli Huang

·

Tongge Xu

·

Lvan Jiao

·

[...]

·

Youmin Zhang

Keyphrase Generation compresses a document into some highly-summative phrases, which is an important task in natural language processing. Most state-of-the-art adopt greedy search or beam search decoding methods. These two decoding methods generate a large number of duplicated keyphrases and are time-consuming. Moreover, beam search only predicts a fixed number of keyphrases for different documents. In this paper, we propose an adaptive generation model-AdaGM, which is mainly inspired by the importance of the first words in keyphrase generation. In AdaGM, a novel reset state training mechanism is proposed to maximize the difference in the predicted first words. To ensure the discreteness and get an appropriate number of keyphrases according to the content of the document adaptively, we equip beam search with a highly effective filter mechanism. Experiments on five public datasets demonstrate the proposed model can generate marginally less duplicated and more accurate keyphrases. The codes of AdaGM are available at: https://github.com/huangxiaolist/adaGM.




Semantic and Morphological Information Guided Chinese Text Classification

December 2019

·

53 Reads

Lecture Notes in Computer Science

Recently proposed models such as BERT, perform well in many text processing tasks. They get context-sensitive features, which is a good semantic for word sense disambiguation, through deeper layer and a large number of texts. But, for Chinese text classification, majority of datasets are crawled from social networking sites, these datasets are semantically complex and variable. How much data is needed to pre-train these models in order for them to grasp semantic features and understand context is a question. In this paper, we propose a novel shallow layer language model, which uses sememe information to guide model to grasp semantic information without a large number of pre-trained data. Then, we use the Chinese character representations generated from this model to do text classification. Furthermore, in order to make Chinese as easy to initialize as English, we employ convolution neural networks over Chinese strokes to get Chinese character structure initialization for our model. This model pre-trains on a part of the Chinese Wikipedia dataset, and we use the representations generated by this pre-trained model to do text classification. Experiments on text classification datasets show our model outperforms other state-of-arts models by a large margin. Also, our model is superior in terms of interpretability due to the introduction of semantic and morphological information.


Comparison of state-of-art optical flow estimation methods. Our method performs well in boundary preserving for fast-moving objects(like the face structure of the person and the bar of the arrow). The area under sleeve is too occluded to match well
Small objects from Middlebury test dataset. The first and second columns are the input image I1and image I2. The third column shows the results of the CPM-Flow [18] and the last column shows the flow results of our method (CAM). The first row shows the edge around hands are better preserved. The second row denotes that the head contour is better. And the third row presents that CAM can get the optical flow of the right fast moving foot of the girl
Observations of this paper, the first row is ground truth, the second and three rows are results of CPM [18] and Flowfiled [1] methods respectively, the third row is the end point error maps between the ground truth (GT) and CPM results. The error values are bigger in brighter areas. The errors are obvious around edges and small fast moving objects(like leg and hand in dashed red circle)
Overview of the proposed CAM algorithm. Two adjacent frames I1 and I2 are the input images. The red box in I1 is the current pixel, the blue boxes are the neighbors of it. The candidates which are more similar with the current one are chosen (the red and two pink ones), and the dashed red circles means the search scope. During the matching, different features are selected
The red bounding box in I1is the patch to be matched, in I2the green bounding boxes are ground truth, the yellow and carmine boxes are the candidates found by sift descriptor. The magnified boxes show that it is not visual similar for the optical flow correspondence

+5

Context-adaptive matching for optical flow
  • Article
  • Publisher preview available

January 2019

·

363 Reads

·

1 Citation

Modern sparse-to-dense optical flow estimation algorithms usually achieve state-of-art performance. Those algorithms need two steps: matching and interpolation. Matching is often unreliable for very large displacement optical flow due to illumination changes, deformations and occlusion etc. Moreover, conspicuous errors around motion discontinuities still keep serious as most methods consider edge only at interpolation step. The context-adaptive matching (CAM) is proposed for optical flow which is better at large displacement and edge preserving. The CAM is selective in feature extraction, adaptive in flow propagation and search radius adjusting. Selective features are proposed to consider edge preserving in matching step. Except for the usually used SIFT descriptor, the local directional pattern flow (LDPF) is introduced to keep more edge structure, and the oriented fast and rotated brief (ORB) is utilized to select out several most similar candidates. Unlike coarse-to-fine matching, which proposed a propagation step with only neighbors, we propose adaptive propagation to extend the matching candidates in order to improve the possibility of getting right correspondences. Furthermore, guided by prior knowledge and taking advantage of upper layers results, adaptive radius instead of constrained radius are proposed at finer layers. The CAM interpolated by EpicFlow is fast and robust for large displacements especially for fast moving objects and also preserves the edge structure well. Extensive experiments show that our algorithm is on par with the state-of-art optical flow methods on MPI-Sintel, KITTI and Middlebury.

View access options



A Combined Local-Global Match for Optical Flow: 13th Conference on Image and Graphics Technologies and Applications, IGTA 2018, Beijing, China, April 8–10, 2018, Revised Selected Papers

August 2018

·

15 Reads

Communications in Computer and Information Science

Optical flow estimation is still an open question in computer vision. Matching is the initialization of the final optical flow results. A good matching is important for the flow. In this paper, a combined local-global matching method is proposed. The local matching method and the global method are integrated together to make a trade-off between the large displacement and local consistency of optical flow. Extensive experiments on state-of-art challenging datasets MPI-Sintel show that the proposed method is efficient and effective.


Citations (4)


... [38] proposed separate mechanisms to deal with present and absent keyphrase generation. [39] presented an AdaGM method to increase the discreteness of keyphrase generation, and [40] proposed an one2set method for generating diverse keyphrases as a set. Other works, including [41,42], and [43], focused on jointly learning extraction and generation for keyphrase prediction. ...

Reference:

ChatGPT vs state-of-the-art models: a benchmarking study in keyphrase generation task
Adaptive Beam Search Decoding for Discrete Keyphrase Generation
  • Citing Article
  • May 2021

Proceedings of the AAAI Conference on Artificial Intelligence

... Unlike English, the Chinese language exhibits a distinctive characteristic in which the demarcation between words lacks clarity, along with an intricate grammatical structure and numerous synonyms [1,3,7]. To avoid word segmentation errors, some approaches [8,9] perform Chinese NER directly at the character level. ...

An Encoding Strategy Based Word-Character
  • Citing Conference Paper
  • January 2019

... Their method performs a competitive result on some public databases. Moreover, Zu et al. [53] adopted a context-adaptive matching scheme to improve the accuracy of the matching result under illumination changes, deformations and occlusions. Furthermore, Zhang et al. [54] constructed a large displacement flow field estimation approach by using similarity transformation based dense correspondence. ...

Context-adaptive matching for optical flow