Wen Zheng’s research while affiliated with China University of Mining and Technology - Beijing and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (26)


基于对齐遗忘机制的信息不平衡图像翻译
  • Article

November 2022

·

12 Reads

Scientia Sinica Informationis

·

·

Qiang Li

·

[...]

·

Yongjin Liu

Snowflake Point Deconvolution for Point Cloud Completion and Generation With Skip-Transformer

October 2022

·

9 Reads

·

72 Citations

IEEE Transactions on Pattern Analysis and Machine Intelligence

Most existing point cloud completion methods suffer from the discrete nature of point clouds and the unstructured prediction of points in local regions, which makes it difficult to reveal fine local geometric details. To resolve this issue, we propose SnowflakeNet with snowflake point deconvolution (SPD) to generate complete point clouds. SPD models the generation of point clouds as the snowflake-like growth of points, where child points are generated progressively by splitting their parent points after each SPD. Our insight into the detailed geometry is to introduce a skip-transformer in the SPD to learn the point splitting patterns that can best fit the local regions. The skip-transformer leverages attention mechanism to summarize the splitting patterns used in the previous SPD layer to produce the splitting in the current layer. The locally compact and structured point clouds generated by SPD precisely reveal the structural characteristics of the 3D shape in local patches, which enables us to predict highly detailed geometries. Moreover, since SPD is a general operation that is not limited to completion, we explore its applications in other generative tasks, including point cloud auto-encoding, generation, single image reconstruction, and upsampling. Our experimental results outperform state-of-the-art methods under widely used benchmarks.



Improving Robustness for Pose Estimation via Stable Heatmap Regression

April 2022

·

9 Reads

·

3 Citations

Neurocomputing

Deep learning methods have achieved excellent performance in pose estimation, but the lack of robustness causes the keypoints to change drastically between similar images. In view of this problem, a stable heatmap regression method is proposed to alleviate network vulnerability to small perturbations. We utilize the correlation between different rows and columns in a heatmap to alleviate the multi-peaks problem, and design a highly differentiated heatmap regression to make a keypoint discriminative from surrounding points. A maximum stability training loss is used to simplify the optimization difficulty when minimizing the prediction gap of two similar images. The proposed method achieves a significant advance in robustness over state-of-the-art approaches on four benchmark datasets and maintains high performance.


PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-step Point Moving Paths

March 2022

·

11 Reads

·

134 Citations

IEEE Transactions on Pattern Analysis and Machine Intelligence

Point cloud completion concerns to predict missing part for incomplete 3D shapes. A common strategy is to generate complete shape according to incomplete input. However, unordered nature of point clouds will degrade generation of high-quality 3D shapes, as detailed topology and structure of unordered points are hard to be captured during the generative process using an extracted latent code. We address this problem by formulating completion as point cloud deformation process. Specifically, we design a novel neural network, named PMP-Net++, to mimic behavior of an earth mover. It moves each point of incomplete input to obtain a complete point cloud, where total distance of point moving paths (PMPs) should be the shortest. Therefore, PMP-Net++ predicts unique PMP for each point according to constraint of point moving distances. The network learns a strict and unique correspondence on point-level, and thus improves quality of predicted complete shape. Moreover, since moving points heavily relies on per-point features learned by network, we further introduce a transformer-enhanced representation learning network, which significantly improves completion performance of PMP-Net++. We conduct comprehensive experiments in shape completion, and further explore application on point cloud up-sampling, which demonstrate non-trivial improvement of PMP-Net++ over state-of-the-art point cloud completion/up-sampling methods.


PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-step Point Moving Paths
  • Preprint
  • File available

February 2022

·

22 Reads

Point cloud completion concerns to predict missing part for incomplete 3D shapes. A common strategy is to generate complete shape according to incomplete input. However, unordered nature of point clouds will degrade generation of high-quality 3D shapes, as detailed topology and structure of unordered points are hard to be captured during the generative process using an extracted latent code. We address this problem by formulating completion as point cloud deformation process. Specifically, we design a novel neural network, named PMP-Net++, to mimic behavior of an earth mover. It moves each point of incomplete input to obtain a complete point cloud, where total distance of point moving paths (PMPs) should be the shortest. Therefore, PMP-Net++ predicts unique PMP for each point according to constraint of point moving distances. The network learns a strict and unique correspondence on point-level, and thus improves quality of predicted complete shape. Moreover, since moving points heavily relies on per-point features learned by network, we further introduce a transformer-enhanced representation learning network, which significantly improves completion performance of PMP-Net++. We conduct comprehensive experiments in shape completion, and further explore application on point cloud up-sampling, which demonstrate non-trivial improvement of PMP-Net++ over state-of-the-art point cloud completion/up-sampling methods.

Download

Snowflake Point Deconvolution for Point Cloud Completion and Generation with Skip-Transformer

February 2022

·

67 Reads

Most existing point cloud completion methods suffered from discrete nature of point clouds and unstructured prediction of points in local regions, which makes it hard to reveal fine local geometric details. To resolve this issue, we propose SnowflakeNet with Snowflake Point Deconvolution (SPD) to generate the complete point clouds. SPD models the generation of complete point clouds as the snowflake-like growth of points, where the child points are progressively generated by splitting their parent points after each SPD. Our insight of revealing detailed geometry is to introduce skip-transformer in SPD to learn point splitting patterns which can fit local regions the best. Skip-transformer leverages attention mechanism to summarize the splitting patterns used in previous SPD layer to produce the splitting in current SPD layer. The locally compact and structured point clouds generated by SPD precisely reveal the structure characteristic of 3D shape in local patches, which enables us to predict highly detailed geometries. Moreover, since SPD is a general operation, which is not limited to completion, we further explore the applications of SPD on other generative tasks, including point cloud auto-encoding, generation, single image reconstruction and upsampling. Our experimental results outperform the state-of-the-art methods under widely used benchmarks.


Review of light field technologies

December 2021

·

329 Reads

·

28 Citations

Visual Computing for Industry Biomedicine and Art

Light fields are vector functions that map the geometry of light rays to the corresponding plenoptic attributes. They describe the holographic information of scenes by representing the amount of light flowing in every direction through every point in space. The physical concept of light fields was first proposed in 1936, and light fields are becoming increasingly important in the field of computer graphics, especially with the fast growth of computing capacity as well as network bandwidth. In this article, light field imaging is reviewed from the following aspects with an emphasis on the achievements of the past five years: (1) depth estimation, (2) content editing, (3) image quality, (4) scene reconstruction and view synthesis, and (5) industrial products because the technologies of lights fields also intersect with industrial applications. State-of-the-art research has focused on light field acquisition, manipulation, and display. In addition, the research has extended from the laboratory to industry. According to these achievements and challenges, in the near future, the applications of light fields could offer more portability, accessibility, compatibility, and ability to visualize the world.


BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation

October 2021

·

397 Reads

Generative Adversarial Networks (GANs) have made a dramatic leap in high-fidelity image synthesis and stylized face generation. Recently, a layer-swapping mechanism has been developed to improve the stylization performance. However, this method is incapable of fitting arbitrary styles in a single model and requires hundreds of style-consistent training images for each style. To address the above issues, we propose BlendGAN for arbitrary stylized face generation by leveraging a flexible blending strategy and a generic artistic dataset. Specifically, we first train a self-supervised style encoder on the generic artistic dataset to extract the representations of arbitrary styles. In addition, a weighted blending module (WBM) is proposed to blend face and style representations implicitly and control the arbitrary stylization effect. By doing so, BlendGAN can gracefully fit arbitrary styles in a unified model while avoiding case-by-case preparation of style-consistent training images. To this end, we also present a novel large-scale artistic face dataset AAHQ. Extensive experiments demonstrate that BlendGAN outperforms state-of-the-art methods in terms of visual quality and style diversity for both latent-guided and reference-guided stylized face synthesis.



Citations (12)


... We identify two key challenges in this task: First, mmWave point clouds exhibit inter-frame heterogeneity, in contrast to the more consistent data from LiDAR or RGB-D sensors. Second, conventional point cloud completion methods are primarily designed for static objects or autonomous driving scenes [15], [21], whereas our focus is on dynamic human bodies, which necessitates accounting for temporal changes and motion. To address these issues, we propose a multi-stage mmWave point cloud enhancement method that leverages 2D human mask information from single-view images as supervision during the training phase. ...

Reference:

mmDEAR: mmWave Point Cloud Density Enhancement for Accurate Human Body Reconstruction
Snowflake Point Deconvolution for Point Cloud Completion and Generation With Skip-Transformer
  • Citing Article
  • October 2022

IEEE Transactions on Pattern Analysis and Machine Intelligence

... If the chosen regions are not representative or insufficient to capture the image's key features, it could adversely affect the learning outcomes. Considering that a set of pixel-wise features contains more semantic and structure information, set similarity (SetSim) [20] generalizes pixelwise similarity learning to set-wise similarity. This method effectively improves robustness through set similarity between views, using attention features to establish corresponding sets, filtering out noise backgrounds, and addressing misleading pixel-level features and semantic inconsistencies. ...

Exploring Set Similarity for Dense Self-supervised Representation Learning
  • Citing Conference Paper
  • June 2022

... The main limitation of using manual features is that such features cannot cover complex point cloud features. Most deep learning [18][19][20] based methods for point cloud completion are grid or voxel based methods [21,22] and point based methods [23,24], due to the disordered nature of point clouds. Both gridand voxel-based methods have common shortcomings such as lack of detailed features, inability to produce high-resolution output, high memory requirements, complex models that are difficult to train, and difficulty in fine-tuning the model [25,26]. ...

PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-step Point Moving Paths
  • Citing Article
  • March 2022

IEEE Transactions on Pattern Analysis and Machine Intelligence

... Zhang and Yang's SA-Net [21] integrates a self-attention mechanism to improve the retention of local features, while Wang et al.'s SoftPoolNet [22] replaces traditional max pooling with SoftPool to preserve more fine-grained details. Xiang et al. [23] proposed SnowflakeNet, which likens point cloud generation to snowflake-like growth, enhancing the richness of the output point cloud details. However, the use of only a global attention mechanism leads to the neglect of the correlation of local details in the final result. ...

SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer
  • Citing Conference Paper
  • October 2021

... In the early stages of development of light field technology, the field was largely dominated by digital refocusing, as proposed by Ng et al. [9], and projection-based super-resolution algorithms, as reviewed by Zhou et al [10]. Shroff and Berkner then marked a significant advancement in light field deconvolution, reformulating the problem as a minimization task and leveraging an iterative nonlinear least squares algorithm for object space estimation [7]. ...

Review of light field technologies

Visual Computing for Industry Biomedicine and Art

... To demonstrate the effectiveness of ReJSHand, we conducted a comparison with other hand pose estimation methods, including I2L-MeshNet [34], CMR [35], I2UV-HandNet [34], MobRecon [8], FastViT [32], Sim-pleHand [22], and transformer-based approaches such as METRO [19], MeshGraphomer [20], FastMETRO [37], Deformer [38], as well as the method proposed by Tang et al. [36]. The results are presented in Table I. ...

Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration
  • Citing Conference Paper
  • June 2021

... To complete point clouds, prior works [4,5,15,23,36,37] have attempted to fill-in the partial point clouds using supervised, unsupervised, or self-supervised learning methods. Supervised learning rely on the presence of groundtruth and labeled input-output pairs. ...

Cycle4Completion: Unpaired Point Cloud Completion using Cycle Transformation with Missing Region Coding
  • Citing Conference Paper
  • June 2021

... Among them, PCN [49] is the first work that directly generates high-resolution complete point clouds in a coarseto-fine manner for point cloud completion. A similar generation strategy is also adopted in a series of following works [18,23,36,37,39]. Transformer [35] has also been leveraged in recent works. ...

PMP-Net: Point Cloud Completion by Learning Multi-step Point Moving Paths
  • Citing Conference Paper
  • June 2021

... Recent works, such as that by Bhat et al., propose a classification-based formulation for distance prediction. Tian et al. [45] integrated attention blocks into the decoder, and Transformer-based architectures gained traction [46], [47]. The estimation of depth is a pivotal component in understanding geometric relations within a scene. ...

Improving Monocular Depth Estimation by Leveraging Structural Awareness and Complementary Datasets
  • Citing Chapter
  • November 2020

Lecture Notes in Computer Science