The study is aimed at solving the problem of large measurement errors caused by the binocular camera in traditional 3D art design, which leads to inaccurate 3D information of the target. The contour information extraction in the process of human motion pose reconstruction is easily affected by the noise in the image. Therefore, a binocular stereo vision system is built first and it integrates image acquisition, camera calibration, and image processing. The dedistortion method is used to process the image because it can reduce errors. Second, a three-dimensional human motion pose reconstruction model is implemented, the Gaussian template is used to remove the noise in the image frame, and the change detection template (CDM) is used to solve the problem of background “exposure” and “occlusion.” Finally, simulation experiments are designed to verify the system and model designed. Since the research on the application of pose estimation based on visual sensing technology in art design is still blank, such research has great significance and provides a reference for the research in the field. The literature analysis is used to expound and analyze the application of pose estimation based on visual sensing technology in visual communication design and environmental art design: (1) although the binocular stereo vision system causes some errors in the measurement, the overall error is controlled within 2% and the accuracy is high, which proves that it can be applied to the acquisition of three-dimensional information of the target in art design; (2) there is a high degree of fitting between the video sequence data created by the three-dimensional human motion pose reconstruction model designed and the real motion data, which indicates that this method has high accuracy in processing image sequences and the feasibility of applying it to human pose reconstruction in three-dimensional art design is high; (3) through the analysis of the existing literature, it is found that most of the current visual-based attitude assessment studies are carried out by using network cameras combined with computers, and the quality of the obtained images is low. The combination of binocular stereo sensor and attitude estimation technology can be applied to the design of advertising, animation, games, and packaging, making the behavior of virtual characters in animation and games more vivid. The combination provides convenience for the collection of environmental spatial information and object attitude information, the formulation of a design scheme, and real-time monitoring of construction in environmental art design. The purpose of this study is to provide an important theoretical basis for the technical upgrading of art design.
1. Introduction
Art belongs to the social superstructure, and it is an important part of people’s spiritual life after their needs of material life are met. If the economy is more prosperous, the higher living standards are and the greater demand for art is becoming [1]. The art design is a process in which artists express their inspiration, experience, and feelings through artworks and communicate with the public. The traditional art design process needs to go through tedious design steps and takes a lot of time, which cannot meet the practical needs of today’s society. In response to the problem, the research of art design combined with science and technology attracts more and more attention of people [2]. No matter what type of art design, it needs to be conveyed through vision, such as color, pattern, and text on clothing; the quality and 3D effect of animation; the composition of product packaging; and the shape and size of landscape [3]. With the development of society, people have higher and higher requirements for artistic products. The quality of 2D photos or video images taken by ordinary cameras is poor, and they are unable to meet practical needs. The equipment specially used for shooting ultra-high-quality films, television dramas, and animations are generally bulky, expensive, and not suitable for people to use in artistic design work.
Visual sensor based on visual sensing technology has a series of advantages, such as small size, low price, and long life. It can draw the surface signal of the object after computer processing and present it in front of the researchers. For example, the most popular binocular stereo vision sensor at present is widely used in three-dimensional modeling, three-dimensional measurement, intelligent monitoring, and other research fields [4]. Visual sensors have great advantages in image processing compared with ordinary cameras, but there are also some shortcomings, such as being vulnerable to complex background and color, occlusion, and irregular movement of the target [5]. Pose estimation refers to the estimation of the position and attitude of the target to be tested through the detection and tracking of key points. Combined with deep learning, 3D pose estimation can be realized without the interference of background and color [6]. However, traditional human motion pose reconstruction methods are susceptible to image noise when contour information is extracted. After the literature is reviewed, it is found that the current research on pose assessment based on visual sensing technology mainly focuses on human pose assessment and UAV (Unmanned Aerial Vehicle) pose estimation, but there is little literature on its application in art design.
Based on the above problems, a binocular stereo vision system and a 3D human motion pose reconstruction model are built first. Second, simulation experiments are designed to verify the system and model designed. Finally, the application of pose estimation based on visual sensing technology in visual communication design and environmental art design is analyzed. The purpose of this study is to provide an important theoretical basis for the technical upgrading of art design.
2. Materials and Methods
2.1. Analysis of Visual Sensing Technology
Vision is the most important feeling of human beings. Through vision, the size, color, and action of objects can be perceived to obtain information about the surrounding environment. However, human visual perception is vulnerable to emotional, physical, and light, and it has certain restrictions [7]. In recent years, with the development of science and technology, visual sensors gradually replace human beings and are used in various fields, solving the problems existing in the human visual. Here, product detection is taken as an example to compare the human vision and visual sensing technology, as shown in Table 1.
Human vision
Visual sensing technology
Accuracy
Need magnifying glass or microscope assistance and the accuracy is low
Accuracy can reach one-thousandth of an inch without physical constraints
Reproducibility
Easy to fatigue and has some subtle differences when testing identical products. The repeatability is poor
Using the same method and parameters to test the product. It will not fatigue and has good repeatability
Speed
Slow detection speed, especially when the product has a certain speed displacement
Faster and better detection of high-speed moving objects on production lines to improve production efficiency
Objectivity
The subjective judgment which is easily influenced by emotion when testing products
The test results are objective and reliable
Cost
Fatigue, illness, or other subjective factors may lead to lower productivity and higher costs
No illness, no need to stop, high efficiency, production efficiency, cost savings