A preview of this full-text is provided by Springer Nature.
Content available from Neural Processing Letters
This content is subject to copyright. Terms and conditions apply.
Neural Processing Letters (2022) 54:101–123
https://doi.org/10.1007/s11063-021-10622-7
Proposal-Based Graph Attention Networks for Workflow
Detection
Min Zhang1,2 ·Haiyang Hu1·Zhongjin Li1·Jie Chen1
Accepted: 5 August 2021 / Published online: 13 August 2021
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021
Abstract
In the process of “Industry 4.0”, video analysis plays a vital role in a variety of industrial
applications. Video-based action detection has obtained promising performance in computer
vision community. However, in complex factory environment, how to detect workflow of
both machines and workers in production process is not well resolved. To solve this issue,
we propose a generic proposal based Graph Attention Networks for workflow detection.
Specifically, an efficient and effective action proposal method is firstly employed to generate
workflow proposals. Then, these proposals and their relations are exploited for proposal graph
construction. Here, two types of relationships are considered for identifying the workflow
phases, which are contextual and surrounding relations to capture context information and
characterize the correlations between different workflow instances. To improve the recog-
nition accuracy, within-category and between-category attention are incorporated to learn
long-range and dynamic dependencies respectively. Thus, the capability of feature represen-
tation for workflow detection can be greatly enhanced. Experimental results verify that the
proposed approach is considerably improved upon the state-of-the-arts on THUMOS’14 and
a practical workflow dataset, achieving 6.7% and 3.9% absolute improvement compared to
the advanced GTAN detector at tIoU threshold 0.4, respectively. Moreover, augmentation
experiments are carried out on ActivityNet1.3 to prove the effectiveness of performance
improvement by modeling workflow proposal relationships.
Keywords Workflow detection ·Graph attention networks ·Temporal action localization
1 Introduction
With the prominent achievement of deep learning, a growing number of cameras are installed
for intelligent monitoring. Many practical applications require cameras to record scene videos
for activity detection in real-time, such as smart surveillance [1], autonomous driving [2]and
human behavior analysis [3]. In large scale factory, the essential ingredients of production
BHaiyang Hu
huhaiyang@hdu.edu.cn
1School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
2Department of Design and Art, Zhejiang Industry Polytechnic College, Shaoxing, China
123
Content courtesy of Springer Nature, terms of use apply. Rights reserved.