Kai Jiang’s research while affiliated with Shanghai University of Engineering Science and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (2)


A cell can be viewed as a directed acyclic graph (DAG), where each intermediate node is a latent representation and each directed edge (i,j) in the DAG represents the application of an operation o between two nodes while the different colors of the edges represent different operations
The macro-structure of the network that contains 8 cells, where we set Reduction Cell in the 3rd, 5th, 7th of cells
Triplet loss is described by a 3-tuple (a,p,n), where a indicates an anchor sample, p is a positive sample that has the same ID as a, and n indicates a negative sample. Before learning, anchor sample a and positive sample p are a pair of positive samples with huge distance during the feature space while negative sample n is near to anchor sample a. The features of the pictures with the same ID gradually form clusters in the feature space by reducing the distance between a and p and increasing the distance between a and n after training with the triplet loss
An instance of the architectures that were searched by using the DARTS method during our experiments, which contains lots of skip-connection operations
Architecture parameters α is considered as the edges of the DAG. At each decision epoch, we then decide the candidate’s pool of the edges according to the topological order and then choose one edge (i⁺,j⁺) from the candidates by using the Decision Criterion. Using greedy optimal choice we then select the edge that determines the operation by replacing ô(i.j)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$ \hat {o}^{(i.j)} $\end{document} with o(i.j)=argmaxo∈Oαo(i+,j+)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$ {o}^{(i.j)} = argmax_{o \in O}\alpha ^{(i^{+},j^{+})}_{o} $\end{document}. After making the greedy decision, we prune the unchosen weights from w and remove α(i+,j+)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$ \alpha ^{(i^{+},j^{+})} $\end{document} from α

+1

TGAS-ReID: Efficient architecture search for person re-identification via greedy decisions with topological order
  • Article
  • Publisher preview available

July 2022

·

43 Reads

·

3 Citations

Applied Intelligence

Shengbo Chen

·

Kai Jiang

·

Xianrui Liu

·

[...]

·

Person Re-Identification (Re-ID) technology is being developed rapidly due to the successful application of deep convolutional neural networks. However, the prevailing Re-ID models are usually built upon manually design backbones. In this paper, we propose using the TGAS-ReID which is automatically designed convolutional network backbones for Re-ID to substitute the backbones originally designed for classification such as ResNet and VGG. In the Re-ID tasks to search for a cell structure, greedy decisions are made instead of deriving the architecture after comprehensive training. In other words, at each decision epoch, according to the topological order, we first decide the candidates’ pool of the edges to progressively reduce the coupling of the internal nodes of the DAG. An edge is then selected based on edge importance, edge certainty, and selection stability. We then make a greedy optimal choice for the selected edge and prune the relevant parameters. To further improve the backbone’s representation capability of the features, we further introduce the triplet loss with batch hard mining as the retrieval loss. Extensive experiments demonstrate that the searched structure of the backbones reaches a performance level close to the previous work with a 20.8% shorter searching time. The proposed method also prevents the final CNNs network from suffering the well-known performance collapse by avoiding aggregation of the skip-connections.

View access options

Figure 2. An overview of DARTS. First, initialize a supernet with random network weights w and unknown operations on the edges. Then, there is continuous relaxation of the search space by placing a mixture of candidate operations and jointly learning architecture α and the network weights w by finding α * that minimizes the validation loss. Lastly, derive the final architecture based on the learned α.
We investigated the effect of each component of the KDAS-ReID. All candidates are trained in the same strategy. In addition, no candidates used pre-training on ImageNet except the teacher model. We use the official ResNet-50 network which pre-trains on the ImageNet as the backbone of the teacher model. We have boldfaced the best results in the table.
The current state-of-the-art model is compared with our model on the Market-1501 dataset. We compare each model in terms of parameters, R-1 (Rank-1) accuracy, and mAP. We have boldfaced the best results in the table.
Comparison of accuracy and mAP with the state-of-art reID models on MSMT17. We have boldfaced the best results in the table.
KDAS-ReID: Architecture Search for Person Re-Identification via Distilled Knowledge with Dynamic Temperature

April 2021

·

304 Reads

·

3 Citations

Person re-Identification(Re-ID) based on deep convolutional neural networks (CNNs) achieves remarkable success with its fast speed. However, prevailing Re-ID models are usually built upon backbones that manually design for classification. In order to automatically design an effective Re-ID architecture, we propose a pedestrian re-identification algorithm based on knowledge distillation, called KDAS-ReID. When the knowledge of the teacher model is transferred to the student model, the importance of knowledge in the teacher model will gradually decrease with the improvement of the performance of the student model. Therefore, instead of applying the distillation loss function directly, we consider using dynamic temperatures during the search stage and training stage. Specifically, we start searching and training at a high temperature and gradually reduce the temperature to 1 so that the student model can better learn from the teacher model through soft targets. Extensive experiments demonstrate that KDAS-ReID performs not only better than other state-of-the-art Re-ID models on three benchmarks, but also better than the teacher model based on the ResNet-50 backbone.

Citations (2)


... Their work emphasizes the importance of learning shared information between different modalities in existing representation learning methods, which primarily aim to improve feature extraction. Chen et al. [50] introduced a NAS approach for person reID task. This method involved searching for an optimal cell structure by making greedy decisions during the search process. ...

Reference:

MNASreID: grasshopper optimization based neural architecture search for motorcycle re-identification
TGAS-ReID: Efficient architecture search for person re-identification via greedy decisions with topological order

Applied Intelligence

... Zhang et al. [33] proposed the use of neural architecture search (NAS) methods to automatically generate high-performance ReID networks that address the limitations of fixed architectures and manual designs. Lei et al. [34] proposed KDAS-ReID, a method that combines knowledge distillation with neural architecture search for pedestrian re-identification. By gradually transferring knowledge from a teacher model to a student model, the performance of the student model improved, while the importance of the teacher model's knowledge decreased. ...

KDAS-ReID: Architecture Search for Person Re-Identification via Distilled Knowledge with Dynamic Temperature