Skip to content

In this project, a novel framework is used from the reference mentioned in the README, which successfully encodes both geometric local features and global representations to distinguish vehicle instances, optimized only by the supervision from official ID labels. Specifically, given the insight that objects in ReID share similar geometric charac…

Notifications You must be signed in to change notification settings

Rubo12345/Directed-Research-on-Vehicle-Re-Identification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 

Repository files navigation

Directed-Research-on-Vehicle-Re-Identification

Self Supervised Geometric Features Discovery vis Interpretable Attention

The paper cited was:

@inproceedings{li2021self,
  title={Self-Supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and Beyond},
  author={Li, Ming and Huang, Xinming and Zhang, Ziming},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={194--204},
  year={2021}
}

Framework Branches:

  1. Global Branch (GB): Encode robust global features codes from an input image. (Architecture - ResNet50)
  2. Self-Supervised Learning Branch (SLB): SLB performs the auxiliary self-supervised representation learning (Architecture - ResNet18)
  3. Geometrics Features Branch (GFB): By sharing its encoder with SLB, GFB discovers discriminative features from automatically discovered geometric locations without corresponding supervision. (Architecture - ResNet18, and later Conv4_x, Conv5_x of ResNet50)

Problem Setup: Input -> Query Image Output -> Ranking list of all gallery images according to similarities between query and gallery image. (Similarity score is obtained by cosine similarity)

Self supervised learning for highlighting geometric features: Self-supervised learning is equivalent to optimizing deep network under the supervion of machine generated pseudo labels. Image rotation degree prediction: rotating image by a random angle and training a classifier to predict it. Vehicle ReID can be regarded as an instance level classification problem, i.e., all images contain the same species but many instance. Thus, salient object in each images has similar geometry properties, e.g., shape, outline, and skeleton. A network to predict the rotation degree of a randomly rotated vehicle image encourages it to focus on these reliable and shared geometric properties, which can help to easily recognize the rotation of an object.

Steps: 1) Rotate an image Xi from Dataset by 0,90,180,270 degreees, to generate a new dataset Dsl = {Xi,r, Yr}. 2) Feed the image Xi,r into a shared encoder ResNet18 (Orange). 3) To predict rotation class, high level representations need to be further condensed from ResNet18. To do this another subnetwork consisting of two basic ResNet blocks are appended. 4) High dimensional embedding vector is obtained: Fsl(Xi,r) = GAP[fse(fae(Xi,r;0ae);0se)] 5) To generate more compact clusters in embedded space, the consine classifier (CC) is employed to assign the rotation class. 6) The learnable parameters of CC is Wcc = [W1,...,Wj,....Wb], b = 4. 7) The probabilities of assigning the input image into each class canbe represented as P(Xi,r) = [p1, p2, p3,..pb], where each element is pj = softmax[cos(Fsl(xi,r),wj)]. 8) Cross Entropy Loss = [CE(P(Xi,r),Yr)]

About

In this project, a novel framework is used from the reference mentioned in the README, which successfully encodes both geometric local features and global representations to distinguish vehicle instances, optimized only by the supervision from official ID labels. Specifically, given the insight that objects in ReID share similar geometric charac…

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages