ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement

Dongli Tan*13,
Jiangjiang Liu*23,
Xingyu Chen3,
Chao Chen3,
Ruixin Zhang3,
Yunhang Shen3,
Shouhong Ding3,
Rongrong Ji1
1Xiamen University 2Nankai University 3Youtu Lab, Tencent Co.,Ltd

Abstract

Modeling sparse and dense image matching within a unified functional model has recently attracted increasing research interest.However, existing efforts mainly focus on improving matching accuracy while ignoring its efficiency, which is crucial for real-world applications.

Pipeline of ECO-TR

In this paper, we propose an efficient structure named Efficient Correspondence Transformer ECO-TR by finding correspondences in a coarse-to-fine manner, which significantly improves the efficiency of functional model method. To achieve this, multiple transformer blocks are stage-wisely connected to gradually refine the predicted coordinates upon a shared multi-scale feature extraction network.All the correspondences are predicted within a single feed-forward pass, given a pair of images and for arbitrary query coordinates.We further propose an adaptive query-clustering strategy and an uncertainty-based outlier detection module to cooperate with the proposed framework for faster and better predictions.

Comparison of the inference time between the proposed ECO-TR and COTR

Experimental comparisons on various sparse and dense matching tasks demonstrate the superiority of our method on both efficiency and effectiveness against existing functional matching methods. are evaluated on a single Tesla V100 GPU.

Matches across different density of queries

Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.

Matches under different conditions

Interpolate start reference image.
day night
Interpolate start reference image.
sketch
Interpolate start reference image.
NIR(Near Infrared) image

Dense alignment

Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.

Optical flow estimation

Reconstruction From two views

Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.

IMC(Image matching challenge)2022

Interpolate start reference image.
Interpolate start reference image.
ECO-TR consists of multiple refinement modules of different levels, which can be used in a plug-and-play manner to refine the matches of various existing methods. Our solution got a golden medal (5th place out of 642 teams) in IMC2022. More details of our solution can be found here.

Online demo

We provide an online demo in colab.

BibTeX

@article{
  update later
  
}