References

pribor

Известия высших учебных заведений. Приборостроение

Journal of Instrument Engineering

0021-34542500-0381

Национальный исследовательский университет ИТМО

10.17586/0021-3454-2025-68-9-781-791

pribor-410

Research Article

СИСТЕМНЫЙ АНАЛИЗ, УПРАВЛЕНИЕ И ОБРАБОТКА ИНФОРМАЦИИ

SYSTEM ANALYSIS, MANAGEMENT AND INFORMATION PROCESSING

Применение дескрипторного подхода и трехмерного гауссова расщепления для визуальной локализации в динамическом окружении внутри и вне помещений

Grounding Keypoint Descriptors into 3D-Gaussian Splatting for Visual Localization in Dynamic Indoor/Outdoor Environments

Мохрат

Mohrat

Малик Мохрат — аспирант; факультет систем управления и робототехники

Санкт-Петербург

Malik Mohrat — Post-Graduate Student; Faculty of Control Systems and Robotics

St. Petersburg

mmohrat@itmo.ru

Сидоров

Г. К.

Sidorov

G. K.

Геннадий Константинович Сидоров — магистрантфакультет систем управления и робототехники

Санкт-Петербург

и робототехники; E-mail: gksidorov@itmo.ru

Gennady K. Sidorov — Graduate Student; IFaculty of Control Systems and Robotics

St. Petersburg

gksidorov@itmo.ru

Гридусов

Д. Д.

Gridusov

D. D.

Денис Дмитриевич Гридусов — бакалавр; факультет систем управления и робототехники

Санкт-Петербург

Denis D. Gridusov — Bachelor Student; Faculty of Control Systems and Robotics

St. Petersburg

ddgridusov@itmo.ru

Колюбин

С. А.

Kolyubin

S. A.

Сергей Алексеевич Колюбин — д-р техн. наук, профессор; факультет систем управления и робототехники

Санкт-Петербург

Sergey A. Kolyubin — Dr. Sci., Professor; ITMO University, Faculty of Control Systems and Robotics

St. Petersburg

s.kolyubin@itmo.ru

Университет ИТМОITMO University

2025

29102025

689781791

2025

Национальный исследовательский университет ИТМО

https://pribor.ifmo.ru/jour/about/submissions#copyrightNotice

https://pribor.ifmo.ru/jour/article/view/410

Робастная визуальная локализация в реальных условиях остается сложной задачей, особенно в присутствии динамических объектов и временных дистракторов. Несмотря на то, что нейронные представления сцен, такие как 3D Gaussian Splatting (3DGS) и NeRF, обеспечивают компактное кодирование геометрии и внешнего вида сцены, они чувствительны к предположению о статичности мира из-за зависимости от фотометрической согласованности. Представлен робастный фреймворк визуальной локализации, использующий 3DGS с семантически-осведомленной маскировкой для повышения точности в динамических сценах. Предлагаемый подход основан на GSplatLoc и представляет собой двухэтапный конвейер: на первом этапе плотные и легковесные дескрипторы ключевых точек, полученные из сети XFeat, интегрируются в представление 3DGS, что позволяет эффективно выполнять 2D-3D сопоставление для грубой оценки позы. Для снижения влияния динамических дистракторов используются семантические маски, сгенерированные предварительно обученными диффузионными моделями, для исключения непоследовательных областей при построении 3D-сцены. На втором этапе начальная поза уточняется с использованием фотометрической функции выравнивания на основе рендеринга. Эксперименты на динамических наборах данных в помещениях и на открытом воздухе демонстрируют, что предложенный метод превосходит базовое решение в сложных динамических условиях.

Robust visual localization in real-world conditions remains a challenging task, particularly in the presence of dynamic objects and transient distractors. While neural scene representations such as 3D Gaussian Splatting (3DGS) or NeRF offer compact encoding of scene geometry and appearance, they are sensitive to static world assumption due to their reliance on photometric consistency. In this work, we present a robust visual localization framework that leverage 3DGS with semantic-aware masking strategy to improve accuracy in dynamic scenes. Our approach extends GSplatLoc, which is a two-stage pipeline: first integrate dense and lightweight keypoint descriptors from the XFeat network into the 3DGS representation, enabling efficient 2D-3D matching for coarse pose estimation. To mitigate the impact of dynamic distractors, we incorporate semantic masks generated from a classifier that utilizes a pre-trained diffusion model to exclude inconsistent regions during 3D modeling. In the second stage, the initial pose is refined using a rendering-based photometric alignment loss. Experiments on both indoor and outdoor dynamic benchmarks demonstrate that our method achieves superior performance compared to baseline method in challenging dynamic environments.

локализациягауссово расщеплениенейросетевая модель

Visual LocalizationNovel View Synthesis3D Gaussian Splatting (3DGS)Robust OptimizationSemantic-Aware MaskingFeature Field SLAMFeature Clustering

Данное исследование выполнено при поддержке в рамках инициативы по научным проектам в области искусственного интеллекта (RPAII) Университет ИТМО

This research was supported by ITMO University Research Projects in AI Initiative (RPAII).

References1

Dong Z., Zhang G., Jia J.. Bao H.. Keyframe-based real-time camera tracking // IEEE 12th Intern. Conf. on Computer Vision. Sept. 2009. P. 1538–1545. DOI: 10.1109/ICCV.2009.5459273.

Dong Z., Zhang G., Jia J., and Bao H. IEEE 12th International Conference on Computer Vision, Sep. 2009, pp. 1538– 1545, DOI: 10.1109/ICCV.2009.5459273.

Heng L. et al. Project AutoVision: Localization and 3D Scene Perception for an Autonomous Vehicle with a MultiCamera Syste // Intern. Conf. on Robotics and Automation (ICRA), May 2019. P. 4695–4702. DOI: 10.1109/ICRA.2019.8793949.

Heng L. et al. International Conference on Robotics and Automation (ICRA), May 2019, pp. 4695–4702, DOI: 10.1109/ICRA.2019.8793949.

Mildenhall B., Srinivasan P. P., Tancik M., Barron J. T., Ramamoorthi R., Ng R. NeRF: representing scenes as neural radiance fields for view synthesis // Commun. ACM. 2022. Vol. 65, N 1. P. 99–106. DOI: 10.1145/3503250.

Mildenhall B., Srinivasan P.P., Tancik M., Barron J.T., Ramamoorthi R., and Ng R. Commun. ACM, 2022, no. 1(65), pp. 99–106, DOI: 10.1145/3503250.

Kerbl B., Kopanas G., Leimkühler T., and Drettakis G. 3d gaussian splatting for real-time radiance field rendering // ACM Trans Graph. 2023. Vol. 42, N 4. P. 139–1, 2023.

Kerbl B., Kopanas G., Leimkühler T., and Drettakis G. ACM Trans Graph, 2023, no. 4(42), pp. 139.

Sabour S., Vora S., Duckworth D., Krasin I., Fleet D. J., Tagliasacchi A. Robustnerf: Ignoring distractors with robust losses // Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2023. P. 20626–20636. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2023/html/Sabour_RobustNeRF_Ignoring_Distractors_With_Robust_Losses_CVPR_2023_paper.html, 19.05.2025.

Sabour S., Vora S., Duckworth D., Krasin I., Fleet D.J., and Tagliasacchi A. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023, pp. 20626–20636, http://openaccess.thecvf.com/content/CVPR2023/html/Sabour_RobustNeRF_Ignoring_Distractors_With_Robust_Losses_CVPR_2023_paper.html.

Tang L.,Jia M., Wang Q., Phoo C. P., Hariharan B. Emergent correspondence from image diffusion // Adv. Neural Inf. Process. Syst. 2023. Vol. 36. P. 1363–1389.

Tang L., Jia M., Wang Q., Phoo C.P., and Hariharan B. Adv. Neural Inf. Process. Syst., 2023, vol. 36, pp. 1363–1389.

Martin-Brualla R., Radwan N., Sajjadi M. S., Barron J. T., Dosovitskiy A., Duckworth D. Nerf in the wild: Neural radiance fields for unconstrained photo collections // Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. P. 7210–7219. [Электронный ресурс]: https://openaccess.thecvf.com/content/CVPR2021/html/Martin-Brualla_NeRF_in_the_Wild_Neural_Radiance_Fields_for_Unconstrained_Photo_CVPR_2021_paper.html?ref=labelbox.ghost.io.

Martin-Brualla R., Radwan N., Sajjadi M.S., Barron J.T., Dosovitskiy A., and Duckworth D. Proceedings of the IEEE/ CVF conference on computer vision and pattern recognition, 2021, pp. 7210–7219, https://openaccess.thecvf.com/content/CVPR2021/html/Martin-Brualla_NeRF_in_the_Wild_Neural_Radiance_Fields_for_Unconstrained_Photo_CVPR_2021_paper.html?ref=labelbox.ghost.io.

Ren W., Zhu Z., Sun B., Chen J., Pollefeys M., Peng S. Nerf on-the-go: Exploiting uncertainty for distractor-free nerfs in the wild // Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2024. P. 8931–8940. [Электронный ресурс]: https://openaccess.thecvf.com/content/CVPR2024/html/Ren_NeRF_On-the-go_Exploiting_Uncertainty_for_Distractor-free_NeRFs_in_the_Wild_CVPR_2024_paper.html

Ren W., Zhu Z., Sun B., Chen J., Pollefeys M., and Peng S. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 8931–8940, https://openaccess.thecvf.com/content/CVPR2024/html/Ren_NeRF_On-the-go_Exploiting_Uncertainty_for_Distractor-free_NeRFs_in_the_Wild_CVPR_2024_paper.html.

Oquab M. et al. DINOv2: Learning Robust Visual Features without Supervision. Feb. 02, 2024, arXiv: arXiv:2304.07193. DOI: 10.48550/arXiv.2304.07193.

Oquab M. et al. arXiv: arXiv:2304.07193, Feb. 02, 2024, DOI: 10.48550/arXiv.2304.07193.

Dahmani H., Bennehar M., Piasco N., Roldão L., Tsishkou D. SWAG: Splatting in the Wild Images with AppearanceConditioned Gaussians // Computer Vision — ECCV 2024; Lecture Notes in Computer Science. 2025. Vol. 15134. P. 325–340. DOI: 10.1007/978-3-031-73116-7_19.

Dahmani H., Bennehar M., Piasco N., Roldão L., and Tsishkou D. Computer Vision — ECCV 2024, Lecture Notes in Computer Science, Cham, Springer Nature Switzerland, 2025, vol. 15134, pp. 325–340, DOI: 10.1007/978-3-03173116-7_19.

Zhang D., Wang C., Wang W., Li P., Qin M., Wang H. Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections // Computer Vision — ECCV 2024; Lecture Notes in Computer Science. 2025. Vol. 15134. P. 341–359. DOI: 10.1007/978-3-031-73116-7_20.

Zhang D., Wang C., Wang W., Li P., Qin M., and Wang H. Computer Vision — ECCV 2024, Lecture Notes in Computer Science, Cham, Springer Nature Switzerland, 2025, vol. 15134, pp. 341–359, DOI: 10.1007/978-3-031-73116-7_20.

Wang Y., Wang J., and Qi Y. WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections. arXiv: arXiv:2406.02407. DOI: 10.48550/arXiv.2406.02407.

Wang Y., Wang J., and Qi Y. arXiv: arXiv:2406.02407, Jun. 04, 2024, DOI: 10.48550/arXiv.2406.02407.

Zhou Q., Maximov M., Litany O., Leal-Taixé L. The NeRFect Match: Exploring NeRF Features for Visual Localization // Computer Vision — ECCV 2024; Lecture Notes in Computer Science. 2025. Vol. 15082. P. 108–127. DOI: 10.1007/978-3-031-72691-0_7.

Zhou Q., Maximov M., Litany O., and Leal-Taixé L. Computer Vision — ECCV 2024, Lecture Notes in Computer Science, Cham, Springer, Nature Switzerland, 2025, vol. 15082, pp. 108–127, DOI: 10.1007/978-3-031-72691-0_7.

Sabour S. et al. SpotLessSplats: Ignoring Distractors in 3D Gaussian Splatting // ACM Trans. Graph. 2025. Vol. 44, N 2. P. 1–11. DOI: 10.1145/3727143.

Sabour S. et al. ACM Trans. Graph., 2025, no. 2(44), pp. 1–11, DOI: 10.1145/3727143.

Chen S., Li X., Wang Z., Prisacariu V. A. DFNet: Enhance Absolute Pose Regression with Direct Feature Matching // Computer Vision — ECCV 2022; Lecture Notes in Computer Science. 2022. Vol. 13670. P. 1–17. DOI: 10.1007/9783-031-20080-9_1.

Chen S., Li X., Wang Z., and Prisacariu V.A. Computer Vision — ECCV 2022, Lecture Notes in Computer Science, Cham, Springer Nature Switzerland, 2022, vol. 13670, pp. 1–17, DOI: 10.1007/978-3-031-20080-9_1.

Chen S. et al. Neural refinement for absolute pose regression with feature synthesis // Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2024. P. 20987–20996. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2024/html/Chen_Neural_Refinement_for_Absolute_Pose_Regression_with_Feature_Synthesis_CVPR_2024_paper.html

Chen S. et al. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 20987–20996, http://openaccess.thecvf.com/content/CVPR2024/html/Chen_Neural_Refinement_for_Absolute_Pose_Regression_with_Feature_Synthesis_CVPR_2024_paper.html.

Yen-Chen L., Florence P., Barron J. T., Rodriguez A., Isola P., Lin T.-Y. Inerf: Inverting neural radiance fields for pose estimation // IEEE/RSJ Intern. Conf. on Intelligent Robots and Systems (IROS), IEEE. 2021. P. 1323–1330. [Электронный ресурс]: https://ieeexplore.ieee.org/abstract/document/9636708/

Yen-Chen L., Florence P., Barron J.T., Rodriguez A., Isola P., and Lin T.-Y. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2021, pp. 1323–1330, https://ieeexplore.ieee.org/abstract/document/9636708/.

Kobayashi S., Matsumoto E., Sitzmann V. Decomposing nerf for editing via feature field distillation // Adv. Neural Inf. Process. Syst. 2022. Vol. 35. P. 23311–23330.

Kobayashi S., Matsumoto E., and Sitzmann V. Adv. Neural Inf. Process. Syst., 2022, vol. 35, pp. 23311–23330.

Tschernezki V., Laina I., Larlus D., Vedaldi A. Neural feature fusion fields: 3d distillation of self-supervised 2d image representations // Intern. Conf. on 3D Vision (3DV), IEEE. 2022. P. 443–453. [Электронный ресурс]: https://ieeexplore.ieee.org/abstract/document/10044452/

Tschernezki V., Laina I., Larlus D., and Vedaldi A. International Conference on 3D Vision (3DV), IEEE, 2022, pp. 443–453, https://ieeexplore.ieee.org/abstract/document/10044452/.

Zhao B., Yang L., Mao M., Bao H., Cui Z. PNeRFLoc: Visual localization with point-based neural radiance fields // Proceedings of the AAAI Conf. on Artificial Intelligence. 2024. P. 7450–7459. [Электронный ресурс]: https://ojs.aaai.org/index.php/AAAI/article/view/28576

Zhao B., Yang L., Mao M., Bao H., and Cui Z. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, pp. 7450–7459, https://ojs.aaai.org/index.php/AAAI/article/view/28576.

Sun Y. et al. iComMa: Inverting 3D Gaussian Splatting for Camera Pose Estimation via Comparing and Matching // arXiv: arXiv:2312.09031. DOI: 10.48550/arXiv.2312.09031.

Sun Y. et al. arXiv: arXiv:2312.09031, Mar. 20, 2024, DOI: 10.48550/arXiv.2312.09031.

Botashev K., Pyatov V., Ferrer G., Lefkimmiatis S. GSLoc: Visual Localization with 3D Gaussian Splatting // IEEE/ RSJ Intern. Conf. on Intelligent Robots and Systems (IROS), IEEE. 2024. P. 5664–5671. [Электронный ресурс]: https://ieeexplore.ieee.org/abstract/document/10801919/

Botashev K., Pyatov V., Ferrer G., and Lefkimmiatis S. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2024, pp. 5664–5671, https://ieeexplore.ieee.org/abstract/document/10801919/.

DeTone D., Malisiewicz T., Rabinovich A. Superpoint: Self-supervised interest point detection and description // Proc. of the IEEE conf. on computer vision and pattern recognition workshops. 2018. P. 224–236. [Электронный ресурс]: https://openaccess.thecvf.com/content_cvpr_2018_workshops/w9/html/DeTone_SuperPoint_Self-Supervised_Interest_CVPR_2018_paper.html

DeTone D., Malisiewicz T., and Rabinovich A. Proceedings of the IEEE conference on computer vision and pattern recognition workshops, 2018, pp. 224–236, https://openaccess.thecvf.com/content_cvpr_2018_workshops/w9/html/DeTone_SuperPoint_Self-Supervised_Interest_CVPR_2018_paper.html.

Dusmanu M. et al. D2-net: A trainable cnn for joint description and detection of local features // Proc. of the ieee/cvf conf. on computer vision and pattern recognition. 2019. P. 8092–8101. [Электронный ресурс]: http://openaccess.thecvf.com/content_CVPR_2019/html/Dusmanu_D2-Net_A_Trainable_CNN_for_Joint_Description_and_Detection_of_CVPR_2019_paper.html

Dusmanu M. et al. Proceedings of the ieee/cvf conference on computer vision and pattern recognition, 2019, pp. 8092–8101, http://openaccess.thecvf.com/content_CVPR_2019/html/Dusmanu_D2-Net_A_Trainable_CNN_for_Joint_Description_and_Detection_of_CVPR_2019_paper.html.

Revaud J., De Souza C., Humenberger M., Weinzaepfel P. R2d2: Reliable and repeatable detector and descriptor // Adv. Neural Inf. Process. Syst. 2019. Vol. 32. [Электронный ресурс]: https://proceedings.neurips.cc/paper/2019/hash/3198dfd0aef271d22f7bcddd6f12f5cb-Abstract.html

Revaud J., De Souza C., Humenberger M., and Weinzaepfel P. Adv. Neural Inf. Process. Syst., 2019, vol. 32, https://proceedings.neurips.cc/paper/2019/hash/3198dfd0aef271d22f7bcddd6f12f5cb-Abstract.html.

Lindenberger P., Sarlin P.-E., Pollefeys M. Lightglue: Local feature matching at light speed // Proc. of the IEEE/CVF Intern. Conf. on Computer Vision. 2023. P. 17627–17638. [Электронный ресурс]: http://openaccess.thecvf.com/content/ICCV2023/html/Lindenberger_LightGlue_Local_Feature_Matching_at_Light_Speed_ICCV_2023_paper.html

Lindenberger P., Sarlin P.-E., and Pollefeys M. Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 17627–17638, http://openaccess.thecvf.com/content/ICCV2023/html/Lindenberger_LightGlue_Local_Feature_Matching_at_Light_Speed_ICCV_2023_paper.html.

Sun J., Shen Z., Wang Y., Bao H., Zhou X. LoFTR: Detector-free local feature matching with transformers // Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2021. P. 8922–8931. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2021/html/Sun_LoFTR_Detector-Free_Local_Feature_Matching_With_Transformers_CVPR_2021_paper.html

Sun J., Shen Z., Wang Y., Bao H., and Zhou X. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 8922–8931, http://openaccess.thecvf.com/content/CVPR2021/html/Sun_LoFTR_Detector-Free_Local_Feature_Matching_With_Transformers_CVPR_2021_paper.html.

Potje G., Cadar F., Araujo A., Martins R., Nascimento E. R. Xfeat: Accelerated features for lightweight image matching // Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2024. P. 2682–2691. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2024/html/Potje_XFeat_Accelerated_Features_for_Lightweight_Image_Matching_CVPR_2024_paper.html

Potje G., Cadar F., Araujo A., Martins R., and Nascimento E.R. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 2682–2691, http://openaccess.thecvf.com/content/CVPR2024/html/otje_XFeat_Accelerated_Features_for_Lightweight_Image_Matching_CVPR_2024_paper.html.

Lindenberger P., Sarlin P.-E., Larsson V., Pollefeys M. Pixel-perfect structure-from-motion with featuremetric refinement // Proc. of the IEEE/CVF Intern. Conference on Computer Vision. 2021. P. 5987–5997. [Электронный ресурс]: http://openaccess.thecvf.com/content/ICCV2021/html/Lindenberger_Pixel-Perfect_Structure-From-Motion_With_Featuremetric_Refinement_ICCV_2021_paper.html

Lindenberger P., Sarlin P.-E., Larsson V., and Pollefeys M. Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 5987–5997, http://openaccess.thecvf.com/content/ICCV2021/html/Lindenberger_Pixel-Perfect_Structure-From-Motion_With_Featuremetric_Refinement_ICCV_2021_paper.html.

Sidorov G., Mohrat M., Gridusov D., Rakhimov R., Kolyubin S. GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization. arXiv: arXiv:2409.16502. DOI: 10.48550/arXiv.2409.16502.

Sidorov G., Mohrat M., Gridusov D., Rakhimov R., and Kolyubin S. arXiv: arXiv:2409.16502, Mar. 20, 2025, DOI: 10.48550/arXiv.2409.16502.

Zhou S. et al. Feature 3dgs: Supercharging 3d gaussian splatting to enable distilled feature fields // Proc. of the IEEE/ CVF Conf. on Computer Vision and Pattern Recognition. 2024. P. 21676–21685. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2024/html/Zhou_Feature_3DGS_Supercharging_3D_Gaussian_Splatting_to_Enable_Distilled_Feature_CVPR_2024_paper.html

Zhou S. et al. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 21676–21685, http://openaccess.thecvf.com/content/CVPR2024/html/Zhou_Feature_3DGS_Supercharging_3D_Gaussian_Splatting_to_Enable_Distilled_Feature_CVPR_2024_paper.html.

Liu H.-T. D., Williams F., Jacobson A., Fidler S., Litany O. Learning Smooth Neural Functions via Lipschitz Regularization // Special Interest Group on Computer Graphics and Interactive Techniques. Conf. Proc., Vancouver, Canada, Aug. 2022. P. 1–13. DOI: 10.1145/3528233.3530713.

Liu H.-T.D., Williams F., Jacobson A., Fidler S., and Litany O. Special Interest Group on Computer Graphics and Interactive Techniques Conference Proceedings, Vancouver BC Canada: ACM, Aug. 2022, pp. 1–13, DOI: 10.1145/3528233.3530713.

Shavit Y., Ferens R., Keller Y. Learning multi-scene absolute pose regression with transformers // Proc. of the IEEE/ CVF Intern. Conf. on Computer Vision. 2021. P. 2733–2742. [Электронный ресурс]: http://openaccess.thecvf.com/content/ICCV2021/html/Shavit_Learning_Multi-Scene_Absolute_Pose_Regression_With_Transformers_ICCV_2021_paper.html

Shavit Y., Ferens R., and Keller Y. Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 2733–2742, http://openaccess.thecvf.com/content/ICCV2021/html/Shavit_Learning_Multi-Scene_Absolute_Pose_Regression_With_Transformers_ICCV_2021_paper.html.

The authors declare that there are no conflicts of interest present.