<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">pribor</journal-id><journal-title-group><journal-title xml:lang="ru">Известия высших учебных заведений. Приборостроение</journal-title><trans-title-group xml:lang="en"><trans-title>Journal of Instrument Engineering</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">0021-3454</issn><issn pub-type="epub">2500-0381</issn><publisher><publisher-name>Национальный исследовательский университет ИТМО</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.17586/0021-3454-2025-68-9-781-791</article-id><article-id custom-type="elpub" pub-id-type="custom">pribor-410</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>СИСТЕМНЫЙ АНАЛИЗ, УПРАВЛЕНИЕ И ОБРАБОТКА ИНФОРМАЦИИ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>SYSTEM ANALYSIS, MANAGEMENT AND INFORMATION PROCESSING</subject></subj-group></article-categories><title-group><article-title>Применение дескрипторного подхода и трехмерного гауссова расщепления для визуальной локализации в динамическом окружении внутри и вне помещений</article-title><trans-title-group xml:lang="en"><trans-title>Grounding Keypoint Descriptors into 3D-Gaussian Splatting for Visual Localization in Dynamic Indoor/Outdoor Environments</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Мохрат</surname><given-names>M.</given-names></name><name name-style="western" xml:lang="en"><surname>Mohrat</surname><given-names>M.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Малик Мохрат — аспирант; факультет систем управления и робототехники</p><p>Санкт-Петербург</p></bio><bio xml:lang="en"><p>Malik Mohrat — Post-Graduate Student; Faculty of Control Systems and Robotics</p><p>St. Petersburg</p></bio><email xlink:type="simple">mmohrat@itmo.ru</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Сидоров</surname><given-names>Г. К.</given-names></name><name name-style="western" xml:lang="en"><surname>Sidorov</surname><given-names>G. K.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Геннадий Константинович Сидоров — магистрантфакультет систем управления и робототехники</p><p>Санкт-Петербург</p><p> </p><p>и робототехники; E-mail: gksidorov@itmo.ru</p></bio><bio xml:lang="en"><p>Gennady K. Sidorov — Graduate Student; IFaculty of Control Systems and Robotics</p><p>St. Petersburg</p></bio><email xlink:type="simple">gksidorov@itmo.ru</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Гридусов</surname><given-names>Д. Д.</given-names></name><name name-style="western" xml:lang="en"><surname>Gridusov</surname><given-names>D. D.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Денис Дмитриевич Гридусов — бакалавр; факультет систем управления и робототехники</p><p>Санкт-Петербург</p></bio><bio xml:lang="en"><p>Denis D. Gridusov — Bachelor Student; Faculty of Control Systems and Robotics</p><p>St. Petersburg</p><p> </p></bio><email xlink:type="simple">ddgridusov@itmo.ru</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Колюбин</surname><given-names>С. А.</given-names></name><name name-style="western" xml:lang="en"><surname>Kolyubin</surname><given-names>S. A.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Сергей Алексеевич Колюбин — д-р техн. наук, профессор; факультет систем управления и робототехники</p><p>Санкт-Петербург</p></bio><bio xml:lang="en"><p>Sergey A. Kolyubin — Dr. Sci., Professor; ITMO University, Faculty of Control Systems and Robotics</p><p>St. Petersburg</p><p> </p></bio><email xlink:type="simple">s.kolyubin@itmo.ru</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Университет ИТМО</institution></aff><aff xml:lang="en"><institution>ITMO University</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2025</year></pub-date><pub-date pub-type="epub"><day>29</day><month>10</month><year>2025</year></pub-date><volume>68</volume><issue>9</issue><fpage>781</fpage><lpage>791</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Национальный исследовательский университет ИТМО, 2025</copyright-statement><copyright-year>2025</copyright-year><copyright-holder xml:lang="ru">Национальный исследовательский университет ИТМО</copyright-holder><copyright-holder xml:lang="en">Национальный исследовательский университет ИТМО</copyright-holder><license xlink:href="https://pribor.ifmo.ru/jour/about/submissions#copyrightNotice" xlink:type="simple"><license-p>https://pribor.ifmo.ru/jour/about/submissions#copyrightNotice</license-p></license></permissions><self-uri xlink:href="https://pribor.ifmo.ru/jour/article/view/410">https://pribor.ifmo.ru/jour/article/view/410</self-uri><abstract><p>Робастная визуальная локализация в реальных условиях остается сложной задачей, особенно в присутствии динамических объектов и временных дистракторов. Несмотря на то, что нейронные представления сцен, такие как 3D Gaussian Splatting (3DGS) и NeRF, обеспечивают компактное кодирование геометрии и внешнего вида сцены, они чувствительны к предположению о статичности мира из-за зависимости от фотометрической согласованности. Представлен робастный фреймворк визуальной локализации, использующий 3DGS с семантически-осведомленной маскировкой для повышения точности в динамических сценах. Предлагаемый подход основан на GSplatLoc и представляет собой двухэтапный конвейер: на первом этапе плотные и легковесные дескрипторы ключевых точек, полученные из сети XFeat, интегрируются в представление 3DGS, что позволяет эффективно выполнять 2D-3D сопоставление для грубой оценки позы. Для снижения влияния динамических дистракторов используются семантические маски, сгенерированные предварительно обученными диффузионными моделями, для исключения непоследовательных областей при построении 3D-сцены. На втором этапе начальная поза уточняется с использованием фотометрической функции выравнивания на основе рендеринга. Эксперименты на динамических наборах данных в помещениях и на открытом воздухе демонстрируют, что предложенный метод превосходит базовое решение в сложных динамических условиях.</p></abstract><trans-abstract xml:lang="en"><p>Robust visual localization in real-world conditions remains a challenging task, particularly in the presence of dynamic objects and transient distractors. While neural scene representations such as 3D Gaussian Splatting (3DGS) or NeRF offer compact encoding of scene geometry and appearance, they are sensitive to static world assumption due to their reliance on photometric consistency. In this work, we present a robust visual localization framework that leverage 3DGS with semantic-aware masking strategy to improve accuracy in dynamic scenes. Our approach extends GSplatLoc, which is a two-stage pipeline: first integrate dense and lightweight keypoint descriptors from the XFeat network into the 3DGS representation, enabling efficient 2D-3D matching for coarse pose estimation. To mitigate the impact of dynamic distractors, we incorporate semantic masks generated from a classifier that utilizes a pre-trained diffusion model to exclude inconsistent regions during 3D modeling. In the second stage, the initial pose is refined using a rendering-based photometric alignment loss. Experiments on both indoor and outdoor dynamic benchmarks demonstrate that our method achieves superior performance compared to baseline method in challenging dynamic environments.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>локализация</kwd><kwd>гауссово расщепление</kwd><kwd>нейросетевая модель</kwd></kwd-group><kwd-group xml:lang="en"><kwd>Visual Localization</kwd><kwd>Novel View Synthesis</kwd><kwd>3D Gaussian Splatting (3DGS)</kwd><kwd>Robust Optimization</kwd><kwd>Semantic-Aware Masking</kwd><kwd>Feature Field SLAM</kwd><kwd>Feature Clustering</kwd></kwd-group><funding-group><funding-statement xml:lang="ru">Данное исследование выполнено при поддержке в рамках инициативы по научным проектам в области искусственного интеллекта (RPAII) Университет ИТМО</funding-statement><funding-statement xml:lang="en">This research was supported by ITMO University Research Projects in AI Initiative (RPAII).</funding-statement></funding-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Dong Z., Zhang G., Jia J.. Bao H.. Keyframe-based real-time camera tracking // IEEE 12th Intern. Conf. on Computer Vision. Sept. 2009. P. 1538–1545. DOI: 10.1109/ICCV.2009.5459273.</mixed-citation><mixed-citation xml:lang="en">Dong Z., Zhang G., Jia J., and Bao H. IEEE 12th International Conference on Computer Vision, Sep. 2009, pp. 1538– 1545, DOI: 10.1109/ICCV.2009.5459273.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Heng L. et al. Project AutoVision: Localization and 3D Scene Perception for an Autonomous Vehicle with a MultiCamera Syste // Intern. Conf. on Robotics and Automation (ICRA), May 2019. P. 4695–4702. DOI: 10.1109/ICRA.2019.8793949.</mixed-citation><mixed-citation xml:lang="en">Heng L. et al. International Conference on Robotics and Automation (ICRA), May 2019, pp. 4695–4702, DOI: 10.1109/ICRA.2019.8793949.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Mildenhall B., Srinivasan P. P., Tancik M., Barron J. T., Ramamoorthi R., Ng R. NeRF: representing scenes as neural radiance fields for view synthesis // Commun. ACM. 2022. Vol. 65, N 1. P. 99–106. DOI: 10.1145/3503250.</mixed-citation><mixed-citation xml:lang="en">Mildenhall B., Srinivasan P.P., Tancik M., Barron J.T., Ramamoorthi R., and Ng R. Commun. ACM, 2022, no. 1(65), pp. 99–106, DOI: 10.1145/3503250.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Kerbl B., Kopanas G., Leimkühler T., and Drettakis G. 3d gaussian splatting for real-time radiance field rendering // ACM Trans Graph. 2023. Vol. 42, N 4. P. 139–1, 2023.</mixed-citation><mixed-citation xml:lang="en">Kerbl B., Kopanas G., Leimkühler T., and Drettakis G. ACM Trans Graph, 2023, no. 4(42), pp. 139.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Sabour S., Vora S., Duckworth D., Krasin I., Fleet D. J., Tagliasacchi A. Robustnerf: Ignoring distractors with robust losses // Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2023. P. 20626–20636. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2023/html/Sabour_RobustNeRF_Ignoring_Distractors_With_Robust_Losses_CVPR_2023_paper.html, 19.05.2025.</mixed-citation><mixed-citation xml:lang="en">Sabour S., Vora S., Duckworth D., Krasin I., Fleet D.J., and Tagliasacchi A. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023, pp. 20626–20636, http://openaccess.thecvf.com/content/CVPR2023/html/Sabour_RobustNeRF_Ignoring_Distractors_With_Robust_Losses_CVPR_2023_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Tang L.,Jia M., Wang Q., Phoo C. P., Hariharan B. Emergent correspondence from image diffusion // Adv. Neural Inf. Process. Syst. 2023. Vol. 36. P. 1363–1389.</mixed-citation><mixed-citation xml:lang="en">Tang L., Jia M., Wang Q., Phoo C.P., and Hariharan B. Adv. Neural Inf. Process. Syst., 2023, vol. 36, pp. 1363–1389.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Martin-Brualla R., Radwan N., Sajjadi M. S., Barron J. T., Dosovitskiy A., Duckworth D. Nerf in the wild: Neural radiance fields for unconstrained photo collections // Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. P. 7210–7219. [Электронный ресурс]: https://openaccess.thecvf.com/content/CVPR2021/html/Martin-Brualla_NeRF_in_the_Wild_Neural_Radiance_Fields_for_Unconstrained_Photo_CVPR_2021_paper.html?ref=labelbox.ghost.io.</mixed-citation><mixed-citation xml:lang="en">Martin-Brualla R., Radwan N., Sajjadi M.S., Barron J.T., Dosovitskiy A., and Duckworth D. Proceedings of the IEEE/ CVF conference on computer vision and pattern recognition, 2021, pp. 7210–7219, https://openaccess.thecvf.com/content/CVPR2021/html/Martin-Brualla_NeRF_in_the_Wild_Neural_Radiance_Fields_for_Unconstrained_Photo_CVPR_2021_paper.html?ref=labelbox.ghost.io.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Ren W., Zhu Z., Sun B., Chen J., Pollefeys M., Peng S. Nerf on-the-go: Exploiting uncertainty for distractor-free nerfs in the wild // Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2024. P. 8931–8940. [Электронный ресурс]: https://openaccess.thecvf.com/content/CVPR2024/html/Ren_NeRF_On-the-go_Exploiting_Uncertainty_for_Distractor-free_NeRFs_in_the_Wild_CVPR_2024_paper.html</mixed-citation><mixed-citation xml:lang="en">Ren W., Zhu Z., Sun B., Chen J., Pollefeys M., and Peng S. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 8931–8940, https://openaccess.thecvf.com/content/CVPR2024/html/Ren_NeRF_On-the-go_Exploiting_Uncertainty_for_Distractor-free_NeRFs_in_the_Wild_CVPR_2024_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Oquab M. et al. DINOv2: Learning Robust Visual Features without Supervision. Feb. 02, 2024, arXiv: arXiv:2304.07193. DOI: 10.48550/arXiv.2304.07193.</mixed-citation><mixed-citation xml:lang="en">Oquab M. et al. arXiv: arXiv:2304.07193, Feb. 02, 2024, DOI: 10.48550/arXiv.2304.07193.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Dahmani H., Bennehar M., Piasco N., Roldão L., Tsishkou D. SWAG: Splatting in the Wild Images with AppearanceConditioned Gaussians // Computer Vision — ECCV 2024; Lecture Notes in Computer Science. 2025. Vol. 15134. P. 325–340. DOI: 10.1007/978-3-031-73116-7_19.</mixed-citation><mixed-citation xml:lang="en">Dahmani H., Bennehar M., Piasco N., Roldão L., and Tsishkou D. Computer Vision — ECCV 2024, Lecture Notes in Computer Science, Cham, Springer Nature Switzerland, 2025, vol. 15134, pp. 325–340, DOI: 10.1007/978-3-03173116-7_19.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Zhang D., Wang C., Wang W., Li P., Qin M., Wang H. Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections // Computer Vision — ECCV 2024; Lecture Notes in Computer Science. 2025. Vol. 15134. P. 341–359. DOI: 10.1007/978-3-031-73116-7_20.</mixed-citation><mixed-citation xml:lang="en">Zhang D., Wang C., Wang W., Li P., Qin M., and Wang H. Computer Vision — ECCV 2024, Lecture Notes in Computer Science, Cham, Springer Nature Switzerland, 2025, vol. 15134, pp. 341–359, DOI: 10.1007/978-3-031-73116-7_20.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Wang Y., Wang J., and Qi Y. WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections. arXiv: arXiv:2406.02407. DOI: 10.48550/arXiv.2406.02407.</mixed-citation><mixed-citation xml:lang="en">Wang Y., Wang J., and Qi Y. arXiv: arXiv:2406.02407, Jun. 04, 2024, DOI: 10.48550/arXiv.2406.02407.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Zhou Q., Maximov M., Litany O., Leal-Taixé L. The NeRFect Match: Exploring NeRF Features for Visual Localization // Computer Vision — ECCV 2024; Lecture Notes in Computer Science. 2025. Vol. 15082. P. 108–127. DOI: 10.1007/978-3-031-72691-0_7.</mixed-citation><mixed-citation xml:lang="en">Zhou Q., Maximov M., Litany O., and Leal-Taixé L. Computer Vision — ECCV 2024, Lecture Notes in Computer Science, Cham, Springer, Nature Switzerland, 2025, vol. 15082, pp. 108–127, DOI: 10.1007/978-3-031-72691-0_7.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Sabour S. et al. SpotLessSplats: Ignoring Distractors in 3D Gaussian Splatting // ACM Trans. Graph. 2025. Vol. 44, N 2. P. 1–11. DOI: 10.1145/3727143.</mixed-citation><mixed-citation xml:lang="en">Sabour S. et al. ACM Trans. Graph., 2025, no. 2(44), pp. 1–11, DOI: 10.1145/3727143.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Chen S., Li X., Wang Z., Prisacariu V. A. DFNet: Enhance Absolute Pose Regression with Direct Feature Matching // Computer Vision — ECCV 2022; Lecture Notes in Computer Science. 2022. Vol. 13670. P. 1–17. DOI: 10.1007/9783-031-20080-9_1.</mixed-citation><mixed-citation xml:lang="en">Chen S., Li X., Wang Z., and Prisacariu V.A. Computer Vision — ECCV 2022, Lecture Notes in Computer Science, Cham, Springer Nature Switzerland, 2022, vol. 13670, pp. 1–17, DOI: 10.1007/978-3-031-20080-9_1.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Chen S. et al. Neural refinement for absolute pose regression with feature synthesis // Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2024. P. 20987–20996. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2024/html/Chen_Neural_Refinement_for_Absolute_Pose_Regression_with_Feature_Synthesis_CVPR_2024_paper.html</mixed-citation><mixed-citation xml:lang="en">Chen S. et al. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 20987–20996, http://openaccess.thecvf.com/content/CVPR2024/html/Chen_Neural_Refinement_for_Absolute_Pose_Regression_with_Feature_Synthesis_CVPR_2024_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Yen-Chen L., Florence P., Barron J. T., Rodriguez A., Isola P., Lin T.-Y. Inerf: Inverting neural radiance fields for pose estimation // IEEE/RSJ Intern. Conf. on Intelligent Robots and Systems (IROS), IEEE. 2021. P. 1323–1330. [Электронный ресурс]: https://ieeexplore.ieee.org/abstract/document/9636708/</mixed-citation><mixed-citation xml:lang="en">Yen-Chen L., Florence P., Barron J.T., Rodriguez A., Isola P., and Lin T.-Y. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2021, pp. 1323–1330, https://ieeexplore.ieee.org/abstract/document/9636708/.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Kobayashi S., Matsumoto E., Sitzmann V. Decomposing nerf for editing via feature field distillation // Adv. Neural Inf. Process. Syst. 2022. Vol. 35. P. 23311–23330.</mixed-citation><mixed-citation xml:lang="en">Kobayashi S., Matsumoto E., and Sitzmann V. Adv. Neural Inf. Process. Syst., 2022, vol. 35, pp. 23311–23330.</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Tschernezki V., Laina I., Larlus D., Vedaldi A. Neural feature fusion fields: 3d distillation of self-supervised 2d image representations // Intern. Conf. on 3D Vision (3DV), IEEE. 2022. P. 443–453. [Электронный ресурс]: https://ieeexplore.ieee.org/abstract/document/10044452/</mixed-citation><mixed-citation xml:lang="en">Tschernezki V., Laina I., Larlus D., and Vedaldi A. International Conference on 3D Vision (3DV), IEEE, 2022, pp. 443–453, https://ieeexplore.ieee.org/abstract/document/10044452/.</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Zhao B., Yang L., Mao M., Bao H., Cui Z. PNeRFLoc: Visual localization with point-based neural radiance fields // Proceedings of the AAAI Conf. on Artificial Intelligence. 2024. P. 7450–7459. [Электронный ресурс]: https://ojs.aaai.org/index.php/AAAI/article/view/28576</mixed-citation><mixed-citation xml:lang="en">Zhao B., Yang L., Mao M., Bao H., and Cui Z. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, pp. 7450–7459, https://ojs.aaai.org/index.php/AAAI/article/view/28576.</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Sun Y. et al. iComMa: Inverting 3D Gaussian Splatting for Camera Pose Estimation via Comparing and Matching // arXiv: arXiv:2312.09031. DOI: 10.48550/arXiv.2312.09031.</mixed-citation><mixed-citation xml:lang="en">Sun Y. et al. arXiv: arXiv:2312.09031, Mar. 20, 2024, DOI: 10.48550/arXiv.2312.09031.</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">Botashev K., Pyatov V., Ferrer G., Lefkimmiatis S. GSLoc: Visual Localization with 3D Gaussian Splatting // IEEE/ RSJ Intern. Conf. on Intelligent Robots and Systems (IROS), IEEE. 2024. P. 5664–5671. [Электронный ресурс]: https://ieeexplore.ieee.org/abstract/document/10801919/</mixed-citation><mixed-citation xml:lang="en">Botashev K., Pyatov V., Ferrer G., and Lefkimmiatis S. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2024, pp. 5664–5671, https://ieeexplore.ieee.org/abstract/document/10801919/.</mixed-citation></citation-alternatives></ref><ref id="cit23"><label>23</label><citation-alternatives><mixed-citation xml:lang="ru">DeTone D., Malisiewicz T., Rabinovich A. Superpoint: Self-supervised interest point detection and description // Proc. of the IEEE conf. on computer vision and pattern recognition workshops. 2018. P. 224–236. [Электронный ресурс]: https://openaccess.thecvf.com/content_cvpr_2018_workshops/w9/html/DeTone_SuperPoint_Self-Supervised_Interest_CVPR_2018_paper.html</mixed-citation><mixed-citation xml:lang="en">DeTone D., Malisiewicz T., and Rabinovich A. Proceedings of the IEEE conference on computer vision and pattern recognition workshops, 2018, pp. 224–236, https://openaccess.thecvf.com/content_cvpr_2018_workshops/w9/html/DeTone_SuperPoint_Self-Supervised_Interest_CVPR_2018_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit24"><label>24</label><citation-alternatives><mixed-citation xml:lang="ru">Dusmanu M. et al. D2-net: A trainable cnn for joint description and detection of local features // Proc. of the ieee/cvf conf. on computer vision and pattern recognition. 2019. P. 8092–8101. [Электронный ресурс]: http://openaccess.thecvf.com/content_CVPR_2019/html/Dusmanu_D2-Net_A_Trainable_CNN_for_Joint_Description_and_Detection_of_CVPR_2019_paper.html</mixed-citation><mixed-citation xml:lang="en">Dusmanu M. et al. Proceedings of the ieee/cvf conference on computer vision and pattern recognition, 2019, pp. 8092–8101, http://openaccess.thecvf.com/content_CVPR_2019/html/Dusmanu_D2-Net_A_Trainable_CNN_for_Joint_Description_and_Detection_of_CVPR_2019_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit25"><label>25</label><citation-alternatives><mixed-citation xml:lang="ru">Revaud J., De Souza C., Humenberger M., Weinzaepfel P. R2d2: Reliable and repeatable detector and descriptor // Adv. Neural Inf. Process. Syst. 2019. Vol. 32. [Электронный ресурс]: https://proceedings.neurips.cc/paper/2019/hash/3198dfd0aef271d22f7bcddd6f12f5cb-Abstract.html</mixed-citation><mixed-citation xml:lang="en">Revaud J., De Souza C., Humenberger M., and Weinzaepfel P. Adv. Neural Inf. Process. Syst., 2019, vol. 32, https://proceedings.neurips.cc/paper/2019/hash/3198dfd0aef271d22f7bcddd6f12f5cb-Abstract.html.</mixed-citation></citation-alternatives></ref><ref id="cit26"><label>26</label><citation-alternatives><mixed-citation xml:lang="ru">Lindenberger P., Sarlin P.-E., Pollefeys M. Lightglue: Local feature matching at light speed // Proc. of the IEEE/CVF Intern. Conf. on Computer Vision. 2023. P. 17627–17638. [Электронный ресурс]: http://openaccess.thecvf.com/content/ICCV2023/html/Lindenberger_LightGlue_Local_Feature_Matching_at_Light_Speed_ICCV_2023_paper.html</mixed-citation><mixed-citation xml:lang="en">Lindenberger P., Sarlin P.-E., and Pollefeys M. Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 17627–17638, http://openaccess.thecvf.com/content/ICCV2023/html/Lindenberger_LightGlue_Local_Feature_Matching_at_Light_Speed_ICCV_2023_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit27"><label>27</label><citation-alternatives><mixed-citation xml:lang="ru">Sun J., Shen Z., Wang Y., Bao H., Zhou X. LoFTR: Detector-free local feature matching with transformers // Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2021. P. 8922–8931. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2021/html/Sun_LoFTR_Detector-Free_Local_Feature_Matching_With_Transformers_CVPR_2021_paper.html</mixed-citation><mixed-citation xml:lang="en">Sun J., Shen Z., Wang Y., Bao H., and Zhou X. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 8922–8931, http://openaccess.thecvf.com/content/CVPR2021/html/Sun_LoFTR_Detector-Free_Local_Feature_Matching_With_Transformers_CVPR_2021_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit28"><label>28</label><citation-alternatives><mixed-citation xml:lang="ru">Potje G., Cadar F., Araujo A., Martins R., Nascimento E. R. Xfeat: Accelerated features for lightweight image matching // Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2024. P. 2682–2691. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2024/html/Potje_XFeat_Accelerated_Features_for_Lightweight_Image_Matching_CVPR_2024_paper.html</mixed-citation><mixed-citation xml:lang="en">Potje G., Cadar F., Araujo A., Martins R., and Nascimento E.R. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 2682–2691, http://openaccess.thecvf.com/content/CVPR2024/html/otje_XFeat_Accelerated_Features_for_Lightweight_Image_Matching_CVPR_2024_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit29"><label>29</label><citation-alternatives><mixed-citation xml:lang="ru">Lindenberger P., Sarlin P.-E., Larsson V., Pollefeys M. Pixel-perfect structure-from-motion with featuremetric refinement // Proc. of the IEEE/CVF Intern. Conference on Computer Vision. 2021. P. 5987–5997. [Электронный ресурс]: http://openaccess.thecvf.com/content/ICCV2021/html/Lindenberger_Pixel-Perfect_Structure-From-Motion_With_Featuremetric_Refinement_ICCV_2021_paper.html</mixed-citation><mixed-citation xml:lang="en">Lindenberger P., Sarlin P.-E., Larsson V., and Pollefeys M. Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 5987–5997, http://openaccess.thecvf.com/content/ICCV2021/html/Lindenberger_Pixel-Perfect_Structure-From-Motion_With_Featuremetric_Refinement_ICCV_2021_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit30"><label>30</label><citation-alternatives><mixed-citation xml:lang="ru">Sidorov G., Mohrat M., Gridusov D., Rakhimov R., Kolyubin S. GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization. arXiv: arXiv:2409.16502. DOI: 10.48550/arXiv.2409.16502.</mixed-citation><mixed-citation xml:lang="en">Sidorov G., Mohrat M., Gridusov D., Rakhimov R., and Kolyubin S. arXiv: arXiv:2409.16502, Mar. 20, 2025, DOI: 10.48550/arXiv.2409.16502.</mixed-citation></citation-alternatives></ref><ref id="cit31"><label>31</label><citation-alternatives><mixed-citation xml:lang="ru">Zhou S. et al. Feature 3dgs: Supercharging 3d gaussian splatting to enable distilled feature fields // Proc. of the IEEE/ CVF Conf. on Computer Vision and Pattern Recognition. 2024. P. 21676–21685. [Электронный ресурс]: http://openaccess.thecvf.com/content/CVPR2024/html/Zhou_Feature_3DGS_Supercharging_3D_Gaussian_Splatting_to_Enable_Distilled_Feature_CVPR_2024_paper.html</mixed-citation><mixed-citation xml:lang="en">Zhou S. et al. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 21676–21685, http://openaccess.thecvf.com/content/CVPR2024/html/Zhou_Feature_3DGS_Supercharging_3D_Gaussian_Splatting_to_Enable_Distilled_Feature_CVPR_2024_paper.html.</mixed-citation></citation-alternatives></ref><ref id="cit32"><label>32</label><citation-alternatives><mixed-citation xml:lang="ru">Liu H.-T. D., Williams F., Jacobson A., Fidler S., Litany O. Learning Smooth Neural Functions via Lipschitz Regularization // Special Interest Group on Computer Graphics and Interactive Techniques. Conf. Proc., Vancouver, Canada, Aug. 2022. P. 1–13. DOI: 10.1145/3528233.3530713.</mixed-citation><mixed-citation xml:lang="en">Liu H.-T.D., Williams F., Jacobson A., Fidler S., and Litany O. Special Interest Group on Computer Graphics and Interactive Techniques Conference Proceedings, Vancouver BC Canada: ACM, Aug. 2022, pp. 1–13, DOI: 10.1145/3528233.3530713.</mixed-citation></citation-alternatives></ref><ref id="cit33"><label>33</label><citation-alternatives><mixed-citation xml:lang="ru">Shavit Y., Ferens R., Keller Y. Learning multi-scene absolute pose regression with transformers // Proc. of the IEEE/ CVF Intern. Conf. on Computer Vision. 2021. P. 2733–2742. [Электронный ресурс]: http://openaccess.thecvf.com/content/ICCV2021/html/Shavit_Learning_Multi-Scene_Absolute_Pose_Regression_With_Transformers_ICCV_2021_paper.html</mixed-citation><mixed-citation xml:lang="en">Shavit Y., Ferens R., and Keller Y. Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 2733–2742, http://openaccess.thecvf.com/content/ICCV2021/html/Shavit_Learning_Multi-Scene_Absolute_Pose_Regression_With_Transformers_ICCV_2021_paper.html.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
