Post

Chapter 1 - IROS 2024 Paper Accepted! πŸŽ‰

The first paper within my PhD has been accepted for oral presentation at IROS in Abu Adhabi πŸ”₯

Chapter 1 - IROS 2024 Paper Accepted! πŸŽ‰

🦜🌍 BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation πŸ“‘πŸ—ΊοΈ

Tavis Shore Simon Hadfield Oscar Mendez

Centre for Vision, Speech, and Signal Processing (CVSSP)

University of Surrey, Guildford, GU2 7XH, United Kingdom

πŸ““ Description

Cross-view image matching for geo-localisation is a challenging problem due to the significant visual difference between aerial and ground-level viewpoints. The method provides localisation capabilities from geo-referenced images, eliminating the need for external devices or costly equipment. This enhances the capacity of agents to autonomously determine their position, navigate, and operate effectively in GNSS-denied environments. Current research employs a variety of techniques to reduce the domain gap such as applying polar transforms to aerial images or synthesising between perspectives. However, these approaches generally rely on having a 360Β° field of view, limiting real-world feasibility. We propose BEV-CV, an approach introducing two key novelties with a focus on improving the real-world viability of cross-view geo-localisation. Firstly bringing ground-level images into a semantic Birds-Eye-View before matching embeddings, allowing for direct comparison with aerial image representations. Secondly, we adapt datasets into application realistic format - limited Field-of-View images aligned to vehicle direction. BEV-CV achieves state-of-the-art recall accuracies, improving Top-1 rates of 70Β° crops of CVUSA and CVACT by 23% and 24% respectively. Also decreasing computational requirements by reducing floating point operations to below previous works, and decreasing embedding dimensionality by 33% - together allowing for faster localisation capabilities.

πŸ“Š Benchmark Results

ModelOrientation
Aware
R@1R@5R@10R@1%R@1R@5R@10R@1\%
CVUSA 90Β°CVUSA 70Β°
CVM❌2.7610.1116.7455.492.629.3015.0621.77
CVFT❌4.8014.8423.1861.233.7912.4419.3355.56
DSM❌16.1931.4439.8571.138.7819.9027.3061.20
L2LTR❌26.9250.4960.4186.8813.9533.0743.8677.65
TransGeo❌30.1254.1863.9689.1816.4337.2848.0280.75
GeoDTR❌18.8143.3657.9488.1414.8438.0351.2788.17
BEV-CV❌15.1733.9145.3382.5314.0332.3243.2581.48
GALβ‰ˆ22.5444.3654.1784.5915.2032.8642.0675.21
DSMβœ…33.6651.7059.6882.4620.8836.9944.7071.10
L2LTRβœ…25.2151.9063.5491.1622.2046.7158.9989.37
TransGeoβœ…21.9645.3556.4986.8017.2738.9549.4481.34
GeoDTRβœ…15.2139.3252.2788.7214.0035.2847.7786.39
BEV-CVβœ…32.1158.3669.0692.9927.4052.9464.4790.94
CVACT 90Β°CVACT 70Β°
CVM❌1.475.709.6438.051.244.988.4234.74
CVFT❌1.856.2810.5439.251.495.138.1934.59
DSM❌18.1133.3440.9468.658.2920.7227.1357.08
L2LTR❌13.0730.3841.0076.076.6715.9423.4549.37
TransGeo❌10.7528.2237.5170.157.0119.4427.5062.19
GeoDTR❌26.5353.2664.5991.1316.8740.2253.1387.92
BEV-CV❌4.1414.4622.6461.183.9213.5020.5359.34
GALβ‰ˆ26.0549.2359.2685.6014.1732.9643.2477.49
DSMβœ…31.1751.4460.0582.9018.4435.8744.3971.97
L2LTRβœ…33.6246.2858.2178.6228.6553.5965.0290.48
TransGeoβœ…28.1634.4441.5467.1524.0542.6855.4780.72
GeoDTRβœ…26.7653.6565.3592.1215.3837.0949.4086.38
BEV-CVβœ…45.7975.8583.9796.7637.8569.0078.5295.03

🍝 SpaGBOL: Spatial-Graph-Based Orientated Localisation

Β Β Β Β Β  arxiv Conference GitHub License

βœ’οΈ Citation

If you find BEV-CV useful for your work please cite:

1
2
3
4
5
6
7
@INPROCEEDINGS{bevcv,
    author={Shore, Tavis and Hadfield, Simon and Mendez, Oscar },
    booktitle={2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)}, 
    title={BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation}, 
    year={2024},
    pages={11047-11054},
}

⭐ Star History

star_dark star

This post is licensed under CC BY 4.0 by the author.