Post

Chapter 2 - WACV 2025 Paper Accepted! πŸŽ‰

The PhD's second chapter has been accepted into WACV 2025! This chapter builds upon the first, aiming to progress the whole field towards practicality.

Chapter 2 - WACV 2025 Paper Accepted! πŸŽ‰

πŸŒπŸš— SpaGBOL: Spatial-Graph-Based Orientated Localisation πŸ“‘πŸ—ΊοΈ

Tavis Shore Oscar Mendez Simon Hadfield

Centre for Vision, Speech, and Signal Processing (CVSSP)

University of Surrey, Guildford, GU2 7XH, United Kingdom

πŸ““ Description

Cross-View Geo-Localisation within urban regions is challenging in part due to the lack of geo-spatial structuring within current datasets and techniques. We propose utilising graph representations to model sequences of local observations and the connectivity of the target location. Modelling as a graph enables generating previously unseen sequences by sampling with new parameter configurations. To leverage this newly available information, we propose a GNN-based architecture, producing spatially strong embeddings and improving discriminability over isolated image embeddings.

We release 🍝 SpaGBOL 🍝, the first graph-based CVGL dataset, consisting of 10 city centre graph networks across the globe. This densely sampled structured dataset will progress the CVGL field towards real-world viability.

πŸ’Ύ SpaGBOL: Graph-Based CVGL Dataset

SpaGBOL contains 98,855 panoramic streetview images across different seasons, and 19,771 corresponding satellite images from 10 mostly densely populated international cities. This translates to 5 panoramic images and one satellite image per graph node. Downloading instructions below.

The map below shows the cities contained in SpaGBOL v1, with the breadth and density being increased in subsequent versions.

πŸ“ City Locations πŸ‡¬πŸ‡§πŸ‡§πŸ‡ͺπŸ‡ΊπŸ‡ΈπŸ‡­πŸ‡°πŸ‡ΈπŸ‡¬πŸ‡―πŸ‡΅

plot_world

🧬 City Graph Representations

Here are a few of the city centre graph networks, where nodes represent road junctions, and edges represent the roads between junctions.

London Graph Manhattan Graph Tokyo Graph
City of LondonManhattan CentreTokyo Centre

πŸ“Έ Image Pair Examples

At each graph node, streetview and satellite images are collected at a ratio of 5:1 to improve training generalisation, here are some examples from across the globe.

image pair image pair image pair

🚢 Exhaustive / Random Depth-First Walk Generation

Image Graph networks can be traversed using Breadth-First Search
(BFS) or Depth-First Search (DFS). BFS explores level by level,
visiting all neighbors of a node before moving deeper, using a
queue. DFS dives into a branch fully before backtracking,
often using a stack or recursion. BFS is ideal for shortest
paths, while DFS suits tasks like cycle detection or exploring
all paths.
DFS relates to a vehicle’s movement by mimicking how a
vehicle explores routes sequentially. This approach is
useful for navigating unmapped areas or exploring all
possible routes systematically. Reference sets contain
exhaustive sampling of each node, retrieving any one of
these random walks is deemed correct.

🧰 SpaGBOL: Benchmarking

🚧 Under Construction

🐍 Environment Setup

1
    conda env create -f requirements.yaml

🏭 Data Download

1

β˜• Submodule Pretraining

1

πŸ‘Ÿ SpaGBOL Training

1

🧐 SpaGBOL Evaluation

1

πŸ“Š SpaGBOL: Benchmark Results

FOV360Β°180Β°90Β°
ModelTop-1Top-5Top-10Top-1%Top-1Top-5Top-10Top-1%Top-1Top-5Top-10Top-1%
CVM2.8712.9621.5128.332.689.8315.1220.231.025.8710.1514.81
CVFT4.0213.0220.2927.192.498.7414.6119.911.215.7410.0213.53
DSM5.8210.2114.1318.623.339.7414.661.595.8710.1116.24
L2LTR11.2331.2742.5049.525.9418.3228.5335.236.1318.7027.9534.08
GeoDTR+17.4940.2752.0159.419.0625.4635.6743.335.5517.0424.3131.78
SAIG-D25.6551.4462.2968.2215.1235.5545.6353.107.4021.7631.1437.14
Sample4Geo50.8074.2279.9682.3237.5264.5271.9276.396.5120.6130.3136.12
SpaGBOL56.4877.4783.8587.2440.8863.7972.8878.2818.6343.2054.0561.20
SpaGBOL+B64.0186.5492.0994.6452.0182.2089.4793.62---
SpaGBOL+YB76.1395.2197.9698.9866.8292.6996.3897.30----

βœ’οΈ Citation

If you find SpaGBOL useful for your work please cite:

1
2
3
4
5
6
7
@InProceedings{Shore_2025_WACV,
    author    = {Shore, Tavis and Mendez, Oscar and Hadfield, Simon},
    title     = {SpaGBOL: Spatial-Graph-Based Orientated Localisation},
    booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
    month     = {February},
    year      = {2025}
}

🦜 BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation

Β Β Β Β Β  arxiv Conference GitHub License

⭐ Star History

star_dark star

This post is licensed under CC BY 4.0 by the author.