Master Thesis

Fusion of GPS and Visual SLAM
to improve localization of autonomous vehicles
in urban environments.

by Adam Kalisz

Agenda

Last time (Recap)
This time
Comparison of various VSLAM Algorithms
Papers on VSLAM and GPS Fusion
Interesting 3D Reconstruction ideas
Conclusion / Feeling

Last time

Demo: DSO (Direct Sparse Odometry)

This time

VSLAM
Comparison time!

Blender 3D Motion Tracker:
Libmv multiview reconstruction
and tracking library.

https://developer.blender.org/tag/libmv/

Pros and Cons:

Compact, field-tested and robust solution
MIT license
Research benefits a huge community
Maintained by Blender Devs (updates!)
Personal contact to Keir Mierle (Dev from BCon17)
Feature-based (i.e. SURF)
Sparse reconstruction
No ROS-integration

CMVS / PMVS2:
Clustering Views for Multi-view Stereo (CMVS)
Patch-based Multi-view Stereo Software (PMVS)

http://www.di.ens.fr/cmvs/

Pros and Cons:

Dense reconstruction
GPL license
Used by ILM, Weta and Google
Feature-based (Based on SfM output via SIFT)
No ROS-integration
Crashed during my tests on Windows
Last update: 7 years ago

PTAM-GPL:
PTAM (Parallel Tracking and Mapping)
re-released under GPLv3.

http://www.robots.ox.ac.uk/~gk/PTAM/

Pros and Cons:

GPL license
Feature-based (FAST-10, machine-learned D-Tree)
Sparse reconstruction
Not robust in any environment
No ROS-integration
Last update: 3 years ago

ORB-SLAM2: Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

https://github.com/raulmur/ORB_SLAM2

Pros and Cons:

Feature-rich solution (Mono, Stereo, RGB-D)
GPLv3 license
last PR-commit: 1 month ago
ROS-integration available (optional)
Feature-based (ORB, combined FAST+BRIEF)
Sparse reconstruction
292 issues on GitHub!

LSD Slam:
Large-Scale Direct Monocular SLAM

https://vision.in.tum.de/research/vslam/lsdslam

Pros and Cons:

Mature and feature-rich solution
GPLv3 license
ROS-integration available (optional, rosmake)
Direct (no features!)
Semi-dense reconstruction in real-time
176 issues on GitHub!
last PR-commit: 3 years ago

DSO:
Direct Sparse Odometry

https://vision.in.tum.de/research/vslam/dso

Pros and Cons:

Feature-rich solution
GPLv3 license
last PR-commit: 1 month ago
ROS-integration available (optional)
Direct (no features!)
A lot of great documentation (Youtube!)
Semi-dense reconstruction
Stereo-DSO (dense) not yet published

Papers on VSLAM and GPS Fusion

A hybrid bundle adjustment/pose-graph approach to VSLAM/GPS fusion for low-capacity platforms
(Salehi, Gay-bellile, Bourgeois, Chausse (2017))
Vision-based differential GPS : Improving VSLAM / GPS fusion in urban environment with 3D building models
(Gay-Bellile, Bourgeois, Dhome (2014))
A Multi-Sensorial Simultaneous Localization and Mapping (SLAM) System for Low-Cost Micro Aerial Vehicles in GPS-Denied Environments
(López, García, Barea et al. (2017))
GPS-Supported Visual SLAM with a Rigorous Sensor Model for a Panoramic Camera in Outdoor Environments
(Shi, Ji, Shibasaki et al. (2012))

Interesting 3D Reconstruction ideas

Variational methods for dense 3D reconstruction

Do not tell the algorithm how to do reconstruction, but rather what result it should deliver.

Class of optimization methods using mathematical models instead of a series of processing steps to produce a result.

Mathematical analysis of cost function. Example: Perturbation of a sphere to get a complex object (Blender demo)

Variational methods for 3D reconstruction

$ E(u) = \int\limits_{\Omega}^{} $$(u - f)^2$ + $\lambda | \nabla u | dx $
$ E(u) = \int\limits_{\Omega}^{} $$I_0(x) - I_i($$\pi$($g_\xi$($u$ $\cdot$ $x$$)) )$ + $\lambda | \nabla u | dx $

$\pi$ = projection; $g_\xi$ = translation, rotation;
u = distance; x = 2D image point (homogenous).

Data term: local assignment costs (photometric error)

Regularization term: length of interface

Basic idea: We take the sum over the whole image given some constraints and try to minimize it

Imposing Silhouette Consistency
[Cremers, Kolev, PAMI 2011]

Constrained optimization problem

Raycasting through pixel of image into voxelgrid, to find intersection with geometry

Integral of voxels along ray >= 1 if surface, 0 else

Implicit or explicit representation of mesh surface

Problem: Silhouette! Difficult to determine against busy background

Conclusion

Try out direct method with own calibration and Langwasser dataset to get rather dense result?
Do tests with point cloud for 3D Segmentation in order to detect street signs?
Begin reading suitable literature thoroughly?
PLACEHOLDER - ADD SMART IDEAS

Master Thesis

Fusion of GPS and Visual SLAM to improve localization of autonomous vehicles in urban environments.

Agenda

Last time

Demo: DSO (Direct Sparse Odometry)

This time

VSLAMComparison time!

Blender 3D Motion Tracker:Libmv multiview reconstructionand tracking library.

Pros and Cons:

CMVS / PMVS2:Clustering Views for Multi-view Stereo (CMVS)Patch-based Multi-view Stereo Software (PMVS)

Pros and Cons:

PTAM-GPL:PTAM (Parallel Tracking and Mapping)re-released under GPLv3.

Pros and Cons:

ORB-SLAM2: Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

Pros and Cons:

LSD Slam:Large-Scale Direct Monocular SLAM

Pros and Cons:

DSO:Direct Sparse Odometry

Pros and Cons:

Papers on VSLAM and GPS Fusion

Interesting 3D Reconstruction ideas

Variational methods for dense 3D reconstruction

Variational methods for 3D reconstruction

Imposing Silhouette Consistency[Cremers, Kolev, PAMI 2011]

Conclusion

Feeling

Thank you!

Fusion of GPS and Visual SLAM
to improve localization of autonomous vehicles
in urban environments.

VSLAM
Comparison time!

Blender 3D Motion Tracker:
Libmv multiview reconstruction
and tracking library.

CMVS / PMVS2:
Clustering Views for Multi-view Stereo (CMVS)
Patch-based Multi-view Stereo Software (PMVS)

PTAM-GPL:
PTAM (Parallel Tracking and Mapping)
re-released under GPLv3.

LSD Slam:
Large-Scale Direct Monocular SLAM

DSO:
Direct Sparse Odometry

Imposing Silhouette Consistency
[Cremers, Kolev, PAMI 2011]