The Community for Technology Leaders
Green Image
ISSN: 0162-8828
Andreas Geiger , MPI for Intelligent Systems in Tübingen and Karlsruhe Institute of Technology, Karlsruhe
Martin Lauer , Karlsruhe Institute of Technology, Karlsruhe
Christian Wojek , Carl Zeiss Corporate Research, Saarbrücken
Christoph Stiller , Karlsruhe Institute of Technology, Karlsruhe
Raquel Urtasun , Toyota Technological Institute at Chicago, Chicago
In this paper, we present a novel probabilistic generative model for multi-object traffic scene understanding from movable platforms which reasons jointly about the 3D scene layout as well as the location and orientation of objects in the scene. In particular, the scene topology, geometry and traffic activities are inferred from short video sequences. Inspired by the impressive driving capabilities of humans, our model does not rely on GPS, lidar or map knowledge. Instead, it takes advantage of a diverse set of visual cues in the form of vehicle tracklets, vanishing points, semantic scene labels, scene flow and occupancy grids. For each of these cues we propose likelihood functions that are integrated into a probabilistic generative model. We learn all model parameters from training data using contrastive divergence. Experiments conducted on videos of 113 representative intersections show that our approach successfully infers the correct layout in a variety of very challenging scenarios. To evaluate the importance of each feature cue, experiments using different feature combinations are conducted. Furthermore, we show how by employing context derived from the proposed method we are able to improve over the state-of-the-art in terms of object detection and object orientation estimation in challenging and cluttered urban environments.
Roads, Vehicles, Layout, Three-dimensional displays, Semantics, Splines (mathematics), Hidden Markov models, Robotics, Autonomous vehicles, Scene Analysis, Image Processing and Computer Vision

C. Stiller, C. Wojek, M. Lauer, A. Geiger and R. Urtasun, "3D Traffic Scene Understanding from Movable Platforms," in IEEE Transactions on Pattern Analysis & Machine Intelligence.
271 ms
(Ver 3.3 (11022016))