Epipolar geometry

Epipolar geometry

Epipolar geometry refers to the geometry of stereo vision. When two cameras view a 3D scene from two distinct positions, there are a number of geometric relations between the 3D points and their projections onto the 2D images that lead to constraints between the image points. These relations are derived based on the assumption that the cameras can be approximated by the pinhole camera model.

The figure below depicts two pinhole cameras looking at point X. In real cameras, the image plane is actually behind the focal point, and produces a rotated image. Here, however, the projection problem is simplified by placing a "virtual image plane" in front of the focal point of each camera to produce an unrotated image. OL and OR represent the focal points of the two cameras. X represents the point of interest in both cameras. Points xL and xR are the projections of point X onto the image planes.

Each camera captures a 2D image of the 3D world. This conversion from 3D to 2D is referred to as a perspective projection and is described by the pinhole camera model. It is common to model this projection operation by rays that emanate from the camera, passing through its focal point. Note that each emanating ray corresponds to a single point in the image.

Epipole

Since the two focal points of the cameras are distinct, each focal point projects onto a distinct point into the other camera's image plane. These two image points are denoted by EL and ER and are called "epipoles". Both epipoles EL and ER and both focal points OL and OR lie on a single line.

Epipolar line

The line OL-X is seen by the left camera as a point because it is directly in line with that camera's focal point. However, the right camera sees this line as a line in its image plane. That line (ER-xR) in the right camera is called an "epipolar line". Symmetrically, the line OR-X seen by the right camera as a point is seen as epipolar line EL-xLby the left camera.

Epipolar plane

As an alternative visualization, consider the points X, OL & OR that form a plane called the "epipolar plane". The epipolar plane intersects each camera's image plane where it forms lines - the epipolar lines. All epipolar lines intersect the epipole regardless of where X is located.

Epipolar constraint and triangulation

If the relative translation and rotation of the two cameras is known, the corresponding epipolar geometry leads to two important observations

* If the projection point xL is known, then the epipolar line ER - xR is known and the point X projects into the right image, on a point xR which must lie on this particular epipolar line. This means that for each point observed in one image the same point must be observed in the other image on a known epipolar line. This provides an "epipolar constraint" which corresponding image points must satisfy and it means that it is possible to test if two points really correspond to the same 3D point. Epipolar constraints can also be described by the essential matrix or the fundamental matrix between the two cameras.

* If the points xL and xR are known, their projection lines are also known. If the two image points correspond to the same 3D point X the projection lines must intersect precisely at X. This means that X can be calculated from the coordinates of the two image points, a process called "triangulation".

Simplified cases

The epipolar geometry is simplified if the two camera image planes coincide. In this case, the epipolar lines also coincide (EL-xL = ER-xR). Furthermore, the epipolar lines are parallel to the line OL - OR between the focal points, and can in practice be aligned with the horizontal axes of the two images. This means that for each point in one image, its corresponding point in the other image can be found by looking only along a horizontal line. If the cameras cannot be positioned in this way, the image coordinates from the cameras may be transformed to emulate having a common image plane. This process is called image rectification.

References

*cite book
author=Richard Hartley and Andrew Zisserman
title=Multiple View Geometry in computer vision
publisher=Cambridge University Press
year=2003
id=ISBN 0-521-54051-8

*cite web
author=Quang-Tuan Luong
title=Learning Epipolar Geometry
url=http://www.ai.sri.com/~luong/research/Meta3DViewer/EpipolarGeo.html
accessdate=2007-03-04

*cite web
author=Robyn Owens
title=Epipolar geometry
url=http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/OWENS/LECT10/node3.html
accessdate=2007-03-04

*cite book
author=Linda G. Shapiro and George C. Stockman
title=Computer Vision
publisher=Prentice Hall
year=2001
id=ISBN 0-13-030796-3
pages=395-403


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Fundamental matrix (computer vision) — In computer vision, the fundamental matrix mathbf{F} is a 3 imes 3 matrix of rank 2 which relates corresponding points in stereo images. In epipolar geometry, with homogeneous image coordinates mathbf{y 1} and mathbf{y 2} of corresponding points… …   Wikipedia

  • Kernstrahlgeometrie — Zwei Kameras nehmen eine Szene auf. Die Epipolargeometrie beschreibt die Beziehung zwischen den beiden Bildern. Die Epipolargeometrie (selten auch Kernstrahlgeometrie) ist ein mathematisches Modell aus der Geometrie, das die geometrischen… …   Deutsch Wikipedia

  • Stereoanalyse — Zwei Kameras nehmen eine Szene auf. Die Epipolargeometrie beschreibt die Beziehung zwischen den beiden Bildern. Die Epipolargeometrie (selten auch Kernstrahlgeometrie) ist ein mathematisches Modell aus der Geometrie, das die geometrischen… …   Deutsch Wikipedia

  • Image rectification — is a transformation process used to project multiple images onto a common image surface. It is used to correct a distorted image into a standard coordinate system. *It is used in computer stereo vision to simplify the problem of finding matching… …   Wikipedia

  • Triangulation (computer vision) — In computer vision triangulation refers to the process of determining a point in 3D space given its projections onto two, or more, images. In order to solve this problem it is necessary to know the parameters of the camera projection function… …   Wikipedia

  • Epipolargeometrie — Zwei Kameras nehmen von unterschiedlichen Standpunkten eine Szene auf. Die Epipolargeometrie beschreibt die Beziehung zwischen den beiden Bildern. Die Epipolargeometrie (selten auch Kernstrahlgeometrie) ist ein mathematisches Modell aus der… …   Deutsch Wikipedia

  • Cross product — This article is about the cross product of two vectors in three dimensional Euclidean space. For other uses, see Cross product (disambiguation). In mathematics, the cross product, vector product, or Gibbs vector product is a binary operation on… …   Wikipedia

  • Maximally stable extremal regions — Feature detection Output of a typical corner detection algorithm …   Wikipedia

  • Stereopsis — (from meaning solidity, and opsis meaning vision or sight) is the process in visual perception leading to the sensation of depth from the two slightly different projections of the world onto the retinas of the two eyes. The differences in the two …   Wikipedia

  • Essential matrix — In computer vision, the essential matrix is a 3 imes 3 matrix mathbf{E} , with some additional properties, which relates corresponding points in stereo images assuming that the cameras satisfy the pinhole camera model.FunctionMore specifically,… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”