Pose (computer vision)

In computer vision and in robotics, a typical task is to identify specific objects in an image and to determine each object's position and orientation relative to some coordinate system. This information can then be used, for example, to allow a robot to manipulate an object or to avoid moving into the object. The combination of "position" and "orientation" is referred to as the pose of an object, even though this concept is sometimes used only to describe the orientation. "Exterior orientation" and "Translation" are also used as a synonym to pose.

The image data from which the pose of an object is determined can be either a single image, a stereo image pair, or an image sequence where, typically, the camera is moving with a known speed. The objects which are considered can be rather general, including a living being or parts of a living being, e.g., a head or hands. The methods which are used for determining the pose of an object, however, are usually specific for a class of objects and cannot be expected to work well for other types of objects.

The pose can be described by means of a rotation and translation transformation which brings the object from a reference pose to the observed pose. This rotation transformation can be represented in different ways, e.g., as a rotation matrix or a quaternion.

Pose Estimation

The specific task of determining the pose of an object in an image (or stereo images, image sequence) is referred to as "pose estimation". The pose estimation problem can be solved in different ways depending on the image sensor configuration, and choice of methodology. Two classes of methodologies can be distinguished:

* "Analytic or geometric methods". Given that the image sensor (camera) is calibrated the mapping from 3D points in the scene and 2D points in the image is known. If also the geometry of the object is known, it means that the projected image of the object on the camera image is a well-known function of the object's pose. Once a set of control points on the object, typically corners or other feature points, has been identified it is then possible to solve the pose transformation from a set of equations which relate the 3D coordinates of the points with their 2D image coordinates.

* "Learning based methods". These methods use artificial learning-based system which learn the mapping from 2D image features to pose transformation. In short, this means that a sufficiently large set of images of the object, in different poses, must be presented to the system during a learning phase. Once the learning phase is completed, the system should be able to present an estimate of the object's pose given an image of the object.

ee also

*homography
*camera calibration
*3D Pose Estimation

Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

Computer vision — is the field concerned with automated imaging and automated computer based processing of images to extract and interpret information. It is the science and technology of machines that see. Here see means the machine is able to extract information … Wikipedia
Object recognition (computer vision) — Feature detection Output of a typical corner detection algorithm … Wikipedia
List of computer vision topics — This is a list of computer vision and image processing topics Contents 1 Image enhancement 2 Transformations 3 Filtering, Fourier and wavelet transforms and image compression … Wikipedia
Pose (disambiguation) — Pose can refer to:*To position oneself (ex. magazine photos). See also, human position.Computers and technology*Pose (computer vision) *Palm OS Emulator, abbreviated as POSEMusic*Pose (Daddy Yankee song), a song by Daddy Yankee, from the album… … Wikipedia
Computer security compromised by hardware failure — is a branch of computer security applied to hardware. The objective of computer security includes protection of information and property from theft, corruption, or natural disaster, while allowing the information and property to remain accessible … Wikipedia
computer — computerlike, adj. /keuhm pyooh teuhr/, n. 1. Also called processor. an electronic device designed to accept data, perform prescribed mathematical and logical operations at high speed, and display the results of these operations. Cf. analog… … Universalium
Articulated body pose estimation — Articulated body pose estimation, in computer vision, is the study of algorithms and systems that recover the pose of an articulated body, which consists of joints and rigid parts using image based observations. It is one of longest lasting… … Wikipedia
3D computer graphics — This article is about the process of creating 3D computer graphics. For information on the study of computer graphics, see Computer graphics. 3D computer graphics … Wikipedia
Brain–computer interface — Neuropsychology Topics Brain computer interface … Wikipedia
List of computer-animated films — A computer animated film commonly refers to feature films that have been computer animated to appear three dimensional on a movie screen. While traditional 2D animated films are now done primarily with the help of computers, the technique to… … Wikipedia

Academic Dictionaries and Encyclopedias

Pose (computer vision)

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Pose (computer vision)

Look at other dictionaries:

Share the article and excerpts

Direct link