The most commonly accepted hypothesis states that a 3D-point M is projected with a perspective projection onto an image plane on a 2D-point . Choosing a reference frame attached to the camera, the projection equation is:
This model can be refined, by taking optical distortions into account [18,4,7]. In this paper, we will consider that the needed corrections have been done as a preprocessing.
Two approximations have been proposed in the literature :
The perspective projection model may be approximated [2,15,12,13,14] to its first order with respect to the 3D coordinates. This is equivalent to approximate the perspective projection in two steps: (i) a projection parallel to the gaze direction onto an auxiliary plane which is parallel to the image plane and passes through the scene center followed by (ii) a perspective projection onto the image plane. This so called para-perspective model yields linear equations (2).
Equation 2 corresponds to the most general case of para-perspective projection although more simple expressions have been proposed [16].
The zero-order development with respect to the 3D depth consists in a rougher approximation. It is also equivalent to another two steps approximation: (i) an orthogonal projection onto the auxiliary plane followed by (ii) a perspective projection onto the image plane. This approximation, called the orthographic model (4), is well adapted to foveal attention and is characterized by linear equations without any new parameter. It is an approximation of the para-perspective model when the observed objects are in the fovea, i.e. close to the optical axis :
projection case | ||
perspective projection | 1 | 1 |
orthographic projection | 0 | 0 |
para-perspective projection | 1 | 0 |
Let I1 and I2 denote two images. In the general case, there exists a fundamental relation between points in I2 and points in I1 : where is called the fundamental matrix [9].
However, this relation is not defined in some singular cases. For example, it is well known that, in the perspective projection case, if the displacement is a pure rotation or, if the scene is planar, the relation between points is homographic : where is called the homographic matrix.
Thus .
Remember that and
Let
and .
Previous equations lead to :
This relation is homographic if and only if or if there exists a (33) matrix such as . The first condition induces a displacement constraint. It leads to the simple equation meaning that the rotation axis is parallel to the gaze direction. The second condition induces a geometric relation on the 3D point : Z1 is an affine function of X1 and Y1, meaning that the 3D points must belong to a plane , which cannot contain the optical axis and the gaze direction (see [13] for a demonstration).
The orthographic case is a particular case of para-perspective projection for which the gaze direction is the optical axis. Following a demonstration similar to the para-perspective case, we also obtain two constraints; the displacement constraint states that the rotation axis must be parallel to the optical axis, and the geometric constraint states that the 3D-points must belong to the same plane which does not contain the optical axis. All constraints on displacement and scene geometry for homographic relations are summarized in the following table :
projection | displacement constraint | geometric constraint |
perspective | plane | |
para-perspective | plane Z=f(X,Y) | |
orthographic | plane Z=f(X,Y) |