Computer Graphics Tutorial on 2D Transformation

transformation means changing some graphics into something else by applying rules. we can have various types of transformations such as translation, scaling up or down, rotation, shearing, etc. when a transformation takes place on a 2d plane, it is called 2d transformation.

transformations play an important role in computer graphics to reposition the graphics on the screen and change their size or orientation.

homogenous coordinates

to perform a sequence of transformation such as translation followed by rotation and scaling, we need to follow a sequential process −

  • translate the coordinates,
  • rotate the translated coordinates, and then
  • scale the rotated coordinates to complete the composite transformation.

to shorten this process, we have to use 3×3 transformation matrix instead of 2×2 transformation matrix. to convert a 2×2 matrix to 3×3 matrix, we have to add an extra dummy coordinate w.

in this way, we can represent the point by 3 numbers instead of 2 numbers, which is called homogenous coordinate system. in this system, we can represent all the transformation equations in matrix multiplication. any cartesian point p(x, y) can be converted to homogenous coordinates by p’ (xh, yh, h).

translation

a translation moves an object to a different position on the screen. you can translate a point in 2d by adding translation coordinate (tx, ty) to the original coordinate (x, y) to get the new coordinate (x’, y’).

translation

from the above figure, you can write that −

x’ = x + tx

y’ = y + ty

the pair (tx, ty) is called the translation vector or shift vector. the above equations can also be represented using the column vectors.

$p = \frac{[x]}{[y]}$ p' = $\frac{[x']}{[y']}$t = $\frac{[t_{x}]}{[t_{y}]}$

we can write it as −

p’ = p + t

rotation

in rotation, we rotate the object at particular angle θ (theta) from its origin. from the following figure, we can see that the point p(x, y) is located at angle φ from the horizontal x coordinate with distance r from the origin.

let us suppose you want to rotate it at the angle θ. after rotating it to a new location, you will get a new point p’ (x’, y’).

rotation

using standard trigonometric the original coordinate of point p(x, y) can be represented as −

$x = r \, cos \, \phi ...... (1)$

$y = r \, sin \, \phi ...... (2)$

same way we can represent the point p’ (x’, y’) as −

${x}'= r \: cos \: \left ( \phi \: + \: \theta \right ) = r\: cos \: \phi \: cos \: \theta \: − \: r \: sin \: \phi \: sin \: \theta ....... (3)$

${y}'= r \: sin \: \left ( \phi \: + \: \theta \right ) = r\: cos \: \phi \: sin \: \theta \: + \: r \: sin \: \phi \: cos \: \theta ....... (4)$

substituting equation (1) & (2) in (3) & (4) respectively, we will get

${x}'= x \: cos \: \theta − \: y \: sin \: \theta $

${y}'= x \: sin \: \theta + \: y \: cos \: \theta $

representing the above equation in matrix form,

$$[x' y'] = [x y] \begin{bmatrix} cos\theta & sin\theta \\ −sin\theta & cos\theta \end{bmatrix}or $$

p’ = p . r

where r is the rotation matrix

$$r = \begin{bmatrix} cos\theta & sin\theta \\ −sin\theta & cos\theta \end{bmatrix}$$

the rotation angle can be positive and negative.

for positive rotation angle, we can use the above rotation matrix. however, for negative angle rotation, the matrix will change as shown below −

$$r = \begin{bmatrix} cos(−\theta) & sin(−\theta) \\ -sin(−\theta) & cos(−\theta) \end{bmatrix}$$

$$=\begin{bmatrix} cos\theta & −sin\theta \\ sin\theta & cos\theta \end{bmatrix} \left (\because cos(−\theta ) = cos \theta \; and\; sin(−\theta ) = −sin \theta \right )$$

scaling

to change the size of an object, scaling transformation is used. in the scaling process, you either expand or compress the dimensions of the object. scaling can be achieved by multiplying the original coordinates of the object with the scaling factor to get the desired result.

let us assume that the original coordinates are (x, y), the scaling factors are (sx, sy), and the produced coordinates are (x’, y’). this can be mathematically represented as shown below −

x' = x . sx and y' = y . sy

the scaling factor sx, sy scales the object in x and y direction respectively. the above equations can also be represented in matrix form as below −

$$\binom{x'}{y'} = \binom{x}{y} \begin{bmatrix} s_{x} & 0\\ 0 & s_{y} \end{bmatrix}$$

or

p’ = p . s

where s is the scaling matrix. the scaling process is shown in the following figure.

before scaling after scaling

if we provide values less than 1 to the scaling factor s, then we can reduce the size of the object. if we provide values greater than 1, then we can increase the size of the object.

reflection

reflection is the mirror image of original object. in other words, we can say that it is a rotation operation with 180°. in reflection transformation, the size of the object does not change.

the following figures show reflections with respect to x and y axes, and about the origin respectively.

reflection reflection line

shear

a transformation that slants the shape of an object is called the shear transformation. there are two shear transformations x-shear and y-shear. one shifts x coordinates values and other shifts y coordinate values. however; in both the cases only one coordinate changes its coordinates and other preserves its values. shearing is also termed as skewing.

x-shear

the x-shear preserves the y coordinate and changes are made to x coordinates, which causes the vertical lines to tilt right or left as shown in below figure.

x-shear

the transformation matrix for x-shear can be represented as −

$$x_{sh} = \begin{bmatrix} 1& shx& 0\\ 0& 1& 0\\ 0& 0& 1 \end{bmatrix}$$

y' = y + shy . x

x’ = x

y-shear

the y-shear preserves the x coordinates and changes the y coordinates which causes the horizontal lines to transform into lines which slopes up or down as shown in the following figure.

y-shear

the y-shear can be represented in matrix from as −

$$y_{sh} \begin{bmatrix} 1& 0& 0\\ shy& 1& 0\\ 0& 0& 1 \end{bmatrix}$$

x’ = x + shx . y

y’ = y

composite transformation

if a transformation of the plane t1 is followed by a second plane transformation t2, then the result itself may be represented by a single transformation t which is the composition of t1 and t2 taken in that order. this is written as t = t1∙t2.

composite transformation can be achieved by concatenation of transformation matrices to obtain a combined transformation matrix.

a combined matrix −

[t][x] = [x] [t1] [t2] [t3] [t4] …. [tn]

where [ti] is any combination of

  • translation
  • scaling
  • shearing
  • rotation
  • reflection

the change in the order of transformation would lead to different results, as in general matrix multiplication is not cumulative, that is [a] . [b] ≠ [b] . [a] and the order of multiplication. the basic purpose of composing transformations is to gain efficiency by applying a single composed transformation to a point, rather than applying a series of transformation, one after another.

for example, to rotate an object about an arbitrary point (xp, yp), we have to carry out three steps −

  • translate point (xp, yp) to the origin.
  • rotate it about the origin.
  • finally, translate the center of rotation back where it belonged.