Ray Tracing
Rendering is a process that takes as inputs a set of objects and produces an array of pixels. It can be organized in 2 ways
- Object-order rendering: Where the objects are processed one by one
- Image-order rendering: Where the pixels are processed one by one
Ray tracing is an image-order algorithm
Basic Algorithm
A basic ray tracer consists of three parts
- Ray generation : Computes origin and direction of each pixel's viewing ray based on camera geometry
- Ray intersection : Finds the intersection of the viewing ray with the scene geometry
- Lighting : Computes the color of the pixel based on the result of ray-object intersection
for each pixel in the image:
compute viewing ray
find first object hit by the ray and its surface normal n
shade the pixel using hit-point, light and n
Ray Generation or Computing Viewing Ray
- basic tools for ray generation are the viewpoint and the image plane
- Ray representation \(p(t) = e + t (s - e)\), where \(e\) is eye and \(s\) is a point on the image plane
- Point \(e\) is ray's origin and (s - e) is ray's direction
- Ray generation takes place from the camera frame
- position \(e\) (eye point)
- \(\mathbf{u}, \mathbf{v}, \mathbf{w}\) are basis vectors
- Choose \(-w\) as view direction and an up vector, using which we construct the basis vectors
Orthographic Views
- All rays will have direction \(- \mathbf{w}\)
- A viewpoint is not require but viewing rays can start from the plane containing the camera, so we know when object is behind the camera.
- Viewing rays start on the image plane and are parallel to each other, and cab be defined by point \(e\) and vectors \(\mathbf{u}\) and \(\mathbf{v}\)
- \(l\) and \(r\) are left and right limits of the image plane (measured along \(\mathbf{u}\)) \(l < 0 < r\)
- \(b\) and \(t\) are bottom and top limits of the image plane (measured along \(\mathbf{v}\)) \(b < 0 < t\)
Pixel spacing for \(n_x \times n_y\) image is
- Horizontal : \(\Large \frac{r - l}{n_x}\)
- Vertical : \(\Large \frac{t - b}{n_y}\)
Therefore, pixel position \((i, j)\) in the raster image is
- \(u = l + \Large \frac{(r - l)(i + 0.5)}{n_x}\)
- \(v = b + \Large \frac{(t - b)(j + 0.5)}{n_y}\)
Where \(u, v\) are the coordinates of the pixel in the image plane w.r.t origin \(e\) and basis vectors \(\mathbf{u} , \mathbf{v}\)
Therefore, Ray parameters are
- Origin: \(e + u * \mathbf{u} + v * \mathbf{v}\)
- Direction : \(-\mathbf{w}\)
Perspective Views
- All rays will have different directions for different pixels
- All rays will have origin at the eye point
- Image plane positioned at distance \(d\) from the eye point \(e\)
- Ray parameters are
- Origin: \(e\)
- Direction: \(-d * \mathbf{w} + u * \mathbf{u} + v * \mathbf{v}\)
Where \(u, v\) are the coordinates of the pixel in the image plane w.r.t origin \(e\) and basis vectors \(\mathbf{u} , \mathbf{v}\)
Note
In some sense, the origin in orthographic view is direction in perspective view.
Ray intersection
- Given a generated ray \(p(t) = e + t \cdot d\), find intersection (first hit) with the scene geometry such that \(t > 0\)
- Given a ray \(p(t) = e + t \cdot d\), and an implicit surface \(f(p) = 0\), the intersection point is found by solving \(f(e + t \cdot d) = 0\)
- There can be multiple solutions for \(t\), but we are interested in the smallest positive solution
For Sphere
- Parametric equation for any point \(p\) on a sphere with center \(c\) and radius \(r\) is
- For finding points of intersection of a ray with a sphere, we substitute \(p = e + t \cdot d\) in the above equation and solve for \(t\)
This gives the solution
Normal for Sphere
- The normal vector at \(\mathbf{p}\) on the implicit surface \(f(p)\) is given by
- For sphere, the normal at point \(p\) is
For Triangle
- Parametric equation for a triangle is
-
3 unknowns \(t, u, v\) and 3 equations \((x, y, z)\) for the triangle. We can solve for \(t, u, v\) and check if \(u, v\) are within the range of the triangle.
-
For a parametric plane, the parametric surface can be represented in terms of any three points in the plane. Utilize barycentric coordinates for ray-triangle test
-
For a triangle with vertices \(\mathbf{a}, \mathbf{b}, \mathbf{c}\) intersection will occur when
- Intersection is inside the triangle if \(\beta, \gamma \geq 0\) and \(\beta + \gamma \leq 1\)
- To solve for \(t, \beta, \gamma\), we can write the above equation as
This can be written in matrix form as
-
Let \(A\) be the matrix on the left, \(B\) be the matrix on the right, then using Cramer's rule, we can solve for \(\beta, \gamma, t\) as follows
-
Let \(A_i\) be the matrix obtained by replacing \(i^{th}\) column of \(A\) with the column of \(B\), then values of \(\beta, \gamma, t\) are
Ray-Polygon Intersection
Given \(m\) vertices \(p_1, p_2, \ldots, p_m\) of a polygon and surface normal \(n\) - Compute the intersection point between ray \(\mathbf{e} + t \mathbf{d}\) and the plane containing the polygon \((p - p_1) \cdot n = 0\)
- If \(\mathbf{p}\) is inside the polygon, then the ray hits it.
Point-in-Polygon Test
Ray Polygon Intersection or Crossing Number Algorithm
- Send a 2D ray out from \(p\) and count the number of intersections between the ray and the polygon
- If the intersection count is odd, then \(p\) is inside the polygon
Warning
It's a ray, so only one way
- Easily done by checking the sign of the cross product of the vectors from \(p\) to the vertices of the polygon
Winding Number Algorithm
- Winding number of a closed curve in the plane around a given point is an integer representing the total number of times that curve travels counterclockwise around the point
- If the winding number is non-zero, then the point is inside the polygon
Shading and Lighting
Intersection with Multiple Objects
Typically, a scene will contain multiple objects. To find the first object hit by the ray, we need to find the smallest positive \(t\) value
t_min = infinity
firstSurface = None
for each object in the surfaceList:
hitSurface, t = object.intersect(ray)
if hitSurface != None and t < t_min:
t_min = t
firstSurface = hitSurface
return firstSurface, t_min
Shading
We have already read about the Phong reflection model in the Lighting section. We can use the same model to shade the pixel, so our algorithm becomes as follows
def Scene::trace(ray, t_min, t_max):
surface, t = surfaces.intersect(ray, t_min, t_max)
if surface == None:
return background_color
else:
return surface.shade(ray, t, light)
def Surface::shade(ray, t, light):
p = ray.origin + t * ray.direction
n = surface_normal(p)
v = normalize(ray.origin - p)
l = normalize(light.position - p)
h = normalize(v + l)
# compute ambient, diffuse and specular components
The shading algorithm can be changed for multiple light sources as well
Casting Shadows
- Surface is illuminated if nothing blocks the view of the light source
- To check if a point is in shadow, we can cast a ray from the point to the light source and check if it intersects any object
- As many rays need to be cast as there are light sources
- Ideally test \(t \in [0, \infty]\), but to account for floating point errors, test \(t \in [0 + \epsilon, \infty]\) where \(\epsilon\) is a small positive number
def Surface::shade(ray, t, lights):
p = ray.origin + t * ray.direction
n = surface_normal(p)
v = normalize(ray.origin - p)
for each light in lights:
l = normalize(light.position - p)
h = normalize(v + l)
shadowRay = Ray(p, l)
if inShadow(shadowRay):
# point is in shadow
return ambient
else:
# compute ambient, diffuse and specular components
return shading
Specular Reflections
- Mirror reflections can be added by shading reflected rays \(r = d - 2(d \cdot n)n\)
- Some energy is lost in each reflection, so we can recursively
- \(k_m\) is specular RGB color for mirror reflection
- Trace reflected rays for materials that are specular
- Recursion depth can be limited to avoid infinite recursion
Refraction and Transparency
- Compute refracted ray as following
Total Internal Reflection
- Happens when light travels from denser to rarer medium at an angle greater than the critical angle
- Critical angle \(\cos (\theta_c) = \frac{n_2}{n_1}\)
Schlick's Approximation
- Approximates the Fresnel equations for reflection and refraction
Beer's Law
- For homogeneous impurities in a dielectric, a light carrying ray's intensity attenuates as per Beer's law
- Loss of intensity in the medium \(dI = -C I dx\)
- Solution: \(I = k' + k e^{-C x}\) with boundary conditions \(I(s) = I_0 e^{s\ln(a)}\)
- \(s\) is distance from interface, and \(a\) is attenuation constant
Putting it all together
def Surface::shade(ray, point, normal, lights):
if point is on a dielectric:
r = reflect(ray, normal)
if dot(ray, normal) < 0: # entering a dielectric
c = -dot(ray, normal)
k_r = k_g = k_b = 1
else:
# apply Beer's law
k_r = expo(-a_r * t)
k_g = expo(-a_g * t)
k_b = expo(-a_b * t)
if refract(ray, -normal, 1/eta, t):
c = dot(t, normal)
else:
return k * Scene::trace(r, point, epsilon, infinity)
R_0 = ((eta - 1) / (eta + 1)) ** 2
R = R_0 + (1 - R_0) * (1 - c) ** 5
return k * (
R * Scene::trace(r, point, epsilon, infinity) + \
(1 - R) * Scene::trace(t, point, epsilon, infinity)
)