Ray-sphere Intersection

We've seen how to form a ray following the trajectory of the mouse, but that's only half the solution to making a renderer interactive. We must also see what that ray hits and where. This is collision detection. We could try intersecting the ray with each triangle in the scene, but modern games on ultra settings can render 10 million triangles per frame. Detecting collisions with that many is far too expensive.

A faster method is to collide not with a detailed trimesh but with simpler proxy geometry like a bounding box or a bounding sphere. Let's examine intersecting a bounding sphere first, because it's less involved.

First, we observe the fact that a point $\mathbf{p}$ is on a sphere of radius $r$ if it satisfies the implicit equation of a sphere:

$$p_x^2 + p_y^2 + p_z^2 = r^2$$

Note that the sum on the left-hand side is a dot product. We can therefore rewrite the equation like so:

$$\mathbf{p} \cdot \mathbf{p} = r^2$$

These equations apply only if the sphere is centered at the origin. If it's positioned elsewhere, we must subtract away the center in the equation:

$$\begin{align} (p_x - c_x)^2 + (p_y - c_y)^2 + (p_z - c_z)^2 &= r^2 \\ (\mathbf{p} - \mathbf{c}) \cdot (\mathbf{p} - \mathbf{c}) &= r^2 \end{align}$$

Second, we observe that a point $\mathbf{p}$ on the ray that starts at $\mathbf{o}$ and points in direction $\mathbf{d}$ must satisfy this parametric equation:

$$\mathbf{p} = \mathbf{o} + t \times \mathbf{d}$$

We want to find a point that is both on the ray and on the sphere, so we combine the two equations:

$$(\mathbf{o} + t \times \mathbf{d} - \mathbf{c}) \cdot (\mathbf{o} + t \times \mathbf{d} - \mathbf{c}) = r^2$$

Before we can act on this, we need to establish a property of the dot product. We've seen binomial expansion applied to the multiplication of sums of scalars. Does it also apply to the dot product of sums of vectors? In other words, is this equation true?

$$\begin{align} (\mathbf{a} + \mathbf{b}) \cdot (\mathbf{a} + \mathbf{b}) &= \mathbf{a} \cdot \mathbf{a} + 2 \mathbf{a} \cdot \mathbf{b} + \mathbf{b} \cdot \mathbf{b} \\ \end{align}$$

Let's expand the dot product into scalars using only its definition and then regroup the terms:

$$\begin{align} (\mathbf{a} + \mathbf{b}) \cdot (\mathbf{a} + \mathbf{b}) &= (a_x + b_x)^2 + (a_y + b_y)^2 + (a_z + b_z)^2 \\ &= a_x^2 + 2 a_x b_x + b_x^2 + a_y^2 + 2 a_y b_y + b_y^2 + a_z^2 + 2 a_z b_z + b_z^2 \\ &= (a_x^2 + a_y^2 + a_z^2) + 2 (a_x b_x + a_y b_y + a_z b_z) + (b_x^2 + b_y^2 + b_z^2) \\ &= \mathbf{a} \cdot \mathbf{a} + 2 \mathbf{a} \cdot \mathbf{b} + \mathbf{b} \cdot \mathbf{b} \\ \end{align}$$

The dot product does indeed distribute over vector addition. To expand out our combined equation, we must sum these combinations of dot products:

	$\mathbf{o}$	$t \times \mathbf{d}$	$-\mathbf{c}$
$\mathbf{o}$	$\mathbf{o} \cdot \mathbf{o}$	$\mathbf{o} \cdot \mathbf{d} \times t$	$-\mathbf{c} \cdot \mathbf{o}$
$t \times \mathbf{d}$	$\mathbf{o} \cdot \mathbf{d} \times t$	$\mathbf{d} \cdot \mathbf{d} \times t^2$	$-\mathbf{c} \cdot \mathbf{d} \times t$
$-\mathbf{c}$	$-\mathbf{c} \cdot \mathbf{o}$	$-\mathbf{c} \cdot \mathbf{d} \times t$	$\mathbf{c} \cdot \mathbf{c}$

Let's group these terms: one includes $t^2$, four include $t$, and four are constants. This grouping lets us rewrite the sum as a quadratic equation of $t$:

$$\begin{align} a &= \mathbf{d} \cdot \mathbf{d} \\ b &= 2 (\mathbf{o} \cdot \mathbf{d} - \mathbf{c} \cdot \mathbf{d}) \\ &= 2 (\mathbf{o} - \mathbf{c}) \cdot \mathbf{d} \\ c &= \mathbf{o} \cdot \mathbf{o} - 2 \mathbf{c} \cdot \mathbf{o} + \mathbf{c} \cdot \mathbf{c} - r^2 \\ &= (\mathbf{o} - \mathbf{c}) \cdot (\mathbf{o} - \mathbf{c}) - r^2 \\ a t^2 + b t + c &= 0 \\ \end{align}$$

We've moved $r^2$ into $c$ to put the equation in standard form. The only unknown is $t$. We solve for it using the quadratic equation:

$$t = \frac{b \pm \sqrt{b^2 - 4ac}}{2a}$$

The value $t$—if there is one—is a scalar that tells us how far along the ray the intersection occurs. The ray may never hit the sphere, may hit it exactly once, or may hit it twice. We examine the discriminant $b^2 - 4ac$ to determine the number of intersections. If it's negative, there are none. If it's 0, there's one. If it's positive, there are two.

Cast a ray in this renderer and rotate the scene to see how the ray intersects the sphere. Achieving only a single intersection is nearly impossible because making a ray tangent requires more precision than we can achieve with the mouse.

All that remains is to turn this math into a reusable function with more helpful variable names.

Place this function in lib/intersect.ts.

← Raycasting Ray-Box Intersection →

	\(\mathbf{o}\)	\(t \times \mathbf{d}\)	\(-\mathbf{c}\)
\(\mathbf{o}\)	\(\mathbf{o} \cdot \mathbf{o}\)	\(\mathbf{o} \cdot \mathbf{d} \times t\)	\(-\mathbf{c} \cdot \mathbf{o}\)
\(t \times \mathbf{d}\)	\(\mathbf{o} \cdot \mathbf{d} \times t\)	\(\mathbf{d} \cdot \mathbf{d} \times t^2\)	\(-\mathbf{c} \cdot \mathbf{d} \times t\)
\(-\mathbf{c}\)	\(-\mathbf{c} \cdot \mathbf{o}\)	\(-\mathbf{c} \cdot \mathbf{d} \times t\)	\(\mathbf{c} \cdot \mathbf{c}\)

How to 3D

Ray-sphere Intersection