In a previous post I’ve introduced the Gaussian distribution and how it is commonly found in the vast majority of natural phenomenon. It can be used to dramatically improve some aspect of your game, such as procedural terrain generation, enemy health and attack power, etc. Despite being so ubiquitous, very few gaming frameworks offer functions to generate numbers which follow such distribution. Unity developers, for instance, heavily rely on Random.Range
which generates uniformly distributed numbers (in blue). This post will show how to generate Gaussian distributed numbers (in red) in C#.
I’ll be explaining the Maths behind it, but there is no need to understand it to use the function correctly. You can download the RandomGaussian
Unity script here.
Step 1: From Gaussian to uniform
Many gaming frameworks only include functions to generate continuous uniformly distributed numbers. In the case of Unity3D, for instance, we have Random.Range(min, max)
which samples a random number from min
and max
. The problem is to create a Gaussian distributed variable out of a uniformly distributed one.
Sample two Gaussian distributed values
Let’s imagine we already have two independent, normally distributed variables:
from which we sampled two values, and , respectively. Sampling several of these points in the Cartesian plane will generate a roundly shaped cloud centred at .
Calculate their joint probability
The probability of having a certain is defined as the probability sampling from , times the probability of sampling from . This is called joint probability and since samplings from and are independent from each other:
Switch to polar coordinates
The point can be represented in the plane using polar coordinates, with an angle and its distance from the origin :
Now the original point can be expressed as a function of and :
Rewrite the joint probability
We can now rewrite the joint probability :
which is the product of the two probability distributions:
Expanding the exponential distribution
While is already in a form which can be expressed using a uniformly distributed variable, a little bit more work is necessary for . Remembering the definition of the exponential distribution:
Now both and , which are coordinates of a point generated from two independent Gaussian distributions, can be expressed using two uniformly distributed values.
๐ฐ Ad Break
Step 2: From uniform to Gaussian
We can now reverse the procedure done in Step 1 to derive a simple algorithm:
- Generate two random numbers
- Use them to create the radius and the angle
- Convert from polar to Cartesian coordinates:
This is know as the Box-Muller transform. The image below (from Wikipedia) shows how the uniformly distributed points from the unit square are re-mapped by the Box-Muller transform onto the Cartesian plane, in a Gaussian fashion.
Step 3: The Marsaglia polar method
The Box-Muller transform has a problem: it uses trigonometric functions which are notoriously slow. To avoid that, a slightly different technique exists, called the Marsaglia polar method. Despite being similar, it stars from an uniformly distributed point in the interval . This point must fit within the unit circle and shouldn’t be the origin ; if it doesn’t, another one has to be chosen.
public static float NextGaussian() { float v1, v2, s; do { v1 = 2.0f * Random.Range(0f,1f) - 1.0f; v2 = 2.0f * Random.Range(0f,1f) - 1.0f; s = v1 * v1 + v2 * v2; } while (s >= 1.0f || s == 0f); s = Mathf.Sqrt((-2.0f * Mathf.Log(s)) / s); return v1 * s; }
Approximately 21% of the points will be rejected with this method.
Step 4: Mapping to arbitrary Gaussian curves
The algorithm described in Step 3 provides a way to sample from . We can transform that into any arbitrary like this:
public static float NextGaussian(float mean, float standard_deviation) { return mean + NextGaussian() * standard_deviation; }
There is yet another problem: Gaussian distributions have the nasty habit to generate numbers which can be quite far from the mean. However, clamping a Gaussian variable between a min
and a max
can have quite catastrophic results. The risk is to squash the left and right tails and having a rather bizarre function with three very likely values: the mean, the min and the max. The most common technique to avoid this is to take another sample if it falls outside its range:
public static float NextGaussian (float mean, float standard_deviation, float min, float max) { float x; do { x = NextGaussian(mean, standard_deviation); } while (x < min || x > max); retun x; }
Another solution changes the parameter of the curve so that min
and max
are at 3.5 standard deviations from the mean (which should contain more than 99.9% of the sampled points).
๐ Recommended Books
Conclusion
This tutorial shows how to sample Gaussian distributed numbers starting from uniformly distributed ones. You can download the complete RandomGaussian
script for Unity here.
Other resources
- Probability and Games: Damage Rolls: a very detailed explanation of how dices can be used to sample from different distributions;
- ProjectRhea: the principle of how to generate a Gaussian random variable;
- Sampling From the Normal Distribution: a similar tutorial;
- Box-Muller transforms: a more Maths-y tutorial.
- Part 1: Understanding the Gaussian distribution
- Part 2: How to generate Gaussian distributed numbers
Leave a Reply