Studying for Exam LTAM, Part 1.1

Where do we begin? Assuming the reader’s knowledge of the fundamentals of (calculus-based) probability theory, the best place to start is with continuous survival models. This is also a common initial chapter in the textbooks on the subject.
As I mentioned in the post “Studying for Exam LTAM Series“, the textbook resources that I have and will make use of are: (1) “Actuarial Mathematics“, 2nd Edition, by Bowers, Gerber, Hickman, Jones, and Nesbitt; (2) “Models for Quantifying Risk“, 4th Edition, by Cunningham, Herzog, and London; and (3) “Actuarial Mathematics for Life Contingent Risks“, 2nd Edition, by Dickson, Hardy, and Waters.
Reference (3) is the one I will be referring to most often since it is the reference listed in the syllabus on the Exam LTAM page of the Society of Actuaries website in 2019. Most of the time I will use notation and ideas consistent with this reference, but sometimes I will deviate from it.
Non-negative Continuous (Lifetime) Random Variables
A continuous random variable is a quantity whose value along a “continuum” is determined “by chance”. The word “continuum” is essentially just shorthand for an interval along the real line
But what does it mean for such a variable to be determined “by chance”? This is actually a difficult question to answer because it is hard to define what “random” means. For the purposes of modeling, however, we assume that probabilities related to the values of can be calculated by integrating an appropriate probability density function (PDF)
over an appropriate interval.
For the values of where the values of the PDF
are large or small, the “probability density” is large or small, respectively. When integrating over a given constant-length
-interval over which
is large or small, we therefore expect relatively large or small probabilities, respectively.
In the abstract setting, we typically assume that is defined for all
(for all real
). Our key conditions for
to be a PDF in this situation are: (1)
for all
and (2)
These two conditions correspond to axioms (2) and (1), respectively, on probability set functions in my article on axiomatic probability theory.
The probability of taking on a value in an interval of the form
or
is taken to be the value of
We would write, for instance,
The probability of taking on any particular value is zero, since
for any number
But what about if the random variable of interest never takes on negative values, for instance? Do we really need to define when
in such situations? And when might such a situation occur?
Let us answer the last question first. Such a situation occurs, quite often, when the random variable of interest is an amount of time. Of special interest in actuarial science is when the amount of time represents the lifetime of an individual. Of course, this is not the only situation where a variable will be non-negative. But it is common enough that many people call such a quantity a (continuous) lifetime random variable.
Let be a continuous lifetime random variable. Then the conditions for a function
to be a PDF for
are usually written as: (1)
for all real
and (2)
Probabilities are found in the same way as above, though we would assume
Cumulative Distribution Functions and Survival Functions
For any kind of random variable , the cumulative distribution function (CDF) is defined by
for any
This represents the probability that the random variable
is less than the number
If is a lifetime random variable, we would write
for
When
is continuous, then this can be calculated by doing the integral
Suppose is the remaining lifetime of a person, in years. Then
represents their likelihood of dying in the next
years. Since this is a “negative” way of looking at life, actuaries typically change their perspective. They typically focus on the probability of the person surviving at least another
years. This is the value of
and it has a name: the survival function (SF). This function is initially represented as
though we will introduce a different notation later.
Note that the properties of and
are “complementary” with each other. By definition,
for all
In particular,
But
(the life must live “at least a millisecond”). Therefore,
Also note that Hence,
Furthermore, it should make sense that
is non-decreasing while
is non-increasing.
Example: Uniform Distribution (Linear Survival), a.k.a. De Moivre’s Law
The simplest kind of probability distribution, continuous or discrete, is a uniform distribution. Let be a continuous lifetime random variable, representing the time until death of an individual. If
has a uniform distribution, the PDF would be
for
in some interval of the form
for some
In fact, since we want the integral over this interval to be 1, we have
for
For such a uniform distribution, the probability of dying during any particular time interval of constant length is constant. In fact, the constant is
To be technical, since the PDF is defined for all the PDF is actually the piecewise-defined (discontinuous) function
Because of this, the CDF is And the SF is
Of course, quantities of great interest here are the mean, variance, and standard deviation of .
The mean, also called the expected value, is
Doing this integral gives This makes intuitive sense because it says that the “center of mass” of the graph of
which is a horizontal line over
is halfway between 0 and
To find the variance and standard deviation, we first find the “second moment”:
Evaluation leads to Therefore, the variance is
This implies that the standard deviation is
There is a rule of thumb that says “almost all” the probability in any distribution will be within two standard deviations of the mean. In the case of a uniform distribution, two standard deviations is within about Since this is greater than
in fact, “all of the probability” will be within two standard deviations of the mean at
(recall that the PDF is only nonzero on the interval
The following animation shows the graph of the PDF, as well as the locations of and
, as
increases from 50 to 100.

For those who are interested, here is a picture of the Mathematica code that created this animation.

The Family of RVs for a Given Survival RV
For concreteness, let us indeed imagine we are modeling human lifetimes. Suppose a baby has just been born a fraction of a second ago. No one but God knows how long that baby will live. From a human perspective, the baby’s length of life will be a continuous lifetime random variable (continuous because we imagine we are measuring to the nearest millisecond). Let us also assume time is measured in years.
Let be the continuous lifetime random variable that represents how long that newborn baby will live, in years. Suppose
and, at birth, we make the assumption that the baby will live to at least age
years. Based on this assumption, after age
years, the baby will have
further years to live. Call this quantity
In other words, define
when the newborn baby is assumed to make it to age
We have thus generated an infinite “family” of random variables, We now determine how the PDF, CDF, and SF depend on
We also introduce our first bit of (tricky) actuarial notation.
Given survival to age the SF, as a function of a future amount of time
is defined to be
Since we are assuming survival to age
this can be computed as a conditional probability based on the distribution of
In other words, we assume/define that
Since, in general, we can write
It may not be strictly true that an adult who is currently age has the same probability of living at least another 10 years as the probability that a newborn baby who is assumed to live to age
will live at least another 10 years after that. However, for simplicity, we will often assume this is true.
If represents, for
the SF of
then we can write the equation
as
Also note that
which represents the general multiplication rule
Now for the tricky notation. Most typically, actuaries denote the function by the symbol
Furthermore, the CDF
is most commonly denoted by the symbol
Based on this symbolism, we can write the identity
By convention we also write for
and
for
The PDF will be denoted in the “usual” way, however. It is
The equation which can also be written as
has an important graphic interpretation. It shows that, to obtain the survival function for the remaining lifetime of a newborn baby when you assume that baby makes it to age
you need to translate the graph of
to the left by
units and then rescale it by multiplying by
This rescaling is done so that
Back to the Uniform Distribution Example
For the uniform distribution example when we have
and
Therefore, for an assumed attained age we get
and
(for
in both cases). From this, we also have
for
A person might object that
is not differentiable at
and
for this example, but we will not worry about this.
These calculations confirm that, if is uniform over
then
is uniform over
Note that the mean would then be
and the standard deviation would be
The animation below shows, for how the graphs of
and
change as
increases from 0 to 50.

The Mathematica code for this animation is shown in the figure below. Note that and
have been made large via keyboard shortcuts and are colored red and blue through Mathematica‘s “Writing Assistant” palette, rather than using the built-in functions Text and Style.

In blog posts to come shortly, we will dive into more examples related to these ideas.