Studying for Exam LTAM, Part 1.1
Where do we begin? Assuming the reader’s knowledge of the fundamentals of (calculus-based) probability theory, the best place to start is with continuous survival models. This is also a common initial chapter in the textbooks on the subject.
As I mentioned in the post “Studying for Exam LTAM Series“, the textbook resources that I have and will make use of are: (1) “Actuarial Mathematics“, 2nd Edition, by Bowers, Gerber, Hickman, Jones, and Nesbitt; (2) “Models for Quantifying Risk“, 4th Edition, by Cunningham, Herzog, and London; and (3) “Actuarial Mathematics for Life Contingent Risks“, 2nd Edition, by Dickson, Hardy, and Waters.
Reference (3) is the one I will be referring to most often since it is the reference listed in the syllabus on the Exam LTAM page of the Society of Actuaries website in 2019. Most of the time I will use notation and ideas consistent with this reference, but sometimes I will deviate from it.
Non-negative Continuous (Lifetime) Random Variables
A continuous random variable is a quantity whose value along a “continuum” is determined “by chance”. The word “continuum” is essentially just shorthand for an interval along the real line
But what does it mean for such a variable to be determined “by chance”? This is actually a difficult question to answer because it is hard to define what “random” means. For the purposes of modeling, however, we assume that probabilities related to the values of can be calculated by integrating an appropriate probability density function (PDF) over an appropriate interval.
For the values of where the values of the PDF are large or small, the “probability density” is large or small, respectively. When integrating over a given constant-length -interval over which is large or small, we therefore expect relatively large or small probabilities, respectively.
In the abstract setting, we typically assume that is defined for all (for all real ). Our key conditions for to be a PDF in this situation are: (1) for all and (2) These two conditions correspond to axioms (2) and (1), respectively, on probability set functions in my article on axiomatic probability theory.
The probability of taking on a value in an interval of the form or is taken to be the value of We would write, for instance,
The probability of taking on any particular value is zero, since for any number
But what about if the random variable of interest never takes on negative values, for instance? Do we really need to define when in such situations? And when might such a situation occur?
Let us answer the last question first. Such a situation occurs, quite often, when the random variable of interest is an amount of time. Of special interest in actuarial science is when the amount of time represents the lifetime of an individual. Of course, this is not the only situation where a variable will be non-negative. But it is common enough that many people call such a quantity a (continuous) lifetime random variable.
Let be a continuous lifetime random variable. Then the conditions for a function to be a PDF for are usually written as: (1) for all real and (2) Probabilities are found in the same way as above, though we would assume
Cumulative Distribution Functions and Survival Functions
For any kind of random variable , the cumulative distribution function (CDF) is defined by for any This represents the probability that the random variable is less than the number
If is a lifetime random variable, we would write for When is continuous, then this can be calculated by doing the integral
Suppose is the remaining lifetime of a person, in years. Then represents their likelihood of dying in the next years. Since this is a “negative” way of looking at life, actuaries typically change their perspective. They typically focus on the probability of the person surviving at least another years. This is the value of and it has a name: the survival function (SF). This function is initially represented as though we will introduce a different notation later.
Note that the properties of and are “complementary” with each other. By definition, for all In particular, But (the life must live “at least a millisecond”). Therefore,
Also note that Hence, Furthermore, it should make sense that is non-decreasing while is non-increasing.
Example: Uniform Distribution (Linear Survival), a.k.a. De Moivre’s Law
The simplest kind of probability distribution, continuous or discrete, is a uniform distribution. Let be a continuous lifetime random variable, representing the time until death of an individual. If has a uniform distribution, the PDF would be for in some interval of the form for some In fact, since we want the integral over this interval to be 1, we have for
For such a uniform distribution, the probability of dying during any particular time interval of constant length is constant. In fact, the constant is
To be technical, since the PDF is defined for all the PDF is actually the piecewise-defined (discontinuous) function
Because of this, the CDF is And the SF is
Of course, quantities of great interest here are the mean, variance, and standard deviation of .
The mean, also called the expected value, is
Doing this integral gives This makes intuitive sense because it says that the “center of mass” of the graph of which is a horizontal line over is halfway between 0 and
To find the variance and standard deviation, we first find the “second moment”:
Evaluation leads to Therefore, the variance is
This implies that the standard deviation is
There is a rule of thumb that says “almost all” the probability in any distribution will be within two standard deviations of the mean. In the case of a uniform distribution, two standard deviations is within about Since this is greater than in fact, “all of the probability” will be within two standard deviations of the mean at (recall that the PDF is only nonzero on the interval
The following animation shows the graph of the PDF, as well as the locations of and , as increases from 50 to 100.
For those who are interested, here is a picture of the Mathematica code that created this animation.
The Family of RVs for a Given Survival RV
For concreteness, let us indeed imagine we are modeling human lifetimes. Suppose a baby has just been born a fraction of a second ago. No one but God knows how long that baby will live. From a human perspective, the baby’s length of life will be a continuous lifetime random variable (continuous because we imagine we are measuring to the nearest millisecond). Let us also assume time is measured in years.
Let be the continuous lifetime random variable that represents how long that newborn baby will live, in years. Suppose and, at birth, we make the assumption that the baby will live to at least age years. Based on this assumption, after age years, the baby will have further years to live. Call this quantity In other words, define when the newborn baby is assumed to make it to age
We have thus generated an infinite “family” of random variables, We now determine how the PDF, CDF, and SF depend on We also introduce our first bit of (tricky) actuarial notation.
Given survival to age the SF, as a function of a future amount of time is defined to be Since we are assuming survival to age this can be computed as a conditional probability based on the distribution of In other words, we assume/define that
Since, in general, we can write
It may not be strictly true that an adult who is currently age has the same probability of living at least another 10 years as the probability that a newborn baby who is assumed to live to age will live at least another 10 years after that. However, for simplicity, we will often assume this is true.
If represents, for the SF of then we can write the equation as Also note that which represents the general multiplication rule
Now for the tricky notation. Most typically, actuaries denote the function by the symbol Furthermore, the CDF is most commonly denoted by the symbol Based on this symbolism, we can write the identity
By convention we also write for and for
The PDF will be denoted in the “usual” way, however. It is
The equation which can also be written as has an important graphic interpretation. It shows that, to obtain the survival function for the remaining lifetime of a newborn baby when you assume that baby makes it to age you need to translate the graph of to the left by units and then rescale it by multiplying by This rescaling is done so that
Back to the Uniform Distribution Example
For the uniform distribution example when we have and
Therefore, for an assumed attained age we get and (for in both cases). From this, we also have for A person might object that is not differentiable at and for this example, but we will not worry about this.
These calculations confirm that, if is uniform over then is uniform over Note that the mean would then be and the standard deviation would be
The animation below shows, for how the graphs of and change as increases from 0 to 50.
The Mathematica code for this animation is shown in the figure below. Note that and have been made large via keyboard shortcuts and are colored red and blue through Mathematica‘s “Writing Assistant” palette, rather than using the built-in functions Text and Style.
In blog posts to come shortly, we will dive into more examples related to these ideas.