Studying for Exam LTAM, Part 1.5
An important principle in mathematical modeling is to start simple. For example, if you have a situation where a linear function matches the trend in your data, go ahead and use a linear function rather than, for example, a quadratic function.
If a simple model is not accurate enough, then you will want to use a more complicated model.
For example, a linear function is definitely not accurate enough as a representation of the distance fallen by an object under the influence of the Earth’s gravity. A quadratic function must be used in this situation.
So far in our discussions about continuous survival random variables (in this series on studying for actuarial exam LTAM), we have focused on two relatively simple models.
- Uniform Distribution (De Moivre’s Law): the probability density function (PDF) is piecewise constant (and constant over the time of life).
- Constant Force of Mortality (Exponential Distribution): the force of mortality is constant, resulting in a PDF and a survival function (SF) that are exponential decay functions.
In this post, we will discuss another simple model that has the potential to be somewhat more accurate for human lifetimes: triangular distributions (also see https://learnandteachstatistics.files.wordpress.com/2013/07/notes-on-triangle-distributions.pdf). These are continuous random variables whose PDFs have graphs that look like the triangle in the figure above. They have the potential to be somewhat accurate models of human lifetimes because they can model the following facts:
- Not many people die at very young or very old ages (where the values of the PDF are low).
- Most commonly, people die in the age-range of 50 to 90 years (where the values of the PDF are large — near the high “peak of the tent”).
One aspect of human mortality that such models do not account for is relatively high infant mortality. Very young humans (< 2 months old) are more susceptible to death than, for example, young humans between the ages of 1 and 10 years old.
Formula for the PDF in Terms of Two Parameters
Let be a continuous survival random variable with a triangular distribution. For some “limiting age” , this means we are assuming that the PDF takes on positive values for . It also means that the graph of looks like the top part (two legs) of a triangle whose base is on the horizontal axis (see the image at the start of this post).
We assume that there is some number where has a maximum value (technically this would be called the “mode” of the continuous distribution). It will be most accurate for human lifetimes to assume that , though the cases where or could be separately considered.
In order to be a PDF, the area under this graph, which is the area of the triangle, must be equal to 1. If is the height of the triangle (maximum value of ), then so that .
This means the slope of the first (left-most) piece of this graph is . And the slope of the second (right-most) piece is
Therefore, over the interval the formula for can be written in piecewise form as
In this formula, and would be labeled as “parameters”, while is the true “variable”. A better notation for this (family of) function(s) would be . However, we will avoid such complication in our writing. If you want to use a program such as Mathematica to deal with this in its full generality, however, you will need to take the facts that and are arbitrary into account.
In the case of the graph at the top of this post, and , so the formula becomes .
Survival Function(s)
The survival function (SF) for is the complement of the cumulative distribution function (CDF): If , we get .
If , the formula is . (Think about this!)
After a bit of tricky algebra, when , you should check that this simplifies to .
Hence, .
When and , the graph of is as shown below. Note the inflection point at Actually, this is a point where the function is not twice-differentiable, since fails to be differentiable at (and ).
Assuming survival to age , let be the corresponding remaining-life random variable.
In the case where , the corresponding survival function for is .
In the case where , the corresponding survival function for is for .
The following animation shows the how the graph of changes as increases from 0 to 60, still in the case where and .
Note that, for any fixed , the graph of is obtained from the graph of by a horizontal shift to the left by units combined with a vertical stretch by a factor of . The resulting graph is, in a sense, “similar” to a piece of the original, at least with respect to these two transformations.
Force(s) of Mortality (Hazard Rate Function(s))
The force of mortality (FM) for a newborn is . You should take the time to check that this definition gives . Do not worry about the fact that is undefined at .
The FM for an individual of attained age is then taken to be .
The following animation shows the how the graph of changes as increases from 0 to 60, in the case where and .
Note that there is a vertical asymptote at and a point of non-differentiability at the location of the dot . In fact, as mentioned above, is undefined at , so is undefined at . The dot really represents a “hole” in the graph.
Complete Expectation of Life
The complete expectation of life is, for this example with limiting age , equal to
When and , I will leave it to you as an exercise to determine a formula for (separate it into cases where versus ).
Below is a static graph of when and . I have also plotted the graph of .
It turns out that this second function is always an increasing function of , no matter what the situation is (no matter what continuous survival model we are considering). You may also want to see if you can figure out why this is true.
You should make sure you understand the real-life meaning of the complete expectation of life. In this model, it turns out that . This is the expected lifetime of a newborn baby. Note that it is less than the mode . That should not be too surprising when you realize the distribution is left-skewed (you might also see if you can determine the median lifetime to be about 69.3).
As another example, for a person who is currently age 80, the model predicts an expected remaining lifetime of .
You might also wonder whether anything special is happening for this graph at . For example: is differentiable there, or not? I do not know the answer off-hand. You can send me a note if you think you figure it out.
One final animation is worth doing. What happens to the graph of as, for example, the mode increases?
From this animation, we see that the mean remaining lifetimes are getting larger as the mode increases, but they go down at a faster rate. The derivative becomes more and more negative as increases. This should make intuitive sense.
As a final challenge, you might want to see if you can determine how depends on .
Mathematica Code
Below, for your reference, are pictures of the Mathematica code for the three animations above.