Sukrit Mittal Franklin Templeton Investments
Variance treats upside and downside symmetrically; Investors do not.
Losses hurt more than gains feel good—this asymmetry is fundamental to human psychology.
It measures total volatility—both upside and downside. \[ \text{Portfolio risk} = \sigma_p^2 = w^\top \Sigma w \]
Example: Consider two one-year return distributions:
Both have the same mean and variance:
\[ \mathbb{E}[R_A]=\mathbb{E}[R_B]=0,\qquad \mathrm{Var}(R_A)=\mathrm{Var}(R_B)=0.01 \]
So variance can label them as equally risky, even though B has a much more severe left tail.
Risk management addresses specific questions:
How much can we lose?
With what probability?
Over what horizon?
These questions require downside risk measures, not total volatility.
Real-world applications:
All of these focus on left-tail risk—extreme losses, not extreme gains.
Late 1980s-early 1990s: Financial institutions sought a unified risk metric.
Problem:
Solution: JP Morgan developed RiskMetrics (1994)
Result:
Despite known flaws, VaR persists because:
But it fails catastrophically in crises.
Portfolio optimization
Risk management
Key difference:
This is not a contradiction—they’re complementary perspectives.
Risk aversion can be characterized by a concave utility function \(u(W)\), where \[ u(\mathbb{E}[W]) > \mathbb{E}[u(W)] \]
Key insight: The concavity of \(u\) is strongest in the loss region.
For CRRA utility \(u(W) = \frac{W^{1-\gamma}}{1-\gamma}\): \[ u'(W) = W^{-\gamma} \]
Interpretation: The first dollar lost hurts much more than the last dollar gained feels good.
This asymmetry motivates downside risk measures.
Risk premium revisited: \[ RP = \mathbb{E}[W] - CE \]
The risk premium reflects willingness to pay to avoid the downside.
VaR attempts to quantify this downside directly.
Let \(R_p\) denote the portfolio return over a fixed horizon \(T\) (e.g., 1 day, 10 days). Define loss as: \[ L = -R_p \]
Sign convention: * Positive \(L\) = Loss * Negative \(L\) = Gain
The distribution of \(L\) encodes all risk information.
For a portfolio with initial value \(V_0\): \[ \text{Profit/Loss} = V_0 R_p = -V_0 L \]
Example: If \(R_p \sim N(0.001, 0.01^2)\) (daily), then \[ L \sim N(-0.001, 0.01^2) \]
The left tail of \(R_p\) corresponds to the right tail of \(L\).
For a random variable \(X\) with CDF \(F_X(x) = \mathbb{P}(X \le x)\), the \(\alpha\)-quantile is:
\[ q_\alpha = \inf\{x : F_X(x) \ge \alpha\} = F_X^{-1}(\alpha) \]
Interpretation:
With probability \(\alpha\), the outcome does not exceed \(q_\alpha\).
Common quantiles:
For the standard normal \(Z \sim N(0,1)\):

Figure: Loss distribution with VaR at 95% and 99% confidence levels. The shaded regions show the probability mass beyond each VaR threshold. VaR captures a single quantile but ignores the severity of losses beyond that point. (\(\mu=0.1\%\), \(\sigma=1\%\))
Key observation:
VaR tells us where the \(\alpha\)-quantile is, but not how bad losses can be beyond that point.
This is a fundamental limitation.
The Value at Risk at confidence level \(\alpha\) over horizon \(T\) is:
\[ \text{VaR}_\alpha(L) = \inf\{x : \mathbb{P}(L \le x) \ge \alpha\} \]
Alternative formulation: For continuous distributions,
\[ \mathbb{P}(L \le \text{VaR}_\alpha) = \alpha \]
In words:
VaR is the worst loss not exceeded with probability \(\alpha\).
Or equivalently:
VaR is exceeded with probability \(1-\alpha\).
Standard choices:
Example statement:
“The 1-day 99% VaR is $2 million.”
Meaning:
Two sign conventions:
We use the loss convention: \(\text{VaR}_\alpha(L) > 0\) means potential loss.
If we work with returns \(R_p\) instead of losses \(L = -R_p\):
\[ \text{VaR}_\alpha = -F_R^{-1}(1-\alpha) \]
where \(F_R\) is the CDF of returns.
Example: For \(\alpha = 95\%\):
\[ \text{VaR}_{0.95} = -q_{0.05}^R \]
where \(q_{0.05}^R\) is the 5th percentile of the return distribution.
For dollar VaR:
\[ \text{VaR}_\alpha^{\$} = V_0 \times \text{VaR}_\alpha \]
where \(V_0\) is the portfolio value.
VaR can be connected to utility through the certainty equivalent concept.
Recall from Lecture 09:
\[ u(CE) = \mathbb{E}[u(W)] \]
VaR is related but simpler—it ignores utility curvature and focuses solely on the quantile.
Key difference:
VaR is easier to compute but loses information about preferences.
Example: Two investors with different \(u\) will have different CEs for the same lottery, but identical VaR.
VaR is preference-free—it describes the distribution, not the investor’s attitude toward risk.
We now examine VaR’s formal properties and see where it fails to be a “coherent” risk measure.
For any \(\lambda > 0\):
\[ \text{VaR}_\alpha(\lambda L) = \lambda \, \text{VaR}_\alpha(L) \]
Proof: If \(\mathbb{P}(L \le x) = \alpha\), then \(\mathbb{P}(\lambda L \le \lambda x) = \alpha\).
Therefore, \(\text{VaR}_\alpha(\lambda L) = \lambda x = \lambda \, \text{VaR}_\alpha(L)\). âś“
Interpretation: Doubling position size doubles VaR.
This is intuitive and desirable.
For any constant \(c\):
\[ \text{VaR}_\alpha(L + c) = \text{VaR}_\alpha(L) + c \]
Proof: \(\mathbb{P}(L + c \le x) = \mathbb{P}(L \le x - c) = \alpha\) when \(x - c = \text{VaR}_\alpha(L)\).
So \(\text{VaR}_\alpha(L + c) = \text{VaR}_\alpha(L) + c\). âś“
Interpretation: Adding a certain loss \(c\) increases VaR by exactly \(c\).
This is also intuitive.
If \(L_1 \le L_2\) almost surely, then:
\[ \text{VaR}_\alpha(L_1) \le \text{VaR}_\alpha(L_2) \]
Proof: For any \(x\):
\[ \{L_1 \le x\} \supseteq \{L_2 \le x\} \]
So \(\mathbb{P}(L_1 \le x) \ge \mathbb{P}(L_2 \le x)\).
This implies \(\text{VaR}_\alpha(L_1) \le \text{VaR}_\alpha(L_2)\). âś“
Interpretation: A portfolio that always has smaller losses has lower VaR.
Reasonable.
Subadditivity would require:
\[ \text{VaR}_\alpha(L_1 + L_2) \le \text{VaR}_\alpha(L_1) + \text{VaR}_\alpha(L_2) \]
Interpretation: Combining two portfolios should not increase total risk.
This is the diversification principle.
Bad news: VaR is not subadditive in general.
Counterexample: See next slide.
Consider two independent bonds, each with:
Individual VaR at 95%:
Each bond has:
\[ \mathbb{P}(\text{loss} = 0) = 0.96, \quad \mathbb{P}(\text{loss} = 100) = 0.04 \]
Since \(\mathbb{P}(\text{loss} \le 0) = 0.96 > 0.95\):
\[ \text{VaR}_{0.95}(\text{one bond}) = 0 \]
Portfolio VaR at 95%:
For two independent bonds, the joint distribution is:
Since \(\mathbb{P}(\text{loss} \le 0) = 0.9216 < 0.95\), we need the next threshold: \[ \mathbb{P}(\text{loss} \le 100) = 0.9216 + 0.0768 = 0.9984 > 0.95 \]
So: \[ \text{VaR}_{0.95}(\text{two bonds}) = 100 \]
Subadditivity fails: \[ 100 = \text{VaR}_{0.95}(L_1 + L_2) > \text{VaR}_{0.95}(L_1) + \text{VaR}_{0.95}(L_2) = 0 + 0 = 0 \]
Interpretation: Diversifying (holding two bonds instead of one) increases VaR!
This is paradoxical and violates the diversification principle.
The problem is that VaR ignores the tail beyond the quantile.
In the bond example:
VaR sees only that you’re more likely to exceed the threshold, not that the worst outcome is still bounded.
Key insight: VaR measures “probability of exceeding a threshold” but not “severity of exceedance.”
This is why we need Expected Shortfall —it captures tail severity.
VaR is subadditive under specific conditions:
Theorem: If returns are jointly elliptically distributed (e.g., multivariate normal), then VaR is subadditive.
Proof sketch: For elliptical distributions, portfolio VaR can be written as:
\[ \text{VaR}_\alpha(w^\top R) = w^\top \mu + z_\alpha \sqrt{w^\top \Sigma w} \]
By the triangle inequality for norms:
\[ \sqrt{(w_1 + w_2)^\top \Sigma (w_1 + w_2)} \le \sqrt{w_1^\top \Sigma w_1} + \sqrt{w_2^\top \Sigma w_2} \]
So subadditivity holds for normal returns. âś“
But: Real returns are not normal, especially in the tails.
So VaR fails subadditivity in realistic settings.
| Property | VaR Satisfies? | Implication |
|---|---|---|
| Positive homogeneity | âś“ Yes | Scales correctly with position size |
| Translation invariance | âś“ Yes | Adding cash affects VaR correctly |
| Monotonicity | ✓ Yes | More loss → higher VaR |
| Subadditivity | âś— No | May penalize diversification |
The failure of subadditivity is a critical flaw.
It means VaR can:
This motivated the development of coherent risk measures.
There are three main approaches to computing VaR:
We’ll cover each in detail.
Assume portfolio returns are normally distributed: \[ R_p \sim N(\mu, \sigma^2) \]
Then losses \(L = -R_p\) are also normal: \[ L \sim N(-\mu, \sigma^2) \]
For \(L \sim N(-\mu, \sigma^2)\), the \(\alpha\)-quantile is: \[ \text{VaR}_\alpha = -\mu + \sigma \cdot z_\alpha \]
where \(z_\alpha = \Phi^{-1}(\alpha)\) is the \(\alpha\)-quantile of the standard normal \(N(0,1)\).
For reference:
| Confidence \(\alpha\) | \(z_\alpha\) | Interpretation |
|---|---|---|
| 90% | 1.282 | Exceeded 10% of the time |
| 95% | 1.645 | Exceeded 5% of the time |
| 99% | 2.326 | Exceeded 1% of the time |
| 99.5% | 2.576 | Exceeded 0.5% of the time |
| 99.9% | 3.090 | Exceeded 0.1% of the time |
Note: These are for the upper tail of \(N(0,1)\).
For the lower tail (relevant for returns), we use \(z_{1-\alpha} = -z_\alpha\).
For returns \(R_p \sim N(\mu, \sigma^2)\): \[ \boxed{\text{VaR}_\alpha = \sigma z_\alpha - \mu} \]
Often \(\mu\) is small relative to \(\sigma z_\alpha\), so: \[ \text{VaR}_\alpha \approx \sigma z_\alpha \]
Dollar VaR: \[ \text{VaR}_\alpha^{\$} = V_0(\sigma z_\alpha - \mu) \]
Approximation for small \(\mu\): \[ \text{VaR}_\alpha^{\$} \approx V_0 \sigma z_\alpha \]
This is the form most commonly used in practice.
Consider a portfolio invested in a single stock:
Compute 1-day 95% VaR:
\[ \text{VaR}_{0.95} = \sigma z_{0.95} - \mu = 0.02 \times 1.645 - 0.0005 = 0.0329 - 0.0005 = 0.0324 = 3.24\% \]
Dollar VaR:
\[ \text{VaR}_{0.95}^{\$} = 1,000,000 \times 0.0324 = \$32,400 \]
Interpretation: We expect to lose more than $32,400 on 5% of days (roughly 1 day per month).
Consider a portfolio with two assets:
Step 1: Calculate portfolio variance
\[ \sigma_p^2 = w_1^2 \sigma_1^2 + w_2^2 \sigma_2^2 + 2w_1 w_2 \rho_{12} \sigma_1 \sigma_2 \]
\[ = (0.6)^2(0.15)^2 + (0.4)^2(0.25)^2 + 2(0.6)(0.4)(0.4)(0.15)(0.25) \]
\[ = 0.0081 + 0.01 + 0.0036 = 0.0217 \]
\[ \sigma_p = \sqrt{0.0217} = 14.73\% \]
Step 2: Calculate VaR
\[ \text{VaR}_{0.99} = \sigma_p z_{0.99} - \mu_p = 0.1473 \times 2.326 - 0.08 = 0.3427 - 0.08 = 0.2627 = 26.27\% \]
Step 3: Dollar VaR
\[ \text{VaR}_{0.99}^{\$} = 5,000,000 \times 0.2627 = \$1,313,500 \]
Interpretation: With 99% confidence, annual losses will not exceed $1.31M.
Or: We expect to lose more than $1.31M in 1 out of 100 years.
A common (but imperfect) approximation: VaR scales with the square root of time.
For a \(T\)-period horizon:
\[ \text{VaR}_\alpha(T) = \sqrt{T} \times \text{VaR}_\alpha(1) \]
Assumptions:
Example: Scale 1-day VaR to 10-day VaR:
\[ \text{VaR}_{10} = \sqrt{10} \times \text{VaR}_1 \approx 3.16 \times \text{VaR}_1 \]
Warning: This scaling is approximate and breaks down for:
Better approach: Compute VaR directly for the desired horizon using the full model.
For a portfolio with weights \(w \in \mathbb{R}^n\) and return covariance matrix \(\Sigma\):
\[ \sigma_p = \sqrt{w^\top \Sigma w} \]
\[ \text{VaR}_\alpha = \sigma_p z_\alpha - w^\top \mu \]
Marginal VaR: Contribution of asset \(i\) to portfolio VaR:
\[ \frac{\partial \text{VaR}_\alpha}{\partial w_i} = z_\alpha \frac{(\Sigma w)_i}{\sigma_p} - \mu_i \]
This measures how VaR changes when we increase allocation to asset \(i\).
Component VaR: How much of total VaR is attributable to asset \(i\):
\[ \text{CVaR}_i = w_i \times \frac{\partial \text{VaR}_\alpha}{\partial w_i} \]
Property: Component VaRs sum to total VaR:
\[ \sum_{i=1}^n \text{CVaR}_i = \text{VaR}_\alpha \]
This decomposition is useful for risk attribution and portfolio management.
Idea: Use the empirical distribution of past returns to estimate VaR.
Assumptions:
Collect historical returns: \(r_1, r_2, \ldots, r_T\) (e.g., daily returns over past 250 days)
Convert to losses: \(\ell_t = -r_t\) for \(t = 1, \ldots, T\)
Sort losses in ascending order: \(\ell_{(1)} \le \ell_{(2)} \le \cdots \le \ell_{(T)}\)
Find the \(\alpha\)-quantile: \(\text{VaR}_\alpha = \ell_{(\lceil \alpha T \rceil)}\)
where \(\lceil x \rceil\) denotes rounding up to the nearest integer.
Example: For 95% VaR with \(T = 250\) days:
\[ \lceil 0.95 \times 250 \rceil = 238 \]
So \(\text{VaR}_{0.95}\) is the 238th worst loss out of 250 observations.
Suppose we have 100 days of historical returns for a portfolio. Here are the 10 worst returns:
| Rank | Return | Loss |
|---|---|---|
| 1 | -5.2% | 5.2% |
| 2 | -4.8% | 4.8% |
| 3 | -4.1% | 4.1% |
| 4 | -3.7% | 3.7% |
| 5 | -3.5% | 3.5% |
| 6 | -3.2% | 3.2% |
| 7 | -2.9% | 2.9% |
| 8 | -2.7% | 2.7% |
| 9 | -2.5% | 2.5% |
| 10 | -2.3% | 2.3% |
95% VaR: \(\lceil 0.95 \times 100 \rceil = 95\)th worst loss
Looking at our sorted data, the 95th observation (5th worst return) gives:
\[ \text{VaR}_{0.95} = 3.5\% \]
99% VaR: \(\lceil 0.99 \times 100 \rceil = 99\)th worst loss
\[ \text{VaR}_{0.99} = 4.8\% \]
For dollar VaR with \(V_0 = \$2,000,000\):
\[ \text{VaR}_{0.95}^{\$} = 2,000,000 \times 0.035 = \$70,000 \]
\[ \text{VaR}_{0.99}^{\$} = 2,000,000 \times 0.048 = \$96,000 \]
Advantages:
Disadvantages:
Improvements:
Idea: Simulate many paths of portfolio returns from a specified model, then compute VaR from the simulated distribution.
Specify a model for return dynamics (e.g., geometric Brownian motion, GARCH, jump-diffusion)
Estimate parameters from historical data
Simulate \(N\) paths of returns over horizon \(T\): \[ r_1^{(1)}, r_2^{(1)}, \ldots, r_N^{(1)} \]
Compute losses: \(\ell_i = -r_i^{(1)}\) for \(i = 1, \ldots, N\)
Sort and extract quantile: As in historical simulation
Typical choice: \(N = 10,000\) to \(100,000\) simulations
Assume the portfolio value follows:
\[ dS_t = \mu S_t dt + \sigma S_t dW_t \]
Over a small time step \(\Delta t\), the return is approximately:
\[ r_t \approx \mu \Delta t + \sigma \sqrt{\Delta t} Z_t, \quad Z_t \sim N(0,1) \]
Algorithm:
Set parameters: \(\mu = 0.08\), \(\sigma = 0.2\), \(\Delta t = 1/252\) (1 trading day)
Simulate \(N = 10,000\) returns: \[ r_i = 0.08 \times \frac{1}{252} + 0.2 \times \sqrt{\frac{1}{252}} \times Z_i, \quad Z_i \sim N(0,1) \]
Convert to losses: \(\ell_i = -r_i\)
Sort: \(\ell_{(1)} \le \ell_{(2)} \le \cdots \le \ell_{(10000)}\)
Extract 95% VaR: \(\text{VaR}_{0.95} = \ell_{(9500)}\)
Result: With these parameters, Monte Carlo simulation gives approximately the same result as the parametric method (since we assumed normality).
Monte Carlo is more powerful when the model is non-normal or when the portfolio contains options.
Motivation: Empirical returns often have heavier tails than normal (more extreme events).
Model: Use a Student’s t-distribution with \(\nu\) degrees of freedom:
\[ r_t \sim t_\nu(\mu, \sigma^2) \]
For \(\nu < \infty\), this has fatter tails than normal. As \(\nu \to \infty\), it converges to normal.
Typical values: \(\nu = 4\) to \(10\) for financial returns
Simulation:
Generate \(Z_i \sim t_\nu\)
Compute returns: \(r_i = \mu + \sigma Z_i\)
Proceed as before
Result: VaR estimates are larger with fat-tailed distributions.
Example: For \(\mu = 0\), \(\sigma = 2\%\) daily:
Fat tails increase VaR by about 45% in this example.
Advantages:
Disadvantages:
Rule of thumb: To estimate the \(\alpha\)-quantile with reasonable accuracy, need at least \(N \sim 100/(1-\alpha)\) simulations.
| Method | Assumptions | Advantages | Best Use Case |
|---|---|---|---|
| Parametric | Normal returns | Fast, analytical | Simple portfolios, stable markets |
| Historical | Past = future | No model needed | When concerned about model risk |
| Monte Carlo | Specified model | Very flexible | Complex portfolios, options, stress testing |
Practical advice:
In practice, sophisticated institutions use all three methods and compare results.
VaR is the most widely used risk measure, but it has serious limitations.
VaR tells us the threshold, but not what happens beyond it.
Example: Two portfolios with the same VaR:
VaR treats these identically, but Portfolio B is far riskier.
Solution: Use Expected Shortfall (ES), which averages losses beyond VaR.
As we’ve seen, VaR can discourage diversification.
Real-world consequence:
Regulatory response: Basel III moved toward Expected Shortfall for this reason.
Parametric VaR assumes normality.
But returns have:
Example: During the 2008 financial crisis, many banks exceeded their 99% daily VaR multiple times per week.
If VaR were correct, this should happen once per 100 days (2-3 times per year).
Implication: Parametric VaR can severely underestimate risk during crises.
VaR is procyclical—it amplifies market cycles.
Mechanism:
This happened in:
VaR-based risk management can destabilize markets during stress.
All VaR methods rely on models or assumptions:
Model risk is the risk that the model is wrong.
Example: In August 2007, Goldman Sachs’ quantitative equity funds lost money on multiple days that their models said were “25-sigma events” (should happen once per 10^135 years).
The models were wrong.
Mitigation:
VaR can be manipulated by:
Regulatory arbitrage: Banks optimize portfolios to minimize regulatory capital (VaR-based), not actual risk.
Consequence: VaR can create a false sense of security.
Before the crisis:
During the crisis:
Post-crisis consensus:
Quote from Nassim Taleb:
“VaR is like an airbag that works in a crash, except when you need it most.”
Why does VaR remain the industry standard?
Modern best practice:
VaR is a useful risk metric, but a dangerous risk management strategy.
A portfolio has:
Assume returns are normally distributed.
Tasks:
Consider a portfolio with two assets:
Tasks:
You have the following 20 daily returns (in %):
\[ 1.2, 0.8, -0.5, 2.1, -1.3, 0.3, -2.8, 1.5, 0.7, -0.9, \]
\[ 1.8, -1.7, 0.4, -3.5, 2.3, -0.6, 1.1, -2.1, 0.9, -1.0 \]
Tasks:
A portfolio has a 1-day 95% VaR of $50,000.
Tasks:
Consider two independent lotteries:
Tasks:
A portfolio has daily returns that follow a Student’s t-distribution with:
For \(t_5\), the 99th percentile is approximately \(z_{0.99} \approx 3.36\).
Tasks:
You have computed a 95% daily VaR for a portfolio. Over the past 250 trading days, your portfolio exceeded the VaR on 20 days.
Tasks:
Hint: Under the null, the number of exceedances follows \(\text{Binomial}(n=250, p=0.05)\).
A portfolio consists of:
Tasks:
VaR is a quantile-based risk measure
Three methods to compute VaR
Parametric VaR formula: \[ \text{VaR}_\alpha = \sigma z_\alpha - \mu \approx \sigma z_\alpha \] for normal returns with small mean
VaR has critical limitations
VaR satisfies some desirable properties
Despite flaws, VaR remains widely used
Best practices:
Key insight: VaR answers “How much can I lose?” but not “How bad can it get beyond that point?”
Next lecture: Coherent risk measures—Expected Shortfall and the axiomatic foundations of risk measurement.
Because regulators learned the hard way that VaR is not enough.