In this blog we will be looking at the Recipe for Pricing.
The recipe involves starting by mixing together a risky asset and a risk-free asset. Then we take that mixture and feed it into the Ito’s Lemma machine. Out of that machine pops the stochastic differential equation for the mixture. Then we throw that mixture in the Girsanov oven, together with the original real-world probability measure . Let it cook for 30 minutes at 180 degrees celsius and out comes a new risk-neutral probability measure and the cooked mixture and .
With these cooked ingredients we then cool it by placing it in the Martingale Representation Theorem. When it is cooled thoroughly, the cooked mixture reduces to . Then all we need to do it put the icing on it by substituting everything back in to the original stochastic differential equation for the risky asset, but now with a new, cooked-up Wiener process .
So let’s see how this recipe works.
The Black-Scholes Pricing Framework
Risky assets are assets whose value changes randomly. From one instant to the next its price might move up some amount, or down some amount. It could even stay the same. The direction and the amount of movement is random.
So how do we price such risky assets?
Well, we need to introduce some assumptions. Different sets of assumptions define different so-called pricing frameworks. One very popular framework is the Black-Scholes (BS) pricing framework (also known as the Black-Scholes Model), but there are others. Under the BS framework there are 7 assumptions. We won’t go in to them here suffice to point out that one of the assumptions is: the log return of a risky asset price is a random walk with drift. Another way to say this is: risky asset prices follow a geometric Brownian motion.
It turns out that this is a pretty good assumption because GBMs exhibit quite a few similarities with what is observed in real life. For example, and this one is fairly simple: a GBM only assumes positive values, just like real (risky) stock prices! Another similarity is that the amount of ‘roughness‘ as seen in the graph of a GBM is about the same amount of ‘roughness‘ as seen in the graph of a risky stock price.
However, GBMs are not a perfect replication of reality. For example, the volatility of risky stock prices in real life changes over time, even randomly. But the volatility of a GBM is constant over time. Moreover, real life stock prices exhibit jumps caused by unpredictable events, but GBMs are continuous.
Never-the-less the Black-Scholes model is a very popular model in quantitative finance and can be used to accurately predict the price of a risky (random) asset; and in this article we will see just how this is done in practice.
First of all, we must operate under the so-called Black-Scholes-Merton framework (or the BSM model). We certainly don’t have to use the BSM model and, in fact, there are a great many other frameworks we could use. Some frameworks (like the Black-76 framework) contains different assumptions that might better suit some other risky asset that refuses to play by the rules of the Black-Scholes framework (for example, commodity futures). But there are risky assets which do play by the Black-Scholes-Merton rules quite well, and it is these which we will focus our attention on in this article.
The BSM Model assumes that the price of a risky asset follows a geometric Brown motion with drift. This means that the change in price of risky assets is equal to some amount of drift (up or down) plus some amount of randomness (the size of the up or down movement). The combination of the drift and the amount of randomness can be captured, or represented, by the following equation:
But what good is knowing the change of the price? Isn’t it better to know what the actual price is?
That’s a good question, but defining actual prices is a little too specific. We would prefer a generalised approach to pricing and not tie ourselves to any sort of value magnitude or type (like different currencies). You see, a change in price is a dimensionless ratio and is much easier to manipulate and apply to an asset (like, simply multiplying it). Plus, using some sophisticated tools, we will be able to convert a change in price to an actual price anyway.
Let’s put some standard mathematical symbols to our heuristic equation above. I have coloured this equation red to indicate that it is for the risky asset:
Here, denotes the passage of time, denotes the risky asset price, is the drift percentage and is the volatility percentage. What makes this equation geometric is the factor of in the drift and the volatility.
Sometimes you’ll see this equation written divided through by the spot price like so:
but the former is more common, mostly to indicate its geometric property.
This equation looks like your typical differenial equation except for that at the end there. This is the rate of change of Brownian motion (hence the name); and it makes the equation a stochastic differential equation, or SDE for short.
As mentioned before, the rate of change of the price of the risky asset , although general, doesn’t do us much good when it comes to find the actual physical price. We need the actual price, denoted ; and the actual price today would be denoted with a , i.e. by .
It is very difficult to take a randomly evolving thing and create something specific (like an actual price) from it. But it turns out we can with the help of a few mathematical tools and another asset: the risk-free asset.
By incorporating a risk-free asset in with a risky asset to annihilate randomness was a key insight in to the Black-Scholes-Merton framework of 1973.
So the theme for the rest of the article is to derive an actual price given just the 7 assumptions of the BSM model and the stochastic differential equation expressing the rate of change of a risky asset.
What else do we need to assume?
The next assumption we need from the BSM model to be able to make any progress in our quest is the one that states that the rate of return of a risk-free asset is constant, a reasonable assumption. This has a stochastic differential equation representation too, and I’ve coloured this one blue to indicate that it is risk free:
Where is the value of the risk-free asset at time , and is the so-called risk-free interest rate. Note that this equation has no random component, i.e. there is no part.
As it turns out, by defining such an object in the BSM model, allows us to discount the risky asset price from any future time back to today by the risk-free rate. This is referred to as discounting by the risk-free rate. Furthermore, this assumption allows us to utilise continuous discounting, which means we can define
Continuous discounting and the exponential
Why is continuous discounting represented by the exponential , you may wonder?
Suppose you begin with amount of money at time and there is some length of time that you hold that money for, during which time it accrues interest.
One thing you could do is to divide that length of time up in to smaller pieces (a method known as discretisation), each with a length of time ; and over each one of these smaller time periods we will apply a small interest rate $\latex r$, causing a small increase in the amount of money you have, :
How many small intervals of are there? Well, that is equal to the total length divided by the length of the smaller intervals , thus the number of small intervals is:
What interest rates do, is this: they create a change in money amount by taking an interest rate , scaling it by the time interval , and then multiplying all of that by the amount of money you have . The formula for this growth action on the initial amount of money is, very simply, the product of all these little actions:
Thus, an initial amount of money will grow by an amount , i.e.
factoring out the initial amount of money gives
Then, for each little time step we are simply going to keep performing this mapping of taking the money from the previous time step, and multiplying it by a factor of , over and over again, precisely times, until we hit .
Since multiplication commutes, this means that for time steps, we will have factors of , thus:
We already know what is (above), so we can substitute that in:
Now we need to un-discretise time, and the way we do this is to apply the limit as , i.e. see what happens to as we increase the number of time steps to infinity.
If we make the substitution , then we get
and then a second substitution , then we get
then using the property of exponents to split the exponent up, we get
and now, the part in brackets – together with the limit, is well-known to be .
Finally, we need to reverse our subsitutions:
Combining a Risky Asset with Risk-a Free Asset
By continuously discounting at the risk-free interest rate we have essentially defined a new function by the product of a risk-free asset and a risky asset, which I’ll show as green to indicate that it is a mixture of the two:
As written, this function is also random and contains drift (both properties of risky things) because it contains a factor of and hence contains Brownian motion; and, as much as we’d like to, we can’t just write:
as this gives us no information because the right-hand side still contains .
As before, let’s look at the change in , i.e. let’s see if we can derive .
It’s difficult to take the derivative of a random function. But luckily we can use Itô’s Lemma (which is basically the random, stochastic version of the chain rule) to perform the mapping
So let’s look at how we do that…
Practical Itô’s Lemma
Once you have a composite function like you can apply Itô’s lemma to derive its derivative . The formula for doing this is:
which is really just a sum of four parts and is derived from the Taylor series expansion of which looks like this:
Both of these expansions are worth memorising if you are going to be doing a lot of these kinds of calculations.
But now we have to do some maths…
The first part involves the calculation of the partial derivative of with respect to time . But first, let us make the notation a little easier to handle by defining the function of as the exponent:
Then, recalling the fundamental theorem of calculus, we immediately have that
which will be a useful result for us in just a second.
Right, let’s jump in and perform this partial derivative. The only explicit appears in the exponent and the variable is considered constant with respect to this partial derivative, hence:
The second part is much easier. We just take the partial derivative of with respect to . There is only one explicit and the exponential is considered constant, hence:
The third part is always trivial because
The final and fourth part involves calculation of the second partial derivative of with respect to . In this case, we can take the result of the 2nd part and just take the partial derivative once more to obtain:
Combining Parts for Ito’s Lemma
Now we need to substitute back in our original equation for :
We can’t go any further until we rearrange in to
and notice that the negative exponent has now become a positive one. Now,
and, collecting like terms to factor out the we obtain:
Rearrange to get dW Alone
The next step, is to get the above equation in to the right form.
Think about what we are going to do next: we are going to use Girsanov’s theorem to make an argument about . Thus, should be isolated and by itself – nothing out the front of it. In this step we ensure this by factoring out any coefficient of . In our case we have this factor of , so we factor it out:
and now the component is isolated and we can implement Girsanov’s theorem.
Using Girsanov’s Theorem
Girsanov’s theorem is a theorem about and about . What is ? Well, is the coefficient of that occurs when you do the re-arranging step we did above. It’s easy to define once is isolated by factoring, and for our example is it just
What about the common to both and to ?
Well, it’s not really about the pieces common to both. The parameter is just the coefficients relating to the part, that’s it. I guess you could just divide both sides by that and it really doesn’t feture on the right-hand side anymore.
So, once we have found lambda we can implement Girsanov’s Theorem. How do we do that?
Well, it is pretty much just writing down the same paragraph. Every time you do this step, you’ll be stating the same thing over and over again without ever changing anything, so it is worth memorising it. I have, and now I know that when I get to this stage, I just blurt it out, word-for-word, without thinking. It’s needed though, and it’s important, because you can’t proceed to the next step unless you have clearly stated it. So let’s do it:
By Girsanov’s Theorem there exists an equivalent martingale measure (EMM) on the filtration defined by the Radon-Nikodym derivative
such that under the measure , the stochastic process is a -standard Wiener process and is
…by differentiating both sides with respect to .
That’s it. This theorem only requires that we have defined precisely what is (which is why we went to all the trouble of factoring everything out so we could find it!), the filtration is given right at the beginning when we defined our probability space, and this theorem simply produces a new probability measure , an a new Wiener process on the new measure (it’s obviously not a Wiener process on the original probability measure ).
The Martingale Representation Theorem
Now that we have been handed the new Wiener process by Girsanov, what do we do with it?
Ultimately, we have a stochastic process , and we know by virtue of Ito’s lemma. But these processes are Wiener processes under the original probability measure . We need the equivalent process, say , which is Wiener under the new probability measure !
The martingale representation theorem comes to the rescue, because it provides the stochastic process so long as you have a driftless one in the first place.
OK, so what do we have? Our stochastic process (under ) looks like this so far:
This is certainly not driftless. Just look at that almighty drift as the coeffcient of the term.
OK, let’s see what this stochastic process looks like under using the results from Girsanov’s theorem. We begin by substituting in ,
Let’s throw in what equals:
The lambda’s cancel out, and we are left with:
which is driftless! … no component!
We then define
…to be the coefficient, which we will use in the next step.
OK, so now we have a driftless stochastic process, albeit, one under the measure . Which means we can use the Martingale Representation Theorem. Again, it’s a bit of word-fest:
Under the -measure, the stochastic process is driftless, then by the Martingale Representation Theorem there exists a , adapted to the filtration , such that,
OK, so we take that formula, and plug in our value for , giving us:
In the final step, we take Girsanov’s Theorem results:
and substitute it in to the original stochastic differential equation:
…and we are done!
We have successfully mixed the risky asset , with drift , with the risk-free asset , with risk-free interest rate , to produce a new risky asset where the drift has been replaced by the risk-free interest rate.
Under the risk-neutral probability measure , with Wiener process , the drift of the risky asset becomes the risk-free interest rate .
In this blog, we considered a market consisting of just one risky asset and one risk-free asset. We assumed that the price dynamics of these two assets were driven by stochastic differential equations: the risky one by a Wiener process, and the risk-free one by a deterministic ordinary differential equation.
We then formed a new stochastic differential equation by continuously discounting the risky one by the risk-free rate. This new process was not a martingale under the original measure, because when we used Ito’s lemma to find the price dynamics of the new process, we found that it had huge amounts of drift: this huge amount we called the market price of risk.
Then we implemented Girsanov’s theorem to find a new probability measure such that the discounted price process is a martingale.
And finally, we found an equivalent risky asset price process, under the new probability measure, that was indeed a martingale again under the new measure.
If the risky asset is, say, the price of stock, then what we have just shown is that the discounted stock price is a martingale under the risk-neutral measure.
This allowed us to find a probability measure and Wiener process such that the drift is replaced by the risk-free interest rate.
- Define a risky stochastic process, .
- Define a risk-free stochastic process, .
- Form some function or composition of the two: .
- Use Ito’s product, or quotient lemma to find .
- Rearrange and factor stuff to get by itself.
- Note the coefficient of as lambda: .
- Implement Girsanov’s theorem and insert in to the equation for . Get .
- State the results of Girsanov’s theorem, get .
- Differentiate w.r.t. and rearrange to find in terms of .
- Form a new stochastic process under , and show that it should be drifless.
- Implement the Martingale Representation Theorem. State equation of in terms of .
- Substitute back in to the original risky stochastic differential equation . Rearrange, cancel stuff, factor, and get final -martingale.
- Itô, Kiyosi, “Stochastic Integral” Proc. Imperial Acad. Tokyo 20, 519-524, (1944)
That’s interesting but too complex to understand 🙂