Longstaff–Schwartz Methods

A short-rate model, in the context of interest rate derivatives, is a mathematical model that describes the future evolution of interest rates by describing the future evolution of the short rate, usually written $r_{t}\,$ .

The short rate

Under a short rate model, the stochastic state variable is taken to be the instantaneous spot rate.^[1] The short rate, $r_{t}\,$ , then, is the (continuously compounded, annualized) interest rate at which an entity can borrow money for an infinitesimally short period of time from time $t$ . Specifying the current short rate does not specify the entire yield curve. However, no-arbitrage arguments show that, under some fairly relaxed technical conditions, if we model the evolution of $r_{t}\,$ as a stochastic process under a risk-neutral measure $Q$ , then the price at time $t$ of a zero-coupon bond maturing at time $T$ with a payoff of 1 is given by

P(t,T)=\operatorname {E} ^{Q}\left[\left.\exp {\left(-\int _{t}^{T}r_{s}\,ds\right)}\right|{\mathcal {F}}_{t}\right],

where ${\mathcal {F}}$ is the natural filtration for the process. The interest rates implied by the zero coupon bonds form a yield curve, or more precisely, a zero curve. Thus, specifying a model for the short rate specifies future bond prices. This means that instantaneous forward rates are also specified by the usual formula

f(t,T)=-{\frac {\partial }{\partial T}}\ln(P(t,T)).

Short rate models are often classified as endogenous and exogenous. Endogenous short rate models are short rate models where the term structure of interest rates, or of zero-coupon bond prices $T\mapsto P(0,T)$ , is an output of the model, so it is "inside the model" (endogenous) and is determined by the model parameters. Exogenous short rate models are models where such term structure is an input, as the model involves some time dependent functions or shifts that allow for inputing a given market term structure, so that the term structure comes from outside (exogenous).^[2]

Particular short-rate models

Throughout this section $W_{t}\,$ represents a standard Brownian motion under a risk-neutral probability measure and $dW_{t}\,$ its differential. Where the model is lognormal, a variable $X_{t}$ is assumed to follow an Ornstein–Uhlenbeck process and $r_{t}\,$ is assumed to follow $r_{t}=\exp {X_{t}}\,$ .

One-factor short-rate models

Following are the one-factor models, where a single stochastic factor – the short rate – determines the future evolution of all interest rates. Other than Rendleman–Bartter and Ho–Lee, which do not capture the mean reversion of interest rates, these models can be thought of as specific cases of Ornstein–Uhlenbeck processes. The Vasicek, Rendleman–Bartter and CIR models are endogenous models and have only a finite number of free parameters and so it is not possible to specify these parameter values in such a way that the model coincides with a few observed market prices ("calibration") of zero coupon bonds or linear products such as forward rate agreements or swaps, typically, or a best fit is done to these linear products to find the endogenous short rate models parameters that are closest to the market prices. This does not allow for fitting options like caps, floors and swaptions as the parameters have been used to fit linear instruments instead. This problem is overcome by allowing the parameters to vary deterministically with time,^[3]^[4] or by adding a deterministic shift to the endogenous model.^[5] In this way, exogenous models such as Ho-Lee and subsequent models, can be calibrated to market data, meaning that these can exactly return the price of bonds comprising the yield curve, and the remaining parameters can be used for options calibration. The implementation is usually via a (binomial) short rate tree ^[6] or simulation; see Lattice model (finance) § Interest rate derivatives and Monte Carlo methods for option pricing, although some short rate models have closed form solutions for zero coupon bonds, and even caps or floors, easing the calibration task considerably. We list the following endogenous models first.

Merton's model (1973) explains the short rate as $r_{t}=r_{0}+at+\sigma W_{t}^{*}$ : where $W_{t}^{*}$ is a one-dimensional Brownian motion under the spot martingale measure.^[7] In this approach, the short rate follows an arithmetic Brownian motion.
The Vasicek model (1977) models the short rate as $dr_{t}=(\theta -\alpha r_{t})\,dt+\sigma \,dW_{t}$ ; it is often written $dr_{t}=a(b-r_{t})\,dt+\sigma \,dW_{t}$ .^[8] The second form is the more common, and makes the parameters interpretation more direct, with the parameter $a$ being the speed of mean reversion, the parameter $b$ being the long term mean, and the parameter $\sigma$ being the instantaneous volatility. In this short rate model an Ornstein–Uhlenbeck process is used for the short rate. This model allows for negative rates, because the probability distribution of the short rate is Gaussian. Also, this model allows for closed form solutions for the bond price and for bond options and caps/floors, and using Jamshidian's trick, one can also get a formula for swaptions.^[2]
The Rendleman–Bartter model (1980)^[9] or Dothan model (1978)^[10] explains the short rate as $dr_{t}=\theta r_{t}\,dt+\sigma r_{t}\,dW_{t}$ . In this model the short rate follows a geometric Brownian motion. This model does not have closed form formulas for options and it is not mean reverting. Moreover, it has the problem of an infinite expected bank account after a short time. The same problem will be present in all lognormal short rate models^[2]
The Cox–Ingersoll–Ross model (1985) supposes $dr_{t}=(\theta -\alpha r_{t})\,dt+{\sqrt {r_{t}}}\,\sigma \,dW_{t}$ , it is often written $dr_{t}=a(b-r_{t})\,dt+{\sqrt {r_{t}}}\,\sigma \,dW_{t}$ . The $\sigma {\sqrt {r_{t}}}$ factor precludes (generally) the possibility of negative interest rates.^[11] The interpretation of the parameters, in the second formulation, is the same as in the Vasicek model. The Feller condition $2ab>\sigma ^{2}$ ensures strictly positive short rates. This model follows a Feller square root process and has non-negative rates, and it allows for closed form solutions for the bond price and for bond options and caps/floors, and using Jamshidian's trick, one can also obtain a formula for swaptions. Both this model and the Vasicek model are called affine models, because the formula for the continuously compounded spot rate for a finite maturity T at time t is an affine function of $r_{t}$ .^[2]

We now list a number of exogenous short rate models.

The Ho–Lee model (1986) models the short rate as $dr_{t}=\theta _{t}\,dt+\sigma \,dW_{t}$ .^[12] The parameter $\theta _{t}$ allows for the initial term structure of interest rates or bond prices to be an input of the model. This model follows again an arithmetic Brownian motion with time dependent deterministic drift parameter.
The Hull–White model (1990)—also called the extended Vasicek model—posits $dr_{t}=(\theta _{t}-\alpha _{t}r_{t})\,dt+\sigma _{t}\,dW_{t}$ . In many presentations one or more of the parameters $\theta ,\alpha$ and $\sigma$ are not time-dependent. The distribution of the short rate is normal, and the model allows for negative rates. The model with constant $\alpha$ and $\sigma$ is the most commonly used and it allows for closed form solutions for bond prices, bond options, caps and floors, and swaptions through Jamshidian's trick. This model allows for an exact calibration of the initial term structure of interest rates through the time dependent function $\theta _{t}$ . Lattice-based implementation for Bermudan swaptions and for products without analytical formulas is usually trinomial.^[13]^[14]
The Black–Derman–Toy model (1990) has $d\ln(r)=[\theta _{t}+{\frac {\sigma '_{t}}{\sigma _{t}}}\ln(r)]dt+\sigma _{t}\,dW_{t}$ for time-dependent short rate volatility and $d\ln(r)=\theta _{t}\,dt+\sigma \,dW_{t}$ otherwise; the model is lognormal.^[15] The model has no closed form formulas for options. Also, as all lognormal models, it suffers from the issue of explosion of the expected bank account in finite time.
The Black–Karasinski model (1991), which is lognormal, has $d\ln(r)=[\theta _{t}-\phi _{t}\ln(r)]\,dt+\sigma _{t}\,dW_{t}$ .^[16] The model may be seen as the lognormal application of Hull–White;^[17] its lattice-based implementation is similarly trinomial (binomial requiring varying time-steps).^[6] The model has no closed form solutions, and even basic calibration to the initial term structure has to be done with numerical methods to generate the zero coupon bond prices. This model too suffers of the issue of explosion of the expected bank account in finite time.
The Kalotay–Williams–Fabozzi model (1993) has the short rate as $d\ln(r_{t})=\theta _{t}\,dt+\sigma \,dW_{t}$ , a lognormal analogue to the Ho–Lee model, and a special case of the Black–Derman–Toy model.^[18] This approach is effectively similar to "the original Salomon Brothers model" (1987),^[19] also a lognormal variant on Ho-Lee.^[20]
The CIR++ model, introduced and studied in detail by Brigo and Mercurio^[5] in 2001, and formulated also earlier by Scott (1995)^[21] used the CIR model but instead of introducing time dependent parameters in the dynamics, it adds an external shift. The model is formulated as $dx_{t}=a(b-x_{t})\,dt+{\sqrt {x_{t}}}\,\sigma \,dW_{t},\ \ r_{t}=x_{t}+\phi (t)$ where $\phi$ is a deterministic shift. The shift can be used to absorb the market term structure and make the model fully consistent with this. This model preserves the analytical tractability of the basic CIR model, allowing for closed form solutions for bonds and all linear products, and options such as caps, floor and swaptions through Jamshidian's trick. The model allows for maintaining positive rates if the shift is constrained to be positive, or allows for negative rates if the shift is allowed to go negative. It has been applied often in credit risk too, for credit default swap and swaptions, in this original version or with jumps.^[22]

The idea of a deterministic shift can be applied also to other models that have desirable properties in their endogenous form. For example, one could apply the shift $\phi$ to the Vasicek model, but due to linearity of the Ornstein-Uhlenbeck process, this is equivalent to making $b$ a time dependent function, and would thus coincide with the Hull-White model.^[5]

Multi-factor short-rate models

Besides the above one-factor models, there are also multi-factor models of the short rate, among them the best known are the Longstaff and Schwartz two factor model and the Chen three factor model (also called "stochastic mean and stochastic volatility model"). Note that for the purposes of risk management, "to create realistic interest rate simulations", these multi-factor short-rate models are sometimes preferred over One-factor models, as they produce scenarios which are, in general, better "consistent with actual yield curve movements".^[23]

The Longstaff–Schwartz model (1992) supposes the short rate dynamics are given by

{\begin{aligned}dX_{t}&=(a_{t}-bX_{t})\,dt+{\sqrt {X_{t}}}\,c_{t}\,dW_{1t},\\[3pt]dY_{t}&=(d_{t}-eY_{t})\,dt+{\sqrt {Y_{t}}}\,f_{t}\,dW_{2t},\end{aligned}}

where the short rate is defined as

dr_{t}=(\mu X+\theta Y)\,dt+\sigma _{t}{\sqrt {Y}}\,dW_{3t}.

^[24]

The Chen model (1996) which has a stochastic mean and volatility of the short rate, is given by

{\begin{aligned}dr_{t}&=(\theta _{t}-\alpha _{t})\,dt+{\sqrt {r_{t}}}\,\sigma _{t}\,dW_{t},\\[3pt]d\alpha _{t}&=(\zeta _{t}-\alpha _{t})\,dt+{\sqrt {\alpha _{t}}}\,\sigma _{t}\,dW_{t},\\[3pt]d\sigma _{t}&=(\beta _{t}-\sigma _{t})\,dt+{\sqrt {\sigma _{t}}}\,\eta _{t}\,dW_{t}.\end{aligned}}

^[25]

The two-factor Hull-White or G2++ models are models that have been used due to their tractability. These models are summarized and shown to be equivalent in Brigo and Mercurio (2006). This model is based on adding two possibly correlated Ornstein-Uhlenbeck (Vasicek) processes plus a shift to obtain the short rate. This model allows for exact calibration of the term structure, semi-closed form solutions for options, control of the volatility term structure for instantaneous forward rates through the correlation parameter, and especially for negative rates, which has become important as rates turned negative in financial markets.^[26]

Other interest rate models

The other major framework for interest rate modelling is the Heath–Jarrow–Morton framework (HJM). Unlike the short rate models described above, this class of models is generally non-Markovian. This makes general HJM models computationally intractable for most purposes. The great advantage of HJM models is that they give an analytical description of the entire yield curve, rather than just the short rate. For some purposes (e.g., valuation of mortgage backed securities), this can be a big simplification. The Cox–Ingersoll–Ross and Hull–White models in one or more dimensions can both be straightforwardly expressed in the HJM framework. Other short rate models do not have any simple dual HJM representation.

The HJM framework with multiple sources of randomness, including as it does the Brace–Gatarek–Musiela model and market models, is often preferred for models of higher dimension.

Models based on Fischer Black's shadow rate are used when interest rates approach the zero lower bound.

References

^ Short rate models, Prof. Andrew Lesniewski, NYU
^ ^a ^b ^c ^d Brigo, Damiano; Mercurio, Fabio (2006). Interest rate models: theory and practice. Springer Finance. Heidelberg: Springer-Verlag. doi:10.1007/978-3-540-34604-3. ISBN 978-3-540-22149-4.
^ An Overview of Interest-Rate Option Models Archived 2012-04-06 at the Wayback Machine, Prof. Farshid Jamshidian, University of Twente
^ Continuous-Time Short Rate Models Archived 2012-01-23 at the Wayback Machine, Prof Martin Haugh, Columbia University
^ ^a ^b ^c Brigo, D. and Mercurio, F. (2001). A deterministic–shift extension of analytically–tractable and time–homogeneous short–rate models. Finance and Stochastics 5, 369–387. https://doi.org/10.1007/PL00013541
^ ^a ^b Binomial Term Structure Models, Mathematica in Education and Research, Vol. 7 No. 3 1998. Simon Benninga and Zvi Wiener.
^ Merton, Robert C. (1973). "Theory of Rational Option Pricing". Bell Journal of Economics and Management Science. 4 (1): 141–183. doi:10.2307/3003143. hdl:1721.1/49331. JSTOR 3003143.
^ Vasicek, Oldrich (1977). "An Equilibrium Characterisation of the Term Structure". Journal of Financial Economics. 5 (2): 177–188. CiteSeerX 10.1.1.456.1407. doi:10.1016/0304-405X(77)90016-2.
^ Rendleman, R.; Bartter, B. (1980). "The Pricing of Options on Debt Securities". Journal of Financial and Quantitative Analysis. 15 (1): 11–24. doi:10.2307/2979016. JSTOR 2979016. S2CID 154495945.
^ Dothan, L.U. (1978). On the term structure of interest rates. Jour. of Fin. Ec., 6:59–69
^ Cox, J.C., J.E. Ingersoll and S.A. Ross (1985). "A Theory of the Term Structure of Interest Rates". Econometrica. 53 (2): 385–407. doi:10.2307/1911242. JSTOR 1911242.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^ T.S.Y. Ho and S.B. Lee (1986). "Term structure movements and pricing interest rate contingent claims". Journal of Finance. 41 (5): 1011–1029. doi:10.2307/2328161. JSTOR 2328161.
^ John Hull; Alan White (1990). "Pricing interest-rate derivative securities". Review of Financial Studies. 3 (4): 573–592. doi:10.1093/rfs/3.4.573.
^ Markus Leippold; Zvi Wiener (2004). "Efficient Calibration of Trinomial Trees for One-Factor Short Rate Models" (PDF). Review of Derivatives Research. 7 (3): 213–239. CiteSeerX 10.1.1.203.4729. doi:10.1007/s11147-004-4810-8.
^ Black, F.; Derman, E.; Toy, W. (1990). "A One-Factor Model of Interest Rates and Its Application to Treasury Bond Options" (PDF). Financial Analysts Journal: 24–32. Archived from the original (PDF) on 2008-09-10.
^ Black, F.; Karasinski, P. (1991). "Bond and Option pricing when Short rates are Lognormal". Financial Analysts Journal. 47 (4): 52–59. doi:10.2469/faj.v47.n4.52.
^ Short Rate Models^{[permanent dead link]}, Professor Ser-Huang Poon, Manchester Business School
^ Kalotay, Andrew J.; Williams, George O.; Fabozzi, Frank J. (1993). "A Model for Valuing Bonds and Embedded Options". Financial Analysts Journal. 49 (3): 35–46. doi:10.2469/faj.v49.n3.35.
^ Kopprasch, Robert (1987). "Effective duration of callable bonds: the Salomon Brothers term structure-based option pricing model". Salomon Bros. OCLC 16187107. {{cite journal}}: Cite journal requires |journal= (help)
^ See pg 218 in Tuckman, Bruce & Angel Serrat (2011). Fixed Income Securities: Tools for Today's Markets. Hoboken, NJ: Wiley. ISBN 978-0-470-89169-8.
^ Scott, L. (1995). The valuation of interest rate derivatives in a multi-factor term-structure model with deterministic components. University of Georgia. Working paper.
^ Brigo, D. and El-Bachir, N. (2010). An exact formula for default swaptions pricing in the SSRJD stochastic intensity model. Mathematical Finance. July 2010, pp. 365-382, https://doi.org/10.1111/j.1467-9965.2010.00401.x
^ Pitfalls in Asset and Liability Management: One Factor Term Structure Models Archived 2012-04-03 at the Wayback Machine, Dr. Donald R. van Deventer, Kamakura Corporation
^ Longstaff, F.A.; Schwartz, E.S. (1992). "Interest Rate Volatility and the Term Structure: A Two-Factor General Equilibrium Model" (PDF). Journal of Finance. 47 (4): 1259–82. doi:10.1111/j.1540-6261.1992.tb04657.x.
^ Lin Chen (1996). "Stochastic Mean and Stochastic Volatility — A Three-Factor Model of the Term Structure of Interest Rates and Its Application to the Pricing of Interest Rate Derivatives". Financial Markets, Institutions & Instruments. 5: 1–88.
^ Giacomo Burro, Pier Giuseppe Giribone, Simone Ligato, Martina Mulas, and Francesca Querci (2017). Negative interest rates effects on option pricing: Back to basics? International Journal of Financial Engineering 4(2), https://doi.org/10.1142/S2424786317500347

Cookie	Duration	Description
_GRECAPTCHA	5 months 27 days	This cookie is set by Google. In addition to certain standard Google cookies, reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__gads	1 year 24 days	This cookie is set by Google and stored under the name dounleclick.com. This cookie is used to track how many times users see a particular advert which helps in measuring the success of the campaign and calculate the revenue generated by the campaign. These cookies can only be read from the domain that it is set on so it will not track any data while browsing through another sites.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga_3TWNWEDH70	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_203667330_1	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
rmuid	1 year	This cookie is provided by Linksynergy. It is used for advertising purposes. This cookie is used for create and storing unique user ID.
WMF-Last-Access	1 month 20 hours 4 minutes	This cookie is used to calculate unique devices accessing the website.
WMF-Last-Access-Global	1 month 20 hours 4 minutes	This cookie is used to count how many times a website has been visited by different visitor. This is done by assigning the visitor an ID.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
NID	6 months	This cookie is used to a profile based on user's interest and display personalized ads to the users.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.

Cookie	Duration	Description
__gpi	1 year 24 days	No description
Captcha	session	No description available.
GeoIP		No description
GoogleAdServingTest	session	No description
S	1 hour	No description available.

codefinance.training

Longstaff–Schwartz Methods