Model

read about the model different ways: High-Level, no math (this page) | More details, minimal math | All math

What does the model have to do?

The model has to generate a probability distribution for the price of the underlying at expiration. With that distribution, calculating expected return and the variance of the return are straightforward.

What am I modeling?

In short, I am making a prediction about people. Participants in this sort of market are all essentially tasked with guessing at what price the other participants are willing to transact and where that price will be in the future.

Many factors may influence this guess (company fundamentals, news, performance of the broader market, etc.), but one stands out to me: price history. This information is available to any participant, it incorporates the influence of many other factors, and I believe, it is what most participants are actaully using to make their guesses. Here’s the argument that convinced me: first, people’s guesses are biased by memorable and readily available information. We can’t escape lingering pereptions that shares may continue to fall after a rough week, and it’s dificult to resist finding patterns in a time series (pattern finding is human nature, after all). Next, price history is the most reported and most memorable metric for goods that are traded. If you don’t believe that last point, here’s a rhetorical question: which do you find easier to recall for AAPL stock, P/E or share price? How about the P/E or share price 1 year ago? 10 years ago? Finally, share price is inseparable from other predictors. Big swings in share price influence news, which can lead to price moves, which in turn spawn more news, etc. Strong share price performance can mean favorable financing or new growth initiatives, directly impacting fundamentals. So, even if pariticipants use other metrics, they are (at least implicitly) making their guesses using the share price history.

With my model, I try to answer the following question: given the (recent) history of share price, what is the probability distribution for the share price at some point in the future? Since I believe price is essentially driven by people’s guesses about other people’s guesses (and so on…), I am trying to model the perceived distribution of participants in price space based on price history—guessing other people’s guesses!—and then predict how that distribution will evolve.

Model Assumptions

Explicit assumptions are as follows:

An implicit assumption of my approach is that prices are not memoryless. This is unconventional (more often, prices are modeled by Martingales), but I believe it aligns more closely with how people actually react to price changes.

Training

Given a functional form for the distribution P(X|\theta, data) with parameters \theta, I can construct a training dataset, then find \theta that maximizes the likelihood of observing that data. The training dataset comprises price observations over a window leading the prediction day (“inputs” to the model, along with \theta), as well as observations lagging the prediction day (think of these as observed “outputs” of the model). I used MCMC methods to model the posterior P(\theta|X, data), and selected the most likely \theta.

More details

There is a lot more to say about the assumptions I’ve made and how they lead to the form of the distribution P(X|\theta, data).
See how deep the rabbit hole goes…

How does the model perform?

See the (live) performance of the model

Model performance