This is a post forecast aggregation. Extremization in particular. If you’re unfamiliar with forecast aggregation, see e.g. this post.
Suppose we have \(n\) equally skilled and knowledgeable forecasters with incomplete information, intending to forecast a binary event. The available information is completely encoded in \(\mu\in\mathbb{R}\), and the probability of the event occurring is \(p=F(\mu)\), where \(F\) is the cumulative distribution of the logistic function.
The forecasters do not load completely on the available information. In other words, they are not aware of all the available information at one point in time – or are not able to effectively compute what to do with it. The forecasters have, however, several sources of idiosyncratic and incorrect sources of information they load on.
Summing these idiosyncratic sources of information together with an error term, we can treat them as a single source of error, denoted \(s\).
Let’s model the situation using a logistic model. \[ \begin{eqnarray*} z_{i} & \sim & \text{Logis}(\lambda\mu,s),\\ z_{i} & = & \log\left(\frac{p_{i}}{1-p_{i}}\right). \end{eqnarray*} \] Here \(\log(p/(1-p))\) is the quantile function of the logistic function, hence we assume that \(p_{i}=F(z_{i}),\) where \(F\) is the cumulative distribution function of the logistic distribution. I used the same transform to go from \(\mu\) to \(p\).
Since the logistic function is symmetric, we have \(EZ_{i}=\lambda\mu\). It follows that the available information equals \(\mu=EZ_{i}/\lambda\). A natural estimator of \(EZ_{i}\) is \[ \frac{1}{n}\sum\log\left(\frac{p_{i}}{1-p_{i}}\right), \] and a natural estimator of \(\mu\) is \[ \hat{\mu}=\frac{1}{\lambda}\frac{1}{n}\sum\log\left(\frac{p_{i}}{1-p_{i}}\right). \] This is an extremizing estimator of the log-odds provided \(0<\lambda<1\), which is very reasonable. Moreover, if \(\lambda=1/\sqrt{3}\approx0.58,\) the estimator approximates the extremizing estimator derived by Neyman and Roughgarden, discussed by Jamie Sevilla here. My argument is similar to the one of Satopää et al., 2014.
Comments