Singular learning theory

$\newcommand{\Cat}{{\rm Cat}}$

$\newcommand{\A}{\mathcal A}$

$\newcommand{\freestar}{ \framebox[7pt]{$\star$} }$

1. When true distribution is outside the model

Singular learning theory currently assumes that the true distribution is inside the family of models under consideration. One problem is to extend or understand the theory when this assumption does not apply.

Add remark

Problem 1.1.
Let us assume that the true probability distribution $q$ of a data generating process is outside the family of models $\mathcal{M}$ that are being considered. Under what conditions does the posterior distribution converge to a distribution with smallest KL divergence to the true distribution $q$ ?
Add remark

Problem 1.2.
[Russell] Understand asymptotics of the stochastic complexity in the case where the true distribution is not in any model under consideration.
Add remark

Problem 1.3.
As a concrete example, suppose the data is generated from a log normal distribution, and suppose we try to fit two model classes - a) a normal distribution b) a gamma distribution

Understand the behavior of asymptotics in this case.
Add remark

Problem 1.4.
[Yongli Zhang] As another example, let us assume the true model is a linear regression. $Y = exp(X) + \epsilon$ where $\epsilon \sim N(\mu,\sigma^2)$ . Let us assume that we try to fit a linear regression model to the data, i.e. $y=bx + \epsilon$ .

Cite this as: AimPL: Singular learning theory, available at http://aimpl.org/singularlearning.

Singular learning theory

1. When true distribution is outside the model

Problem 1.1.

Problem 1.2.

Problem 1.3.

Problem 1.4.