A problem with resource-bounded Solomonoff induction and unpredictable environments

[-]Charlie Steiner10y20

When describing the failure mode, you have the approximate-Solomonoff agent try to predict $s_{2}$ (by assigning approximately uniform probability since it can't invert the hash in O(n^2) time), and then plug that distribution into the reward function to compute the "expected reward" of action 1 (very small, since only one $s_{2}$ can be correct).

However, this problem would be circumvented if the agent did things the opposite way - first try to predict $r$ (by assigning probability 0.9 to $r = 1$ based on the frequency data), and only then trying to invert the hash and failing.

There might be a more general rule here: the order of prediction where you fail later rather than earlier gives better results. Or: the more approximate an answer, the less precedence it should have in the final prediction.

This is still a bit unsatisfying - it's not abstract reasoning (at least not obviously) - but I think the equivalent would look more like abstract reasoning if the underlying predictor had to use smarter search on a smaller hypothesis space than Solomonoff induction.

[-]Watson Ladd10y20

I think the issue presented in the post is that the Solomonoff hypothesis cannot be sampled from, even though we can determine the probability density function computationally. If we were to compute the expected value of the reward based on our action, we run into the curse of dimensionality: there is a single point contributing most of the reward. A Solomonoff inductor would correctly find the probability density function that h(s_2)=s_1 with high probability.

However, I think that if we ask the Solomonoff predictor to predict the reward directly, then it will correctly arrive at a model that predicts the rewards. So we can fix the presented agent.

[-]jessicata10y00

Yes, I think part of the issue is the gap between being able to sample $s_{2}$ given $s_{1}$ and evaluating the density of $s_{2}$ given $s_{1}$ . However, I am not sure how we should score hypotheses that assign density to future observations (instead of predicting bits one at a a time). We will have difficulty computing $P (observations so far | hypothesis)$ .

Predicting the rewards directly seems to fix this issue. I don't know if this solution can be generalized to environments other than betting environments.

[-]orthonormal10y00

Typo: you write "90% of the time, $s_{1} = h_{i} (s_{1})$ " instead of "90% of the time, $s_{1} = h_{i} (s_{2})$ "

[-]jessicata10y00

Fixed, thanks.

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

2

A problem with resource-bounded Solomonoff induction and unpredictable environments

2

Betting environments

A resource-bounded algorithm for betting environments

Failure in an unpredictable environment

Conclusion