Related to
infraBook Club I: Corrigibility is bad ashkually
One of my old blog posts I never wrote (I did not even list it in a "posts I will never write" document) is one about how corrigibility are anti correlated with goal security.
Something like: If you build an AI that don't resist someone trying to change its goals, it will also not try to stop bad actors from changing its goal. (I don't think this particular worry applies to Paul's version of corrigibility, but this blog post idea was from before I learned about his definition.)
I might steal the exorcism metaphor for the post I probably will write about the complexity prior.
This post has been written for the first Refine blog post day, at the end of the week of readings, dicussions, and exercises about epistomology for doing good conceptual research.
(/with courtesy to Adam Shimi who suggested the title and idea. )
Rationality, Probability, Uncertainty, Reasoning
Foundations of Reasoning
Vibes of Mathematics
Life, Complexity, Optimisation, Entropy, Death & Decay
infraBook Club
Miscellaneous
Predicative mathematics is a foundations of mathematics that rejects 'impredicative' definitions. Roughly speaking, you can think of predicative mathematics as rejecting the powerset axiom.