AI ALIGNMENT FORUM
AF

Wikitags

Goal-Directedness

Edited by Jon Garcia, adamShimi, et al. last updated 30th Dec 2024

Goal-Directedness is the property of some system to be aiming at some goal. It is in need of formalization, but might prove important in deciding which kind of AI to try to align.

A goal may be defined as a world-state that an agent tries to achieve. Goal-directed agents may generate internal representations of desired end states, compare them against their internal representation of the current state of the world, and formulate plans for navigating from the latter to the former.

The goal-generating function may be derived from a pre-programmed lookup table (for simple worlds), from directly inverting the agent's utility function (for simple utility functions), or it may be learned through experience mapping states to rewards and predicting which states will produce the largest rewards. The plan-generating algorithm could range from shortest-path algorithms like A* or Dijkstra's algorithm (for fully-representable world graphs), to policy functions that learn through RL which actions bring the current state closer to the goal state (for simple AI), to some combination or extrapolation (for more advanced AI).

Implicit goal-directedness may come about in agents that do not have explicit internal representations of goals but that nevertheless learn or enact policies that cause the environment to converge on a certain state or set of states. Such implicit goal-directedness may arise, for instance, in simple reinforcement learning agents, which learn a policy function π:S→A that maps states directly to actions.

Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged Goal-Directedness
34Literature Review on Goal-Directedness
adamShimi, Michele Campolo, Joe Collman
5y
21
52Coherence arguments do not entail goal-directed behavior
Rohin Shah
7y
50
11Behavioral Sufficient Statistics for Goal-Directedness
adamShimi
4y
12
25AI safety without goal-directed behavior
Rohin Shah
7y
6
26Will humans build goal-directed agents?
Rohin Shah
7y
22
28Measuring Coherence of Policies in Toy Environments
dx26, Richard_Ngo
1y
0
20Intuitions about goal-directed behavior
Rohin Shah
7y
5
13Deliberation Everywhere: Simple Examples
Oliver Sourbut
3y
0
9Goal-directed = Model-based RL?
adamShimi
6y
0
10Focus: you are allowed to be bad at accomplishing your goals
adamShimi
5y
9
11Locality of goals
adamShimi
5y
7
7Goals and short descriptions
Michele Campolo
5y
8
4Goal-Directedness: What Success Looks Like
adamShimi
5y
0
3Goal-directedness is behavioral, not structural
adamShimi
5y
12
51Searching for Search
NicholasKees, janus
3y
0
Load More (15/47)
Add Posts