Do-What-I-Mean hierarchy

by Eliezer Yudkowsky Mar 23 2016 updated Jun 6 2016

Successive levels of "Do What I Mean" or AGIs that understand their users increasingly well

[summary: "Do What I Mean" or "DWIM" refers to the degree to which an AGI can rapidly identify an intended goal and find a safe plan to it, based on the AI's understanding of what the user means or wants.

Levels of DWIM-ness could range over:

Do-What-I-Mean refers to an aligned AGI's ability to produce better-aligned plans, based on an explicit model of what the user wants or believes.

Successive levels of DWIM-ness:

Risks from pushing toward higher levels of DWIM might include: