{ localUrl: '../page/advanced_safety.html', arbitalUrl: 'https://arbital.com/p/advanced_safety', rawJsonUrl: '../raw/2l.json', likeableId: '1505', likeableType: 'page', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], pageId: 'advanced_safety', edit: '9', editSummary: '', prevEdit: '20', currentEdit: '9', wasPublished: 'true', type: 'wiki', title: 'Advanced safety', clickbait: 'An agent is *really* safe when it has the capacity to do anything, but chooses to do what the programmer wants.', textLength: '3002', alias: 'advanced_safety', externalUrl: '', sortChildrenBy: 'likes', hasVote: 'false', voteType: '', votesAnonymous: 'false', editCreatorId: 'AlexeiAndreev', editCreatedAt: '2015-12-16 06:05:43', pageCreatorId: 'EliezerYudkowsky', pageCreatedAt: '2015-03-26 21:36:28', seeDomainId: '0', editDomainId: 'EliezerYudkowsky', submitToDomainId: '0', isAutosave: 'false', isSnapshot: 'false', isLiveEdit: 'true', isMinorEdit: 'false', indirectTeacher: 'false', todoCount: '4', isEditorComment: 'false', isApprovedComment: 'true', isResolved: 'false', snapshotText: '', anchorContext: '', anchorText: '', anchorOffset: '0', mergedInto: '', isDeleted: 'false', viewCount: '564', text: 'A proposal meant to produce [2v value-aligned agents] is 'advanced-safe' if it succeeds, or fails safely, in [2c scenarios where the AI becomes much smarter than its human developers]. \n\n### Definition\n\nA proposal for a value-alignment methodology, or some aspect of that methodology, is alleged to be 'advanced-safe' if that proposal is claimed robust to scenarios where the agent:\n\n- Knows more or has better probability estimates than us\n- Learns new facts unknown to us\n- Searches a larger strategy space than we can consider\n- Confronts new instrumental problems we didn't foresee in detail\n- Gains power quickly\n- Has access to greater levels of cognitive power than in the regime where it was previously tested\n- Wields strategies [2j that wouldn't make sense to us even if we were told about them in advance]\n\n### Importance\n\nIt seems reasonable to expect that there will be difficulties of dealing with minds smarter than our own, doing things we didn't imagine, that will be qualitatively different from designing a toaster oven to not burn down a house, or from designing an AI system that is dumber than human. This means that the concept of 'advanced safety' will end up importantly different from the concept of robust pre-advanced AI.\n\nConcretely, it has been argued to be [ foreseeable] for several difficulties including e.g. [10f programmer deception] and [47 unforeseen maximums], that they won't materialize before an agent is advanced, or won't materialize in the same way, or won't materialize as severely. This means that practice with dumber-than-human AIs may not train us against these difficulties, requiring a separate theory and mental discipline for making advanced AIs safe.\n\nWe have observed in practice that many proposals for 'AI safety' do not seem to have been thought through against advanced agent scenarios; thus, there seems to be a practical urgency to emphasizing the concept and the difference.\n\nKey problems of advanced safety that are new or qualitatively different compared to pre-advanced AI safety include:\n\n- [2w Edge instantiation]\n- [47 Unforeseen maximums]\n- [6q Context change problems]\n- [10f]\n- [ Programmer maximization]\n- [ Philosophical competence]\n\nNon-advanced-safe methodologies may conceivably be useful if a [ known algorithm nonrecursive agent] can be created that is (a) [2s powerful enough to be relevant] and (b) can be known not to become advanced. Even here there may be grounds for worry that such an agent finds unexpectedly strong strategies in some particular subdomain - that it exhibits flashes of domain-specific advancement that break a non-advanced-safe methodology.\n\n### Omni-safety\n\nAs an extreme case, an 'omni-safe' methodology allegedly remains value-aligned, or fails safely, even if the agent suddenly becomes omniscient and omnipotent (acquires delta probability distributions on all facts of interest and has all describable outcomes available as direct options). See: [2x real-world agents should be omni-safe].', metaText: '', isTextLoaded: 'true', isSubscribedToDiscussion: 'false', isSubscribedToUser: 'false', isSubscribedAsMaintainer: 'false', discussionSubscriberCount: '1', maintainerCount: '1', userSubscriberCount: '0', lastVisit: '2016-02-27 20:49:20', hasDraft: 'false', votes: [], voteSummary: 'null', muVoteSummary: '0', voteScaling: '0', currentUserVote: '-2', voteCount: '0', lockedVoteType: '', maxEditEver: '0', redLinkCount: '0', lockedBy: '', lockedUntil: '', nextPageId: '', prevPageId: '', usedAsMastery: 'false', proposalEditNum: '0', permissions: { edit: { has: 'false', reason: 'You don't have domain permission to edit this page' }, proposeEdit: { has: 'true', reason: '' }, delete: { has: 'false', reason: 'You don't have domain permission to delete this page' }, comment: { has: 'false', reason: 'You can't comment in this domain because you are not a member' }, proposeComment: { has: 'true', reason: '' } }, summaries: {}, creatorIds: [ 'EliezerYudkowsky', 'AlexeiAndreev' ], childIds: [ 'unbounded_analysis', 'AI_safety_mindset', 'daemons', 'nearest_unblocked', 'safe_useless', 'distinguish_advancement', 'goodness_estimate_bias', 'goodharts_curse', 'context_disaster', 'foreseeable_difficulties', 'actual_effectiveness' ], parentIds: [ 'ai_alignment' ], commentIds: [ '75' ], questionIds: [ '38x' ], tagIds: [], relatedIds: [], markIds: [], explanations: [], learnMore: [], requirements: [], subjects: [], lenses: [], lensParentId: '', pathPages: [], learnMoreTaughtMap: {}, learnMoreCoveredMap: {}, learnMoreRequiredMap: {}, editHistory: {}, domainSubmissions: {}, answers: [], answerCount: '0', commentCount: '0', newCommentCount: '0', linkedMarkCount: '0', changeLogs: [ { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '22181', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'newChild', createdAt: '2017-02-22 04:43:26', auxPageId: 'actual_effectiveness', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '20240', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'newChild', createdAt: '2016-10-23 00:49:50', auxPageId: 'goodharts_curse', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '16054', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'newChild', createdAt: '2016-07-07 21:59:09', auxPageId: 'goodness_estimate_bias', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '14162', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'newChild', createdAt: '2016-06-20 22:12:35', auxPageId: 'distinguish_advancement', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '12189', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'deleteChild', createdAt: '2016-06-09 17:37:16', auxPageId: 'advanced_agent', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '11993', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '9', type: 'newChild', createdAt: '2016-06-08 00:41:30', auxPageId: 'safe_useless', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '9134', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'deleteChild', createdAt: '2016-03-27 20:49:17', auxPageId: 'omni_test', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '8877', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '9', type: 'newChild', createdAt: '2016-03-22 01:35:28', auxPageId: 'daemons', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '4544', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'deleteChild', createdAt: '2015-12-28 21:13:59', auxPageId: 'probable_environment_hacking', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '4539', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'deleteUsedAsTag', createdAt: '2015-12-28 21:04:08', auxPageId: 'distant_SIs', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '4537', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '9', type: 'newUsedAsTag', createdAt: '2015-12-28 21:03:51', auxPageId: 'distant_SIs', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '4533', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'deleteChild', createdAt: '2015-12-28 21:03:33', auxPageId: 'distant_SIs', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '4531', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '9', type: 'newChild', createdAt: '2015-12-28 21:02:49', auxPageId: 'distant_SIs', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '4271', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '9', type: 'newChild', createdAt: '2015-12-23 03:49:40', auxPageId: 'AI_safety_mindset', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '4129', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '0', type: 'deleteChild', createdAt: '2015-12-17 23:00:08', auxPageId: 'ontology_identification', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '3889', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '0', type: 'newAlias', createdAt: '2015-12-16 06:05:43', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '3890', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '9', type: 'newEdit', createdAt: '2015-12-16 06:05:43', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '930', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newChild', createdAt: '2015-10-28 03:46:58', auxPageId: 'unbounded_analysis', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '931', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newChild', createdAt: '2015-10-28 03:46:58', auxPageId: 'context_disaster', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '932', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newChild', createdAt: '2015-10-28 03:46:58', auxPageId: 'advanced_agent', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '933', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newChild', createdAt: '2015-10-28 03:46:58', auxPageId: 'nearest_unblocked', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '935', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newChild', createdAt: '2015-10-28 03:46:58', auxPageId: 'foreseeable_difficulties', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '936', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newChild', createdAt: '2015-10-28 03:46:58', auxPageId: 'ontology_identification', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '937', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newChild', createdAt: '2015-10-28 03:46:58', auxPageId: 'omni_test', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '938', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newChild', createdAt: '2015-10-28 03:46:58', auxPageId: 'probable_environment_hacking', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '389', pageId: 'advanced_safety', userId: 'AlexeiAndreev', edit: '1', type: 'newParent', createdAt: '2015-10-28 03:46:51', auxPageId: 'ai_alignment', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2193', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '20', type: 'newEdit', createdAt: '2015-07-16 01:02:11', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2192', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '19', type: 'newEdit', createdAt: '2015-06-08 04:42:02', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2191', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '18', type: 'newEdit', createdAt: '2015-06-07 20:03:00', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2190', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '17', type: 'newEdit', createdAt: '2015-04-04 23:42:45', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2189', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '16', type: 'newEdit', createdAt: '2015-04-04 23:42:19', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2188', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '15', type: 'newEdit', createdAt: '2015-04-04 21:27:36', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2187', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '14', type: 'newEdit', createdAt: '2015-03-27 00:01:59', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2186', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '13', type: 'newEdit', createdAt: '2015-03-26 23:32:19', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2185', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '12', type: 'newEdit', createdAt: '2015-03-26 23:30:01', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2184', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '11', type: 'newEdit', createdAt: '2015-03-26 23:22:19', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2183', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '10', type: 'newEdit', createdAt: '2015-03-26 23:22:04', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2182', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '8', type: 'newEdit', createdAt: '2015-03-26 23:13:35', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2181', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '7', type: 'newEdit', createdAt: '2015-03-26 23:13:15', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2180', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '6', type: 'newEdit', createdAt: '2015-03-26 23:12:48', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2179', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '5', type: 'newEdit', createdAt: '2015-03-26 23:08:08', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2178', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '4', type: 'newEdit', createdAt: '2015-03-26 22:59:17', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2177', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '3', type: 'newEdit', createdAt: '2015-03-26 22:08:28', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2176', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '2', type: 'newEdit', createdAt: '2015-03-26 21:37:12', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' }, { likeableId: '0', likeableType: 'changeLog', myLikeValue: '0', likeCount: '0', dislikeCount: '0', likeScore: '0', individualLikes: [], id: '2175', pageId: 'advanced_safety', userId: 'EliezerYudkowsky', edit: '1', type: 'newEdit', createdAt: '2015-03-26 21:36:28', auxPageId: '', oldSettingsValue: '', newSettingsValue: '' } ], feedSubmissions: [], searchStrings: {}, hasChildren: 'true', hasParents: 'true', redAliases: {}, improvementTagIds: [], nonMetaTagIds: [], todos: [], slowDownMap: 'null', speedUpMap: 'null', arcPageIds: 'null', contentRequests: {} }