모형한테 introspect 시켜

~24 min · introspection, drift-audit, evidence

Level 0Window Watcher

0 XP0/50 lessons0/13 achievements

0/100 XP to next level100 XP to go0% complete

Confidence는 signal 아니야

'이해했나?' 묻지 마. 모형이 yes 할 거야. 묻기 — live rule list, must-not 리스트, 각 사실 source 명명, 빠졌다 생각하는 거 식별. Evidence-based introspection이 confidence prompt가 숨기는 drift 잡아.

Risk 주변에 introspection 스케줄

Drift audit를 compaction 전, 긴 interrupt 후, commit 전, publish 전, task 전환 후에 돌려. Unverified 가정이 가장 비싼 순간들. 뭐 깨진 후 reactive introspection은 yield 낮아.

Gap 인정하게 audit frame

Audit prompt는 명시적으로 모형한테 'missing' 말할 초대. Prompt가 모형이 통과하고 싶은 quiz처럼 생기면, 빈칸 인정 X priors에서 reconstruct. 정직한 gap 보상하는 prompt 만들어.

Code

Drift audit prompt·text

Drift audit:
Using only current context (no general knowledge), list:
  1. Live rules I must obey, with source.
  2. Actions I must not take, with source.
  3. Sources I have verified this turn (paths/IDs).
  4. The next safe action.
  5. Anything I think might be missing — say 'missing' rather than reconstruct.
Then wait for human confirmation before continuing.

언제 돌릴지·yaml

introspection_triggers:
  - before_compaction
  - after_long_interruption
  - before_commit
  - before_push_or_deploy
  - on_topic_switch
  - when_human_corrects_authority_rule
  - first_action_of_a_resumed_session

External links

OpenAI compaction guide

Exercise

Drift audit prompt를 현재 AI session에 돌려, 또는 한 session output 시뮬레이션. 모형이 confidently reconstruct vs 정직하게 'missing' 인정 어디 했는지 노트.

Hint

중요한 부분은 quiz 통과 X missing context 인정.

Progress

Progress is local-only — sign in to sync across devices.

← ⚡ Caching과 Stable Prefix퀴즈 · 5 questions Next →Re-Prompt 나선

이 페이지에서 버그를 발견하셨거나 피드백이 있으세요?문제 신고

🔔 답글 알림 (로그인 필요)

로그인 — 댓글을 남기려면 로그인해 주세요.

아직 댓글이 없어요. 첫 댓글을 남겨보세요.