1M-Token Context Window

~14 min · gemini, context-window, long-context, performance

Level 0🌱 입문자

0 XP0/70 lessons0/11 achievements

0/120 XP to next level120 XP to go0% complete

Gemini 가 가져오는 context 이점

Gemini 2.5 Pro 와 3.0 Pro 가 1M-token context window ship — Claude Sonnet 의 200K (또는 1M-context variant 의 1M) 와 Codex 의 256K 보다 significantly 큼. 가능한 일이 변해: 중간 사이즈 codebase 통째로 fit, 500-page PDF fit, compaction 없이 몇 주 대화 history fit.

트레이드오프는 latency 와 비용 — 큰 context = 토큰당 더 많은 compute. long-context 승리가 진짜 값 하려면 작업이 폭에서 이득 봐야 함: cross-cutting 리팩토링, 문서 합성, large-corpus search-and-summarize. 빡빡한 작업엔 1M window overkill.

Code

When 1M context is the right tool·text

Yes — 1M context shines:
- Audit a 200K-line codebase in one pass
- Synthesize a 1000-page legal corpus
- Multi-week conversation that hasn't been /compact-ed
- Cross-file refactors that need to see *all* affected sites
- Analyze a folder of 50 PDFs in one query

No — 1M context is overkill:
- "Fix this bug in src/auth.ts"
- "Generate a commit message"
- Single-file review
- Anything that fits in 50K tokens

Pin the long-context model + measure·bash

# Switch to the 1M-context model
gemini --model gemini-2.5-pro

# Or pin in settings
# ~/.gemini/settings.json
{ "model": { "default": "gemini-2.5-pro" } }

# Track usage
gemini /stats
# Tokens in:  847,212
# Tokens out:   8,401
# Cache hit:   612,000  (the cache pays off on long context)

External links

Gemini long context

Exercise

같은 작업 두 가지로 run: (a) Gemini + default 작은 context, 타깃 파일 몇 개; (b) Gemini + 1M-context 모델, 전체 repo 로딩. quality, latency, 비용 비교. 어느 작업 모양이 진짜 1M window 자격 있는지 적어.

Progress

Progress is local-only — sign in to sync across devices.

← PreviousSandbox 컨테이너 Next →고급 패턴 — Vibe Coding, 도구 합성

이 페이지에서 버그를 발견하셨거나 피드백이 있으세요?문제 신고

🔔 답글 알림 (로그인 필요)

로그인 — 댓글을 남기려면 로그인해 주세요.

아직 댓글이 없어요. 첫 댓글을 남겨보세요.