Grok API 가격 가이드 2026: 토큰, 무료 티어, 사용 사례 비용

AI Free API Team

•2026년 7월 2일•18 분 소요•API 가이드

Grok API 비용은 토큰 단가만으로 끝나지 않습니다. 공식 가격 행, 캐시, 출력, 도구 호출, Batch, Priority, 저장소, 재시도, 계정 한도를 함께 봐야 합니다.

Grok API 가격 가이드 2026: 토큰, 무료 티어, 사용 사례 비용

2026년 7월 2일 xAI Docs 기준으로 Grok API 예산을 잡을 때 먼저 봐야 할 현재 기본 행은 grok-4.3입니다. 문서에는 input token 100만 개당 1.25달러, cached input 100만 개당 0.20달러, output token 100만 개당 2.50달러로 표시되어 있습니다. 공개 xAI 문서는 영구적이고 모든 계정에 동일한 공식 free API tier를 보장하지 않습니다. 안전한 기본값은 자신의 xAI console에서 credits, eligibility, billing mode, 지역, rate limit, 사용 가능한 model row를 확인한 뒤 비용을 계산하는 것입니다.

Grok API의 실제 비용은 공식 token row 하나로 계산되지 않습니다. fresh input, cached input, output과 reasoning tokens, Web Search, X Search, Code Execution, file search, RAG, Batch, Priority, storage, downloads, retry rate, spending limit을 같은 worksheet에 넣어야 합니다. 외부 자료에는 무료 티어, 무료 key, Groq API 무료 같은 다른 서비스 주장, 오래된 credit, provider route, consumer subscription이 함께 섞입니다.

먼저 소액 prepaid로 테스트하고 postpaid limit을 낮게 또는 0으로 설정한 뒤 model ID, 토큰 수, tool calls, retry, accepted output을 기록하세요. 콘솔 로그, xAI 공식 가격표, worksheet가 서로 맞을 때만 트래픽을 키우는 것이 안전합니다.

지출 전에 확인할 네 가지

질문	2026-07-02 기준 안전한 답	예산 작업
Grok API 기본 가격 행은?	xAI Docs는 grok-4.3을 input $1.25, cached input $0.20, output $2.50 / 100만 tokens로 표시합니다.	이 행을 기본값으로 쓰고 공개 전 pricing page를 다시 확인합니다.
공식 무료 티어가 있나?	공개 문서는 영구 공식 free API tier를 보장하지 않습니다. quickstart는 API 사용 전 account에 credits를 load하라고 안내합니다.	자신의 console에서 credits, 만료, billing mode를 확인합니다.
인보이스를 가장 크게 바꾸는 것은?	output length, cache hit, tool calls, Batch, Priority, storage, retry, rate-limit tier입니다.	workload별 worksheet를 만듭니다.
가장 안전한 테스트 방식은?	소액 prepaid, 낮은 postpaid limit, pinned model ID, 전체 로그입니다.	console row나 token log가 worksheet와 다르면 중단합니다.

이 표는 공식 가격과 실제 workload cost를 분리합니다. 공식 가격 행은 xAI Docs가 소유합니다. 실제 비용은 prompt, retrieval, tools, output, retries, limits가 만듭니다.

현재 공식 Grok API 가격 행

공식 기준은 xAI pricing documentation입니다. 제3자 summary나 provider calculator는 독자의 혼란을 보여줄 수 있지만 공식 가격 행은 아닙니다. 2026년 7월 2일 확인한 주요 Chat API 행은 다음과 같습니다.

Model row	Docs context	Input / 1M	Cached input / 1M	Output / 1M	사용 방식
grok-4.3	1M	$1.25	$0.20	$2.50	대부분의 text 및 image-input 작업에서 먼저 예산화할 행.
grok-build-0.1	256k	$1.00	$0.20	$2.00	사용 가능하고 품질이 맞으면 더 낮은 build-oriented 행.
grok-4.20-multi-agent-0309	1M	$1.25	$0.20	$2.50	특수 행이므로 console availability와 목적을 확인합니다.
grok-4.20-0309-reasoning	1M	$1.25	$0.20	$2.50	reasoning workload는 실제 output과 behavior로 판단합니다.
grok-4.20-0309-non-reasoning	1M	$1.25	$0.20	$2.50	가격만 보지 말고 품질과 output length를 테스트합니다.

Grok 4.3 model page는 grok-4.3, grok-4.3-latest, grok-latest, text/image input, text output, 1M context window를 보여줍니다. 같은 페이지는 200K context를 넘는 요청에 다른 rate가 적용될 수 있다고 말합니다. 긴 문서나 큰 retrieval을 다루는 경우 일반 행을 그대로 적용하지 말고 별도 샘플로 측정해야 합니다.

Grok API 공식 가격 변수 보드. grok-4.3, grok-build-0.1, 도구, Batch, Priority, 저장소 행을 정리한다.

이 숫자를 영구값으로 고정하지 마세요. xAI는 price, model availability, alias, region, rate limit, console behavior를 바꿀 수 있습니다. 운영 예산표에는 date, exact model ID, docs URL, team/account, sample size, usage log를 남겨야 나중에 검증할 수 있습니다.

도구, Batch, Priority, Storage 비용

Grok API 예산이 틀리는 가장 흔한 이유는 token만 계산하는 것입니다. xAI pricing page에는 agent workflow에서 크게 작용하는 별도 비용 면이 있습니다.

비용 면	2026-07-02 확인 규칙	커지는 상황
Web Search	$5 / 1000 calls	최신 web evidence가 필요한 agent.
X Search	$5 / 1000 calls	X 기반 realtime 정보나 social evidence.
Code Execution	$5 / 1000 calls	coding, data, sandbox execution.
File Attachments search	$10 / 1000 calls	업로드 파일이나 큰 문서 QA.
Collections Search / RAG	$2.50 / 1000 calls	retrieval-heavy knowledge base.
Batch API	text/language token은 보통 20%-50% 할인, 대개 24시간 이내	실시간이 필요 없는 요약, 분류, 추출.
Priority Processing	prompt caching discount 후 standard token rate의 2배	latency가 비용보다 중요한 명시적 priority route.
File storage	$0.025/GiB/day	업로드 파일을 보존할 때.
Collection storage	$0.10/GiB/day	retrieval collection을 저장할 때.
Downloads	$0.20/GiB downloaded	export 또는 download-heavy workflow.

support bot, RAG assistant, coding assistant, research agent는 같은 비용 구조가 아닙니다. support는 cache와 짧은 output이 중요합니다. RAG는 file search나 collection search가 token 절감보다 커질 수 있습니다. research agent는 token보다 search call이 비쌀 수 있습니다. Batch는 latency를 포기할 수 있을 때만 의미가 있습니다.

무료 티어와 Credits 경계

2026년 7월 2일 확인한 공개 xAI Docs는 영구 공식 free API tier를 보장하지 않습니다. Quickstart는 account에 credits를 load한 뒤 API를 사용하라고 설명합니다. 이것은 모든 계정에 매달 무료 API가 있다는 뜻이 아닙니다.

Route	의미	안전한 설명
Official xAI API	xAI team/account 안에서 usage billed or credited.	console에서 credits, eligibility, billing mode, model list, rate limits를 확인합니다.
Console credits or promotions	특정 account balance, trial, promotion.	account state로 기록하고 universal free tier라고 쓰지 않습니다.
Third-party free route	provider가 sponsorship, proxy, throttle, limit를 제공하는 route.	provider contract이지 official xAI price row가 아닙니다.
Grok app / X subscription / SuperGrok	consumer access product.	developer API billing과 분리합니다.

한국어 자료에는 Grok API 가격, 무료 티어, 무료 key, Groq API 무료, 오래된 Grok 3 credit 이야기가 섞입니다. 독자에게 필요한 것은 무료 실험을 막는 것이 아니라 공식 API, account credits, provider route, consumer subscription을 별도 계약으로 나누는 것입니다.

비용 공식

먼저 명시적인 식을 만들고, 그 다음 console log로 교체하세요.

text
estimated cost =
  fresh_input_count / 1,000,000 * input_price
+ cached_input_count / 1,000,000 * cached_input_price
+ output_count / 1,000,000 * output_price
+ tool_calls / 1,000 * tool_call_price
+ storage_gib_days * storage_price
+ downloads_gib * download_price
+ retry_cost
+ priority_multiplier_or_batch_discount

이 공식은 세 가지 실수를 줄입니다. cached input은 실제 cache hit가 있을 때만 절감입니다. server-side tools는 agent 내부에 숨어 있어도 무료가 아닙니다. base token row는 retry, schema repair, Priority, storage, downloads가 있으면 invoice와 달라집니다.

Worksheet에는 model ID, fresh input 토큰 수, cached input 토큰 수, output 토큰 수, tool calls, retry rate, accepted output count, Batch eligible, Priority required, storage retained, console limit, fallback rule을 넣으세요. first response cost가 아니라 accepted result cost를 봐야 합니다.

사용 사례별 비용 예시

다음 예시는 보편적인 월 비용이 아니라 비용 driver를 보여줍니다. 실제 request volume, output length, tools, retry를 자신의 로그로 바꾸세요.

Grok API 사용 사례 비용 예시. support chat, documents, coding, research workloads를 비교한다.

Support chat

Support bot은 output length와 cache hit에 민감합니다. 안정적인 system prompt, tone rules, policy block, tool instructions는 cached input 후보입니다. 비용을 올리는 것은 긴 답변, handoff summary, rejected answer retry입니다.

Assumption	Example value	Cost implication
Requests	100000 replies/month	volume이 높으면 작은 차이도 커집니다.
Fresh input	800 tokens/reply	base input은 비교적 제어 가능합니다.
Cached input	1200 tokens/reply	cache hit rate가 비용을 크게 바꿉니다.
Output	350 tokens/reply	output price가 예상보다 중요합니다.
Tools	0 to 1 retrieval/search call/reply	매번 tool을 쓰면 token savings를 넘을 수 있습니다.

Control rule: stable instructions를 cache하고, answer length를 제한하고, accepted와 retried replies를 나누어 기록하고, 모든 ticket에 연결하기 전에 품질을 sampling합니다.

Documents and RAG

문서 QA는 input-heavy입니다. 하나의 answer에는 retrieved passages, file search, user query, policy text, long output이 들어갈 수 있습니다. token row가 싸 보여도 file search나 collection search가 자주 발생하면 전체 비용은 달라집니다.

Assumption	Example value	Cost implication
Requests	20000 answers/month	medium volume이라도 context가 크면 비싸질 수 있습니다.
Fresh input	6000 tokens/answer	retrieval window가 가장 큰 lever입니다.
Cached input	1000 tokens/answer	fixed instruction은 도움이 되지만 retrieved chunks는 fresh인 경우가 많습니다.
Output	700 tokens/answer	citation과 summary가 output을 늘립니다.
Tools	File Attachments search or Collections Search	tool row를 별도로 계산해야 합니다.

Control rule: fewer and better chunks, compact citations, maximum context budget, quality comparison을 사용하고 retrieval window를 넓히기 전에 작은 window로 통과하는지 봅니다.

Coding assistant

Coding work는 짧은 suggestion이면 저렴할 수 있지만 agentic loop에서는 비싸집니다. 비용 driver는 files, diffs, tests, Code Execution, patch explanation, accepted change까지의 attempts입니다.

Assumption	Example value	Cost implication
Tasks	5000 coding turns/month	turn count는 multi-attempt work를 숨깁니다.
Fresh input	2500 tokens/turn	files, diffs, tests, repo rules가 누적됩니다.
Cached input	500 tokens/turn	repeated repo instructions가 도움이 될 수 있습니다.
Output	900 tokens/turn	patch explanation과 structured output은 길어질 수 있습니다.
Tools	Code Execution when enabled	tool fee와 failed test loop를 기록해야 합니다.

Control rule: first response가 아니라 accepted change cost를 측정합니다. failed tests, retry, human review minutes, rollback을 같은 표에 넣습니다.

Research agent

Research agent는 token이 싸 보이고 tools가 비싼 workload입니다. Web Search, X Search, file search, long evidence summary가 지배적일 수 있습니다. 오래된 사실을 섞을 위험도 가장 큽니다.

Assumption	Example value	Cost implication
Reports	1000 reports/month	volume이 낮아도 task당 비용이 큽니다.
Fresh input	4000 tokens/report	query plan, evidence, instructions가 큽니다.
Cached input	800 tokens/report	reusable report scaffold는 cache될 수 있습니다.
Output	1500 tokens/report	evidence packet과 summary는 output-heavy입니다.
Tools	Multiple Web Search or X Search calls	tool calls가 total cost를 지배할 수 있습니다.

Control rule: tool calls를 cap하고, source quality를 요구하고, non-urgent work는 Batch를 검토하고, current official source를 제시하지 못하는 claim은 멈춥니다.

Rate limit와 Billing control

xAI rate-limit docs는 API team마다 model별 RPS와 TPM이 있고, tier는 2026년 1월 1일 이후 cumulative API spend에 기반한다고 설명합니다. TPM에는 prompt, completion, reasoning, cached prompt, image, audio token이 모두 포함됩니다. model page의 숫자는 참고용이고, 실제 운영 기준은 자신의 team console입니다.

Billing management는 invoices, payment methods, prepaid credit balance, top-ups, historical usage, postpaid invoice preview, spending limits를 다룹니다. 첫 production test는 이렇게 진행하세요.

prepaid credits로 시작합니다.
prepaid-only에 가깝게 운영하려면 postpaid limit을 낮게 또는 0으로 설정합니다.
토큰 수, cached token 수, model ID, tool calls, retry, errors, latency, accepted result를 기록합니다.
작은 sample 이후 actual spend와 worksheet를 비교합니다.
spend, quality, latency, failure rate가 안정될 때만 limit을 올립니다.

Grok API 지출 제어 체크리스트. docs verification, console credits, limits, logs, cache, Batch, old model warning을 정리한다.

Stop rule은 월말 invoice가 아니라 code나 운영 절차에 들어가야 합니다. request volume, tool calls, retry rate, output tokens, Priority use가 worksheet threshold를 넘으면 route를 멈춥니다.

오래된 모델 행과 May 15 retirement

May 15 retirement notice는 Grok API pricing에서 freshness warning입니다. xAI는 2026년 5월 15일 이후 여러 retired slugs가 grok-4.3으로 redirect되고, 그 이후 deprecated slug request는 grok-4.3 pricing으로 billed된다고 설명합니다. Grok 4.1 Fast, Grok 3, 오래된 free credits 중심의 외부 문장은 그대로 예산에 넣을 수 없습니다.

보이는 주장	먼저 보는 방식	더 안전한 행동
Grok 4.1 Fast가 현재 저가 기본값	official docs 또는 console이 증명할 때까지 stale.	pricing page와 model list를 재확인합니다.
모든 계정에 월별 free credits	xAI Docs가 명시하지 않는 한 account-specific or provider-specific.	credit balance와 expiration을 확인합니다.
provider calculator의 free usage	provider contract.	official xAI row와 분리합니다.
grok-latest alias	편하지만 움직일 수 있음.	cost test에서는 exact model ID를 pin합니다.

요청이 성공한다고 해서 예상 가격으로 billed된다는 뜻은 아닙니다. 현재 official row와 자신의 console behavior를 기준으로 예산을 세우세요.

안전한 테스트 계획

Grok API 사용량을 키우기 전에 실제 workload와 가까운 작은 test를 실행합니다.

Step	Action	Pass signal
1. Pin model	grok-4.3 또는 테스트할 exact row로 시작합니다.	logs에 expected model ID와 team/account가 보입니다.
2. Set spend stop	prepaid credits와 낮은 postpaid limit을 사용합니다.	runaway test가 큰 invoice를 만들 수 없습니다.
3. Run real sample	real prompts, retrieval, tools, output format을 사용합니다.	sample이 production work와 닮아 있습니다.
4. Count accepted cost	accepted outputs, retries, tool calls, review time을 셉니다.	cost per accepted result가 분명합니다.
5. Compare alternatives	lower row, Batch, cache, fewer tools, shorter output을 시험합니다.	더 싼 route가 quality gate를 통과합니다.
6. Scale gradually	logs와 worksheet가 맞은 뒤 limit을 올립니다.	spend, quality, latency, failure rate가 안정됩니다.

model ID, alias, migration, rollout은 Grok 4.3 API 가이드에서 다루고, 여기서는 pricing, billing, workload behavior에 집중합니다.

자주 묻는 질문

Grok API 비용은 얼마인가요?

2026년 7월 2일 xAI Docs 기준 grok-4.3은 input $1.25, cached input $0.20, output $2.50 / 100만 tokens입니다. 실제 비용은 output length, cache hit, tool calls, Batch, Priority, storage, retry, account limits에 따라 달라집니다.

Grok API는 무료인가요?

공개 xAI Docs는 영구 공식 free API tier를 보장하지 않습니다. Quickstart는 account에 credits를 load한 뒤 API를 사용하라고 안내합니다. account credits나 provider free route는 official xAI price row와 별개입니다.

어떤 Grok 모델을 먼저 예산화해야 하나요?

일반적인 Grok API 작업은 grok-4.3부터 시작하세요. grok-build-0.1이나 grok-4.20 row를 테스트할 때는 price뿐 아니라 quality, availability, output behavior를 worksheet에 넣어야 합니다.

Cached input은 왜 더 저렴한가요?

반복 prompt content가 cache behavior 대상일 때 cached input row가 적용됩니다. stable system prompt, policy block, template에 유용하지만 자동 절감은 아닙니다. cache hit을 측정한 뒤 예산을 낮추세요.

Tools가 Grok API pricing을 바꾸나요?

네. Web Search, X Search, Code Execution, File Attachments search, Collections Search/RAG는 별도 가격이 있습니다. workflow가 tools를 쓰면 공식에 넣어야 합니다.

Batch API는 언제 써야 하나요?

실시간 응답이 필요 없을 때 검토하세요. xAI는 eligible text/language batch work에 20%-50% discount를 제시하지만, image/video generation은 standard rate일 수 있습니다.

가장 자주 빠지는 비용은 무엇인가요?

tool calls, retries, long output, Priority multiplier, file/collection storage, retired slug redirect, postpaid limit입니다. 최저 token price보다 accepted result cost를 보는 것이 더 안전합니다.