Grok API 料金ガイド 2026：トークン、無料枠、用途別コスト例

AI Free API Team

•2026年7月2日•18 分で読めます•API ガイド

Grok API の料金は token 単価だけでは決まりません。公式行、キャッシュ、出力、ツール、Batch、Priority、保存、リトライ、console limit を同じ表で確認します。

2026年7月2日に xAI Docs を確認すると、Grok API の現在の基準行として最初に見るべきなのは grok-4.3 です。料金は input token が 100万あたり 1.25ドル、cached input が 0.20ドル、output が 2.50ドルです。公開されている xAI Docs は、永続的で全員共通の公式 free API tier を保証していません。安全な前提は、自分の xAI console で credits、eligibility、地域、billing mode、rate limit、利用できる model row を確認してから予算化することです。

Grok API の実コストは、公式 token row だけでは決まりません。新規 input、cached input、output と reasoning token、Web Search、X Search、Code Execution、file search、RAG、Batch、Priority、storage、downloads、retry、spending limit を同じ worksheet に入れる必要があります。外部資料にある「無料枠」「毎月クレジット」「最安」には、公式 API、account credits、provider route、Grok app、X subscription が混ざりがちです。

まず小額の prepaid で試し、postpaid limit を低くするか 0 にし、model ID、トークン数、tool calls、retry、accepted output をログに残します。console の実績、xAI Docs の料金行、worksheet の数字がそろうまで、本番 traffic を広げないほうが安全です。

支出前のクイック判断

質問	2026-07-02 時点の安全な答え	予算上の次アクション
Grok API の基本料金行は？	xAI Docs は grok-4.3 を input $1.25、cached input $0.20、output $2.50 / 100万 token としている。	まずこの行で見積もり、公開前に pricing page を再確認する。
公式 free tier はある？	公開 docs は永続的な公式 free API tier を保証していない。quickstart は credits を入れて使う流れを示す。	自分の console で credits、期限、billing mode を確認する。
請求額を大きく変える要素は？	output length、cache hit、tool calls、Batch、Priority、storage、retry、rate-limit tier。	workload ごとの worksheet を作る。
一番安全な試し方は？	小額 prepaid、低い postpaid limit、固定 model ID、完全な利用ログ。	model row や token log が worksheet と合わなければ止める。

この表は、公式料金行と実ワークロード費用を分けるためのものです。公式行は xAI Docs の責任範囲です。実費用はあなたの prompt、retrieval、tools、output、retry、limit の結果です。

現在の公式 Grok API 料金行

公式の基準は xAI の pricing documentation です。第三者 summary や provider calculator は、読者の悩みを知る手掛かりにはなりますが、公式料金行ではありません。2026年7月2日に確認した主な Chat API 行は次の通りです。

Model row	Docs の context	Input / 100万	Cached input / 100万	Output / 100万	使い方
grok-4.3	1M	$1.25	$0.20	$2.50	多くの text と image-input 仕事の基本行。
grok-build-0.1	256k	$1.00	$0.20	$2.00	利用可能で品質が合う場合の低い build-oriented 行。
grok-4.20-multi-agent-0309	1M	$1.25	$0.20	$2.50	専用用途。console で availability と目的を確認する。
grok-4.20-0309-reasoning	1M	$1.25	$0.20	$2.50	reasoning 行は実測 output と behavior で評価する。
grok-4.20-0309-non-reasoning	1M	$1.25	$0.20	$2.50	token row だけで選ばず、品質と出力量を測る。

Grok 4.3 model page には grok-4.3、grok-4.3-latest、grok-latest、text/image input、text output、1M context window が載っています。同じページは、200K context を超える request で別 rate が使われる可能性にも触れています。長文ドキュメントや大きな retrieval を扱う場合は、通常行とは別に測定してください。

Grok API の公式料金変数ボード。grok-4.3、grok-build-0.1、ツール、Batch、Priority、storage を整理。

この数字を固定値として保存しないでください。xAI は price、model availability、alias、region、rate limit、console behavior を変更できます。社内 budget sheet には date、exact model ID、docs URL、team/account、sample size、usage log を残すほうが監査しやすくなります。

ツール、Batch、Priority、Storage

Grok API の見積もりが外れる理由は、token だけを数えるからです。xAI pricing page には、agent workflow で大きくなる追加費用もあります。

Cost surface	2026-07-02 に確認した規則	影響する場面
Web Search	$5 / 1000 calls	現在の web evidence が必要な agent。
X Search	$5 / 1000 calls	X 上の realtime 情報や social evidence。
Code Execution	$5 / 1000 calls	coding、data、sandbox execution。
File Attachments search	$10 / 1000 calls	大きな upload file を使う文書 QA。
Collections Search / RAG	$2.50 / 1000 calls	knowledge base retrieval。
Batch API	text/language token が通常 20%-50% off、通常 24時間以内	非リアルタイムの要約、分類、抽出。
Priority Processing	prompt caching discount 後の standard token rate の 2倍	latency が明確に重要な route。
File storage	$0.025/GiB/day	uploaded files を保持する場合。
Collection storage	$0.10/GiB/day	retrieval collection を保存する場合。
Downloads	$0.20/GiB downloaded	export や download-heavy flow。

support bot、RAG assistant、coding assistant、research agent は同じ式で済みません。support は cache と短い answer が効きます。RAG は file search が支配的になることがあります。research agent は token より search calls が高くなることがあります。batch summarization は latency を許せるなら安くなる可能性があります。

無料枠と Credits の境界

2026年7月2日に確認した公開 xAI Docs は、永続的な公式 free API tier を保証していません。Quickstart は sign up 後に account に credits を load して API を使う流れを示します。これは「全員に毎月無料 API がある」という意味ではありません。

Route	意味	安全な書き方
Official xAI API	xAI team/account 内で usage billed or credited。	console で credits、eligibility、billing mode、model list、rate limit を確認する。
Console credits or promotions	特定 account の balance、trial、promotion。	account state として記録し、universal free tier と書かない。
Third-party free route	provider が sponsorship、proxy、limit を持つ route。	provider contract であり、official xAI price row ではない。
Grok app / X subscription / SuperGrok	consumer access product。	developer API billing とは分ける。

日本語の外部資料では、Grok API 料金、無料枠、トークン単価、Grok と Groq の混同、古い $150 credit の話が並びます。読者に必要なのは、無料ルートを否定することではなく、公式行、account credits、provider route、consumer access を別々の契約として扱うことです。

コスト計算式

まず明示的な式で見積もり、次に console log で置き換えます。

text
estimated cost =
  fresh_input_count / 1,000,000 * input_price
+ cached_input_count / 1,000,000 * cached_input_price
+ output_count / 1,000,000 * output_price
+ tool_calls / 1,000 * tool_call_price
+ storage_gib_days * storage_price
+ downloads_gib * download_price
+ retry_cost
+ priority_multiplier_or_batch_discount

この式は三つのミスを防ぎます。cached input は cache hit が実測できるときだけ安くなります。server-side tools は agent の裏側にあるだけで無料ではありません。base token row は、retry、schema repair、Priority、storage、downloads があると invoice と一致しません。

Worksheet には、model ID、fresh input のトークン数、cached input のトークン数、output のトークン数、tool calls、retry rate、accepted output count、Batch eligible、Priority required、storage retained、console limit、fallback rule を入れてください。first response cost ではなく accepted result cost を見るのが実務向けです。

用途別の見積もり例

次の例は driver を示すもので、普遍的な月額料金ではありません。実際の request volume、output length、tools、retry を自社ログで置き換えてください。

Grok API の用途別コスト例。support chat、documents、coding、research workloads を比較。

Support chat

Support bot は output length と cache hit に敏感です。固定 system prompt、tone rules、policy block、tool instructions は cached input の候補になります。高くなりやすいのは長い answer、handoff summary、rejected answer の retry です。

Assumption	Example value	Cost implication
Requests	100000 replies/month	volume が高いと小さな差が大きくなる。
Fresh input	800 tokens/reply	base input は比較的制御しやすい。
Cached input	1200 tokens/reply	cache hit rate が大きく効く。
Output	350 tokens/reply	output price は想像より重要。
Tools	0 to 1 retrieval/search call/reply	毎回 tool を呼ぶと token savings を超える。

Control rule: stable instructions を cache し、answer length を制限し、accepted と retried replies を分けて記録し、全 ticket に広げる前に品質を sampling します。

Documents and RAG

文書 QA は input-heavy です。1回の answer に retrieved passages、file search、user query、policy text、long output が入ります。token row が安くても、file search や collection search が増えると総額は変わります。

Assumption	Example value	Cost implication
Requests	20000 answers/month	medium volume でも context が大きいと高くなる。
Fresh input	6000 tokens/answer	retrieval window が主要レバー。
Cached input	1000 tokens/answer	fixed instruction は効くが retrieved chunks は fresh。
Output	700 tokens/answer	citation と summary が output を増やす。
Tools	File Attachments search or Collections Search	tool row を別計上する。

Control rule: fewer and better chunks、compact citations、maximum context budget、quality comparison を使い、retrieval を広げる前に小さい window で通るか確認します。

Coding assistant

Coding work は短い suggestion なら安く、agentic loop では高くなります。cost driver は files、diffs、tests、Code Execution、patch explanation、accepted change までの attempts です。

Assumption	Example value	Cost implication
Tasks	5000 coding turns/month	turn count は multi-attempt work を隠す。
Fresh input	2500 tokens/turn	files、diffs、tests、repo rules が足される。
Cached input	500 tokens/turn	repeated repo instructions が効く可能性。
Output	900 tokens/turn	patch explanation と structured output は長い。
Tools	Code Execution when enabled	tool fee と failed test loop を記録する。

Control rule: first response ではなく accepted change の cost を測ります。failed tests、retry、human review minutes、rollback を同じ表に入れます。

Research agent

Research agent は token が安く見えて tools が高い workload です。Web Search、X Search、file search、long evidence summary が支配的になります。古い事実を混ぜるリスクも最も高い領域です。

Assumption	Example value	Cost implication
Reports	1000 reports/month	volume が低くても 1 task が高い。
Fresh input	4000 tokens/report	query plan、evidence、instructions が重い。
Cached input	800 tokens/report	reusable report scaffold は cache できる可能性。
Output	1500 tokens/report	evidence packet と summary は output-heavy。
Tools	Multiple Web Search or X Search calls	tool calls が total cost を支配する。

Control rule: tool calls を cap し、source quality を要求し、非緊急 work は Batch を検討し、current official source を示せない claim は出力しません。

Rate limit と Billing control

xAI rate-limit docs は、API team ごとに model 別 RPS と TPM があり、tier は 2026-01-01 以降の cumulative API spend に基づくと説明しています。TPM には prompt、completion、reasoning、cached prompt、image、audio token が入ります。model page の数字は参考になりますが、運用上は自分の team console を確認してください。

Billing management では invoices、payment methods、prepaid credit balance、top-ups、historical usage、postpaid invoice preview、spending limits を扱えます。初回 production test は次の順に進めます。

prepaid credits で開始する。
prepaid-only に近づけたい場合は postpaid limit を低く、または 0 にする。
トークン数、cached token 数、model ID、tool calls、retry、errors、latency、accepted result を記録する。
小さい sample 後に actual spend と worksheet を比較する。
spend、quality、latency、failure rate が安定してから limit を上げる。

Grok API 支出制御チェックリスト。docs verification、console credits、limits、logs、cache、Batch、old model warning を整理。

Stop rule は invoice を待たず、code や運用手順に入れます。request volume、tool calls、retry rate、output tokens、Priority use が worksheet の閾値を超えたら route を止めます。

古い model row と May 15 retirement

May 15 retirement notice は、この topic の鮮度警告です。xAI は 2026年5月15日以降、複数の retired slugs が grok-4.3 に redirect され、deprecated slug request は grok-4.3 pricing で請求されると説明しています。Grok 4.1 Fast、Grok 3、古い free credits を中心にした snippet は、そのまま budget input にできません。

見かける記述	先にどう扱うか	安全な行動
Grok 4.1 Fast が現在の安い default	official docs または console が証明するまで stale。	pricing page と model list を再確認。
全員に毎月 free credits	xAI Docs が明示しない限り account-specific or provider-specific。	credit balance と expiration を確認。
provider calculator の free usage	provider contract。	official xAI row から切り離す。
grok-latest alias	便利だが移動する。	cost test では exact model ID を pin。

request が成功しても、想定価格で請求されるとは限りません。現在の official row と自分の console behavior で予算化してください。

安全なテスト計画

Grok API を広げる前に、実 workload に近い小さな test を行います。

Step	Action	Pass signal
1. Pin model	grok-4.3 またはテスト対象の exact row から始める。	logs に expected model ID と team/account が出る。
2. Set spend stop	prepaid credits と低い postpaid limit を使う。	runaway test が大きな invoice を作れない。
3. Run real sample	real prompts、retrieval、tools、output format を使う。	sample が production に近い。
4. Count accepted cost	accepted outputs、retries、tool calls、review time を数える。	cost per accepted result が明確。
5. Compare alternatives	lower row、Batch、cache、fewer tools、shorter output を試す。	安い route が quality gate を通る。
6. Scale gradually	logs と worksheet が合ってから limit を上げる。	spend、quality、latency、failure rate が安定。

モデル ID、alias、migration、rollout の判断は Grok 4.3 API ガイドに分けます。ここでは価格、請求、workload behavior に集中します。

よくある質問

Grok API の料金はいくらですか？

2026年7月2日に xAI Docs を確認すると、grok-4.3 は input $1.25、cached input $0.20、output $2.50 / 100万 token です。実コストは output length、cache hit、tool calls、Batch、Priority、storage、retry、account limits で変わります。

Grok API に無料枠はありますか？

公開 xAI Docs は永続的な公式 free API tier を保証していません。Quickstart は account に credits を load してから API を使う流れを示します。account credits や provider free route は、official xAI price row とは別です。

最初にどの model を予算化すべきですか？

一般的な Grok API work は grok-4.3 から始めます。grok-build-0.1 や grok-4.20 row を試す場合は、price だけでなく quality、availability、output behavior を worksheet に入れてください。

Cached input はなぜ安いのですか？

繰り返し prompt content が cache behavior の対象になると、cached input row が使われます。stable system prompt、policy block、template は候補ですが、自動的に安くなるわけではありません。

Tools は Grok API pricing を変えますか？

はい。Web Search、X Search、Code Execution、File Attachments search、Collections Search/RAG には別料金があります。workflow が tools を使うなら formula に入れます。

Batch API はいつ使うべきですか？

real-time でなくてよい work で検討します。xAI は eligible text/language batch work に 20%-50% discount を示していますが、image/video generation は standard rate のままの場合があります。

何を一番見落としやすいですか？

tool calls、retry、long output、Priority multiplier、file/collection storage、retired slug redirect、postpaid limit です。最低 token price より、accepted result cost を測るほうが実務では安全です。