gemini-2.5-flash-image-preview
Context
32K
Input
text, image
Output
image
Tool calling
Supported
About this model
Gemini 2.5 Flash Image Preview is Google's highly efficient multimodal model optimized for fast image understanding and analysis. It combines rapid processing speeds with strong visual reasoning capabilities, supporting massive context windows for comprehensive image-text understanding in real-time applications.
Capabilities
text
Available through the unified API
image
Available through the unified API
Quick start
View API docscurl https://api.oneinfer.ai/v1/chat/completions \
-H "Authorization: Bearer $ONEINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "dc5fa6dd92a1404ba8457e4663a8a703",
"messages": [
{ "role": "user", "content": "Hello!" }
]
}'Providers
Available routing options for this model through OneInfer.
dc5fa6dd92a1404ba8457e4663a8a703
Input
$0.000 / 1M
Output
$0.000 / 1M
Routing
OneInfer optimized
Pricing
Current OneInfer pricing for this model.
| Usage | Price |
|---|---|
| Input tokens | $0.000 / 1M |
| Output tokens | $0.000 / 1M |
Performance
Published evaluation results associated with this model.
Vision-Language Understanding
MMMU68.9
ScienceQA-IMG82.7
VQAv276.4
TextVQA71.8
ChartQA73.9
Multimodal Reasoning
Visual-CommonSense78.6
Spatial-Understanding72.4
Object-Interaction75.8
Scene-Understanding79.3
Technical Performance
Context-Processing88.7
Batch-Efficiency91.5
Memory-Optimization89.3
Scalability92.8
Real-time Applications
Live-Image-Analysis85.4
Document-Processing81.9
Visual-Search83.7
Content-Moderation87.2
Enterprise Performance
Business-Document79.8
Product-Identification84.6
Quality-Control82.1
Workflow-Integration88.9
API example
curl https://api.oneinfer.ai/v1/chat/completions \
-H "Authorization: Bearer $ONEINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "dc5fa6dd92a1404ba8457e4663a8a703",
"messages": [
{ "role": "user", "content": "Hello!" }
]
}'