novita

qwen-image-txt2img

Context

Input

text

Output

image

Tool calling

Not listed

About this model

Qwen-Image-Txt2Img is a 14B-parameter diffusion model specialized in high-quality text-to-image generation with enhanced prompt understanding and stylistic versatility. Part of the Qwen multimodal family, it features advanced composition control, style adaptation, and detail preservation across diverse visual domains.

Capabilities

text

Available through the unified API

image

Available through the unified API

Quick start

View API docs

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "5e7b17e3a7cb4f15bf7d8c56037e18c1",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Providers

Available routing options for this model through OneInfer.

novita

5e7b17e3a7cb4f15bf7d8c56037e18c1

Available

Input

$0.000 / 1M

Output

$0.000 / 1M

Routing

OneInfer optimized

Pricing

Current OneInfer pricing for this model.

Usage	Price
Input tokens	$0.000 / 1M
Output tokens	$0.000 / 1M

Performance

Published evaluation results associated with this model.

Image Quality

FID-30K12.8

CLIP Score33.2

Inception Score192.4

Precision87.6

Recall85.3

Prompt Understanding

Attribute Binding89.4

Spatial Reasoning84.7

Complex Prompt Accuracy86.9

Style Adaptation91.2

Domain Specialization

Photorealistic90.8

Artistic Styles92.4

Concept Art88.9

Technical Illustration85.7

Product Visualization87.3

API example

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "5e7b17e3a7cb4f15bf7d8c56037e18c1",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'