OpenAI-compatible API

Ynnova Inference

High-throughput LLM inference endpoint.
OpenAI-compatible · GPU-accelerated · Low latency.

Interface

OpenAI /v1

Throughput

140+ tok/s

TTFT P50

~430 ms

Provider

OpenRouter

POST https://api.ynnova.eu/v1/chat/completions

Systems operational