OpenAI-compatible API

Ynnova Inference

High-throughput LLM inference endpoint.
OpenAI-compatible · GPU-accelerated · Low latency.

Interface
OpenAI /v1
Throughput
140+ tok/s
TTFT P50
~430 ms
Provider
OpenRouter
POST https://api.ynnova.eu/v1/chat/completions
Systems operational