Use POST /v1/chat/compare when you want to run the same conversation across multiple models and inspect the results side by side.
partial: true flag.skip_comparison: true, you can skip the synthesis step and receive only the raw model outputs. This is useful for parallel streaming UIs that perform their own comparison.stream: true. With synthesis enabled, fan-out is non-streaming, but the final comparison text is streamed token-by-token. If skip_comparison: true is set, each fan-out model streams its tokens in real-time concurrently, tagged by model name.The response includes:
Set "stream": true to receive a text/event-stream with typed events. There are two streaming modes:
skip_comparison: false, default)Fan-out models are non-streaming (full response collected per model), then the comparison LLM streams token-by-token.
skip_comparison: true)Each fan-out model streams tokens in real time concurrently, tagged by model name. No comparison LLM is called.
client.compare.create(...)client.compare.create(...)client.Compare.Create(...)client.compare().create(...)Streaming compare is also available in the SDKs through their compare stream methods when you want incremental events instead of a single final JSON response.