Model discovery JSON can be corrupted by the inference proxy's SSE streaming path

### Problem Statement

GET /v1/models (model discovery) returns a single JSON model list. The sandbox inference proxy routes it through the Server-Sent Events streaming path. On a streaming size-cap or idle-timeout truncation, that path appends an SSE error frame to the body, which corrupts a payload the client parses as one JSON object.

### Proposed Design

Make response framing a property of the inference protocol. Add a ResponseFraming field to InferenceApiPattern, set once per pattern in default_patterns. model_discovery and openai_embeddings are Buffered; the SSE protocols (chat completions, completions, responses, Anthropic messages) stay Streaming.

### Alternatives Considered

Inspect the request stream flag to choose framing per request. Deferred. It would also let non-streaming chat and completion responses be served buffered, but it is a larger change.

### Agent Investigation

_No response_

### Checklist

- [x] I've reviewed existing issues and the architecture docs
- [x] This is a design proposal, not a "please build this" request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model discovery JSON can be corrupted by the inference proxy's SSE streaming path #1772

Problem Statement

Proposed Design

Alternatives Considered

Agent Investigation

Checklist

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Model discovery JSON can be corrupted by the inference proxy's SSE streaming path #1772

Description

Problem Statement

Proposed Design

Alternatives Considered

Agent Investigation

Checklist

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions