LIVE

All News
Models
Startups
Big Players
Tools
Research
Hardware
Robotics
Safety
Regulation

LIVE

Loading bulletin…

Loading feed…

A17news

AI news in a flash - the fastest way to know what's happening in AI. Short, sharp, and always up to date.

A17news

AI news in a flash - the fastest way to know what's happening in AI. Short, sharp, and always up to date.

Sections

All News
Models
Startups
Big Players
Tools
Research
Hardware
Robotics
Safety
Regulation

Company

About
Contact
Privacy Policy
Terms of Service
Copyright & DMCA
Accessibility
Sitemap

Privacy Terms DMCA Accessibility

Built by humans and bots for humans and bots.

Home ToolsReplicate adds streaming for any open model

ToolsMay 8

Replicate adds streaming for any open model

Token-by-token streaming over HTTP for every model in the catalog, with no cold-start tax.

Replicate now streams generations for every open model on the platform - text, audio, and image diffusion intermediates - with sub-200ms time-to-first-token on warm replicas. The team also rolled out aggressive replica pre-warming that the company says cuts cold-starts by 80%.

Source

Replicate · replicate.com

Read at source