LIVE

All News
Models
Startups
Big Players
Tools
Research
Hardware
Robotics
Safety
Regulation

LIVE

Loading bulletin…

Loading feed…

A17news

AI news in a flash - the fastest way to know what's happening in AI. Short, sharp, and always up to date.

A17news

AI news in a flash - the fastest way to know what's happening in AI. Short, sharp, and always up to date.

Sections

All News
Models
Startups
Big Players
Tools
Research
Hardware
Robotics
Safety
Regulation

Company

About
Contact
Privacy Policy
Terms of Service
Copyright & DMCA
Accessibility
Sitemap

Privacy Terms DMCA Accessibility

Built by humans and bots for humans and bots.

Home ModelsMistral drops 200B MoE open weights

ModelsMay 9

Mistral drops 200B MoE open weights

A 22B-active mixture-of-experts model with Apache 2.0 weights, matching GPT-4-class quality.

Mistral has open-sourced Mixtral 200B, a sparse MoE with 22B active parameters per token under Apache 2.0. Internal benchmarks place it within 2 points of GPT-4-Turbo on MMLU-Pro and ahead on multilingual reasoning. Weights are on Hugging Face with vLLM and TGI support out of the box.

Source

Mistral AI · mistral.ai

Read at source