Mistral Large 2

Mistral AI

model

Mistral Large 2 is a large language model from Mistral AI, launched on July 24, 2024. It is designed to excel at code generation, mathematics, reasoning, and multilingual tasks.

Architecture: Mistral Large 2 uses a Transformer decoder architecture. It employs a “dense” neural network in which every part of the network is connected.
Parameters: It has 123 billion parameters, enabling it to handle complex language tasks with high accuracy. This size lets the model handle complex language tasks with great nuance. Mistral AI designed the model size so that it can operate at scale on a single node.
Context window: It has a context window of 128,000 tokens, which helps maintain coherence and relevance across long conversations or documents.
Multilingual support: Mistral Large 2 supports many languages, including Russian, Chinese, Japanese, Korean, Spanish, and Italian.
Programming languages: It excels in more than 80 programming languages, such as Python, Java, C, C++, and JavaScript.
Performance: Mistral Large 2 shows strong performance in various benchmarks and competes with models like OpenAI’s GPT-4o and Meta’s Llama 3 405B. It does well on Wild Bench, where it placed second behind GPT-4o. On Arena Hard it placed third, behind GPT-4o and Claude 3.5 Sonnet.
Function calling: Mistral Large 2 outperforms larger models, such as GPT-4o and Claude 3.5 Sonnet, at function calling.
Efficiency: Mistral Large 2 sets a new standard for the performance/price ratio, delivering great performance at an affordable price.
Reduced hallucinations: Mistral AI has focused on minimizing inaccuracies by adding stricter accuracy checks and feedback systems to ensure the model provides reliable information. Mistral claims that Large 2 produces more concise responses than leading AI models.
Licensing: Mistral Large 2 is available under the Mistral Research License for open-source use and modifications for research and non-commercial purposes. A Mistral Commercial License is required for commercial use.

Podobné služby

Open AI Sora

OpenAI

Sora is OpenAI's advanced AI for generating realistic videos. It can create complex scenes with characters, motion, and detail, while preser...

model videos

OpenAI DALL-E 3

OpenAI

DALL-E is an artificial intelligence model from OpenAI that generates images from text prompts. It creates original, creative visuals based ...

images model

Flux.1

Black Forest Labs

2024/08

Flux.1 is an advanced AI model for generating images from text prompts, developed by Black Forest Labs. With 12 billion parameters it delive...

model

GPT-4o

OpenAI

2024/06

images model texts

OpenAI o1

OpenAI

2024/12

A model designed for advanced reasoning and solving complex problems, particularly in science, mathematics, and programming.

model texts

OpenAI o3

OpenAI

2025/02

A model focused on improved capabilities in coding, mathematics, and the natural sciences.

model texts

DeepSeek-LLM

DeepSeek

2024/11

A large language model designed to generate human-like text and conduct context-aware dialogues, suitable for chatbots and customer service.

model texts

DeepSeek-V2

DeepSeek

2024/05

A model with a Mixture-of-Experts (MoE) architecture, optimized for efficient training and inference.

model texts