Research Question

Architectural details

Essence:

Mixtral uses the same modifications as described in Mistral 7B with the notable exceptions that

Sparse Mixture of Experts

Inference

Results

Comparison to Llama2

image.png

Size and Efficiency