Second Major Open Release of the Day - MiniMax M3

TG AI News·June 12, 2026 at 3:20 PM·
Trusted Source
It turned out that the model has 428 billion parameters, with 23B active, quite small compared to competitors. The main innovation of the model is yet another variant of sparse attention, MSA (MiniMax Sparse Attention), which is noticeably more efficient than GQA in large contexts.
Second Major Open Release of the Day - MiniMax M3 | AI News | AIventa