Musings by Chris Hayduk
Subscribe
Sign in
The Sciences
Understanding DeepSeek Part I: DeepSeekMoE
Chris Hayduk
Jan 30
4
2
Mixture of experts models with a twist
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Understanding DeepSeek Part I: DeepSeekMoE
Mixture of experts models with a twist