Found 2 repositories(showing 2)
thunlp
Source codes for paper "BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity".
amadeobonde
Research prototype: BlockFFN + Mamba hybrid with routing-gated sequence mixing
All 2 repositories loaded