I did some testing on the scalability of FWKV. It hits a speed bottleneck at 1B due to the T4โs bandwidth limitations. Theoretically, it should match RWKVโs inference speed if the GPU had more bandwidth. So the 1B size is not accurate.
I started a new project called **FWKV** (Feed-forward Weighted Key Value, or Floored Weighted Key Value), a RWKV-style LM that uses FFNNs (Feed-Forward Neural Networks) instead of RNN and floor(WยทKยทV). I'm hoping to make it much more efficient and scalable than RWKV.
So far I have:
- FlameF0X/FWKV-29M โ this one is undertrained and doesn't have a Space yet. In the attached image you can see its speed on a T4 compared to models with the same configuration.
The development of SnowflakeCore-G1-7B-MoE it getting delay. In the mean time I am working on SnowflakeCore-G1-1B-MoE witch would be a pre-train chatbot.
Hello! Important announcement, I will rename SnowflakeCore-G1-Medium to SnowflakeCore-G1-Tiny2 because it's going to have the same parameters as the Tiny version, but this one is trained on more data.