GoofyLM Lab

community

Verified

https://goofylm.site/

https://github.com/GoofyLM

Activity Feed Request to join this org

AI & ML interests

Making funny and goofy LM's and AI's

Recent Activity

FlameF0X submitted a paper 5 days ago

Triplet-Block Diffusion RWKV

aquiffoo updated a collection 10 months ago

FlameF0X updated a model 10 months ago

GoofyLM/N2.1-Eye-1.3B

View all activity

FlameF0X

submitted a paper to Daily Papers 5 days ago

Triplet-Block Diffusion RWKV

Paper • 2605.25969 • Published 8 days ago • 20

FlameF0X

posted an update 16 days ago

Post

215

I did some testing on the scalability of FWKV. It hits a speed bottleneck at 1B due to the T4’s bandwidth limitations. Theoretically, it should match RWKV’s inference speed if the GPU had more bandwidth. So the 1B size is not accurate.

FlameF0X

posted an update 18 days ago

Post

270

Greetings Hugging Face!

I started a new project called **FWKV** (Feed-forward Weighted Key Value, or Floored Weighted Key Value), a RWKV-style LM that uses FFNNs (Feed-Forward Neural Networks) instead of RNN and floor(W·K·V). I'm hoping to make it much more efficient and scalable than RWKV.

So far I have:

- FlameF0X/FWKV-29M — this one is undertrained and doesn't have a Space yet. In the attached image you can see its speed on a T4 compared to models with the same configuration.

The only model that's fully working right now is:
- FlameF0X/FWKV-TinyStories — trained on TinyStories for one epoch. The demo Space is FlameF0X/FWKV-demo.

2 replies

FlameF0X

posted an update 10 months ago

Post

4372

I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore.

7 replies

aquiffoo

updated a collection 10 months ago

Nx

Collection

Main series of models by GoofyLM. • 6 items • Updated Aug 9, 2025

FlameF0X

updated 3 models 10 months ago

published a model 10 months ago

GoofyLM/N2.3-Eye-1.3B-DEV

Image-Text-to-Text • 1B • Updated Aug 8, 2025 • 5 • 2

FlameF0X

posted an update 10 months ago

Post

819

the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus.

FlameF0X

published a model 10 months ago

GoofyLM/N2.2-Eye-1.3B

Image-Text-to-Text • 1B • Updated Aug 8, 2025 • 8 • 2

FlameF0X

updated a collection 10 months ago

Nx

Collection

Main series of models by GoofyLM. • 6 items • Updated Aug 9, 2025

aquiffoo

updated a model 10 months ago

GoofyLM/N2.1-Eye-1.3B

Image-Text-to-Text • 1B • Updated Aug 8, 2025 • 6 • 3

FlameF0X

updated a collection 10 months ago

Nx

Collection

Main series of models by GoofyLM. • 6 items • Updated Aug 9, 2025

FlameF0X

in GoofyLM/N2.1-Eye-1.3B 10 months ago

Adding `safetensors` variant of this model

#1 opened 10 months ago by

SFconvertbot

FlameF0X

published a model 10 months ago

GoofyLM/N2.1-Eye-1.3B

Image-Text-to-Text • 1B • Updated Aug 8, 2025 • 6 • 3

FlameF0X

posted an update 10 months ago

Post

277

The development of SnowflakeCore-G1-7B-MoE it getting delay. In the mean time I am working on SnowflakeCore-G1-1B-MoE witch would be a pre-train chatbot.

1 reply

FlameF0X

posted an update 10 months ago

Post

2958

The development of SnowflakeCore-G1-7B-MoE. I can't say when it would be publish yet because it's big and it requires a lot of computational power.

1 reply

FlameF0X

posted an update 11 months ago

Post

293

I just finished the benchmarks for https://huggingface.co/FlameF0X/SnowflakeCore-G1-Tiny and https://huggingface.co/FlameF0X/SnowflakeCore-G1-Tiny2 in comparation with openai-community/gpt2 .

FlameF0X

posted an update 11 months ago

Post

315

Hello! Important announcement, I will rename SnowflakeCore-G1-Medium to SnowflakeCore-G1-Tiny2 because it's going to have the same parameters as the Tiny version, but this one is trained on more data.

1 reply

AI & ML interests

Recent Activity

Team members 3

GoofyLM's activity

Adding `safetensors` variant of this model