Cross's picture

Cross

dillfrescott

AI & ML interests

AI, anime, computers

Recent Activity

liked a model about 11 hours ago
MiniMaxAI/MiniMax-Text-01
liked a Space 3 days ago
hexgrad/Kokoro-TTS
liked a model 4 days ago
unsloth/phi-4-GGUF
View all activity

Organizations

The Waifu Research Department's profile picture

dillfrescott's activity

reacted to Jaward's post with πŸ‘β€οΈ 17 days ago
view post
Post
2981
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4.

Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
reacted to hexgrad's post with πŸ€—πŸ‘ 18 days ago
view post
Post
3134
Tonight, Adam & Michael join the 82M Apache TTS model in hexgrad/Kokoro-82M
reacted to AdinaY's post with πŸ€—πŸ‘ 20 days ago
view post
Post
3588
The Chinese community is shipping 🚒

DeepSeek V3 (685 B MoE) has quietly released on the hub!
Base: deepseek-ai/DeepSeek-V3-Base
Instruct: deepseek-ai/DeepSeek-V3

Can’t wait to see what’s next!
  • 1 reply
Β·