arxiv:2501.06842
Tianjin Huang
TianjinHuang
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
upvoted
a
paper
1 day ago
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
upvoted
a
paper
about 1 month ago
APOLLO: SGD-like Memory, AdamW-level Performance
Organizations
None yet
Papers
1
models
None public yet
datasets
None public yet