jmyoon
's Collections
nubbury
updated
StarCoder 2 and The Stack v2: The Next Generation
Paper
•
2402.19173
•
Published
•
137
Griffin: Mixing Gated Linear Recurrences with Local Attention for
Efficient Language Models
Paper
•
2402.19427
•
Published
•
53
Simple linear attention language models balance the recall-throughput
tradeoff
Paper
•
2402.18668
•
Published
•
19
Priority Sampling of Large Language Models for Compilers
Paper
•
2402.18734
•
Published
•
17
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
•
2402.17764
•
Published
•
607
When Scaling Meets LLM Finetuning: The Effect of Data, Model and
Finetuning Method
Paper
•
2402.17193
•
Published
•
23
Towards Optimal Learning of Language Models
Paper
•
2402.17759
•
Published
•
16
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in
Text-to-Image Generation
Paper
•
2402.17245
•
Published
•
10
Disentangled 3D Scene Generation with Layout Learning
Paper
•
2402.16936
•
Published
•
10
VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction
Paper
•
2402.17427
•
Published
•
9