47 30 152

Asankhaya Sharma

codelion

http://asankhaya.github.io/

AI & ML interests

AI/ML, Dev Tools and Application Security

Recent Activity

upvoted a paper about 11 hours ago

Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series

liked a model about 11 hours ago

NovaSky-AI/Sky-T1-32B-Preview

upvoted a paper about 11 hours ago

Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning

View all activity

Organizations

codelion's activity

upvoted 2 papers about 11 hours ago

Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series

Paper • 2401.03955 • Published Jan 8, 2024 • 7

Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning

Paper • 2412.08587 • Published Dec 11, 2024 • 1

upvoted 2 papers 1 day ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 7 days ago • 30

Adaptive Margin Global Classifier for Exemplar-Free Class-Incremental Learning

Paper • 2409.13275 • Published Sep 20, 2024 • 1

upvoted 2 papers 2 days ago

Protoformer: Embedding Prototypes for Transformers

Paper • 2206.12710 • Published Jun 25, 2022 • 1

Overcoming catastrophic forgetting in neural networks

Paper • 1612.00796 • Published Dec 2, 2016 • 1

upvoted a paper 5 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 7 days ago • 218

upvoted a paper about 2 months ago

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator

Paper • 2312.04474 • Published Dec 7, 2023 • 30

upvoted 2 collections 3 months ago

MobileLLM

Collection

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 102

indic-evals

Collection

Translated versions of popular LLM benchmarks. • 4 items • Updated Oct 23, 2024 • 2

upvoted a paper 3 months ago

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27, 2024 • 139

upvoted 4 papers 4 months ago

upvoted 2 articles 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 216

Article

Faster fine-tuning using TRL & Unsloth

Jan 10, 2024

• 44

upvoted 3 papers 4 months ago

Patched RTC: evaluating LLMs for diverse software development tasks

Paper • 2407.16557 • Published Jul 23, 2024 • 1

Evaluating Pre-trained Language Models for Repairing API Misuses

Paper • 2310.16390 • Published Oct 25, 2023 • 1

Patched MOA: optimizing inference for diverse software development tasks

Paper • 2407.18521 • Published Jul 26, 2024 • 1