Ken Tsui PRO

kenhktsui

AI & ML interests

ML Engineer Lead. Researcher on Small Language Model - Building Classifiers to Find High Quality Data/ Reasoning Benchmark/ Synthetic Data

Recent Activity

liked a model about 19 hours ago

Qwen/Qwen2.5-Math-PRM-7B

published an article 2 days ago

Embodied AI == Unlimited Training Data

upvoted a paper 6 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

View all activity

Articles

Embodied AI == Unlimited Training Data

2 days ago

• 2

∞🧙🏼‍♂️AnyClassifier - Generating Synthetic Data For Text Classification

Aug 19, 2024

• 8

Low Latency CPU Based Educational Value Classifier With Generic Educational Value

Jun 12, 2024

• 9

Organizations

kenhktsui's activity

liked a model about 19 hours ago

Qwen/Qwen2.5-Math-PRM-7B

Text Classification • Updated about 6 hours ago • 391 • 26

published an article 2 days ago

Article

Embodied AI == Unlimited Training Data

•

2 days ago

• 2

upvoted a paper 6 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 7 days ago • 218

liked a model 7 days ago

microsoft/phi-4

Text Generation • Updated 7 days ago • 72.3k • 1.28k

liked a model 15 days ago

Qwen/Qwen2.5-Math-7B-Instruct

Text Generation • Updated Sep 23, 2024 • 29.7k • 43

liked a dataset 15 days ago

tasksource/PRM800K

Preview • Updated May 31, 2023 • 96 • 22

updated a collection 16 days ago

LongTalk

Collection

A Very Long Chain-of-Thought Dataset for Reasoning Model Post-Training • 5 items • Updated 16 days ago

updated a model 16 days ago

kenhktsui/llama3.1-8b-instruct-thinking-sft-merged-gguf

Updated 16 days ago • 34 • 1

liked a model 16 days ago

kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged-gguf

Updated 16 days ago • 87 • 1

updated a model 16 days ago

kenhktsui/llama3.1-8b-instruct-thinking-sft-merged

Text Generation • Updated 16 days ago • 33

liked 3 datasets 16 days ago

updated a collection 16 days ago

LongTalk

Collection

A Very Long Chain-of-Thought Dataset for Reasoning Model Post-Training • 5 items • Updated 16 days ago

updated 2 models 16 days ago

kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged

Text Generation • Updated 16 days ago • 35

kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged-gguf

Updated 16 days ago • 87 • 1

updated a dataset 16 days ago

kenhktsui/longtalk-cot-v0.1

Viewer • Updated 16 days ago • 61.2k • 163 • 11