Steve Li's picture

50 11

Steve Li

CHNtentes

·

CHNtentes

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

Qwen/Qwen2.5-Coder-32B-Instruct:Thieves!

new activity 5 days ago

unsloth/DeepSeek-V3-GGUF:why use q5 for key cache?

new activity 5 days ago

bullerwins/DeepSeek-V3-GGUF:这我也跑不动啊你们都是在拿什么玩意玩啊。我勒个公鸡母鸡战斗鸡啊

View all activity

Organizations

None yet

CHNtentes's activity

New activity in Qwen/Qwen2.5-Coder-32B-Instruct 1 day ago

Thieves!

#36 opened 2 days ago by

New activity in unsloth/DeepSeek-V3-GGUF 5 days ago

why use q5 for key cache?

#7 opened 5 days ago by

New activity in bullerwins/DeepSeek-V3-GGUF 5 days ago

这我也跑不动啊你们都是在拿什么玩意玩啊。我勒个公鸡母鸡战斗鸡啊

#8 opened 8 days ago by

New activity in city96/HunyuanVideo-gguf 16 days ago

Can we use "temporal tiling support" with these gguf models?

#10 opened 16 days ago by

New activity in deepseek-ai/DeepSeek-V3 20 days ago

minimum vram?

#9 opened 20 days ago by

New activity in Qwen/QVQ-72B-Preview 21 days ago

GGUF weights?

#1 opened 22 days ago by

New activity in CohereForAI/c4ai-command-r7b-12-2024 28 days ago

How was r7b?

#3 opened about 1 month ago by

New activity in NexaAIDev/OmniVLM-968M about 2 months ago

transformers version?

#5 opened about 2 months ago by

New activity in city96/stable-diffusion-3.5-medium-gguf 2 months ago

Q4_0, Q4_1, Q5_0, Q5_1 can be dropped?

#1 opened 2 months ago by

New activity in stabilityai/stable-diffusion-3.5-medium 2 months ago

Where is 't5xxl.safetensors' ?

#12 opened 2 months ago by

New activity in ArtificialAnalysis/Text-to-Image-Leaderboard 3 months ago

🐼🐼🐼

#3 opened 3 months ago by

New activity in Qwen/Qwen2-VL-2B-Instruct 4 months ago

Hardware requirements

#10 opened 4 months ago by

New activity in stepfun-ai/GOT-OCR2_0 4 months ago

T4 - bfloat 16 not support

#2 opened 4 months ago by

New activity in mattshumer/Reflection-Llama-3.1-70B 4 months ago

🚩 Report: Spam

#150 opened 4 months ago by

New activity in city96/FLUX.1-dev-gguf 5 months ago

Is it using ggml to compute?

#30 opened 5 months ago by

New activity in city96/t5-v1_1-xxl-encoder-gguf 5 months ago

For the fastest inference on 12GB VRAM, are the following GGUF models appropriate to use?

#4 opened 5 months ago by

New activity in google/gemma-2-9b-it 5 months ago

Inquiry on Minimum Configuration and Cost for Running Gemma-2-9B Model Efficiently

#39 opened 5 months ago by

New activity in nvidia/Llama-3.1-Minitron-4B-Width-Base 5 months ago

Error in readme?

#6 opened 5 months ago by

New activity in alvdansen/flux-koda 5 months ago

Good work!

#1 opened 5 months ago by

New activity in city96/FLUX.1-dev-gguf 5 months ago

Compared to the regular FP8 model, what is the better performance of the 8BIT model here

#16 opened 5 months ago by