view post Post 524 Check out my collection of pre-made GGUF LoRA adapters!This allow you to use both normal + abliterated version of popular models like llama, qwen, etc, without having to double to amount of VRAM usage. ngxson/gguf_lora_collection See translation
view post Post 2209 I made this small tool that can be useful for debugging Ollama chat template: ngxson/ollama_template_testCC @bartowski you may need this ;-) See translation
Extracted LoRA (mergekit) PEFT-compatible LoRA adapters produced by mergekit-extract-lora Running 2 📁 Extracted LoRA - GGUF version Redirection to ggml-org collection ngxson/LoRA-Qwen2.5-3B-Instruct-abliterated Updated 6 days ago ngxson/LoRA-Qwen2.5-7B-Instruct-abliterated-v3 Updated 8 days ago ngxson/LoRA-Qwen2.5-14B-Instruct-abliterated-v2 Updated 8 days ago • 1
MiniThinky: extra small reasoning models My first trial to make reasoning models Running 55 🧠 Llama 3.2 Reasoning WebGPU Small and powerful reasoning LLM that runs in your browser ngxson/MiniThinky-v2-1B-Llama-3.2 Text Generation • Updated 5 days ago • 3.2k • 24 ngxson/MiniThinky-v2-1B-Llama-3.2-Q8_0-GGUF Updated 7 days ago • 227 • 5 ngxson/MiniThinky-1B-Llama-3.2 Text Generation • Updated 7 days ago • 220 • 3