39 23 24

Xuan Son NGUYEN

ngxson

https://blog.ngxson.com

AI & ML interests

Doing AI for fun, not for profit

Recent Activity

new activity about 2 hours ago

5CD-AI/Vintern-1B-v3_5:Deployment as server?

upvoted a collection about 12 hours ago

2025 January

replied to their post about 22 hours ago

Check out my collection of pre-made GGUF LoRA adapters! This allow you to use both normal + abliterated version of popular models like llama, qwen, etc, without having to double to amount of VRAM usage. https://huggingface.co/spaces/ngxson/gguf_lora_collection

View all activity

Articles

Code a simple RAG from scratch

Oct 29, 2024

• 16

Organizations

Posts 2

Post

524

Check out my collection of pre-made GGUF LoRA adapters!

This allow you to use both normal + abliterated version of popular models like llama, qwen, etc, without having to double to amount of VRAM usage.

ngxson/gguf_lora_collection

Post

2209

I made this small tool that can be useful for debugging Ollama chat template: ngxson/ollama_template_test

CC @bartowski you may need this ;-)

Collections 3

spaces 8

pinned

Running

🦙

Wllama

Run GGUF directly on your browser!

pinned

Running

🚧

Ollama Template Test

This is a web-based tool for testing Ollama chat template

Running

📁

Extracted LoRA - GGUF version

Redirection to ggml-org collection

Paused

🏆

Mergekit Extract Lora

Running

🐞

Debug Ollama Manifest

Show ollama registry manifest for debugging

Sleeping

💻

Test Llamacpp Zerogpu

models 36

datasets 4

ngxson/MiniThinky-dataset-v3

Viewer • Updated 4 days ago • 41.2k • 40

ngxson/MiniThinky-dataset

Viewer • Updated 7 days ago • 88.2k • 103 • 8

ngxson/ThisTTS_dataset_no_file

Viewer • Updated 10 days ago • 867 • 27

ngxson/ThisTTS_dataset

Viewer • Updated 10 days ago • 867 • 26

Xuan Son NGUYEN

AI & ML interests

Recent Activity

Articles

Introducing GGUF-my-LoRA

Code a simple RAG from scratch

Introduction to ggml

Organizations

Posts 2

Collections 3

Extracted LoRA - GGUF version

Llama 3.2 Reasoning WebGPU

spaces 8 Sort: Recently updated

Wllama

Ollama Template Test

Extracted LoRA - GGUF version

Mergekit Extract Lora

Debug Ollama Manifest

Test Llamacpp Zerogpu

models 36 Sort: Recently updated

datasets 4 Sort: Recently updated

spaces 8

models 36

datasets 4