wolfram commited on
Commit
be696a5
·
verified ·
1 Parent(s): 47c8fe3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -10,7 +10,8 @@ tags:
10
  - chat
11
  library_name: transformers
12
  ---
13
-
 
14
 
15
  # QVQ-72B-Preview
16
 
 
10
  - chat
11
  library_name: transformers
12
  ---
13
+ > [!NOTE]
14
+ > EXL2 4.65bpw-h6 quantized version of [Qwen/QVQ-72B-Preview](https://huggingface.co/Qwen/QVQ-72B-Preview). Supports 32K context with Q4 cache on systems with 48 GB VRAM.
15
 
16
  # QVQ-72B-Preview
17