iproskurina
's Collections
Quantized LLMs
updated
iproskurina/Mistral-7B-v0.3-GPTQ-4bit-g128
Text Generation
•
Updated
•
17
iproskurina/bloom-7b1-GPTQ-4bit-g128
Text Generation
•
Updated
•
36
•
2
iproskurina/bloom-1b7-GPTQ-4bit-g128
Text Generation
•
Updated
•
19
iproskurina/bloom-3b-GPTQ-4bit-g128
Text Generation
•
Updated
•
29
iproskurina/bloom-560m-GPTQ-4bit-g128
Text Generation
•
Updated
•
15
iproskurina/bloom-1b1-GPTQ-4bit-g128
Text Generation
•
Updated
•
30
iproskurina/opt-2.7b-GPTQ-4bit-g128
Text Generation
•
Updated
•
15
iproskurina/opt-13b-GPTQ-4bit-g128
Text Generation
•
Updated
•
26
iproskurina/opt-6.7b-GPTQ-4bit-g128
Text Generation
•
Updated
•
41
iproskurina/opt-125m-GPTQ-4bit-g128
Text Generation
•
Updated
•
13
iproskurina/opt-350m-GPTQ-4bit-g128
Text Generation
•
Updated
•
14
iproskurina/opt-1.3b-GPTQ-4bit-g128
Text Generation
•
Updated
•
59
iproskurina/Mistral-7B-v0.1-GPTQ-8bit-g128
Text Generation
•
Updated
•
7
iproskurina/Mistral-7B-v0.3-GPTQ-8bit-g128
Text Generation
•
Updated
•
8
iproskurina/Mistral-7B-v0.1-GPTQ-3bit-g64
Text Generation
•
Updated
•
6
iproskurina/Mistral-7B-v0.1-GPTQ-8bit-g64
Text Generation
•
Updated
•
6
iproskurina/Mistral-7B-v0.1-GPTQ-4bit-g128
Text Generation
•
Updated
•
9
iproskurina/Mistral-7B-v0.1-GPTQ-3bit-g128
Text Generation
•
Updated
•
5
TheBloke/Mistral-7B-Instruct-v0.1-GPTQ
Text Generation
•
Updated
•
2.94k
•
79
TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
Text Generation
•
Updated
•
426k
•
50
TheBloke/bloomz-176B-GPTQ
Text Generation
•
Updated
•
10
•
20
TheBloke/BLOOMChat-176B-v1-GPTQ
Text Generation
•
Updated
•
13
•
31
TheBloke/Llama-2-13B-chat-GPTQ
Text Generation
•
Updated
•
28.5k
•
362
When Quantization Affects Confidence of Large Language Models?
Paper
•
2405.00632
•
Published