How to quantize

#10

by supercharge19 - opened Jan 9, 2024

Discussion

supercharge19

Jan 9, 2024

Is there an example, code, to use quantize this model or is there a quantized version available?

supercharge19

Jan 9, 2024

I found this: https://huggingface.co/WhereIsAI/UAE-Large-V1/blob/main/onnx/model_quantized.onnx

But how to use it, please give some example.

SeanLee97

WhereIsAI org Jan 10, 2024

@supercharge19 hi, you can use optimum to load the quantized onnx model, as follows:

from optimum.onnxruntime import ORTModelForFeatureExtraction
from optimum.pipelines import pipeline

model = ORTModelForFeatureExtraction.from_pretrained('WhereIsAI/UAE-Large-V1', file_name="onnx/model_quantized.onnx")
extractor = pipeline('feature-extraction', model=model)
output = extractor('hello world')

supercharge19

Jan 10, 2024

@supercharge19 hi, you can use optimum to load the quantized onnx model, as follows:

from optimum.onnxruntime import ORTModelForFeatureExtraction
from optimum.pipelines import pipeline

model = ORTModelForFeatureExtraction.from_pretrained('WhereIsAI/UAE-Large-V1', file_name="onnx/model_quantized.onnx")
extractor = pipeline('feature-extraction', model=model)
output = extractor('hello world')

Thanks man.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment