Spaces:

minishlab
/

README

Running

App Files Files Community

Inferencing in Rust

by do-me - opened 2 days ago

Discussion

do-me

2 days ago

•

edited 2 days ago

Hey,
just wanted to bring this to your attention if you're interested: https://github.com/huggingface/text-embeddings-inference/issues/468.
In case, are you aware of any way to run your models already directly in Rust?

Pringled

The Minish Lab org 2 days ago

Hey @do-me , cool, thanks for creating that ticket! I'm not aware of a way to run our models directly in Rust, but I think the speedup would probably be much smaller than in larger models since our biggest bottleneck is the tokenizer. Will follow that ticket though, curious to see what comes out of it!

do-me

2 days ago

If I remember correctly, the tokenizer is anyway already written in Rust, right?
Personally, I'm not looking for speed improvements but simply compatibility. I'm developing a tauri2 app based on your multilingual embedding model at the moment (with do-me/foursquare_places_100M) where I'm forced to inference in Rust in the backend.

Pringled

The Minish Lab org 2 days ago

Indeed, so I think the speedups would be very marginal in our case. Compatibility is interesting though; it's not something on our roadmap at the moment since neither of us knows Rust well enough to add support for it, but it would be awesome if someone is willing to add support for it via the ticket you posted :).

do-me

about 3 hours ago

Brief follow-up: I settled with https://github.com/StarlightSearch/EmbedAnything in Rust, a nice wrapper around candle. They don't seem to mention your models anywhere directly. However, considering that they're supporting onnx models and support normal BERT-based models, your static models should work out of the box right? Did not have the time to give it a spin yet, but will try in the next days.

do-me changed discussion title from Text Embeddings Inference support to Inferencing in Rust about 3 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment