Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
55
Mex Ivanov
MexIvanov
Follow
21world's profile picture
evilfreelancer's profile picture
2 followers
ยท
8 following
MexIvanov
AI & ML interests
NLP, Coding, Quantum Computing and more.
Recent Activity
reacted
to
m-ric
's
post
with ๐
7 days ago
Since I published it on GitHub a few days ago, Hugging Face's new agentic library ๐๐บ๐ผ๐น๐ฎ๐ด๐ฒ๐ป๐๐ has gathered nearly 4k stars ๐คฏ โก๏ธ But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents: agents that can act on your computer with mouse & keyboard, like Claude Computer Use. We will make it work better, and fully open. โจ Sounds like something you'd like to do? Apply here ๐ https://apply.workable.com/huggingface/j/AF1D4E3FEB/
reacted
to
m-ric
's
post
with ๐ฅ
7 days ago
Since I published it on GitHub a few days ago, Hugging Face's new agentic library ๐๐บ๐ผ๐น๐ฎ๐ด๐ฒ๐ป๐๐ has gathered nearly 4k stars ๐คฏ โก๏ธ But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents: agents that can act on your computer with mouse & keyboard, like Claude Computer Use. We will make it work better, and fully open. โจ Sounds like something you'd like to do? Apply here ๐ https://apply.workable.com/huggingface/j/AF1D4E3FEB/
reacted
to
singhsidhukuldeep
's
post
with ๐ฅ
22 days ago
Exciting News in AI: JinaAI Releases JINA-CLIP-v2! The team at Jina AI has just released a groundbreaking multilingual multimodal embedding model that's pushing the boundaries of text-image understanding. Here's why this is a big deal: ๐ Technical Highlights: - Dual encoder architecture combining a 561M parameter Jina XLM-RoBERTa text encoder and a 304M parameter EVA02-L14 vision encoder - Supports 89 languages with 8,192 token context length - Processes images up to 512ร512 pixels with 14ร14 patch size - Implements FlashAttention2 for text and xFormers for vision processing - Uses Matryoshka Representation Learning for efficient vector storage โก๏ธ Under The Hood: - Multi-stage training process with progressive resolution scaling (224โ384โ512) - Contrastive learning using InfoNCE loss in both directions - Trained on massive multilingual dataset including 400M English and 400M multilingual image-caption pairs - Incorporates specialized datasets for document understanding, scientific graphs, and infographics - Uses hard negative mining with 7 negatives per positive sample ๐ Performance: - Outperforms previous models on visual document retrieval (52.65% nDCG@5) - Achieves 89.73% image-to-text and 79.09% text-to-image retrieval on CLIP benchmark - Strong multilingual performance across 30 languages - Maintains performance even with 75% dimension reduction (256D vs 1024D) ๐ฏ Key Innovation: The model solves the long-standing challenge of unifying text-only and multi-modal retrieval systems while adding robust multilingual support. Perfect for building cross-lingual visual search systems! Kudos to the research team at Jina AI for this impressive advancement in multimodal AI!
View all activity
Organizations
None yet
models
6
Sort:ย Recently updated
MexIvanov/MistRAG-7B-ruen-v1-merged
Text Generation
โข
Updated
Nov 25, 2024
โข
9
MexIvanov/MistRAG-7B-ruen-v1
Text Generation
โข
Updated
Nov 25, 2024
MexIvanov/MistRAG-7B-ruen-v1-gguf
Text Generation
โข
Updated
Nov 25, 2024
โข
6
MexIvanov/zephyr-python-ru
Text Generation
โข
Updated
Nov 11, 2024
โข
2
MexIvanov/zephyr-python-ru-merged
Text Generation
โข
Updated
Nov 11, 2024
โข
221
MexIvanov/zephyr-python-ru-gguf
Text Generation
โข
Updated
Nov 11, 2024
โข
34
โข
4
datasets
4
Sort:ย Recently updated
MexIvanov/RAG-v1-ruen
Viewer
โข
Updated
Nov 11, 2024
โข
51.4k
โข
38
MexIvanov/image-gen-vector-consistency
Viewer
โข
Updated
Aug 30, 2024
โข
184
โข
30
MexIvanov/CodeExercise-Python-27k-ru
Viewer
โข
Updated
Dec 19, 2023
โข
27.2k
โข
34
โข
1
MexIvanov/Vezora-Tested-22k-Python-Alpaca-ru
Viewer
โข
Updated
Dec 19, 2023
โข
22.6k
โข
33
โข
2