Mex Ivanov's picture

1 1 55

Mex Ivanov

MexIvanov

·

MexIvanov

AI & ML interests

NLP, Coding, Quantum Computing and more.

Recent Activity

reacted to m-ric's post with 🚀 7 days ago

Since I published it on GitHub a few days ago, Hugging Face's new agentic library 𝘀𝗺𝗼𝗹𝗮𝗴𝗲𝗻𝘁𝘀 has gathered nearly 4k stars 🤯 ➡️ But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents: agents that can act on your computer with mouse & keyboard, like Claude Computer Use. We will make it work better, and fully open. ✨ Sounds like something you'd like to do? Apply here 👉 https://apply.workable.com/huggingface/j/AF1D4E3FEB/

reacted to m-ric's post with 🔥 7 days ago

Since I published it on GitHub a few days ago, Hugging Face's new agentic library 𝘀𝗺𝗼𝗹𝗮𝗴𝗲𝗻𝘁𝘀 has gathered nearly 4k stars 🤯 ➡️ But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents: agents that can act on your computer with mouse & keyboard, like Claude Computer Use. We will make it work better, and fully open. ✨ Sounds like something you'd like to do? Apply here 👉 https://apply.workable.com/huggingface/j/AF1D4E3FEB/

reacted to singhsidhukuldeep's post with 🔥 22 days ago

Exciting News in AI: JinaAI Releases JINA-CLIP-v2! The team at Jina AI has just released a groundbreaking multilingual multimodal embedding model that's pushing the boundaries of text-image understanding. Here's why this is a big deal: 🚀 Technical Highlights: - Dual encoder architecture combining a 561M parameter Jina XLM-RoBERTa text encoder and a 304M parameter EVA02-L14 vision encoder - Supports 89 languages with 8,192 token context length - Processes images up to 512×512 pixels with 14×14 patch size - Implements FlashAttention2 for text and xFormers for vision processing - Uses Matryoshka Representation Learning for efficient vector storage ⚡️ Under The Hood: - Multi-stage training process with progressive resolution scaling (224→384→512) - Contrastive learning using InfoNCE loss in both directions - Trained on massive multilingual dataset including 400M English and 400M multilingual image-caption pairs - Incorporates specialized datasets for document understanding, scientific graphs, and infographics - Uses hard negative mining with 7 negatives per positive sample 📊 Performance: - Outperforms previous models on visual document retrieval (52.65% nDCG@5) - Achieves 89.73% image-to-text and 79.09% text-to-image retrieval on CLIP benchmark - Strong multilingual performance across 30 languages - Maintains performance even with 75% dimension reduction (256D vs 1024D) 🎯 Key Innovation: The model solves the long-standing challenge of unifying text-only and multi-modal retrieval systems while adding robust multilingual support. Perfect for building cross-lingual visual search systems! Kudos to the research team at Jina AI for this impressive advancement in multimodal AI!

View all activity

Organizations

None yet

models 6

MexIvanov/MistRAG-7B-ruen-v1-merged

Text Generation • Updated Nov 25, 2024 • 9

MexIvanov/MistRAG-7B-ruen-v1

Text Generation • Updated Nov 25, 2024

MexIvanov/MistRAG-7B-ruen-v1-gguf

Text Generation • Updated Nov 25, 2024 • 6

MexIvanov/zephyr-python-ru

Text Generation • Updated Nov 11, 2024 • 2

MexIvanov/zephyr-python-ru-merged

Text Generation • Updated Nov 11, 2024 • 221

MexIvanov/zephyr-python-ru-gguf

Text Generation • Updated Nov 11, 2024 • 34 • 4

datasets 4

MexIvanov/RAG-v1-ruen

Viewer • Updated Nov 11, 2024 • 51.4k • 38

MexIvanov/image-gen-vector-consistency

Viewer • Updated Aug 30, 2024 • 184 • 30

MexIvanov/CodeExercise-Python-27k-ru

Viewer • Updated Dec 19, 2023 • 27.2k • 34 • 1

MexIvanov/Vezora-Tested-22k-Python-Alpaca-ru

Viewer • Updated Dec 19, 2023 • 22.6k • 33 • 2