Nandan Thakur's picture

3 7 45

Nandan Thakur

nthakur

·

https://thakur-nandan.github.io

AI & ML interests

NLP, IR, QA

Recent Activity

new activity about 22 hours ago

Shitao/bge-m3-data:Is the training split available?

updated a dataset about 2 months ago

miracl/nomiracl

upvoted a paper about 2 months ago

NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation

View all activity

Organizations

nthakur's activity

upvoted a paper about 2 months ago

NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation

Paper • 2312.11361 • Published Dec 18, 2023 • 1

upvoted an article 3 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 37

upvoted a paper 7 months ago

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3, 2024 • 44

upvoted an article 8 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 171

upvoted a collection 9 months ago

🦢SWIM-IR Dataset [NAACL'24]

29 million Synthetic Wikipedia-based Multilingual Retrieval Training Pairs. • 4 items • Updated Nov 23, 2024 • 7

upvoted a paper 9 months ago

Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval

Paper • 2311.05800 • Published Nov 10, 2023 • 3

upvoted a paper about 1 year ago

BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models

Paper • 2104.08663 • Published Apr 17, 2021 • 3