The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19, 2024 • 129
The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models Jan 29, 2024 • 17
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Paper • 2410.15999 • Published Oct 21, 2024 • 19
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 95 items • Updated about 6 hours ago • 96
Adapting Neural Link Predictors for Data-Efficient Complex Query Answering Paper • 2301.12313 • Published Jan 29, 2023
Attention Is All You Need But You Don't Need All Of It For Inference of Large Language Models Paper • 2407.15516 • Published Jul 22, 2024 • 1
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations Paper • 2410.18860 • Published Oct 24, 2024 • 9