🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 95 items • Updated about 6 hours ago • 96
Enhancing Automated Interpretability with Output-Centric Feature Descriptions Paper • 2501.08319 • Published about 19 hours ago • 4 • 1
Enhancing Automated Interpretability with Output-Centric Feature Descriptions Paper • 2501.08319 • Published about 19 hours ago • 4
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 95 items • Updated about 6 hours ago • 96
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 95 items • Updated about 6 hours ago • 96
Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization Paper • 2412.04619 • Published Dec 5, 2024 • 1
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 95 items • Updated about 6 hours ago • 96
Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models Paper • 2412.16247 • Published 27 days ago • 1
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 95 items • Updated about 6 hours ago • 96