Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer Paper • 2405.16436 • Published May 26, 2024
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Paper • 2410.08067 • Published Oct 10, 2024
DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs Paper • 2411.13611 • Published Nov 20, 2024
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 26 days ago • 38
Ego4D: Around the World in 3,000 Hours of Egocentric Video Paper • 2110.07058 • Published Oct 13, 2021
Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation Paper • 2305.03907 • Published May 6, 2023 • 1
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders Paper • 2412.09586 • Published Dec 12, 2024 • 5
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published Oct 14, 2024 • 15
Distilling an End-to-End Voice Assistant Without Instruction Training Data Paper • 2410.02678 • Published Oct 3, 2024 • 22
Development of Cognitive Intelligence in Pre-trained Language Models Paper • 2407.01047 • Published Jul 1, 2024
Multi-Level Feedback Generation with Large Language Models for Empowering Novice Peer Counselors Paper • 2403.15482 • Published Mar 21, 2024 • 1
Modeling Motivational Interviewing Strategies On An Online Peer-to-Peer Counseling Platform Paper • 2211.05182 • Published Nov 9, 2022
Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback Paper • 2305.08982 • Published May 15, 2023
Human Behavioral Benchmarking: Numeric Magnitude Comparison Effects in Large Language Models Paper • 2305.10782 • Published May 18, 2023
Pre-training LLMs using human-like development data corpus Paper • 2311.04666 • Published Nov 8, 2023
WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain Paper • 2211.00083 • Published Oct 31, 2022
ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment Paper • 2305.14463 • Published May 23, 2023
Revisiting non-English Text Simplification: A Unified Multilingual Benchmark Paper • 2305.15678 • Published May 25, 2023
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment Paper • 2405.19332 • Published May 29, 2024 • 15
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3 Text Generation • Updated May 13, 2024 • 23