DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published Dec 10, 2024 • 45
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published Dec 9, 2024 • 71
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks Paper • 2408.03615 • Published Aug 7, 2024 • 31
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper • 2407.12784 • Published Jul 17, 2024 • 49
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents Paper • 2407.04363 • Published Jul 5, 2024 • 28
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding Paper • 2403.11481 • Published Mar 18, 2024 • 13
Evaluating Very Long-Term Conversational Memory of LLM Agents Paper • 2402.17753 • Published Feb 27, 2024 • 18
ChatQA: Building GPT-4 Level Conversational QA Models Paper • 2401.10225 • Published Jan 18, 2024 • 34
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement Paper • 2401.14215 • Published Jan 25, 2024 • 2
Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries Paper • 2402.13043 • Published Feb 20, 2024 • 2
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper • 2412.07774 • Published Dec 10, 2024 • 26
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 73
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 76
Maya: An Instruction Finetuned Multilingual Multimodal Model Paper • 2412.07112 • Published Dec 10, 2024 • 26
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published Dec 6, 2024 • 127
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper • 2412.04862 • Published Dec 6, 2024 • 50
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published Dec 4, 2024 • 46
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection Paper • 2412.04455 • Published Dec 5, 2024 • 37
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules Paper • 2310.08992 • Published Oct 13, 2023 • 10
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published Dec 5, 2024 • 9
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 123
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper • 2412.02980 • Published Dec 4, 2024 • 12
Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training in LLMs Paper • 2411.08719 • Published Nov 10, 2024
Little Giants: Synthesizing High-Quality Embedding Data at Scale Paper • 2410.18634 • Published Oct 24, 2024
A Survey on Data Synthesis and Augmentation for Large Language Models Paper • 2410.12896 • Published Oct 16, 2024
Self-Improvement in Language Models: The Sharpening Mechanism Paper • 2412.01951 • Published Dec 2, 2024
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published Nov 12, 2024 • 63
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published Nov 29, 2024 • 57
MALT: Improving Reasoning with Multi-Agent LLM Training Paper • 2412.01928 • Published Dec 2, 2024 • 40
Multi-Agent Large Language Models for Conversational Task-Solving Paper • 2410.22932 • Published Oct 30, 2024
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning Paper • 2412.03248 • Published Dec 4, 2024 • 26
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation Paper • 2412.02592 • Published Dec 3, 2024 • 21
Scaling Image Tokenizers with Grouped Spherical Quantization Paper • 2412.02632 • Published Dec 3, 2024 • 10
X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models Paper • 2412.01824 • Published Dec 2, 2024 • 65
Open-Sora Plan: Open-Source Large Video Generation Model Paper • 2412.00131 • Published Nov 28, 2024 • 33
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning Paper • 2412.00568 • Published Nov 30, 2024 • 14
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos Paper • 2412.01800 • Published Dec 2, 2024 • 6
A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models Paper • 2411.19477 • Published Nov 29, 2024 • 6
Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting Paper • 2412.00869 • Published Dec 1, 2024 • 4
World-consistent Video Diffusion with Explicit 3D Modeling Paper • 2412.01821 • Published Dec 2, 2024 • 4
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset Paper • 2411.15640 • Published Nov 23, 2024 • 4
Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens Paper • 2411.17691 • Published Nov 26, 2024 • 11
Learning 3D Representations from Procedural 3D Programs Paper • 2411.17467 • Published Nov 25, 2024 • 8
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 49
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper • 2411.16489 • Published Nov 25, 2024 • 42
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge Paper • 2411.16594 • Published Nov 25, 2024 • 37
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20, 2024 • 26
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 8
Best of Both Worlds: Advantages of Hybrid Graph Sequence Models Paper • 2411.15671 • Published Nov 23, 2024 • 7
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 58
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper • 2411.12946 • Published Nov 20, 2024 • 20
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games Paper • 2411.13543 • Published Nov 20, 2024 • 18
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published Nov 21, 2024 • 58
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21, 2024 • 30
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 40
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models Paper • 2411.14257 • Published Nov 21, 2024 • 9
Patience Is The Key to Large Language Model Reasoning Paper • 2411.13082 • Published Nov 20, 2024 • 7
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper • 2411.13503 • Published Nov 20, 2024 • 30
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper • 2411.10958 • Published Nov 17, 2024 • 52
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Paper • 2411.13476 • Published Nov 20, 2024 • 15
Continuous Speculative Decoding for Autoregressive Image Generation Paper • 2411.11925 • Published Nov 18, 2024 • 15
Building Trust: Foundations of Security, Safety and Transparency in AI Paper • 2411.12275 • Published Nov 19, 2024 • 10
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages Paper • 2411.12240 • Published Nov 19, 2024 • 6
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Paper • 2411.10640 • Published Nov 16, 2024 • 44
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering Paper • 2411.11504 • Published Nov 18, 2024 • 20
Drowning in Documents: Consequences of Scaling Reranker Inference Paper • 2411.11767 • Published Nov 18, 2024 • 17
Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering Paper • 2411.09213 • Published Nov 14, 2024 • 7
Evaluating the role of `Constitutions' for learning from AI feedback Paper • 2411.10168 • Published Nov 15, 2024 • 5
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use Paper • 2411.10323 • Published Nov 15, 2024 • 31
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 113
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper • 2411.09595 • Published Nov 14, 2024 • 71
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published Nov 11, 2024 • 35
Scaling Properties of Diffusion Models for Perceptual Tasks Paper • 2411.08034 • Published Nov 12, 2024 • 13
GRS-QA -- Graph Reasoning-Structured Question Answering Dataset Paper • 2411.00369 • Published Nov 1, 2024 • 6
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models Paper • 2410.13080 • Published Oct 16, 2024
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published Oct 31, 2024 • 59
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference Paper • 2410.21465 • Published Oct 28, 2024 • 11
RARe: Retrieval Augmented Retrieval with In-Context Examples Paper • 2410.20088 • Published Oct 26, 2024 • 5
Game-theoretic LLM: Agent Workflow for Negotiation Games Paper • 2411.05990 • Published Nov 8, 2024 • 7
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published Nov 6, 2024 • 32
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7, 2024 • 113
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published Nov 7, 2024 • 50
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model Paper • 2411.04496 • Published Nov 7, 2024 • 22
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper • 2411.03562 • Published Nov 5, 2024 • 65
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published Nov 4, 2024 • 47
How Far is Video Generation from World Model: A Physical Law Perspective Paper • 2411.02385 • Published Nov 4, 2024 • 33
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper • 2411.02265 • Published Nov 4, 2024 • 24
Adaptive Caching for Faster Video Generation with Diffusion Transformers Paper • 2411.02397 • Published Nov 4, 2024 • 23
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks Paper • 2411.01192 • Published Nov 2, 2024 • 3
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published Oct 30, 2024 • 46
Survey of User Interface Design and Interaction Techniques in Generative AI Applications Paper • 2410.22370 • Published Oct 28, 2024 • 11
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments Paper • 2410.23918 • Published Oct 31, 2024 • 19
AAAR-1.0: Assessing AI's Potential to Assist Research Paper • 2410.22394 • Published Oct 29, 2024 • 14
On Memorization of Large Language Models in Logical Reasoning Paper • 2410.23123 • Published Oct 30, 2024 • 18
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions Paper • 2410.20424 • Published Oct 27, 2024 • 40
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization Paper • 2410.19609 • Published Oct 25, 2024 • 17
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant Paper • 2410.18603 • Published Oct 24, 2024 • 32
Counting Ability of Large Language Models and Impact of Tokenization Paper • 2410.19730 • Published Oct 25, 2024 • 10
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper • 2410.17243 • Published Oct 22, 2024 • 89
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch Paper • 2410.18693 • Published Oct 24, 2024 • 40
Unbounded: A Generative Infinite Game of Character Life Simulation Paper • 2410.18975 • Published Oct 24, 2024 • 35
Multi-Draft Speculative Sampling: Canonical Architectures and Theoretical Limits Paper • 2410.18234 • Published Oct 23, 2024 • 3
WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published Oct 23, 2024 • 18
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Paper • 2311.05556 • Published Nov 9, 2023 • 82
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Paper • 2310.04378 • Published Oct 6, 2023 • 19
Aligning Text-to-Image Diffusion Models with Reward Backpropagation Paper • 2310.03739 • Published Oct 5, 2023 • 21
Large Concept Models: Language Modeling in a Sentence Representation Space Paper • 2412.08821 • Published Dec 11, 2024 • 13
The Role of Summarization in Generative Agents: A Preliminary Perspective Paper • 2305.01253 • Published May 2, 2023
Generative Agents: Interactive Simulacra of Human Behavior Paper • 2304.03442 • Published Apr 7, 2023 • 12
SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents Paper • 2403.08715 • Published Mar 13, 2024 • 20
DRLC: Reinforcement Learning with Dense Rewards from LLM Critic Paper • 2401.07382 • Published Jan 14, 2024 • 2
Secrets of RLHF in Large Language Models Part II: Reward Modeling Paper • 2401.06080 • Published Jan 11, 2024 • 26
Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs Paper • 2403.05020 • Published Mar 8, 2024 • 2
Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards Paper • 2403.07708 • Published Mar 12, 2024
Large Language Model-based Human-Agent Collaboration for Complex Task Solving Paper • 2402.12914 • Published Feb 20, 2024
Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions Paper • 2408.15787 • Published Aug 28, 2024
Building Cooperative Embodied Agents Modularly with Large Language Models Paper • 2307.02485 • Published Jul 5, 2023 • 11
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents Paper • 2412.03563 • Published Dec 4, 2024
AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios Paper • 2410.19346 • Published Oct 25, 2024
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Paper • 2411.00927 • Published Nov 1, 2024
Positive Experience Reflection for Agents in Interactive Text Environments Paper • 2411.02223 • Published Nov 4, 2024
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons Paper • 2412.08442 • Published Dec 11, 2024
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23, 2024 • 69
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published Jul 1, 2024 • 57
Scaling Instructable Agents Across Many Simulated Worlds Paper • 2404.10179 • Published Mar 13, 2024 • 28
CodeNav: Beyond tool-use to using real-world codebases with LLM agents Paper • 2406.12276 • Published Jun 18, 2024
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale Paper • 2409.16299 • Published Sep 9, 2024 • 11
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Paper • 2412.18450 • Published 22 days ago • 32
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published 23 days ago • 39
GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models Paper • 2412.12735 • Published 29 days ago
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models Paper • 2412.07171 • Published Dec 10, 2024 • 1
In Case You Missed It: ARC 'Challenge' Is Not That Challenging Paper • 2412.17758 • Published 23 days ago • 16
Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation Paper • 2412.15255 • Published about 1 month ago • 3
Rethinking Thinking Tokens: Understanding Why They Underperform in Practice Paper • 2411.11371 • Published Nov 18, 2024
Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation Paper • 2412.11831 • Published 30 days ago
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing Paper • 2412.14711 • Published 27 days ago • 15
A Survey on Inference Optimization Techniques for Mixture of Experts Models Paper • 2412.14219 • Published 28 days ago
PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model Paper • 2411.08212 • Published Nov 12, 2024
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Paper • 2412.14922 • Published 27 days ago • 85
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 23 days ago • 45
Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published 23 days ago • 29
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 23 days ago • 42
Revisiting In-Context Learning with Long Context Language Models Paper • 2412.16926 • Published 24 days ago • 29
Outcome-Refining Process Supervision for Code Generation Paper • 2412.15118 • Published 27 days ago • 19
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published 23 days ago • 21
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World Paper • 2412.17589 • Published 23 days ago • 12
Agent-SafetyBench: Evaluating the Safety of LLM Agents Paper • 2412.14470 • Published 27 days ago • 12
NILE: Internal Consistency Alignment in Large Language Models Paper • 2412.16686 • Published 25 days ago • 8
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 26 days ago • 38
SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation Paper • 2412.13649 • Published 28 days ago • 20
DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs Paper • 2412.14838 • Published 27 days ago
SCBench: A KV Cache-Centric Analysis of Long-Context Methods Paper • 2412.10319 • Published Dec 13, 2024 • 9
Progressive Multimodal Reasoning via Active Retrieval Paper • 2412.14835 • Published 27 days ago • 73
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Paper • 2412.15204 • Published 27 days ago • 33
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks Paper • 2412.18072 • Published 23 days ago • 17
A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression Paper • 2412.17483 • Published 23 days ago • 30
CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era Paper • 2412.18702 • Published 22 days ago • 6
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Paper • 2412.19326 • Published 20 days ago • 18
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published about 1 month ago • 53
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 21 days ago • 89
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper • 2412.20070 • Published 18 days ago • 43
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 14 days ago • 93
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 13 days ago • 47
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published 13 days ago • 46
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Paper • 2501.01423 • Published 13 days ago • 35
ProgCo: Program Helps Self-Correction of Large Language Models Paper • 2501.01264 • Published 13 days ago • 24
Unifying Specialized Visual Encoders for Video Language Models Paper • 2501.01426 • Published 13 days ago • 20
Dynamic Scaling of Unit Tests for Code Reward Modeling Paper • 2501.01054 • Published 13 days ago • 16
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing Paper • 2501.00658 • Published 15 days ago • 7
Deep Learning-based Approaches for State Space Models: A Selective Review Paper • 2412.11211 • Published about 1 month ago
On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages Paper • 2412.19350 • Published 20 days ago
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published 10 days ago • 36
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Paper • 2501.02506 • Published 10 days ago • 9
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published 10 days ago • 24
Revisiting Graph Neural Networks on Graph-level Tasks: Comprehensive Experiments, Analysis, and Improvements Paper • 2501.00773 • Published 14 days ago
Personalized Graph-Based Retrieval for Large Language Models Paper • 2501.02157 • Published 12 days ago • 27
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling Paper • 2408.16737 • Published Aug 29, 2024 • 1
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model Paper • 2410.13639 • Published Oct 17, 2024 • 17
Personalized Audiobook Recommendations at Spotify Through Graph Neural Networks Paper • 2403.05185 • Published Mar 8, 2024 • 23
Dynamic graph neural networks for enhanced volatility prediction in financial markets Paper • 2410.16858 • Published Oct 22, 2024
A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection Paper • 2307.03759 • Published Jul 7, 2023
A Survey of Graph Neural Networks for Social Recommender Systems Paper • 2212.04481 • Published Dec 8, 2022
Multi-Reranker: Maximizing performance of retrieval-augmented generation in the FinanceRAG challenge Paper • 2411.16732 • Published Nov 23, 2024 • 1
'Finance Wizard' at the FinLLM Challenge Task: Financial Text Summarization Paper • 2408.03762 • Published Aug 7, 2024
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning Paper • 2411.03314 • Published Nov 5, 2024
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models Paper • 2402.10986 • Published Feb 16, 2024 • 77
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 7 days ago • 75
LLM4SR: A Survey on Large Language Models for Scientific Research Paper • 2501.04306 • Published 7 days ago • 33
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 7 days ago • 78
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics Paper • 2501.04686 • Published 7 days ago • 47
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 7 days ago • 218
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published 11 days ago • 78
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 8 days ago • 61
Synthetic Vision: Training Vision-Language Models to Understand Physics Paper • 2412.08619 • Published Dec 11, 2024
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published Dec 13, 2024 • 32
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Paper • 2501.02955 • Published 9 days ago • 40
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides Paper • 2501.03936 • Published 8 days ago • 18
Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback Paper • 2501.03916 • Published 8 days ago • 14
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Paper • 2501.02393 • Published 11 days ago • 6
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published 5 days ago • 56
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Paper • 2501.05707 • Published 5 days ago • 16
Demystifying Domain-adaptive Post-training for Financial LLMs Paper • 2501.04961 • Published 6 days ago • 9
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 6 days ago • 41
An Empirical Study of Autoregressive Pre-training from Videos Paper • 2501.05453 • Published 6 days ago • 34
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 2 days ago • 62
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 4 days ago • 19
O1 Replication Journey: A Strategic Progress Report -- Part 1 Paper • 2410.18982 • Published Oct 8, 2024 • 3