Pending Papers - a admarcosai Collection

Video Creation by Demonstration

Paper • 2412.09551 • Published Dec 12, 2024 • 8

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 45

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 71

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Paper • 2408.03615 • Published Aug 7, 2024 • 31

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17, 2024 • 49

AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5, 2024 • 28

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Paper • 2403.11481 • Published Mar 18, 2024 • 13

Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27, 2024 • 18

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18, 2024 • 34

Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement

Paper • 2401.14215 • Published Jan 25, 2024 • 2

Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries

Paper • 2402.13043 • Published Feb 20, 2024 • 2

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 47

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics

Paper • 2412.07774 • Published Dec 10, 2024 • 26

Granite Guardian

Paper • 2412.07724 • Published Dec 10, 2024 • 18

Fully Open Source Moxin-7B Technical Report

Paper • 2412.06845 • Published Dec 8, 2024 • 10

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 73

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 76

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published Dec 10, 2024 • 26

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 127

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published Dec 6, 2024 • 50

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 46

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Paper • 2412.04455 • Published Dec 5, 2024 • 37

CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

Paper • 2310.08992 • Published Oct 13, 2023 • 10

Densing Law of LLMs

Paper • 2412.04315 • Published Dec 5, 2024 • 17

Discriminative Fine-tuning of LVLMs

Paper • 2412.04378 • Published Dec 5, 2024 • 10

MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

Paper • 2412.04448 • Published Dec 5, 2024 • 9

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 123

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Paper • 2412.02980 • Published Dec 4, 2024 • 12

Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training in LLMs

Paper • 2411.08719 • Published Nov 10, 2024

Little Giants: Synthesizing High-Quality Embedding Data at Scale

Paper • 2410.18634 • Published Oct 24, 2024

A Survey on Data Synthesis and Augmentation for Large Language Models

Paper • 2410.12896 • Published Oct 16, 2024

Self-Improvement in Language Models: The Sharpening Mechanism

Paper • 2412.01951 • Published Dec 2, 2024

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 63

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57

MALT: Improving Reasoning with Multi-Agent LLM Training

Paper • 2412.01928 • Published Dec 2, 2024 • 40

Multi-Agent Large Language Models for Conversational Task-Solving

Paper • 2410.22932 • Published Oct 30, 2024

AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning

Paper • 2412.03248 • Published Dec 4, 2024 • 26

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published Dec 3, 2024 • 21

Scaling Image Tokenizers with Grouped Spherical Quantization

Paper • 2412.02632 • Published Dec 3, 2024 • 10

X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Paper • 2412.01824 • Published Dec 2, 2024 • 65

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 43

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published Nov 28, 2024 • 33

The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning

Paper • 2412.00568 • Published Nov 30, 2024 • 14

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Paper • 2412.01800 • Published Dec 2, 2024 • 6

A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models

Paper • 2411.19477 • Published Nov 29, 2024 • 6

Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting

Paper • 2412.00869 • Published Dec 1, 2024 • 4

World-consistent Video Diffusion with Explicit 3D Modeling

Paper • 2412.01821 • Published Dec 2, 2024 • 4

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 25

Reverse Thinking Makes LLMs Stronger Reasoners

Paper • 2411.19865 • Published Nov 29, 2024 • 20

AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset

Paper • 2411.15640 • Published Nov 23, 2024 • 4

Large Language Model-Brained GUI Agents: A Survey

Paper • 2411.18279 • Published Nov 27, 2024 • 29

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 11

Learning 3D Representations from Procedural 3D Programs

Paper • 2411.17467 • Published Nov 25, 2024 • 8

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 49

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 42

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 37

Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

Paper • 2411.15221 • Published Nov 20, 2024 • 26

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Paper • 2411.16508 • Published Nov 25, 2024 • 8

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Paper • 2411.15671 • Published Nov 23, 2024 • 7

LLMs Do Not Think Step-by-step In Implicit Reasoning

Paper • 2411.15862 • Published Nov 24, 2024 • 8

Predicting Emergent Capabilities by Finetuning

Paper • 2411.16035 • Published Nov 25, 2024 • 6

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 58

A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20, 2024 • 20

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Paper • 2411.13543 • Published Nov 20, 2024 • 18

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published Nov 21, 2024 • 30

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 40

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21, 2024 • 9

Patience Is The Key to Large Language Model Reasoning

Paper • 2411.13082 • Published Nov 20, 2024 • 7

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 30

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 52

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 15

Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published Nov 18, 2024 • 15

Building Trust: Foundations of Security, Safety and Transparency in AI

Paper • 2411.12275 • Published Nov 19, 2024 • 10

Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages

Paper • 2411.12240 • Published Nov 19, 2024 • 6

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 75

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published Nov 16, 2024 • 44

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 20

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published Nov 18, 2024 • 17

Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering

Paper • 2411.09213 • Published Nov 14, 2024 • 7

Evaluating the role of `Constitutions' for learning from AI feedback

Paper • 2411.10168 • Published Nov 15, 2024 • 5

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Paper • 2411.10323 • Published Nov 15, 2024 • 31

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 71

Hardware and Software Platform Inference

Paper • 2411.05197 • Published Nov 7, 2024 • 3

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 35

Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published Nov 12, 2024 • 13

GRS-QA -- Graph Reasoning-Structured Question Answering Dataset

Paper • 2411.00369 • Published Nov 1, 2024 • 6

GPT or BERT: why not both?

Paper • 2410.24159 • Published Oct 31, 2024 • 14

Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models

Paper • 2410.13080 • Published Oct 16, 2024

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31, 2024 • 59

ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Paper • 2410.21465 • Published Oct 28, 2024 • 11

RARe: Retrieval Augmented Retrieval with In-Context Examples

Paper • 2410.20088 • Published Oct 26, 2024 • 5

Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8, 2024 • 17

Game-theoretic LLM: Agent Workflow for Negotiation Games

Paper • 2411.05990 • Published Nov 8, 2024 • 7

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 32

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 113

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7, 2024 • 64

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 50

Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model

Paper • 2411.04496 • Published Nov 7, 2024 • 22

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 17

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 65

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 47

How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published Nov 4, 2024 • 33

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published Nov 4, 2024 • 24

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Paper • 2411.02397 • Published Nov 4, 2024 • 23

Constrained Diffusion Implicit Models

Paper • 2411.00359 • Published Nov 1, 2024 • 6

Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks

Paper • 2411.01192 • Published Nov 2, 2024 • 3

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30, 2024 • 46

Personalization of Large Language Models: A Survey

Paper • 2411.00027 • Published Oct 29, 2024 • 31

Survey of User Interface Design and Interaction Techniques in Generative AI Applications

Paper • 2410.22370 • Published Oct 28, 2024 • 11

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published Oct 31, 2024 • 19

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 23

AAAR-1.0: Assessing AI's Potential to Assist Research

Paper • 2410.22394 • Published Oct 29, 2024 • 14

On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published Oct 30, 2024 • 18

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Paper • 2410.20424 • Published Oct 27, 2024 • 40

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25, 2024 • 17

A Survey of Small Language Models

Paper • 2410.20011 • Published Oct 25, 2024 • 40

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24, 2024 • 32

Counting Ability of Large Language Models and Impact of Tokenization

Paper • 2410.19730 • Published Oct 25, 2024 • 10

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Paper • 2410.18693 • Published Oct 24, 2024 • 40

Unbounded: A Generative Infinite Game of Character Life Simulation

Paper • 2410.18975 • Published Oct 24, 2024 • 35

Multi-Draft Speculative Sampling: Canonical Architectures and Theoretical Limits

Paper • 2410.18234 • Published Oct 23, 2024 • 3

WorldSimBench: Towards Video Generation Models as World Simulators

Paper • 2410.18072 • Published Oct 23, 2024 • 18

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 82

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Paper • 2310.04378 • Published Oct 6, 2023 • 19

Conditional Diffusion Distillation

Paper • 2310.01407 • Published Oct 2, 2023 • 20

Aligning Text-to-Image Diffusion Models with Reward Backpropagation

Paper • 2310.03739 • Published Oct 5, 2023 • 21

Large Concept Models: Language Modeling in a Sentence Representation Space

Paper • 2412.08821 • Published Dec 11, 2024 • 13

The Role of Summarization in Generative Agents: A Preliminary Perspective

Paper • 2305.01253 • Published May 2, 2023

Generative Agents: Interactive Simulacra of Human Behavior

Paper • 2304.03442 • Published Apr 7, 2023 • 12

SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents

Paper • 2403.08715 • Published Mar 13, 2024 • 20

Generative Agent Simulations of 1,000 People

Paper • 2411.10109 • Published Nov 15, 2024 • 3

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 146

DRLC: Reinforcement Learning with Dense Rewards from LLM Critic

Paper • 2401.07382 • Published Jan 14, 2024 • 2

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11, 2024 • 26

Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs

Paper • 2403.05020 • Published Mar 8, 2024 • 2

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

Paper • 2403.07708 • Published Mar 12, 2024

Large Language Model-based Human-Agent Collaboration for Complex Task Solving

Paper • 2402.12914 • Published Feb 20, 2024

Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions

Paper • 2408.15787 • Published Aug 28, 2024

Building Cooperative Embodied Agents Modularly with Large Language Models

Paper • 2307.02485 • Published Jul 5, 2023 • 11

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28

Challenges in Human-Agent Communication

Paper • 2412.10380 • Published Nov 28, 2024

From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

Paper • 2412.03563 • Published Dec 4, 2024

AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios

Paper • 2410.19346 • Published Oct 25, 2024

ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents

Paper • 2411.00927 • Published Nov 1, 2024

Simulating User Agents for Embodied Conversational-AI

Paper • 2410.23535 • Published Oct 31, 2024

Positive Experience Reflection for Agents in Interactive Text Environments

Paper • 2411.02223 • Published Nov 4, 2024

From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

Paper • 2412.08442 • Published Dec 11, 2024

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 69

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 57

Scaling Instructable Agents Across Many Simulated Worlds

Paper • 2404.10179 • Published Mar 13, 2024 • 28

CodeNav: Beyond tool-use to using real-world codebases with LLM agents

Paper • 2406.12276 • Published Jun 18, 2024

Code Agents are State of the Art Software Testers

Paper • 2406.12952 • Published Jun 18, 2024 • 1

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale

Paper • 2409.16299 • Published Sep 9, 2024 • 11

Reinforcement Learning: An Overview

Paper • 2412.05265 • Published Dec 6, 2024 • 4

Automated Reinforcement Learning: An Overview

Paper • 2201.05000 • Published Jan 13, 2022

3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding

Paper • 2412.18450 • Published 22 days ago • 32

3D Scene Graph Guided Vision-Language Pre-training

Paper • 2411.18666 • Published Nov 27, 2024

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published 23 days ago • 39

GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models

Paper • 2412.12735 • Published 29 days ago

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models

Paper • 2412.07171 • Published Dec 10, 2024 • 1

In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Paper • 2412.17758 • Published 23 days ago • 16

Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Paper • 2412.15255 • Published about 1 month ago • 3

Rethinking Thinking Tokens: Understanding Why They Underperform in Practice

Paper • 2411.11371 • Published Nov 18, 2024

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 29 days ago • 91

A NotSo Simple Way to Beat Simple Bench

Paper • 2412.12173 • Published Dec 12, 2024

Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation

Paper • 2412.11831 • Published 30 days ago

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published 27 days ago • 15

A Survey on Inference Optimization Techniques for Mixture of Experts Models

Paper • 2412.14219 • Published 28 days ago

PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model

Paper • 2411.08212 • Published Nov 12, 2024

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published 27 days ago • 85

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 23 days ago • 45

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published 23 days ago • 29

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 23 days ago • 42

OpenAI o1 System Card

Paper • 2412.16720 • Published 25 days ago • 31

Revisiting In-Context Learning with Long Context Language Models

Paper • 2412.16926 • Published 24 days ago • 29

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published 26 days ago • 22

Outcome-Refining Process Supervision for Code Generation

Paper • 2412.15118 • Published 27 days ago • 19

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published 23 days ago • 21

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Paper • 2412.17589 • Published 23 days ago • 12

A Survey on Human-Centric LLMs

Paper • 2411.14491 • Published Nov 20, 2024

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published 27 days ago • 12

NILE: Internal Consistency Alignment in Large Language Models

Paper • 2412.16686 • Published 25 days ago • 8

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 27 days ago • 50

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published 26 days ago • 38

SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation

Paper • 2412.13649 • Published 28 days ago • 20

DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs

Paper • 2412.14838 • Published 27 days ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 9

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published 27 days ago • 73

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 27 days ago • 33

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 22 days ago • 44

MMFactory: A Universal Solution Search Engine for Vision-Language Tasks

Paper • 2412.18072 • Published 23 days ago • 17

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Paper • 2412.17483 • Published 23 days ago • 30

CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era

Paper • 2412.18702 • Published 22 days ago • 6

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Paper • 2412.19326 • Published 20 days ago • 18

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published about 1 month ago • 53

1.58-bit FLUX

Paper • 2412.18653 • Published 22 days ago • 72

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 21 days ago • 89

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published 18 days ago • 43

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 14 days ago • 93

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published 13 days ago • 47

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 13 days ago • 46

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published 13 days ago • 35

ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published 13 days ago • 24

Unifying Specialized Visual Encoders for Video Language Models

Paper • 2501.01426 • Published 13 days ago • 20

Dynamic Scaling of Unit Tests for Code Reward Modeling

Paper • 2501.01054 • Published 13 days ago • 16

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

Paper • 2501.00658 • Published 15 days ago • 7

Attamba: Attending To Multi-Token States

Paper • 2411.17685 • Published Nov 26, 2024

Xmodel-2 Technical Report

Paper • 2412.19638 • Published 19 days ago • 25

Deep Learning-based Approaches for State Space Models: A Selective Review

Paper • 2412.11211 • Published about 1 month ago

On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages

Paper • 2412.19350 • Published 20 days ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published 10 days ago • 36

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Paper • 2501.02506 • Published 10 days ago • 9

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published 10 days ago • 24

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published 13 days ago • 17

Revisiting Graph Neural Networks on Graph-level Tasks: Comprehensive Experiments, Analysis, and Improvements

Paper • 2501.00773 • Published 14 days ago

Personalized Graph-Based Retrieval for Large Language Models

Paper • 2501.02157 • Published 12 days ago • 27

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

Paper • 2408.16737 • Published Aug 29, 2024 • 1

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Paper • 2410.13639 • Published Oct 17, 2024 • 17

Reinforcement Learning Enhanced LLMs: A Survey

Paper • 2412.10400 • Published Dec 5, 2024

Personalized Audiobook Recommendations at Spotify Through Graph Neural Networks

Paper • 2403.05185 • Published Mar 8, 2024 • 23

Dynamic graph neural networks for enhanced volatility prediction in financial markets

Paper • 2410.16858 • Published Oct 22, 2024

Cooperative Graph Neural Networks

Paper • 2310.01267 • Published Oct 2, 2023 • 1

A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection

Paper • 2307.03759 • Published Jul 7, 2023

Spatio-Temporal Graph Neural Networks: A Survey

Paper • 2301.10569 • Published Jan 25, 2023

A Survey of Graph Neural Networks for Social Recommender Systems

Paper • 2212.04481 • Published Dec 8, 2022

Multi-Reranker: Maximizing performance of retrieval-augmented generation in the FinanceRAG challenge

Paper • 2411.16732 • Published Nov 23, 2024 • 1

FinGen: A Dataset for Argument Generation in Finance

Paper • 2405.20708 • Published May 31, 2024

'Finance Wizard' at the FinLLM Challenge Task: Financial Text Summarization

Paper • 2408.03762 • Published Aug 7, 2024

MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning

Paper • 2411.03314 • Published Nov 5, 2024

A Survey of Large Language Models in Finance (FinLLMs)

Paper • 2402.02315 • Published Feb 4, 2024

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 77

GeAR: Generation Augmented Retrieval

Paper • 2501.02772 • Published 9 days ago • 20

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 7 days ago • 75

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published 7 days ago • 33

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 7 days ago • 78

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published 7 days ago • 47

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 7 days ago • 218

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 11 days ago • 78

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 8 days ago • 61

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published 19 days ago • 22

Synthetic Vision: Training Vision-Language Models to Understand Physics

Paper • 2412.08619 • Published Dec 11, 2024

Large Action Models: From Inception to Implementation

Paper • 2412.10047 • Published Dec 13, 2024 • 32

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published 9 days ago • 40

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

Paper • 2501.03936 • Published 8 days ago • 18

Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback

Paper • 2501.03916 • Published 8 days ago • 14

Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers

Paper • 2501.02393 • Published 11 days ago • 6

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published 5 days ago • 56

Infecting Generative AI With Viruses

Paper • 2501.05542 • Published 6 days ago • 12

Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

Paper • 2501.05707 • Published 5 days ago • 16

Demystifying Domain-adaptive Post-training for Financial LLMs

Paper • 2501.04961 • Published 6 days ago • 9

Enhancing Human-Like Responses in Large Language Models

Paper • 2501.05032 • Published 6 days ago • 41

An Empirical Study of Autoregressive Pre-training from Videos

Paper • 2501.05453 • Published 6 days ago • 34

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 2 days ago • 62

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 4 days ago • 45

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 4 days ago • 19

O1 Replication Journey: A Strategic Progress Report -- Part 1

Paper • 2410.18982 • Published Oct 8, 2024 • 3

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 7 days ago • 30

VideoAuteur: Towards Long Narrative Video Generation

Paper • 2501.06173 • Published 5 days ago • 24