7 32 60

zhang yuechen

julianjuaner

https://julianjuaner.github.io/

julianjuaner

AI & ML interests

Controllable Generation (Customization)

Recent Activity

authored a paper 7 days ago

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

upvoted a paper 7 days ago

Cosmos World Foundation Model Platform for Physical AI

upvoted a paper 8 days ago

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

View all activity

Organizations

julianjuaner's activity

authored a paper 7 days ago

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

Paper • 2501.03931 • Published 8 days ago • 14

upvoted a paper 7 days ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 8 days ago • 61

upvoted a paper 8 days ago

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

Paper • 2501.03931 • Published 8 days ago • 14

commented a paper 8 days ago

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

Paper • 2501.03931 • Published 8 days ago • 14 •

upvoted a paper 16 days ago

1.58-bit FLUX

Paper • 2412.18653 • Published 22 days ago • 72

upvoted a collection 26 days ago

X2I Dataset

Collection

Datasets used in OmniGen-v1 • 5 items • Updated 13 days ago • 9

liked a model 27 days ago

FastVideo/FastHunyuan

Text-to-Video • Updated 8 days ago • 792 • 148

upvoted a paper 28 days ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 45

authored a paper about 1 month ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 44

upvoted a paper about 1 month ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 54

commented a paper about 1 month ago

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Paper • 2412.09593 • Published Dec 12, 2024 • 18 •

upvoted 3 papers about 1 month ago

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Paper • 2412.09593 • Published Dec 12, 2024 • 18

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 44

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 105

liked a model about 1 month ago

Yuanshi/OminiControl

Image-to-Image • Updated Dec 10, 2024 • 5.78k • 108

liked a model about 2 months ago

ali-vilab/In-Context-LoRA

Text-to-Image • Updated 29 days ago • 101k • • 519

liked a model 2 months ago

THUDM/CogVideoX1.5-5B-SAT

Image-to-Video • Updated Nov 8, 2024 • 148

liked 2 models 3 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 1.27M • • 8.02k

genmo/mochi-1-preview

Text-to-Video • Updated 28 days ago • 42k • 1.14k

liked a model 4 months ago

alibaba-pai/CogVideoX-Fun-2b-InP

Image-to-Video • Updated Sep 23, 2024 • 340 • 20