OpenGVLab

community

https://github.com/opengvlab

opengvlab

OpenGVLab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

lixinhao new activity about 5 hours ago

OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448:RuntimeError: shape '[-1, 0]' is invalid for input of size 1206

lixinhao updated a model about 5 hours ago

OpenGVLab/VideoChat-Flash-Qwen2-7B_res224

lixinhao updated a model about 5 hours ago

OpenGVLab/VideoChat-Flash-Qwen2-7B_res448

View all activity

Organization Card

Community About org cards

OpenGVLab

Welcome to OpenGVLab! We are a research group from Shanghai AI Lab focused on Vision-Centric AI research. The GV in our name, OpenGVLab, means general vision, a general understanding of vision, so little effort is needed to adapt to new vision-based tasks.

Models

InternVL: a pioneering open-source alternative to GPT-4V.
InternImage: a large-scale vision foundation models with deformable convolutions.
InternVideo: large-scale video foundation models for multimodal understanding.
VideoChat: an end-to-end chat assistant for video comprehension.
All-Seeing-Project: towards panoptic visual recognition and understanding of the open world.

Datasets

ShareGPT4o: a groundbreaking large-scale resource that we plan to open-source with 200K meticulously annotated images, 10K videos with highly descriptive captions, and 10K audio files with detailed descriptions.
InternVid: a large-scale video-text dataset for multimodal understanding and generation.
MMPR: a high-quality, large-scale multimodal preference dataset.

Benchmarks

MVBench: a comprehensive benchmark for multimodal video understanding.
CRPE: a benchmark covering all elements of the relation triplets (subject, predicate, object), providing a systematic platform for the evaluation of relation comprehension ability.
MM-NIAH: a comprehensive benchmark for long multimodal documents comprehension.
GMAI-MMBench: a comprehensive multimodal evaluation benchmark towards general medical AI.

Collections 19

spaces 10

InternVL

MVBench Leaderboard

Running on Zero

InternVideo2 Chat 8B HD

ControlLLM

Running on Zero

VideoMamba

VideoChat2

models 140

OpenGVLab/VideoChat-Flash-Qwen2-7B_res224

Video-Text-to-Text • Updated about 5 hours ago • 55

OpenGVLab/VideoChat-Flash-Qwen2-7B_res448

Video-Text-to-Text • Updated about 5 hours ago • 249 • 3

OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448

Video-Text-to-Text • Updated about 5 hours ago • 174 • 4

OpenGVLab/VideoMAEv2-giant

Video Classification • Updated 1 day ago • 6 • 1

OpenGVLab/VideoMAEv2-Huge

Video Classification • Updated 1 day ago • 2

OpenGVLab/VideoMAEv2-Large

Video Classification • Updated 1 day ago • 2

OpenGVLab/VideoMAEv2-Base

Video Classification • Updated 1 day ago • 63

OpenGVLab/InternViT-300M-448px

Image Feature Extraction • Updated 7 days ago • 20.1k • 52

OpenGVLab/InternVL2_5-78B-MPO-AWQ

Image-Text-to-Text • Updated 9 days ago • 251 • 6

OpenGVLab/VideoChat-TPO

Video-Text-to-Text • Updated 13 days ago • 51 • 4

datasets 30

OpenGVLab/MMPR-v1.1

Preview • Updated 25 days ago • 917 • 34

OpenGVLab/MMPR

Preview • Updated 25 days ago • 454 • 44

OpenGVLab/GMAI-MMBench

Preview • Updated 29 days ago • 226 • 14

OpenGVLab/V2PE-Data

Preview • Updated Dec 14, 2024 • 799 • 5

OpenGVLab/InternVL-Domain-Adaptation-Data

Preview • Updated Dec 9, 2024 • 122 • 6

OpenGVLab/GUI-Odyssey

Viewer • Updated Nov 20, 2024 • 7.74k • 12.1k • 10

OpenGVLab/OmniCorpus-YT

Updated Nov 17, 2024 • 527 • 9

OpenGVLab/OmniCorpus-CC-210M

Viewer • Updated Nov 17, 2024 • 208M • 381 • 19

OpenGVLab/OmniCorpus-CC

Viewer • Updated Nov 17, 2024 • 986M • 20.5k • 12

OpenGVLab/MVBench

Viewer • Updated Oct 18, 2024 • 4k • 7.91k • 28