This repository contains the HandsOnVLM model presented in the paper HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction.
Project page: https://www.chenbao.tech/handsonvlm/ Code: https://github.com/Kami-code/HandsOnVLM-release
- Downloads last month
- 9
Inference API (serverless) does not yet support transformers models for this pipeline type.