TencentAILab-CVC
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Python 6.2k 584
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Python 5k 408
[CVPR 2024 & TPAMI 2025] UniRepLKNet
Python 1.1k 60
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.
Python 773 57
Official implementation of SEED-LLaMA (ICLR 2024).
Python 642 33
Multimodal Models in Real World
Jupyter Notebook 556 23