登录注册写文章

Paper | OVTrack: Open-Vocabulary Multiple Object Tracking

与阳光共进早餐

Paper | OVTrack: Open-Vocabulary Multiple Object Tracking

1 basic info

OVTrack: Open-Vocabulary Multiple Object Tracking

paper
website
cvpr23
model name : OVTrack

2 introduction

open vocabulary MOT: tracking beyond predefined training categories.

the classes of interested objects are available at test time

Detection: similar to OV D, use CLIP to align image features and text embedding.
Association: CLIP feature distillation helps in learning better appearance representations.
Besides, used the denoising diffusion probabilistic models (DDPMs) to form an effective data hallucination strategy.

OVTracker sets a new SOTA on TAO benchmark with only static images as training data

3 open-vocabulary MOT

basically the same as OVD.

benchmark builds on the TAO benchmark.

4 OVTrack

framework:

OVTracker's functionality: localization, classification, and association;

localization: train Faster-RCNN in a class-agnostic manner
classification: first replace the original classifier in Faster-RCNN with a text head add an image head generating the embeddings. Then, use the CLIP text and image encoders to supervise these two heads. Apply supervision on image and text getting the $L_{image}$ and $L_{text}$ , respectively.
Association: using contrastive learning with paired objects in $I_{key}$ and $I_{ref}$ .

Learning to track without video data.

use the large-scale, diverse image dataset LVIS to train the OVTrack.
propose a data hallucination method.

©著作权归作者所有,转载或内容合作请联系作者
【社区内容提示】社区部分内容疑似由AI辅助生成，浏览时请结合常识与多方信息审慎甄别。
平台声明：文章内容（如有图片或视频亦包括在内）由作者上传并发布，文章内容仅代表作者本人观点，简书系信息发布平台，仅提供信息存储服务。

相关阅读更多精彩内容

Paper ｜ Open-Vocabulary Object Detection Using ...
1 basic github.com/alirezazareian/ovr-cnn the first paper...
与阳光共进早餐阅读 2,880评论 0赞 1
人工智能/数据科学比赛汇总 2019.8
内容来自 DataSciComp，人工智能/数据科学比赛整理平台。Github：iphysresearch/Dat...
布客飞龙阅读 5,556评论 0赞 1

人工智能/数据科学比赛汇总 2019.9
内容来自 DataSciComp，人工智能/数据科学比赛整理平台。Github：iphysresearch/Dat...
布客飞龙阅读 5,423评论 0赞 1
targetSdkVersion升级到28一些修改的地方(持续更新)
前言 Google Play应用市场对于应用的targetSdkVersion有了更为严格的要求。从 2018 年...
申国骏阅读 64,959评论 15赞 98
2018-07-18 字符的基础操作
"""1.个性化消息: 将用户的姓名存到一个变量中，并向该用户显示一条消息。显示的消息应非常简单，如“Hello ...
她即我命阅读 8,675评论 0赞 5

友情链接更多精彩内容

1赞2赞

赞赏

手机看全文