-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is MOTIP a class-agnostic model for tracking-by-detection task? #36
Comments
I'm sorry for the delay in my responses. I'll be in the hospital for a while (maybe a month or more), so my replies might be slower than usual. Thanks for your understanding and patience. Thanks for your interest in our work. |
Thanks for the excellent repo! So do you mean that we can separate detector and SeqDecoder during inference? For instance, I get a general usage detector (DETR or YOLO series), can I then do it like Sort those tracking-by-detection methods? Or we need to train both detector and SeqDecoder jointly and then do the inference? Clarify me if there's any misunderstanding! Thanks a lot! |
The codebase we provide in this repo does not support this feature. My above reply means that our thinking (ID prediction for target association) can be extended to the association-only model (similar to ReID methods) rather than the joint detection and association model (as we did). If you want to get an association-only tracking-by-detection model, you need to re-write your own code based on ours. Once you ensure that feature extraction in SeqDecoder is decoupled from the detector, then this SeqDecoder can be combined with any trained detector to become a tracking-by-detection method you mentioned (not tied to any specific detector). You can refer to work like MASA/PuTR for inspiration, where they trained a decoupled feature extractor.
For this repo, YES. Because the detector (Deformable DETR) also plays the role of feature extractor for SeqDecoder. Therefore, the detector and SeqDecoder are coupled together, trained together and inferred together. I hope this clarifies your concerns. Please let me know if you need additional details. |
Thanks for prompt response! Appreciated!
Just one last question, this repo used Deformable DETR as the detector as well feature extractor for SeqDecoder, is it feasible to replace it with other transformer? Thanks so much! |
Yes, of course. In this repo, we also provide MOTIP-DAB-Deformable-DETR except the default MOTIP-Deformable-DETR, as reported in dancetrack results. Specifically, you can refer to the following code to use your own transformer detector ( Lines 91 to 100 in 1dda4c4
Additionally, we need to make some modifications to the return values of the DETR detector to ensure it returns the target features ( MOTIP/models/deformable_detr/deformable_detr.py Lines 191 to 194 in 1dda4c4
|
Hi, thank you for amazing work!
Is MOTIP a class-agnostic model for tracking-by-detection task?
For example, if I have my Deformable DETR checkpoint trained on some marine dataset, can I use your pre-trained MOTIP models without any re-training or fine-tuning? Specifically, is the SeqDecoder module class-agnostic?
The text was updated successfully, but these errors were encountered: