MVTrack dataset is designed to fill the gaps in the field of MVOT. Compared to single-view datasets, we maintain competitive class diversity while adding multi-view capabilities. Compared to MVOT datasets, we provide significantly richer object categories and more videos with practical camera setups. MVTrack dataset is the only dataset that combines multi-view tracking, rich object categories, absent label annotations, and calibration information.
Below are the results on the MVTrack dataset. MITracker provides multi-view tracking results, whereas other single-view tracking methods yield results from individual views.
Model | AUC(%) | PNorm(%) | P(%) |
---|---|---|---|
DiMP | 43.14 | 59.52 | 53.13 |
PrDiMP | 48.61 | 66.09 | 58.93 |
MixFormer | 57.59 | 75.44 | 67.72 |
OSTrack | 60.04 | 77.72 | 70.06 |
GRM | 52.53 | 69.91 | 62.31 |
SeqTrack | 58.37 | 76.63 | 69.03 |
ARTrack | 53.23 | 70.25 | 62.49 |
HIPTrack | 60.45 | 78.92 | 70.53 |
EVPTrack | 61.37 | 79.76 | 71.97 |
AQATrack | 61.93🥉 | 80.00 🥉 | 72.69 🥉 |
ODTrack | 63.36 🥈 | 82.25 🥈 | 74.46 🥈 |
SAM2 ♦ | 46.49 | 63.12 | 56.82 |
SAM2Long ♦ | 55.30 | 72.84 | 67.40 |
MITracker (ours) | 71.13 🥇 | 91.87 🥇 | 83.95 🥇 |