Comparison with 3D tracking approaches: SceneTracker, SpatialTracker, and DELTA on in-the-wild videos.

The inference time (in second) are noted on top of each video.