hierarchical representations for visual object tracking by detection