Crowdsourcing on Natural Language Descriptions for Visual Object Tracking

TL;DR We plan to carry out a crowd sourcing project that would annotate existing visual object tracking benchmark dataset with natural language (NL) descriptions. Following carefully designed experiments and instructions, we will obtain NL descriptions that would serve as training data for our research on tracking with NL.

read the full report here