TL;DR
We plan to carry out a crowd sourcing project that would annotate existing visual object tracking benchmark dataset with natural language (NL) descriptions. Following carefully designed experiments and instructions, we will obtain NL descriptions that would serve as training data for our research on tracking with NL.