Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries
Keyword(s):
Keyword(s):