Yugang Li1, *, Haibo Sun1, Zhe Chen1, Yudan Ding1, Siqi Zhou2
CMC-Computers, Materials & Continua, Vol.65, No.3, pp. 2529-2541, 2020, DOI:10.32604/cmc.2020.011886
- 16 September 2020
Abstract Referring expressions comprehension is the task of locating the image region
described by a natural language expression, which refer to the properties of the region or
the relationships with other regions. Most previous work handles this problem by
selecting the most relevant regions from a set of candidate regions, when there are many
candidate regions in the set these methods are inefficient. Inspired by recent success of
image captioning by using deep learning methods, in this paper we proposed a framework
to understand the referring expressions by multiple steps of reasoning. We present a
model… More >