Abstract: Dense captioning creates diverse Region of Interests (RoIs) descriptions for complex visual scenes. While promising results have been obtained, several issues persist. In particular: 1) it ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results