Capturing Temporal Structures for Video Captioning by Spatio-temporal Contexts and Channel Attention Mechanism.评价结果

评估详情

1