Does this dataset have a corresponding video-text pair version? Thank you.
Does this dataset have a corresponding video-text pair version? Thank you.