Model 70 Bolt Disassembly

BOLT: Boost Large Vision-Language Model Without Training for Long-Form Video Understanding

Abstract: Large video-language models (VLMs) have demonstrated promising progress in various video understanding tasks. However, their effectiveness in long-form video analysis is constrained by ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

BOLT: Boost Large Vision-Language Model Without Training for Long-Form Video Understanding

Trending now