!!hot!! - Jollyvids.

Originally filmed in Ollie’s living room, the channel’s success allowed them to hire a small team (including members like Mike and Grace) and produce professional, long-form content specifically optimized for television viewing. Signature Content Series

The golden age of the internet was over. Everyone agreed on that. The web had become a wasteland of rage-bait, doomsday scrolling, and cynical comment sections. People were angrier, lonelier, and more exhausted than ever before. jollyvids.

We present , a curated collection of > 1.2 million short video clips (average length ≈ 7 seconds) spanning 150 semantic categories, sourced from open‑license platforms. Each clip is paired with high‑quality textual captions, temporally aligned audio transcripts, and fine‑grained action annotations. JollyVids is designed to address three shortcomings of existing video corpora: (1) limited semantic diversity, (2) poor alignment between visual and linguistic modalities, and (3) insufficient scale for training modern transformer‑based video‑language models. We provide extensive baseline experiments on video‑text retrieval, zero‑shot video classification, and video captioning, demonstrating that models pretrained on JollyVids outperform those trained on previous datasets by 4–12 % on standard downstream benchmarks. Originally filmed in Ollie’s living room, the channel’s