I’ve been in the media business for a very long time, but I’ve never been a video-first journalist. My instincts have always leaned toward reporting, writing, and structuring ideas in a way that unfolds over paragraphs, not frames. Video has been part of the mix, increasingly so over the years, but it was never the foundation.

That’s why Jon Youshaei’s session at SXSW felt less like another “pivot to video” talk and more like a masterclass in how the medium actually works now. Youshaei is a former product and creator partnerships leader at Google and Instagram who has spent nearly a decade working directly with top creators and platforms—and now operates as a full-time creator himself, giving him a rare dual perspective on both how the algorithms work and what actually performs in practice.

Video is no longer an emerging format. It is the default interface through which people consume information, entertainment, and, increasingly, journalism. And within that dominance, a hierarchy has emerged that is much more defined than it was even a few years ago.

image

Short-form has become the front door. This is the discovery layer and where habits are formed.TikTok now reaches roughly 37% of U.S. adults, up from just 21% in 2021, according to Pew Research. Among teens, the numbers are even more pronounced—about 67% use TikTok, and roughly 16% say they are on it almost constantly. YouTube, meanwhile, is effectively universal, reaching 83% of U.S. adults and 93% of teens.

But the idea that this shift eliminates long-form video doesn’t hold up. In fact, the opposite is happening. Long-form is not disappearing; it’s consolidating into environments where attention can be sustained. Nielsen reports that streaming now accounts for 44.8% of total U.S. television viewing, surpassing broadcast and cable combined. Within that shift, YouTube alone represents more than 12% of total TV consumption in the U.S.. That makes it one of the largest “channels” in the country.

Regulators in the UK have observed something similar. Ofcom’s Media Nations report notes that while short videos dominate overall consumption on YouTube, many of the most popular videos increasingly resemble traditional television formats—interviews, episodic programming, and long-form storytelling.

Online video consumption is increasingly bifurcated: short-form content captures attention. Long-form converts it into time.

Jon Youshaei says that runs counter to many of the instincts that media companies have built over the last decade. His first point is almost countercultural at this stage: not every idea should be a video. “The biggest mistake… is that they make too many videos that, quite frankly, should be an article,” he says.

For years, video carried an implicit premium. It was assumed to be more engaging, more monetizable, more future-proof. What Youshaei is really arguing is that the constraint has shifted. The barrier to producing video has collapsed. The differentiator is now editorial judgment—knowing when video is the right medium and when it isn’t.

He introduces what he calls a “visual anchor” as the test. A story earns its place as a video when there is something that must be seen to be understood. He cites an example of Venezuelan currency being turned into sculptures—an image that communicates economic collapse more effectively than any paragraph of analysis. “You don’t have to know anything… to see that something’s going on here,” he says.

image

That framing is useful because it restores discipline to a medium that has become frictionless. It forces a distinction between what is inherently visual and what is simply being translated into video because the platform rewards it. 

In a feed-driven environment, the first frame has become the unit of competition. It functions as a headline, thumbnail, and narrative hook simultaneously. The viewer is not waiting for context. They are scanning for clarity. YouTube’s own guidance reflects this change, advising creators to ensure that short-form clips deliver immediate context and can be understood without sound.

“We’re past [the first few seconds]… It’s all about the zero-second mark,” he says. 

For anyone coming from a traditional editorial background, this is a fundamental shift. We are used to building narratives. The platforms reward immediate comprehension.

There’s also a more subtle adjustment happening at the level of presentation. Youshaei encourages creators to “go physical”—to use props, objects, and real-world setups instead of relying entirely on digital graphics.

That advice sounds stylistic, but it reflects something deeper about how people process information in crowded feeds. In an environment saturated with polished visuals and AI-generated imagery, something tangible reads as more immediate. It reduces cognitive load. It communicates faster. The more quickly a viewer understands what they are looking at, the more likely they are to stay.

If the first half of his framework is about choosing the right ideas, the second half is about structuring them. Youshaei focuses on outliers—videos that dramatically outperform the size of their producing channel.

That’s where he sees the real signal. Not in averages or best practices, but in anomalies. His conclusion is straightforward: success in modern video is increasingly format-driven. “Everything is a remix… copy with taste,” he says.

One of the more practical implications of this shift is how content gets distributed. Short-form is no longer just a format. It’s a layer. “It’s better that you clip yourself,” Youshaei says, referring to the growing practice of extracting short segments from long-form content and distributing them independently.

YouTube has formalized this behavior by allowing Shorts to link directly back to longer videos. The workflow is becoming standardized: produce long-form content, extract the most compelling moments, distribute them as short-form, and use those clips to drive discovery.

Youshaei also dismantles one of the more persistent excuses in the industry—the idea that certain categories don’t lend themselves to video. “This doesn’t work for a boring niche… oh really?” he says.

The data support him. Categories that were once considered too narrow or too technical—finance, logistics, industrial processes—are now generating large audiences on short-form platforms. They just need to be packaged for the format that people are looking for.

And then there is his closing point, which feels less like strategy and more like operating discipline. “Perfectionism is procrastination in disguise,” he says.

Short-form is where ideas are tested and attention is captured. Long-form is where trust is built and time is spent. Formats provide the structure. Clips provide the distribution. Volume provides the learning.

The industry spent years asking whether video mattered. That question has been resolved.

The pivot to video never really ended. LLMs are just finishing off consuming what SEO and Social Media began.  

And the very idea of an Open Web is being left behind.

You can watch his entire talk here.