At IBC, Discover How Media and Entertainment Audience Experiences Harness Generative AI

by Rick Champagne

Generative AI is enhancing media and entertainment by enabling hyper-personalized, dynamic and immersive fan experiences.

At IBC, a trade show focused on the intersection of media, entertainment and technology, NVIDIA and its partners will showcase the latest generative AI innovations helping media companies engage audiences with increasingly sophisticated digital experiences.

The spotlight will be on NVIDIA Holoscan for Media, an AI-enabled, software-defined platform for live media. It allows live video pipelines to run on the same infrastructure as AI, providing access to AI clusters for training and inference and bringing generative AI capabilities on premises via partner offerings. Developers can use the platform to easily integrate NVIDIA software development kits (SDKs), giving end users access to the latest AI technologies.

NVIDIA will also demonstrate how developers can use NVIDIA AI Workbench to create personal generative AI support assistants trained on provided documents. The toolkit enables easy GPU workstation setup and empowers developers to work, manage and collaborate across heterogeneous platforms — regardless of their skill level.

In addition, attendees will be able to step into the spotlight as the star of their very own custom sports trading card using the power of guided generative AI. NVIDIA will demonstrate this technology in the Dell booth in hall 7, stand 7.A45.

Partners Deliver Transformative Innovations

NVIDIA’s partners, including members of the NVIDIA Inception program for startups, will put their latest and greatest generative AI technologies on display at IBC.

Monks, a global marketing and technology services company, will be showcasing an AI computer vision demo on the NVIDIA Holoscan for Media platform that identifies objects, brand logos and other features in a live broadcast to create a searchable database of media in real time. Monks is a leading system integrator for Holoscan for Media and designed the cloud-based workflow using the company’s AI-centric solution Monks.Flow to help broadcast, media and entertainment companies develop hyper-personalized content for distribution across new media channels at speed and scale. Stop by Monks’ booth 14.AIB4 in the AI Tech Zone.

Speechmatics will showcase its real-time automatic speech recognition technology, featuring the highest accuracy with the lowest latency in the market. Its speech-to-text models assist in transcription and improve the quality and efficiency of offline and live broadcast services. Attendees can find Speechmatics in hall 8, stand 8.B77b.

Qvest will demonstrate an NVIDIA-accelerated video metadata capture and story recommendations engine. Media companies manage vast amounts of video content, making it challenging and time-consuming to locate, catalog and compile it into finished assets. Qvest’s AI video discovery engine, built on NVIDIA NIM microservices, accelerates these processes by automating the data capture of video files. This streamlines a user’s ability to contextualize how discovered videos can fit in their intended story. Attendees can find Qvest in hall 10, booth 10.C24.

Moments Lab will showcase the latest features powered by its award-winning, multimodal AI indexing model MXT-1.5. Attendees can see how MXT-1.5’s new automatic sound bite feature highlights the best quotes in a video, as well as how its moment-based and timeline searches enable editors and producers to find the exact clips they need in seconds. Moments Lab will also demo its scalable suite of products for media, sports and entertainment organizations, including Just Index with MXT-1.5, Cloud Media Hub, Live Asset Manager and Media Marketplace. Attendees can find Moments Lab in hall 5, stand 5.H60.

Deepdub will present a technical demo of its AI-driven localization technology at the AWS booth at Stand 5.C90, offering a look at new capabilities and technology that enable seamless and authentic multilingual content localization. Deepdub will be presenting a joint case study with Paul Robinson, President at Kartoon Channel on September 15 at 3 p.m. at the IBC AI Tech Zone. Attendees can visit Deepdub in hall 14, stand 14.AI10, in the AI Tech Zone.

Alugha will demonstrate its NVIDIA GPU-powered, AI-driven technologies for multilingual content processing. Alugha enables creators to produce and distribute videos in multiple languages, breaking down language barriers to help reach global audiences. Using the performance and versatility of NVIDIA technology, the company trains complex models efficiently, delivering high-quality, scalable language solutions. Attendees can find Alugha at hall 3, stand 3.B54-4.

Mobius Labs’ Aana SDK is an open-source toolkit designed for creating and deploying multimodal AI applications across text, images, audio and video. It offers high efficiency with models up to 10x smaller and faster that run on consumer hardware, making advanced AI more accessible and cost-effective. Aana’s modular design supports rapid, large-scale media and entertainment solutions, fostering innovation in the AI community. Find Mobius Labs in hall 14, stand 14.AIP2.

Bria will highlight its open and responsible generative AI platform, which is designed to equip developers in the media and entertainment industry with tools and capabilities, including source-available foundation models, APIs, and SDKs, for streamlining creative workflows using copyright- and privacy-cleared solutions for commercial use. These include a patented attribution engine that benefits data owners and artists. Attendees can explore real-world use cases, including AI-driven branded content generation at scale, dynamic media personalization, user-generated content campaigns for audience engagement and more. Find Bria in the AI Tech Zone, hall 14, stand 14.AI13.

Beamr will showcase a live 4Kp60-optimized content-adaptive application powered by the Holoscan for Media platform. Offering maximum efficiency without compromising quality, Beamr technologies can help save up to 50% on cloud storage and bandwidth, while enabling a new pipeline with faster video creation and delivery for generative AI applications. Visit Beamr in booth 7.A53.

Twelve Labs will demonstrate capabilities that allow computers to understand video content similarly to human cognition. Their video foundation model enables semantic video search, classification and other tasks by analyzing various modalities such as audio, speech and visual elements. Twelve Labs will highlight how its models can dynamically extract metadata, enhancing content discoverability and management within extensive video libraries to help enable the future of hyper-personalized content. Find Twelve Labs in hall 6, stand C22.

See It in Action

Generative AI is set to drive further advances in media and entertainment, offering increased efficiency, personalization and engagement.

Join NVIDIA at IBC at the Dell booth in hall 7, stand 7.A45, to learn about the emerging technologies in this space and how they can be used to drive innovation.