Visual captions: Using large language models to augment video conferences with dynamic visuals Google AI Google AI Blog
Posted by Ruofei Du, Research Scientist, and Alex Olwal, Senior Staff Research Scientist, Google Augmented Reality Recent advances in video conferencing have significantly improved remote video communication through features like live captioning and noise cancellation. However, there are various situations where dynamic visual augmentation would… Read More »Visual captions: Using large language models to augment video conferences with dynamic visuals Google AI Google AI Blog