Sam Liang is making things easier for the creators of the NVIDIA AI Podcast — and just about every remote worker.
He’s the CEO and co-founder of Otter.ai, which uses AI to produce speech-to-text transcriptions in real time or from recording uploads. The platform has a range of capabilities, from differentiating between multiple people, to understanding accents, to parsing through various background noises.
And now, Otter.ai, an NVIDIA Inception member, is making live captioning possible on a variety of platforms, including Zoom, Skype and Microsoft Teams. Even Liang’s conversation with AI Podcast host Noah Kravitz was captioned in real time over Skype.
This new capability has been enthusiastically received by remote workers — Liang says that Otter.ai has already transcribed tens of millions of meetings.
Liang envisions even more practical effects of Otter.ai’s live captions. The platform can already identify keywords. Soon he thinks it’ll be recognizing action items, helping manage agendas and providing notifications.
Key Points From This Episode:
- Otter.ai was founded in 2016 and is Liang’s second startup, after Alohar, a company focused on mobile behavior services. Once Alohar was acquired, Liang reflected that he needed better tools to help transcribe and share meetings, inspiring him to found Otter.ai.
- The company’s AI model was built from scratch. Although Siri and Alexa predate it, Otter.ai needed to comprehend multiple voices that could overlap and vary in accents — a different, more complex task than understanding and responding to just one voice.
“Though it’s been growing steadily before COVID, people have been using Otter on their laptop or on iOS or Android devices … you can use it anywhere.” — Sam Liang [7:32]
“Otter is your new meeting assistant. People will have the peace of mind that they don’t have to write down everything themselves.” — Sam Liang [22:07]
You Might Also Like:
Research engineer Sam Shleifer talks about Hugging Face’s natural language processing technology, which is in use at over 1,000 companies, including Apple, Bing and Grammarly, across fields ranging from finance to medical technology.
Serial entrepreneur Andrew Mason talks about his company, Descript Podcast Studio, which is using AI, NLP and automatic speech synthesis to make podcast editing easier and more collaborative.
SoundHound made its name as a music identification service. Since then, it’s leveraged its 10+ years in data analytics to create a voice recognition tool that companies can bake into any product. SoundHound VP of Product Marketing Mike Zagorsek speaks about how the company has grown into a significant player in voice-driven AI.
Tune in to the AI Podcast
Get the AI Podcast through iTunes, Google Podcasts, Google Play, Castbox, DoggCatcher, Overcast, PlayerFM, Pocket Casts, Podbay, PodBean, PodCruncher, PodKicker, Soundcloud, Spotify, Stitcher and TuneIn. If your favorite isn’t listed here, drop us a note.
Make the AI Podcast Better
Have a few minutes to spare? Fill out this listener survey. Your answers will help us make a better podcast.
NVIDIA Inception is the leading acceleration platform for AI, data science and HPC startups. Companies within the program benefit from go-to-market support, free access to NVIDIA’s self-paced Deep Learning Institute trainings, preferred pricing on NVIDIA GPUs and over $100,000 in credits through our cloud computing partners. Apply now.