NVIDIA - World Leader in Visual Computing Technologies
USA - United States
USA - United States

ARG - Argentina

BRA - Brasil

CHL - Chile

CHN - China

CLM - Colombia

DEU - Germany

ESP - Spain

FRA - France

GBR - United Kingdom

IND - India

ITA - Italy

JPN - Japan

KOR - Korea

MEX - Mexico

POL - Poland

RUS - Russia

TWN - Taiwan

THA - Thailand

TUR - Turkey

USA - United States

VEN - Venezuela

Change default
  • Drivers
    • GeForce Drivers
    • All NVIDIA Drivers
  • Products
    • Processors
      • GeForce
      • Quadro
      • Tegra
      • Tesla
      • Legacy
    • Technologies
      • SLI
      • PhysX
      • Optimus
      • Maximus
      • CUDA
      • Windows 8
      • All Technologies
    • Cloud Computing
      • Overview
      • Enterprise
      • Gaming
    • 3D Vision
    • Platforms
      • Desktops
      • Notebooks
      • Tablets
      • Smartphones
      • Workstations
      • Servers
      • High Performance Computing
      • Automotive
  • Communities
    • GeForce.com
    • TegraZone.com
    • 3D Vision Live
    • GPU Technology Conference
    • CUDA Zone
    • Developer Zone
    • Forums
    • GPU Venture Zone
    • PartnerForce
    • NVIDIA Research
  • Support
  • Shop
  • About NVIDIA
    • Company Information
    • Newsroom
    • NVIDIA Blog
    • Investors
    • Citizenship
Blog Home
  • Home
  • Corporate
  • 3D Vision
  • Gaming
  • Mobile
  • Notebook
  • Software
  • Supercomputing
  • Workstation

Nexiwave and the Future of Voice Search – Accelerated by CUDA

Calisa-Cole-headshot-2 By Calisa Cole on Jun 22 2010
In Software
No Comments 0 Comments

Recently we had a chance to interview CEO Ben Jiang of speech indexing startup, Nexiwave. Ben aims to help people retrieve spoken words as easily as we google text and images. Ben co-founded Cambridge, Mass.-based Nexiwave in 2008 with Nickolay Shmyrev and graduated from MIT, where he initiated a high-performance computing cluster. Take a look below for an excerpt of the interview.

Benjiang
Nexiwave CEO Ben Jiang


NVIDIA: Ben, what makes speech indexing compelling?

Ben: Ninety percent of human communication is through speech. The amount of spoken words that could potentially be indexed and searched is staggering. Skype callers have logged over 100 billion minutes of talk time. Conference call companies are carrying over a billion minutes of calls per month. There are hundreds of millions of podcasts on the web, with 24 hours of video uploaded to YouTube every minute.

The problem is that today's information retrieval applications, such as internet search, focus on textual content. Information retrieval from speech content still relies primarily on a human's memory. The objective of speech indexing is to enable us to easily extract information from archived audio and video content. Through the Nexiwave system, an end user can easily search the content and locate the exact location of interest, whether it's a word, a phrase or a general topic.

NVIDIA: What are some of the potentially big applications of speech indexing?

Ben: Think about the conference calls that happen 24×7 at companies around the world. We've all had moments where we thought: "Ahh, John said something really useful in the last call. I wish I could remember exactly what he said." In the future, with speech indexing-enabled conference calls, we will be able to easily do that via a quick search to locate the exact audio snippet. Another interesting market is call centers, where the ability to do a deep search (not just time of call and phone number) will enable companies to find out what customers are really telling them. Other markets are e-discovery (in the legal field), recorded educational media, podcasts and audio-centric enterprises.

NVIDIA: What stage is your technology in?

Ben: Nexiwave 1.0 was released in October 2009. Nexiwave 2.0, our NVIDIA GPU-enabled version, was released on June 3, 2010 and is in production. We offer a SaaS (software as a service) and cloud computing solution as well as software licenses.

NVIDIA: What is the connection between Nexiwave and CMU Sphinx, the speech recognition system from Carnegie Mellon?

Ben: CMU Sphinx is a very popular open source speech processing engine. Our system is built on top of it with many of our own proprietary improvements, such as CUDA-based acoustic scoring (a total re-write of the core acoustic scoring code). We are one of the major commercial companies contributing to it through code fixes, developer resources and user forum support.

NVIDIA: Where does the GPU fit into this?

Ben: Speech indexing is computationally intensive and has traditionally been very expensive. Speech indexing can be efficiently processed in parallel which means the GPU is a perfect fit for it. The GPU will solve the cost issue associated with indexing vast amounts of audio content quickly and accurately.

NVIDIA: How did you like programming/porting in the CUDA C environment?

Ben: Our experience with programming in CUDA C has been enjoyable. The CUDA Best Practices Guide provided tons of help in performance tuning.

NVIDIA: How does CUDA help you?

Ben: Nexiwave has been able to move 75% of our computing processes (or 11 million computation loops per audio minute) to CUDA C. This directly translates into cost reduction (we have released a large number of CPU machines back to our computing provider). The exciting thing about this speedup is that it enables us to move into markets where speech indexing has not been possible before.

Tagged: CUDA, GPGPU

0 Comments Post a Comment

Similar Stories

mars-rover

GPUs Processing Images From the Red Planet

By Gary Rainville on May 17 2012

ccoe-award-1

First Achievement Award Bestowed By CUDA Centers of Excellence

By Chandra Cheij on May 16 2012

iain-couzin-gtc-2012-keynote-2

Using GPUs to Decipher Animal (and Human) Crowd Behavior

By Tony Kontzer on May 16 2012

lego-session-gtc-2012-2

LEGO Locks In On CUDA To Build A Better Business

By Tony Kontzer on May 16 2012

gtc-poster-boards-3

GTC Poster Session Shows Breadth of GPU Research

By Tony Kontzer on May 14 2012

kepler-die-shot

What Makes Kepler Tick?

By Will Park on May 8 2012

supercomputing-pipeline

Contest: What would you do with a petaflop supercomputer?

By Sumit Gupta on Apr 24 2012

nab-2012

GPU’s Transform Digital Content Creation Workflow at NAB 2012

By Greg Estes on Apr 17 2012

NVIDIA CUDA Fellow

NVIDIA Names Four New CUDA Fellows

By Chandra Cheij on Apr 10 2012

fermi die shot-wide-2

No Free Lunch for Intel MIC (or GPU’s)

By Steve Scott on Apr 3 2012

+ More Similar Stories

Subscribe via: RSS Email

Connect & Share: Find us on Facebook Follow us on Twitter Find us on Flickr Watch us on YouTube

X

Enter your email address:

Most Discussed

144 CommentsContest: What would you do with a petaflop supercomputer?posted Apr 24 2012 at 15:52:32 PM
28 CommentsReal Ultrabooks Have GPU’sposted Mar 12 2012 at 15:18:08 PM
26 CommentsNo Free Lunch for Intel MIC (or GPU’s)posted Apr 3 2012 at 09:01:14 AM

Featured Series

  • NVIDIA Stories on TVTV shorts showing how GPUs are used
  • Inner GeekNVIDIA employees telling personal stories of how technology affects their lives
  • GPU Technology ConferenceThe latest GPU Technology Conference updates
  • Kizuna UpdateA series of Operation Kizuna updates detailing how we are aiding Japan's Tsunami recovery

Latest Tweets

nvidia That's a really good sign :) RT @filippospiga: Just after #GTC12 I already look forward for #GTC13 posted May 18 2012 at 13:57:29 PM

Popular Tags

3D 3d vision acer android arm asus CES ces2011 Corporate CSR CUDA Drivers ECS Events Gaming GeForce Global Citizenship GPGPU gpu gpu computing GPU Technology Conference GTC high performance computing hpc Inner Geek Medical Mobile mwc New GPU uses Notebooks NVIDIA NVIDIA Foundation NVIDIA in a Minute Optimus parallel computing Professional quad-core Quadro Steve Wildstrom Supercomputing super phone Tegra Tegra 3 Tesla Visual computing

NVIDIA on YouTube

NVIDIA on Flickr

Archives

NVIDIA Blog Authors

Disclaimer

All company and product names appearing in the NVIDIA Blog are trademarks and/or registered trademarks of NVIDIA Corporation or of their respective owners.

NVIDIA Blog Comment Policy

While we encourage you to interact with us by leaving comments, we reserve the right, in our sole discretion, to remove comments or block readers if they violate any of the following conditions.


To read the NVIDIA Blog commenting guidelines and privacy policy, click here.

Solutions: 3DTV Play | 3D PCs | Optimus | Graphics Cards | High Performance Computing | Visualization | CUDA | Tegra Android App
Corporate: About NVIDIA | Newsroom | Blog | Events | Affiliate Program | Developers | Channel Partners | Investor Relations
Employment | RSS Feeds | Newsletter
Copyright © 2012 NVIDIA Corporation | Legal Info | Privacy Policy