Close
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Subscribe
Logo
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Subscribe
    Home Cloud
    • Cloud

    Google Makes Speech-to-Text API More ‘Business Friendly’

    Written by

    Jaikumar Vijayan
    Published April 10, 2018
    Share
    Facebook
    Twitter
    Linkedin

      eWEEK content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More.

      Google has rolled out several major updates to its Cloud Speech-to-Text speech recognition technology. 

      The overhaul is the biggest since Google announced the service two years ago and is designed to make Speech-to-Text more useful for businesses. 

      Among the updates are pre-built models for transcribing phone calls and video, features that support automatic punctuation, and a new tagging and grouping mechanism for transcription workloads. In keeping with its business focus, the updates also come with a standard service level agreement (SLA) guaranteeing 99.9 percent availability. 

      “Access to quality speech transcription technology opens up a world of possibilities for companies that want to connect with and learn from their users,” wrote Google product manager Dan Aharon in a blog April 9. The update takes advantage of Google’s latest research around machine learning technology, he said. 

      Google announced Cloud-Speech-to-Text in June 2016. The technology gives developers a way to convert audio to text. Google has described Speech-to-Text as an API that applies neural network models to the task of converting speech to text. The technology is designed to process both pre-recorded audio and real-time streaming audio so it can work in a call-center setting just as well as it might in transcribing voice mail messages. 

      The API can be used to transcribe short and long-form audio in 120 languages and dialects in near real-time. It is tailored to recognize and transcribe speech in real world conditions involving multiple speakers and background noise. According to Google Speech-to-Text can even transcribe proper nouns and appropriately format content such as dates and phone numbers. 

      Since cloud Speech-to-Text is powered by Google’s machine learning technology, the accuracy of its transcription improves over time, the company has claimed. 

      Aharon listed several enterprise use cases for the technology including human-computer interactions, call-center analytics and automated transcriptions of phone calls, audio and video content. 

      As an example of the newly updated API’s capabilities, Aharon pointed to a TV broadcast involving four speakers and lots of background noise. Depending on the length of the game, Speech-to-Text would be able to transcribe the contents of the broadcast in about two hours, he claimed. 

      The multiple pre-built models that Google has made available with the latest update include those tailored for specific uses cases such as video to audio transcriptions and phone call transcriptions. 

      The updates reflect feedback from organizations that have been testing cloud Speech-to-Text since is launch in 2016, Aharon said.  Information provided by customers of the technology has allowed Google to prioritize features and focus on what to do next, he said. 

      Pricing for the API starts at $0.006 per 15 seconds of audio. Video models start at $0.012 per 15 seconds though it is available at a discount through May 31. 

      The updates to the Speech-to-Text API are the second major announcement from Google’s Cloud AI speech products group in recent days. Last month, Google’s introduced Cloud Text-to-Speech, a speech synthesis API that converts text to speech. 

      Jaikumar Vijayan
      Jaikumar Vijayan
      Vijayan is an award-winning independent journalist and tech content creation specialist covering data security and privacy, business intelligence, big data and data analytics.

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      MOST POPULAR ARTICLES

      Artificial Intelligence

      9 Best AI 3D Generators You Need...

      Sam Rinko - June 25, 2024 0
      AI 3D Generators are powerful tools for many different industries. Discover the best AI 3D Generators, and learn which is best for your specific use case.
      Read more
      Cloud

      RingCentral Expands Its Collaboration Platform

      Zeus Kerravala - November 22, 2023 0
      RingCentral adds AI-enabled contact center and hybrid event products to its suite of collaboration services.
      Read more
      Artificial Intelligence

      8 Best AI Data Analytics Software &...

      Aminu Abdullahi - January 18, 2024 0
      Learn the top AI data analytics software to use. Compare AI data analytics solutions & features to make the best choice for your business.
      Read more
      Latest News

      Zeus Kerravala on Networking: Multicloud, 5G, and...

      James Maguire - December 16, 2022 0
      I spoke with Zeus Kerravala, industry analyst at ZK Research, about the rapid changes in enterprise networking, as tech advances and digital transformation prompt...
      Read more
      Video

      Datadog President Amit Agarwal on Trends in...

      James Maguire - November 11, 2022 0
      I spoke with Amit Agarwal, President of Datadog, about infrastructure observability, from current trends to key challenges to the future of this rapidly growing...
      Read more
      Logo

      eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

      Facebook
      Linkedin
      RSS
      Twitter
      Youtube

      Advertisers

      Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

      Advertise with Us

      Menu

      • About eWeek
      • Subscribe to our Newsletter
      • Latest News

      Our Brands

      • Privacy Policy
      • Terms
      • About
      • Contact
      • Advertise
      • Sitemap
      • California – Do Not Sell My Information

      Property of TechnologyAdvice.
      © 2024 TechnologyAdvice. All Rights Reserved

      Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

      ×