Close
  • Latest News
  • Artificial Intelligence
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Logo
  • Latest News
  • Artificial Intelligence
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Home Applications
    • Applications
    • Virtualization

    Microsoft’s AI-Powered Drawing Bot Turns Text into Images

    By
    Pedro Hernandez
    -
    January 18, 2018
    Share
    Facebook
    Twitter
    Linkedin
      AI for Ocean Research

      Microsoft already has already developed an artificial intelligence application that turns images into descriptive text. For example, the software maker’s Seeing AI app for Apple iOS devices turns images captured by the camera into text that is then spoken aloud to visually impaired users.

      Now, the company’s researchers have built a new AI system, simply dubbed “drawing bot,” that can turn text descriptions into images.

      Before coming full circle, Microsoft Research began its journey with CaptionBot, a machine learning technology that adds captions to photos. They then revisited the company’s research on a neural network-based system that can process visual information as a human would and answers questions about a photo’s content.

      To flesh out the “drawing” part of the new AI app, Microsoft had to devise a technology that would essentially “imagine” or fill out details that may be missing from a caption.

      That’s where a technology known as a Generative Adversarial Network, or GAN, comes into play.

      “The network consists of two machine learning models, one that generates images from text descriptions and another, known as a discriminator, that uses text descriptions to judge the authenticity of generated images,” explained Microsoft in a Jan 18 announcement. “The generator attempts to get fake pictures past the discriminator; the discriminator never wants to be fooled. Working together, the discriminator pushes the generator toward perfection.”

      Microsoft trained the system using datasets comprised of image and caption pairs. It draws images much like an artistically-inclined person would, first creating a rough outline and repeatedly referring to the written description to fill out the finer details.

      To turn long descriptions into detailed images, researchers created an attentional GAN, or AttnGAN, that mimics human attention and can break up a verbose sentence into individual words that are accurately represented as visual elements on-screen.

      The end result, is a text-to-image system with a nearly three-fold increase in image quality compared to previous techniques, claims Microsoft. As shown in this announcement blog, it creates painterly images, in this case depicting a bird on a branch.

      Microsoft’s drawing bot isn’t limited to visuals that are grounded in the real word. The technology can be used to generate fantastical scenes like “a floating double-decker bus,” according to the company.

      It can also fill in the blanks.

      Returning to the bird example, the drawing bot typically draws birds perched on tree branches even if the input text doesn’t even mention a branch. The system takes this liberty because many of the photos used to train the AI showed a bird sitting in a tree.

      Although it may be a while before the company’s text-to-image technology is used to paint masterpieces, Microsoft already foresees some practical applications. In the same way that Cortana and other virtual assistants help busy professionals plan their day and keep to a schedule, drawing bot’s successors may someday act as a sketch assistant for painters or interior designers, the company said. 

      Microsoft published a research paper describing the AttnGAN technology behind the drawing bot.

      Pedro Hernandez
      Pedro Hernandez is a contributor to eWEEK and the IT Business Edge Network, the network for technology professionals. Previously, he served as a managing editor for the Internet.com network of IT-related websites and as the Green IT curator for GigaOM Pro.
      Get the Free Newsletter!
      Subscribe to Daily Tech Insider for top news, trends & analysis
      This email address is invalid.
      Get the Free Newsletter!
      Subscribe to Daily Tech Insider for top news, trends & analysis
      This email address is invalid.

      MOST POPULAR ARTICLES

      Latest News

      Zeus Kerravala on Networking: Multicloud, 5G, and...

      James Maguire - December 16, 2022 0
      I spoke with Zeus Kerravala, industry analyst at ZK Research, about the rapid changes in enterprise networking, as tech advances and digital transformation prompt...
      Read more
      Applications

      Datadog President Amit Agarwal on Trends in...

      James Maguire - November 11, 2022 0
      I spoke with Amit Agarwal, President of Datadog, about infrastructure observability, from current trends to key challenges to the future of this rapidly growing...
      Read more
      IT Management

      Intuit’s Nhung Ho on AI for the...

      James Maguire - May 13, 2022 0
      I spoke with Nhung Ho, Vice President of AI at Intuit, about adoption of AI in the small and medium-sized business market, and how...
      Read more
      Applications

      Kyndryl’s Nicolas Sekkaki on Handling AI and...

      James Maguire - November 9, 2022 0
      I spoke with Nicolas Sekkaki, Group Practice Leader for Applications, Data and AI at Kyndryl, about how companies can boost both their AI and...
      Read more
      Cloud

      IGEL CEO Jed Ayres on Edge and...

      James Maguire - June 14, 2022 0
      I spoke with Jed Ayres, CEO of IGEL, about the endpoint sector, and an open source OS for the cloud; we also spoke about...
      Read more
      Logo

      eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

      Facebook
      Linkedin
      RSS
      Twitter
      Youtube

      Advertisers

      Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

      Advertise with Us

      Menu

      • About eWeek
      • Subscribe to our Newsletter
      • Latest News

      Our Brands

      • Privacy Policy
      • Terms
      • About
      • Contact
      • Advertise
      • Sitemap
      • California – Do Not Sell My Information

      Property of TechnologyAdvice.
      © 2022 TechnologyAdvice. All Rights Reserved

      Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

      ×