Close
  • Latest News
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Logo
  • Latest News
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Home Applications
    • Applications

    Microsoft Is Teaching Computers to See Like People

    By
    Pedro Hernandez
    -
    November 28, 2015
    Share
    Facebook
    Twitter
    Linkedin
      computer vision

      Microsoft’s quest to build computing systems that understand the world around them doesn’t end with the company’s Project Oxford machine-learning technology. Researchers at the Redmond, Wash., software maker are also developing systems that mimic how humans pull information from the things they see.

      “When a person is asked about something in a photo, they’re taking in a lot of details—a lot of words—to answer questions about it,” blogged Microsoft spokesperson Athima Chansanchai. “Now, a team of Microsoft researchers, together with colleagues from Carnegie Mellon University, has created a system that uses computer vision, deep learning and language understanding to analyze images and answer questions the same way humans would.”

      Together, the researchers created a model that “applies multi-step reasoning to answer questions about pictures,” said Chansanchai. The technology is being advanced by Li Deng, Xiaodong He and Jianfeng Gao from Microsoft Research’s Deep Learning Technology Center, along with Carnegie Mellon University researchers Zichao Yang and Alex Smola.

      “The system takes in information a human set of eyes and brain would, looking at a scene’s action (if there is any) and the relationships among multiple visual objects,” said Chansanchai. “Though it may sound simple for humans, it’s a lot for a computer to learn language and to find answers in an image. But using deep neural networks, it can.”

      Deng and his group are imbuing the system with the ability to pay attention, focus on visual cues and infer answers progressively to solve problems. It’s an advancement in human behavior modeling that was not possible a few years ago, he said.

      Microsoft envisions that the work will lead to systems that can anticipate human needs and provide real-time recommendations. Systems that can answer questions based on visual information are also key to developing artificial intelligence tools, according to the company.

      For example, the technology can potentially lead to improved bike safety.

      “The system could power all kinds of applications, such as a warning system for bicyclists. With a mounted camera continuously taking in the environment around the cyclist,” said Chansanchai.

      The image analysis system builds on Microsoft’s prior work on technologies that can automatically caption photos. “The researchers say that was an important step in getting to this point because descriptions of scenes, annotated by people, provide meaning to a picture. That helps train the computer to understand the image the way a person would.”

      Microsoft is increasingly banking on machine-learning systems as a way to help developers build a new generation of intelligent apps. Last month, the company announced the public beta of the Project Oxford Language Understanding Intelligent Service (LUIS), enabling coders to create applications that understand spoken instructions and search queries, similar to Microsoft’s own virtual assistant, Cortana. Project Oxford is a collection of machine-learning application programming interfaces (APIs) that also includes face and emotion detection, speech recognition and computer vision.

      Pedro Hernandez
      Pedro Hernandez is a contributor to eWEEK and the IT Business Edge Network, the network for technology professionals. Previously, he served as a managing editor for the Internet.com network of IT-related websites and as the Green IT curator for GigaOM Pro.

      MOST POPULAR ARTICLES

      Cybersecurity

      Visa’s Michael Jabbara on Cybersecurity and Digital...

      James Maguire - May 17, 2022 0
      I spoke with Michael Jabbara, VP and Global Head of Fraud Services at Visa, about the cybersecurity technology used to ensure the safe transfer...
      Read more
      Android

      Samsung Galaxy XCover Pro: Durability for Tough...

      Chris Preimesberger - December 5, 2020 0
      Have you ever dropped your phone, winced and felt the pain as it hit the sidewalk? Either the screen splintered like a windshield being...
      Read more
      Cloud

      Yotascale CEO Asim Razzaq on Controlling Multicloud...

      James Maguire - May 5, 2022 0
      Asim Razzaq, CEO of Yotascale, provides guidance on understanding—and containing—the complex cost structure of multicloud computing. Among the topics we covered:  As you survey the...
      Read more
      Big Data and Analytics

      GoodData CEO Roman Stanek on Business Intelligence...

      James Maguire - May 4, 2022 0
      I spoke with Roman Stanek, CEO of GoodData, about business intelligence, data as a service, and the frustration that many executives have with data...
      Read more
      IT Management

      Intuit’s Nhung Ho on AI for the...

      James Maguire - May 13, 2022 0
      I spoke with Nhung Ho, Vice President of AI at Intuit, about adoption of AI in the small and medium-sized business market, and how...
      Read more
      Logo

      eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

      Facebook
      Linkedin
      RSS
      Twitter
      Youtube

      Advertisers

      Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

      Advertise with Us

      Menu

      • About eWeek
      • Subscribe to our Newsletter
      • Latest News

      Our Brands

      • Privacy Policy
      • Terms
      • About
      • Contact
      • Advertise
      • Sitemap
      • California – Do Not Sell My Information

      Property of TechnologyAdvice.
      © 2021 TechnologyAdvice. All Rights Reserved

      Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

      ×