Close
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Logo
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Home Cloud
    • Cloud

    Microsoft Adopts Kubernetes to Scale Machine Learning Cloud Workloads

    Written by

    Pedro Hernandez
    Published December 29, 2017
    Share
    Facebook
    Twitter
    Linkedin

      eWEEK content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More.

      Add Kubernetes to the growing list of technologies that Microsoft is using to push the boundaries of cloud-based artificial intelligence.

      Kubernetes, the popular open-source container orchestration platform, had a breakout year in 2017. Now, in addition to helping enterprises manage their application container deployments in the cloud or on-premises, Kubernetes is being recruited to give cloud-based AI workloads the room they need to get the job done when demand picks up.

      Microsoft unveiled a new auto-scaling system that uses Kubernetes to expand or shrink the amount of cloud-computing resources required for learning training workloads. The system was developed in partnership with Litbit, a San Jose, Calif. technology startup that uses Internet of Things data to create “AI Personas” that workplaces can use to augment the capabilities of their employees based on their collective experiences and know-how.

      For example, an organization can create and train a persona that helps its field technicians detect and diagnose equipment problems before jumping in a work truck and physically visit machinery that is acting up to save time and expense.

      It’s a tall order and an unpredictable one, it turns out. Litbit discovered that AI training workloads loads varied wildly, since customers training their personas at different times.

      “Some of these training jobs (e.g., Spark ML) make heavy use of CPUs, while others (e.g., TensorFlow) make heavy use of GPUs. In the latter case, some jobs retrain a single layer of the neural net and finish very quickly, while others need to train an entire new neural net and can take several hours or even days,” explained Microsoft representatives in a blog post.

      Microsoft and Litbit settled on Kubernetes, partly because of its proven cluster management technology, but also because of the strong community support the project had attracted in a few short years. Although it started off at Google, the project is considered a crown jewel of the Linux Foundation’s Cloud Native Computing Foundation (CNCF).

      The companies set out to solve the problem of highly variable machine learning workloads by configuring a Kubernetes cluster on Azure with GPU support using the Azure CNI Networking plugin for Kubernetes. They then applied a node-level auto-scaler using the Helm package manager for Kubernetes, followed by some configuration changes to get the system up and running.

      The project was a success. The system has been running for four months and has enabled Litbit to scale to up to 40 nodes at a time and seamlessly downsize when demand abated. Microsoft has posted a complete walk-through of the Kubernetes auto-scaler on its developer blog.

      Reflecting the container craze that has gripped enterprise DevOps teams, Microsoft has doubled down on its support for Kubernetes.

      During the KubeCon conference earlier this December, Microsoft announced that its Azure Container Service is now abbreviated AKS, signifying the company’s customer-focused, Kubernetes-centric approach to cloud-native application development. The company also unveiled a new connector called Virtual Kubelet, which allows users to target Azure Container Instances (ACI), the company’s rapid container creation and deployment service.

      Pedro Hernandez
      Pedro Hernandez
      Pedro Hernandez is a writer for eWEEK and the IT Business Edge Network, the network for technology professionals. Previously, he served as a managing editor for the Internet.com network of IT-related websites and as the Green IT curator for GigaOM Pro.

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      MOST POPULAR ARTICLES

      Artificial Intelligence

      9 Best AI 3D Generators You Need...

      Sam Rinko - June 25, 2024 0
      AI 3D Generators are powerful tools for many different industries. Discover the best AI 3D Generators, and learn which is best for your specific use case.
      Read more
      Cloud

      RingCentral Expands Its Collaboration Platform

      Zeus Kerravala - November 22, 2023 0
      RingCentral adds AI-enabled contact center and hybrid event products to its suite of collaboration services.
      Read more
      Artificial Intelligence

      8 Best AI Data Analytics Software &...

      Aminu Abdullahi - January 18, 2024 0
      Learn the top AI data analytics software to use. Compare AI data analytics solutions & features to make the best choice for your business.
      Read more
      Latest News

      Zeus Kerravala on Networking: Multicloud, 5G, and...

      James Maguire - December 16, 2022 0
      I spoke with Zeus Kerravala, industry analyst at ZK Research, about the rapid changes in enterprise networking, as tech advances and digital transformation prompt...
      Read more
      Video

      Datadog President Amit Agarwal on Trends in...

      James Maguire - November 11, 2022 0
      I spoke with Amit Agarwal, President of Datadog, about infrastructure observability, from current trends to key challenges to the future of this rapidly growing...
      Read more
      Logo

      eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

      Facebook
      Linkedin
      RSS
      Twitter
      Youtube

      Advertisers

      Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

      Advertise with Us

      Menu

      • About eWeek
      • Subscribe to our Newsletter
      • Latest News

      Our Brands

      • Privacy Policy
      • Terms
      • About
      • Contact
      • Advertise
      • Sitemap
      • California – Do Not Sell My Information

      Property of TechnologyAdvice.
      © 2024 TechnologyAdvice. All Rights Reserved

      Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

      ×