Close
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Logo
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Home Cloud
    • Cloud
    • Cybersecurity
    • Networking
    • Storage

    Yahoo Adds Workflows, Authentication to Hadoop

    Written by

    Chris Preimesberger
    Published June 29, 2010
    Share
    Facebook
    Twitter
    Linkedin

      eWEEK content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More.

      SANTA CLARA, Calif.-Yahoo, which is running 38,000 Apache Hadoop Web servers and counting among its hundreds of thousands of other servers, is quickly building quite an ecosystem for its heavy-lifting cloud computing software platform.
      Hadoop, an open-source project created by Yahoo developers in 2005 that became an Apache project in late 2006, has engaged dozens of core developers, hundreds of contributors and thousands of interested IT folks since then.
      The Hadoop software layer handles control and scaling of Yahoo’s exponentially increasing volumes of data. In only five years, the company has taken Hadoop from a 20-server prototype in Yahoo Labs to the world’s largest Web server deployment running in production across Yahoo’s global network.
      The attendance at the annual Hadoop Summit at the Santa Clara Convention Center has been growing in parallel to the amount of data Yahoo has to process each day, if not quite as fast. The first Hadoop event attracted about 300 people in 2008, and it increased to about 600 last year, while this year more than 1,000 people crammed into the convention center ballroom here on June 29.
      So there’s much interest in how this software works. Thousands of Websites are experiencing problems in dealing with the processing and storage of a deluge of business and personal data, and there is a lot to learn from how Yahoo is approaching this.
      At the summit, Yahoo announced two key enhancements to the beta-level platform: Hadoop with Security, and Oozie, a new workflow engine.
      “Hadoop with Security is Hadoop integrated with Kerberos [authentication securityware], which amounts to a set of security updates that enable much stronger authentication,” Blake Irving, Yahoo’s new chief product officer, told summit attendees. “Hadoop with Security brings more secure collaboration and sharing of authenticated data.”
      Hadoop with Security also sets the stage for secure cloud computing multitenancy by providing authenticated secure access and processing of sensitive data, Irving said.
      Oozie, which integrates with Hadoop with Security, is the platform’s new open-source workflow management and coordination engine for developers managing jobs running on Hadoop servers. It includes Hadoop Distributed File System, Pig and MapReduce, Irving said. It is designed for Yahoo’s internal compute-intensive use cases that require managing complex work processes and ETL (extraction, transformation and loading) on a global scale, Irving said.
      Both Hadoop with Security and Oozie are available for free download here.
      Hadoop is currently used only for internal Yahoo purposes, but because it is open source, it is freely available for anybody to use.
      Yahoo originally used Hadoop for specific science projects, but it quickly morphed into the enterprise-class platform it is today to improve its own personalized user experiences. Hadoop plays a key role in Yahoo’s home page, Yahoo Search, Yahoo Mail and others by remembering user preferences, among other things.
      “Businesses across all sectors are looking for ways to leverage the vast quantities of data they are accumulating, and Apache Hadoop is an efficient solution for processing data at scale,” said IDC analyst Melanie Posey. “Hadoop has matured and is now becoming an enterprise-ready cloud computing technology with the addition of Kerberos authentication. Now organizations of various sizes can leverage Yahoo’s Hadoop investment and deployments to run it on their own systems and build out their own Hadoop deployments without starting from scratch on internal science experiments.”
      How else can Yahoo monetize this considerable five-year investment? Yahoo is not, and never has been, in the software-selling business.
      “We’re already monetizing Hadoop every day,” Shelton Shugar, senior vice president of cloud computing at Yahoo, told eWEEK. “We use it to optimize our advertising and ad placement businesses here at Yahoo, and it’s a very important ingredient in our overall IT environment.”
      On the idea of commercializing Hadoop-or parts of it-with special Yahoo “secret sauce” of some kind, Shugar told eWEEK that the company has certainly talked about it.
      “But at this time, we don’t have any plans to use Hadoop in that way,” he said.
      In other news from the summit, Cloudera announced a new distribution of its own Hadoop implementation.

      Chris Preimesberger
      Chris Preimesberger
      https://www.eweek.com/author/cpreimesberger/
      Chris J. Preimesberger is Editor Emeritus of eWEEK. In his 16 years and more than 5,000 articles at eWEEK, he distinguished himself in reporting and analysis of the business use of new-gen IT in a variety of sectors, including cloud computing, data center systems, storage, edge systems, security and others. In February 2017 and September 2018, Chris was named among the 250 most influential business journalists in the world (https://richtopia.com/inspirational-people/top-250-business-journalists/) by Richtopia, a UK research firm that used analytics to compile the ranking. He has won several national and regional awards for his work, including a 2011 Folio Award for a profile (https://www.eweek.com/cloud/marc-benioff-trend-seer-and-business-socialist/) of Salesforce founder/CEO Marc Benioff--the only time he has entered the competition. Previously, Chris was a founding editor of both IT Manager's Journal and DevX.com and was managing editor of Software Development magazine. He has been a stringer for the Associated Press since 1983 and resides in Silicon Valley.
      Linkedin Twitter

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      MOST POPULAR ARTICLES

      Artificial Intelligence

      9 Best AI 3D Generators You Need...

      Sam Rinko - June 25, 2024 0
      AI 3D Generators are powerful tools for many different industries. Discover the best AI 3D Generators, and learn which is best for your specific use case.
      Read more
      Cloud

      RingCentral Expands Its Collaboration Platform

      Zeus Kerravala - November 22, 2023 0
      RingCentral adds AI-enabled contact center and hybrid event products to its suite of collaboration services.
      Read more
      Artificial Intelligence

      8 Best AI Data Analytics Software &...

      Aminu Abdullahi - January 18, 2024 0
      Learn the top AI data analytics software to use. Compare AI data analytics solutions & features to make the best choice for your business.
      Read more
      Latest News

      Zeus Kerravala on Networking: Multicloud, 5G, and...

      James Maguire - December 16, 2022 0
      I spoke with Zeus Kerravala, industry analyst at ZK Research, about the rapid changes in enterprise networking, as tech advances and digital transformation prompt...
      Read more
      Video

      Datadog President Amit Agarwal on Trends in...

      James Maguire - November 11, 2022 0
      I spoke with Amit Agarwal, President of Datadog, about infrastructure observability, from current trends to key challenges to the future of this rapidly growing...
      Read more
      Logo

      eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

      Facebook
      Linkedin
      RSS
      Twitter
      Youtube

      Advertisers

      Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

      Advertise with Us

      Menu

      • About eWeek
      • Subscribe to our Newsletter
      • Latest News

      Our Brands

      • Privacy Policy
      • Terms
      • About
      • Contact
      • Advertise
      • Sitemap
      • California – Do Not Sell My Information

      Property of TechnologyAdvice.
      © 2024 TechnologyAdvice. All Rights Reserved

      Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

      ×