Intel Releases Hadoop Distribution for Big Data
Intel's move comes the same week that other players have made significant advances in big data and Hadoop. Hewlett-Packard announced a Hadoop plug-in for its ArcSight security software that will make it easier and faster for organizations to run through huge amounts of security data. Hortonworks' new beta of its Hadoop Data Platform will run on Microsoft's Windows Server, and EMC announced Feb. 25 a new Hadoop distribution, Pivotal HD, that works closely with the storage vendor's Greenplum massively parallel processing (MMP) database. Davis said Intel will leave much of the application work to its partners, but that the chip maker will create a foundation for Hadoop that will enable organizations to leverage the capabilities in its data center hardware. Intel's AES-NI technology will enable up to 20 times the encryption speed of other technologies, while Intel's SSD and cache acceleration will offer queries in Hive—the data warehouse system in Hadoop—that are 8.5 times faster. The combination of Intel's silicon and its Hadoop distribution means that analyzing a terabyte of data, which normally would take as long as 4 hours, can now be done in 7 minutes, according to Intel. During the Webcast, Intel offered a long list of partners to help integrate its software into various platforms, including Cisco Systems, Cray, Dell, Infosys, NextBio, Red Hat, SAP, SAS, Savvis and Teradata. SuperMicro announced Feb. 26 that it was adding Intel's Hadoop distribution to some of its servers and storage systems aimed at big data environments. Intel's investment arm, Intel Capital, also is investing in smaller big data companies, such as 10gen and Guavus Analytics.
The strong move into big data also will help fuel sales of its Xeon chips, driving organizations to run their big data workloads on Intel-based servers from the likes of HP and Dell. Davis said that "one of [Intel's] biggest motivators is to drive faster growth of the data center."