Apache Moves Sqoop Big Data Tool Out of Incubator - Application Development - News & Reviews - eWeek.com | eWeek

Apache Moves Sqoop Big Data Tool Out of Incubator

Written By
Darryl K. Taft
Darryl K. Taft
Apr 2, 2012
2 minute read
eWeek content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More

The Apache Software Foundation (ASF) announced that its Apache Sqoop big data tool has graduated from the Apache Incubator to become a top-level project (TLP).

Sqoop is designed to efficiently transfer bulk data between Apache Hadoop and structured data stores such as relational databases. Apache Sqoop allows the import of data from external data stores and enterprise data warehouses into a Hadoop Distributed File System or related systems like Apache Hive and HBase.

“The Sqoop Project has demonstrated its maturity by graduating from the Apache Incubator,” said Arvind Prabhakar, vice president of Apache Sqoop, in a statement. “With jobs transferring data on the order of billions of rows, Sqoop is proving its value as a critical component of production environments.”

ASF officials said Sqoop builds on the Hadoop infrastructure and parallelizes data transfer for fast performance and best use of system and network resources. In addition, Sqoop allows fast copying of data from external systems to Hadoop to make data analysis more efficient, and mitigates the risk of excessive load to external systems.

“Connectivity to other databases and warehouses is a critical component for the evolution of Hadoop as an enterprise solution, and that’s where Sqoop plays a very important role” said Deepak Reddy, Hadoop Manager at Coupons.com, in a statement. “We use Sqoop extensively to store and exchange data between Hadoop and other warehouses like Netezza. The power of Sqoop also comes in the ability to write free-form queries against structured databases and pull that data into Hadoop.”

Moreover, “Sqoop has been an integral part of our production data pipeline” said Bohan Chen, director of the Hadoop Development and Operations team at Apollo Group, also in a statement. “It provides a reliable and scalable way to import data from relational databases and export the aggregation results to relational databases.”

Since entering the Apache Incubator in June 2011, Sqoop was quickly embraced as a key SQL-to-Hadoop data transfer solution. The project provides connectors for popular systems such as MySQL, PostgreSQL, Oracle, SQL Server and DB2, and also allows for the development of drop-in connectors that provide high-speed connectivity with specialized systems like enterprise data warehouses.

Meanwhile, in a statement, Craig Ling, director of business systems at Tsavo Media, said: “We adopted the use of Sqoop to transfer data into and out of Hadoop with our other systems over a year ago. It is straightforward and easy to use, which has opened the door to allow team members to start consuming data autonomously, maximizing the analytical value of our data repositories.”

eWeek Logo

eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site's focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.