Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: reorg'd everything under 'community', 'users', and 'developers'

What is Apache Sqoop?

Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. You can use Sqoop to import data from external structured datastores into Hadoop Distributed File System or related systems like Hive and HBase. Conversely, Sqoop can be used to extract data from Hadoop and export it to external structured datastores such as relational databases and enterprise data warehouses.

Sqoop provides a pluggable connector mechanism for optimal connectivity to external systems. The Sqoop extension API provides a convenient framework for building new connectors. New connectors can be dropped into Sqoop installations to provide connectivity to various systems. Sqoop itself comes bundled with various connectors that can be used for popular database and data warehousing systems.

Getting started

...

Section
Column
width50%

Presentations

  • Habits of Effective Sqoop Users
  • Sqooping 50 Million Rows a Day from MySQL
  • Scratching Your Own Itch
Column
width50%

Release 1.4.0-incubating

...

...

Resources

...

...

Section
Column
width25%

Community

...

Column
width25%

Users

...

...

Column
width25%

Developers

  • Proposed Design
Column
width25%

Archived