Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

Project Page

General information:

Future Work

Sqoop 1.99.* Releases

1.99.6 Release ( TBD )

1.99.5 Release  

Feature Docs

Design Docs

1.99.4 Release

Feature Docs


Feature Tickets 

Before 1.99.4 

Sqoop 1 Releases

Release 1.4.5

Archived 

Sqoop User Resources

Community

Sqoop Developer Resources

General Guidelines for Development

Note

If you are contributing to Sqoop 2, refer the guidelines for Sqoop2 for coding guidelines and review guidelines

 

News

What is Apache Sqoop?

Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. You can use Sqoop to import data from external structured datastores into Hadoop Distributed File System or related systems like Hive and HBase. Conversely, Sqoop can be used to extract data from Hadoop and export it to external structured datastores such as relational databases and enterprise data warehouses.

Sqoop provides a pluggable connector mechanism for optimal connectivity to external systems. The Sqoop extension API provides a convenient framework for building new connectors. New connectors can be dropped into Sqoop installations to provide connectivity to various systems. Sqoop itself comes bundled with various connectors that can be used for popular database and data warehousing systems.

Getting started

Section
Column
width75%

Presentations

Column
width25%

Release 1.4.1-incubating

Resources

...

Column
width33%

Community

Column
width33%

Users

...

width33%

Developers

...

...

Sqoop 2 (1.99.* releases)

Sqoop 1

...

 

...