Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The following table contains a list of features that have been completed, as of Pig 0.79

Feature

Available in Release

Comments

Describe Schema

0.1

 

Explain Plan

0.1

 

Add log4j to Pig Latin

0.1

 

Parameterized Queries

0.1

 

Streaming

0.1

 

Documentation

0.2

Docs are never really done of course, but Pig now has a setup document, tutorial, Pig Latin users and reference guides, a cookbook, a UDF writers guide, and API javadocs.

Early error detection and failure

0.2

When this was originally added to the !ProposedRoadMap it referred to being able to do type checking and other basic semantic checks.

Remove automatic string encoding

0.2

 

Add ORDER BY DESC

0.2

 

Add LIMIT

0.2

 

Add support for NULL values

0.2

 

Types beyond String

0.2

 

Multiquery support

0.3

 

Add skewed join

0.4

 

Add merge join

0.4

 

Add Zebra as contrib project

0.4

 

Support Hadoop 0.20

0.5

 

Improved Sampling

0.6

There is still room for improvement for order by sampling

Change bags to spill after reaching fixed size

0.6

Also created bag backed by Hadoop iterator for single UDF cases

Add Accumulator interface for UDFs

0.6

 

Switch local mode to Hadoop local mode

0.6

 

Outer join for default, fragment-replicate, skewed

0.6

 

Make configuration available to UDFs

0.6

 

Load Store Redesign

0.7

 

Pig Mix 2.0

0.7

 

Rewrite Logical Optimizer

0.8

 

Cleanup of javadocs

0.8

 

UDFs in scripting languages

0.8

 

Ability to specify a custom partitioner

0.8

 

Pig usage stats collection

0.8

 

Make Pig available via Maven

0.8

 

Standard UDFs Pig Should Provide

0.8

 

Add Scalars To Pig Latin

0.8

 

Run Map Reduce Jobs Directly From Pig

0.8

 

Make Illustrate Work

0.9

 

Better Parser and Scanner Technology

0.9

 

Clarify Pig Latin Semantics

0.9

 

Extending Pig to Include Branching, Looping, and Functions

0.9

 

Typed maps

0.9

 

...