This page describes tools that users have built on top of Pig. Some of these tools make developing Pig Latin scripts easier. Some facilitate running Pig in your environment. The tools are listed in alphabetical order.
Amazon
...
UDF Collections.
DataFu
DataFu is Linkedin's collection of Pig UDFs (http://sna-projects.com/datafu/)
...
)
http://code.google.com/p/pig-eclipse
Provides pig-latin editor in eclipse, have the feature of syntax highlighting. I just make it for interests, now it has less features than pigpen.
Elephant-Bird
http://github.com/kevinweil/elephant-bird/
...
RPM and Debian packages for Elephant Bird can be found at http://code.google.com/p/hadoop-gpl-packing/
Pygmalion
https://github.com/jeromatron/pygmalion
A project to facilitate using Pig with Apache Cassandra. It mostly focuses on tabular data with static column names.
Tools that help run Pig workflows
Amazon
Amazon Elastic MapReduce makes it easy to launch Pig in interactive or batch mode in AWS. The service supports Pig via AWS Management Console, Command Line Client, and its APIs.
HAMAKE
http://code.google.com/p/hamake/
'hamake' utility allows you to automate incremental processing of datasets stored on HDFS using Hadoop tasks written in Java or using PigLatin scripts.
Piglet
http://github.com/iconara/piglet
Piglet is a DSL for writing Pig Latin scripts in Ruby. Piglet aims to look like Pig Latin while allowing for things like loops and control of flow that are missing from Pig.
PigPy
http://code.google.com/p/pigpy/
A Python wrapper that helps users manage their Pig processes. It can manage many similar Pig Latin scripts, including running common root scripts and caching the results to be used in generation of the final output scripts. Pig can be run directly from PigPy, allowing users to inspect results of the Pig job and take further actions.
Editor plugins
Eclipse Plugin (Lightweight)
http://code.google.com/p/pig-eclipse
Provides pig-latin editor in eclipse, have the feature of syntax highlighting. I just make it for interests, now it has less features than pigpen.
Emacs Pig Latin Mode
http://github.com/cloudera/piglatin-mode
...
Another emacs mode for pig, with automatic indentation and full Pig 0.2 syntax support.
HAMAKE
http://code.google.com/p/hamake/
'hamake' utility allows you to automate incremental processing of datasets stored on HDFS using Hadoop tasks written in Java or using PigLatin scripts.
Pig Editor for Eclipse
http://romainr.github.com/PigEditor/
...
It works with Eclipse 3.7: http://romainr.github.com/PigEditor/updates/
...
http://github.com/iconara/piglet
Piglet is a DSL for writing Pig Latin scripts in Ruby. Piglet aims to look like Pig Latin while allowing for things like loops and control of flow that are missing from Pig.
PigPen
http://issues.apache.org/jira/browse/PIG-366
An Eclipse plugin for Pig. In addition to standard syntax highlighting it supports graphical construction of Pig Latin scripts (box and arrow) as well as showing the user the output schema of each line in a Pig Latin script and example output of his or her script. Pig can be run directly from PigPen in either local or Map-Reduce mode.
PigPy
http://code.google.com/p/pigpy/
A Python wrapper that helps users manage their Pig processes. It can manage many similar Pig Latin scripts, including running common root scripts and caching the results to be used in generation of the final output scripts. Pig can be run directly from PigPy, allowing users to inspect results of the Pig job and take further actions.
Pygmalion
https://github.com/jeromatron/pygmalion
A project to facilitate using Pig with Apache Cassandra. It mostly focuses on tabular data with static column names.
TextMate Plugin
http://www.github.com/kevinweil/pig.tmbundle
...
Javascript based online editor with syntax highlighting for Pig Latin.