Google Summer of Code 2008

                            Map-Reduce support for Apache Tuscany - Chris Trezzo

Project Details 

Timeline

Time Period

Task

April 21 - May 25

  • Getting to know the Tuscany community.
  • Discussion of high level approach with the community.
  • Familiarization with the Tuscany SPIs required for extension development.
  • Familiarization with Hadoop's Map/Reduce API.
  • Design of extension, and corresponding modules.

May 26 - July 6

Implementation of extension

July 7

Midterm Evaluation

July 8 - August 11

  • Finish implementation of extension.
  • Finish remaining unit tests.
  • Finish documentation.
  • Implement demo SCA Map/Reduce application.

August 12 - August 18

Buffer time.

August 19

Final Evaluation

Tuscany

Extending Tuscany

Hadoop

Hadoop Homepage
Hadoop Quick Start
Hadoop Map/Reduce Tutorial
Hadoop API

Maven

Getting Started
Introduction to the POM
Maven Test Properties

Project Log

05/13/08  -   Most dependencies now seem to be resolved. Looking through the sample CRUD Implementation, and the set of slides entitled Extending Tuscany .

05/12/08  -   Updated local repository, and I am now having trouble with dependency resolution. There are a lot of files that are not being downloaded from the remote repositories.

05/08/08  -   Successfully installed and ran an instance of Hadoop's DFS in pseudo-distributed mode.

05/07/08  -   Ran into an IllegalArgumentException that caused the Tuscany build to fail. Opened JIRA-2302 .

05/05/08  -   Begun exploring Hadoop's Map/Reduce API at a greater detail. Started going through Hadoop's Map/Reduce Tutorial .

04/30/08  -   Downloaded latest Tuscany release (1.2), and checked out trunk source. Going through samples, and attempting a top-down build of Tuscany.

04/29/08  -   Submitted signed CLA via email to ASF.

04/21/08  -   Proposal accepted by GSOC 2008.

  • No labels