Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • AOL
    • AOL has multiple clusters from a few nodes to several hundred nodes.
    • We use Hadoop for analytics and batch data processing for various applications.
    • Hadoop is used by ! MapQuest, Ad, Search, Truveo, and Media groups.
    • All of our jobs are written in Pig or native map reduce.

...

  • Stanford University WebBase Project
    • The ! WebBase project has crawled and archived fixed sets of Web sites very regularly for several years. These time series site snapshots include government, and a number of topic-specific destinations. We use Pig to power an emerging interface to these archives for social scientists. The goal is to support these scientists in analyzing the archives for their research. Pig and Hadoop are used for the underlying processing and indexing.

...