Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

Applies to release: none (trunk) Flume 1.2.0 as of 2012-0508-2112

Using Event Serializers

The hdfs and file_roll sinks support using event serializers. EventSerializer is an interface that allows arbitrary serialization of an event. While it is possible to implement this interface directly, many people will prefer to use one of the Avro serialization implementations built into Flume.

The recommended usage is to serialize your data as Avro. This is a great file format that has a lot of advantages over platform- and language-specific serialization formats. For Avro-serialized events you have two options in Flume: avro_event which is built-in, and writing a custom subclass of AbstractAvroEventSerializer. The first option uses the built-in Flume event schema, while the second option allows you to specify your own avro schema.

...

An example is provided as a unit test : https://svn.apache.org/viewvc/incubator/flume/trunk/in Git at flume-ng-core/src/test/java/org/apache/flume/serialization/SyslogAvroEventSerializer.java?view=markupImage Removed

Additional unit tests / examples : https://svn.apache.org/viewvc/incubator/flume/trunk/in Git at flume-ng-core/src/test/java/org/apache/flume/serialization/Image Removed

In this case, you must specify the Builder of your type as the serializer in the configuration file. For example:

...