Applies to release: none (trunk) Flume 1.2.0 as of 2012-0508-2112
Using Event Serializers
The hdfs
and file_roll
sinks support using event serializers. EventSerializer is an interface that allows arbitrary serialization of an event. While it is possible to implement this interface directly, many people will prefer to use one of the Avro serialization implementations built into Flume.
The recommended usage is to serialize your data as Avro. This is a great file format that has a lot of advantages over platform- and language-specific serialization formats. For Avro-serialized events you have two options in Flume: avro_event
which is built-in, and writing a custom subclass of AbstractAvroEventSerializer. The first option uses the built-in Flume event schema, while the second option allows you to specify your own avro schema.
...
An example is provided as a unit test : https://svn.apache.org/viewvc/incubator/flume/trunk/in Git at flume-ng-core/src/test/java/org/apache/flume/serialization/SyslogAvroEventSerializer.java?view=markup
Additional unit tests / examples : https://svn.apache.org/viewvc/incubator/flume/trunk/in Git at flume-ng-core/src/test/java/org/apache/flume/serialization/
In this case, you must specify the Builder of your type as the serializer
in the configuration file. For example:
...