Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Efficient Caching and Spilling on Nemo

In-memory caching and spilling are essential features in in-memory big data processing frameworks, and Nemo needs one.

  • Identify and persist frequently used data and unpersist it when its usage ended
  • Spill in-memory data to disk upon memory pressure
Difficulty: Major
Project size: ~350 hour (large)
Potential mentors:
Jeongyoon Eo, mail: jeongyoon (at) apache.org
Project Devs, mail: dev (at) nemo.apache.org

Apache

RocketMQ 

RocketMQ-Connect Apache Doris

The Apache Doris Sink in RocketMQ allows moving data from RocketMQ to Doris. It writes data from topics in RocketMQ to tables in Doris.

So, in this project, you need to implement a sink based on RocketMQ Streams API, and will executed on RocketMQ Streams runtime.

You should learn before applying for this topic

Difficulty: Medium
Project size: ~350 hour (large)
Potential mentors:
Tigerlee, mail: tigerlee (at) apache.org

RocketMQ-Connect Clickhouse

The Clickhouse Sink in RocketMQ allows moving data from RocketMQ to Clickhouse. It writes data from topics in RocketMQ to tables in Clickhouse.

So, in this project, you need to implement a sink based on RocketMQ Streams API, and will executed on RocketMQ Streams runtime.

You should learn before applying for this topic


Difficulty: Medium
Project size: ~350 hour (large)
Potential mentors:
Tigerlee, mail: tigerlee (at) apache.org


...