Apache CarbonData & Spark Meetup Apache Spark™ is a unified analytics engine for large-scale data processing.

      CarbonData is a high-performance data solution that supports various data analytic scenarios, including BI analysis, ad-hoc SQL query, fast filter lookup on detail record, streaming analytics, and so on. CarbonData has been deployed in many enterprise production environments, in one of the largest scenario it supports queries on single table with 3PB data (more than 5 trillion records) with response time less than 3 seconds!

      Apache CarbonData 是一种新的融合存储解决方案,利用先进的列式存储、索引、压缩和编码技术提高计算效率,从而加快查询速度,万亿级数据秒级响应,目前已在 20 企业生产环境上部署应用。 

  • Calendar:

Time

Topic

Speaker

9:00-9:40

What's New in Apache Spark 2.4?

李潇,Databricks Tech Lead Manager,Apache Spark PMC member,佛罗里达大学博士

9:40-10:20

Apache CarbonData技术介绍与行业实践

李昆,华为大数据平台架构师,Apache CarbonData PMC成员

10:20-10:35

短休交流


10:35-11:15

QATCodec: Past, Present, and Future

徐铖 ,Intel Engineering Manager,Apache Hive、Commons、ORC committer

11:15-11:55

Apache CarbonData的使用实践与性能调优

徐传印,华为某产品大数据平台系统工程师,Apache CarbonData Committer

11:55-12:10

合影留念



    • What's New in Apache Spark 2.4


    • 基于CarbonData构建万亿级数据仓库

    • QATCodec Past, Present and Future

    • QATCodec Past, Present and Future


  • No labels