Apache Kylin : Analytical Data Warehouse for Big Data
Page History
...
Please make sure that historical table contains all the columns that you want to be included in your streaming cube and data type is match.
Please choose "DAY_START/HOUR_START" as partition column of historical table, depend on in which frequency do you want it is recommended to refresh segment once a day.
Code Block | ||||||||
---|---|---|---|---|---|---|---|---|
| ||||||||
CREATE EXTERNAL TABLE IF NOT EXISTS lambda_flat_table ( -- event timestamp and debug purpose column EVENT_TIME timestamp, str_minute_second string COMMENT "For debug purpose, maybe check timezone etc", -- dimension column act_type string COMMENT "What did user interact with our mobile app in this event", user_devide_type string COMMENT "Which kind of device did user use in this event", location_city string COMMENT "Which city did user locate in this event", video_id bigint COMMENT "Which video did user watch in this event", device_brand string, page_id string, -- measure column play_times bigint, play_duration decimal(23, 10), pageview_id string COMMENT "Identier of a pageview", -- for kylin used (dimension) MINUTE_START timestamp, HOUR_START timestamp, MONTH_START date ) COMMENT 'Fact table. Store raw user action log.' PARTITIONED BY (DAY_START date) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' STORED AS TEXTFILE LOCATION 'hdfs:///LACUS/lambda_data/lambda_flat_table'; |
...
Overview
Content Tools
ThemeBuilder
Apps