我遵循教程为流立方体建设从
kylin cube from streaming(Kafka)
所有的属性都是在上面提到的页面中设置的。
但是当触发来构建立方体的时候。在第一步保存Kafka的数据失败
说:
org.apache.kylin.engine.mr.exception.MapReduceException: no counters for job job_1547096967734_0086
at org.apache.kylin.engine.mr.common.MapReduceExecutable.doWork(MapReduceExecutable.java:173)
at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:164)
at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:70)
at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:164)
at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:114)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
我见过ApacheKylinCube在“没有作业计数器”时失败
但是这里的用例是用于普通的立方体构建,而不是通过kafka立方体构建流。
在mapred-root-historyserver.log中,下面的条目似乎没有帮助。
2019-01-22 11:33:15,557 INFO org.apache.hadoop.mapreduce.v2.hs.CompletedJob:
Loading job: job_1547096967734_0087 from file:
hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done_intermediate/root/job_1547096967734_0087-
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step-
1548149585065-0-0-FAILED-default-1548149566816.jhist
2019-01-22 11:33:15,557 INFO org.apache.hadoop.mapreduce.v2.hs.CompletedJob:
Loading history file: [hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done_intermediate/root/job_1547096967734_0087-
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step-
1548149585065-0-0-FAILED-default-1548149566816.jhist]
2019-01-22 11:33:15,572 INFOorg.apache.hadoop.mapreduce.jobhistory.
JobSummary:jobId=job_1547096967734_0087,submitTime=1548149562328
,launchTime=1548149566816,firstMapTaskLaunchTime=1548149570064,
firstReduceTaskLaunchTime=0,finishTime=1548149585065,resourcesPerMap
=1024,resourcesPerReduce=0,numMaps=1,numReduces=0,user=root,queue=
default,status=FAILED,mapSlotSeconds=8,reduceSlotSeconds=0,jobName=
Kylin_Save_Kafka_Data_kylin_streaming_cube_Step
2019-01-22 11:33:15,572 INFO org.apache.hadoop.mapreduce.v2.hs.
HistoryFileManager: Deleting JobSummary file: [hdfs://localhost:9000/
tmp/hadoop-yarn/staging/history/done_intermediate/
root/job_1547096967734_0087.summary]
2019-01-22 11:33:15,574 INFO
org.apache.hadoop.mapreduce.v2.hs.HistoryFileManager: Moving
hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done_intermediate/root/job_1547096967734_0087-
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step-
1548149585065-0-0-FAILED-default-1548149566816.jhist to
hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done/2019/01/22/000000/job_1547096967734_0087-
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step-
1548149585065-0-0-FAILED-default-1548149566816.jhist
2019-01-22 11:33:15,574 INFO
org.apache.hadoop.mapreduce.v2.hs.HistoryFileManager: Moving
hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done_intermediate/root/job_1547096967734_0087_conf.xml
to hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done/2019/01/22/000000/job_1547096967734_0087_conf.xml
2019-01-22 11:35:30,160 INFO org.apache.hadoop.mapreduce.v2.hs.JobHistory:
Starting scan to move intermediate done files
这是一个完全手动安装的kylin环境,以下是版本规范:
apache-hive-2.3.4-bin
apache-kylin-2.5.2-bin-hbase1x
hadoop-2.9.1
hbase-1.4.9
kafka_2.11-2.0.0
spark-2.3.2-bin-hadoop2.7
zookeeper-3.4.13
任何帮助都将不胜感激。
3条答案
按热度按时间cbjzeqam1#
请检查mr作业中Yarn上的第一个立体步骤。在作业中,您可以深入到每个Map器的日志中,然后您应该能够在那里看到一些异常。通常,可能的原因包括“无法与kafka连接”、“无法加载kafka客户端jar”等。
r7xajy2e2#
看来你的环境有问题。您可以查看错误消息的更多日志。你最好参考最新的文件http://kylin.apache.org/docs/tutorial/cube_streaming.html. 如果你想尽快启动Kylin。建议您试用kylin或使用集成沙盒(如hdp沙盒)开发它,并确保它至少有10gb内存。
wlzqhblo3#
我们可以通过在yarn-share-lib中提供kafka-client-2.0.0.jar来修复它。如mapreduce作业日志所示,未找到kafka的类def。