apache kyline多维数据集流生成错误没有作业计数器

falq053o  于 2021-06-07  发布在  Kafka
关注(0)|答案(3)|浏览(401)

我遵循教程为流立方体建设从
kylin cube from streaming(Kafka)
所有的属性都是在上面提到的页面中设置的。
但是当触发来构建立方体的时候。在第一步保存Kafka的数据失败
说:

org.apache.kylin.engine.mr.exception.MapReduceException: no counters for job job_1547096967734_0086
at org.apache.kylin.engine.mr.common.MapReduceExecutable.doWork(MapReduceExecutable.java:173)
at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:164)
at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:70)
at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:164)
at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:114)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)

我见过ApacheKylinCube在“没有作业计数器”时失败
但是这里的用例是用于普通的立方体构建,而不是通过kafka立方体构建流。
在mapred-root-historyserver.log中,下面的条目似乎没有帮助。

2019-01-22 11:33:15,557 INFO org.apache.hadoop.mapreduce.v2.hs.CompletedJob: 
Loading job: job_1547096967734_0087 from file: 
hdfs://localhost:9000/tmp/hadoop- 
yarn/staging/history/done_intermediate/root/job_1547096967734_0087- 
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step- 
1548149585065-0-0-FAILED-default-1548149566816.jhist
2019-01-22 11:33:15,557 INFO org.apache.hadoop.mapreduce.v2.hs.CompletedJob: 
Loading history file: [hdfs://localhost:9000/tmp/hadoop- 
yarn/staging/history/done_intermediate/root/job_1547096967734_0087- 
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step- 
1548149585065-0-0-FAILED-default-1548149566816.jhist]
2019-01-22 11:33:15,572 INFOorg.apache.hadoop.mapreduce.jobhistory.
JobSummary:jobId=job_1547096967734_0087,submitTime=1548149562328
,launchTime=1548149566816,firstMapTaskLaunchTime=1548149570064,
firstReduceTaskLaunchTime=0,finishTime=1548149585065,resourcesPerMap
=1024,resourcesPerReduce=0,numMaps=1,numReduces=0,user=root,queue=
default,status=FAILED,mapSlotSeconds=8,reduceSlotSeconds=0,jobName=
Kylin_Save_Kafka_Data_kylin_streaming_cube_Step
2019-01-22 11:33:15,572 INFO org.apache.hadoop.mapreduce.v2.hs.
HistoryFileManager: Deleting JobSummary file: [hdfs://localhost:9000/
tmp/hadoop-yarn/staging/history/done_intermediate/
root/job_1547096967734_0087.summary]
2019-01-22 11:33:15,574 INFO 
org.apache.hadoop.mapreduce.v2.hs.HistoryFileManager: Moving 
hdfs://localhost:9000/tmp/hadoop- 
yarn/staging/history/done_intermediate/root/job_1547096967734_0087- 
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step- 
1548149585065-0-0-FAILED-default-1548149566816.jhist to 
hdfs://localhost:9000/tmp/hadoop- 
yarn/staging/history/done/2019/01/22/000000/job_1547096967734_0087- 
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step- 
1548149585065-0-0-FAILED-default-1548149566816.jhist
2019-01-22 11:33:15,574 INFO 
org.apache.hadoop.mapreduce.v2.hs.HistoryFileManager: Moving 
hdfs://localhost:9000/tmp/hadoop- 
yarn/staging/history/done_intermediate/root/job_1547096967734_0087_conf.xml 
to hdfs://localhost:9000/tmp/hadoop- 
yarn/staging/history/done/2019/01/22/000000/job_1547096967734_0087_conf.xml
2019-01-22 11:35:30,160 INFO org.apache.hadoop.mapreduce.v2.hs.JobHistory: 
Starting scan to move intermediate done files

这是一个完全手动安装的kylin环境,以下是版本规范:

apache-hive-2.3.4-bin
apache-kylin-2.5.2-bin-hbase1x
hadoop-2.9.1
hbase-1.4.9
kafka_2.11-2.0.0
spark-2.3.2-bin-hadoop2.7
zookeeper-3.4.13

任何帮助都将不胜感激。

cbjzeqam

cbjzeqam1#

请检查mr作业中Yarn上的第一个立体步骤。在作业中,您可以深入到每个Map器的日志中,然后您应该能够在那里看到一些异常。通常,可能的原因包括“无法与kafka连接”、“无法加载kafka客户端jar”等。

r7xajy2e

r7xajy2e2#

看来你的环境有问题。您可以查看错误消息的更多日志。你最好参考最新的文件http://kylin.apache.org/docs/tutorial/cube_streaming.html. 如果你想尽快启动Kylin。建议您试用kylin或使用集成沙盒(如hdp沙盒)开发它,并确保它至少有10gb内存。

wlzqhblo

wlzqhblo3#

我们可以通过在yarn-share-lib中提供kafka-client-2.0.0.jar来修复它。如mapreduce作业日志所示,未找到kafka的类def。

相关问题