spark/yarn-connection error retryingblockfetcher尝试从随机端口获取块

kpbwa7wx  于 2021-05-27  发布在  Hadoop
关注(0)|答案(1)|浏览(466)

我想在aws机器上的Yarn上设置Spark。我的spark.driver.port是32975。我在Yarn容器日志中看到以下错误。它正在尝试连接到主资源管理器端口35653。我不知道它想从35653端口取什么块。有人能帮忙吗
spark命令
spark submit--deploy mode client--class org.apache.spark.examples.sparkpi$spark\u home/examples/jars/spark-examples\u 2.11-2.4.4.jar 10
hadoop版本:3.x spark版本:2.4.4
2019-12-01 19:09:54,590错误shuffle.retryingblockfetcher:开始提取1个未完成块时出现异常java.io.ioexception:连接到xyz.com/xx.xx.xx.xx:35653在org.apache.spark.network.client.transportclientfactory.createclient(transportclientfactory)处超时(120000毫秒)。java:243)在org.apache.spark.network.client.transportclientfactory.createclient(transportclientfactory)。java:187)在org.apache.spark.network.netty.nettyblocktransferservice$$anon$2.createandstart(nettyblocktransferservice。scala:114)在org.apache.spark.network.shuffle.retryingblockfetcher.fetchalloutstanding(retryingblockfetcher。java:141)在org.apache.spark.network.shuffle.retryingblockfetcher.start(retryingblockfetcher。java:121)在org.apache.spark.network.netty.nettyblocktransferservice.fetchblocks(nettyblocktransferservice。scala:124)在org.apache.spark.network.blocktransferservice.fetchblocksync(blocktransferservice。scala:98)在org.apache.spark.storage.blockmanager.getremotebytes(blockmanager。scala:757)在org.apache.spark.broadcast.torrentbroadcast$$anonfun$org$apache$spark$broadcast$torrentbroadcast$$readblocks$1.apply$mcvi$sp(torrentbroadcast)。scala:162)在org.apache.spark.broadcast.torrentbroadcast$$anonfun$org$apache$spark$broadcast$torrentbroadcast$$readblocks$1.apply(torrentbroadcast)。scala:151)在org.apache.spark.broadcast.torrentbroadcast$$anonfun$org$apache$spark$broadcast$torrentbroadcast$$readblocks$1.apply(torrentbroadcast)。scala:151)在scala.collection.immutable.list.foreach(列表。scala:392)在org.apache.spark.broadcast.torrentbroadcast.org$apache$spark$broadcast$torrentbroadcast$$readblocks(torrentbroadcast)。scala:151)在org.apache.spark.broadcast.torrentbroadcast$$anonfun$readbroadcastblock$1$$anonfun$apply$2.apply(torrentbroadcast)。scala:231)在scala.option.getorelse(选项。scala:121)在org.apache.spark.broadcast.torrentbroadcast$$anonfun$readbroadcastblock$1.apply(torrentbroadcast。scala:211)在org.apache.spark.util.utils$.tryorioexception(utils。scala:1326)在org.apache.spark.broadcast.torrentbroadcast.readbroadcastblock(torrentbroadcast。scala:207)在org.apache.spark.broadcast.torrentbroadcast.\价值$lzycompute(torrentbroadcast。scala:66)在org.apache.spark.broadcast.torrentbroadcast.\u value(torrentbroadcast。scala:66)在org.apache.spark.broadcast.torrentbroadcast.getvalue(torrentbroadcast。scala:96)在org.apache.spark.broadcast.broadcast.value(broadcast。scala:70)在org.apache.spark.scheduler.resulttask.runtask(resulttask。scala:84)在org.apache.spark.scheduler.task.run(task。scala:123)在org.apache.spark.executor.executor$taskrunner$$anonfun$10.apply(executor。scala:408)在org.apache.spark.util.utils$.trywithsafefinally(utils。scala:1360)在org.apache.spark.executor.executor$taskrunner.run(executor。scala:414)位于java.util.concurrent.threadpoolexecutor.runworker(threadpoolexecutor。java:1149)在java.util.concurrent.threadpoolexecutor$worker.run(threadpoolexecutor。java:624)在java.lang.thread.run(线程。java:748)

hxzsmxv2

hxzsmxv21#

请检查hadoop/Yarn是否正常工作。您应该首先启动hadoop,然后检查hadoop是否正在运行,而不仅仅是在终端中执行jps。

hadoop start-all.sh
jps

相关问题