hive select语句

fzwojiic  于 2021-05-31  发布在  Hadoop
关注(0)|答案(0)|浏览(329)

我试图通过sparkshell对配置单元表运行select查询。这是我的密码:

  1. scala >import org.apache.spark.sql.hive.HiveContext
  2. scala >val sqlContext = new HiveContext(sc)
  3. scala >val df = sqlContext.sql("select count(*) from timeserie")
  4. scala >df.head

但是我在执行读取命令时出错了( df.head , df.count , df.show ) . 这是错误:

  1. org.apache.spark.sql.catalyst.errors.package$TreeNodeException: execute, tree:
  2. Exchange SinglePartition
  3. +- *(1) HashAggregate(keys=[], functions=[partial_count(1)], output=[count#13L])
  4. +- HiveTableScan HiveTableRelation `default`.`timeserie`,
  5. org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [data#0, temperature#1, hum#2]
  6. at org.apache.spark.sql.catalyst.errors.package$.attachTree(package.scala:56)
  7. at org.apache.spark.sql.execution.exchange.ShuffleExchangeExec.doExecute(ShuffleExchangeExec.scala:119)
  8. at org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$1.apply(SparkPlan.scala:131)
  9. ... 49 elided
  10. Caused by: java.io.IOException: Not a file: hdfs://sandbox-
  11. hdp.hortonworks.com:8020/warehouse/tablespace/managed/hive/timeserie/delta_0000001_0000001_0000
  12. at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:337)
  13. at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:200)
  14. at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:253)
  15. at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:251)
  16. ... 73 more

ps:当我执行 show tables 查询我得到的结果没有错误
show create table timeserie:显示创建表
和hdfs dfs-ls../../warehouse/tablespace/managed/hive/bdp.db/timeserie:hdfs dfs-ls
hdfs dfs-ls-r../../warehouse/tablespace/managed/hive/bdp.db/serie/的结果:在此处输入图像描述

暂无答案!

目前还没有任何答案,快来回答吧!

相关问题