配置单元-加载twitter json数据时出错

h9vpoimq  于 2021-05-29  发布在  Hadoop
关注(0)|答案(2)|浏览(434)

Hive路径= /usr/local/hive/ hadoop路径= /usr/local/hadoop/ hadoop版本=2.6.0
配置单元版本=2.3.2
我又加了一句。加进去 /lib 中的路径和hdf的目录 /input 下载链接=此处(hive-serdes-1.0-snapshot)
我在hiveshell中添加了.jar文件 add jar /usr/local/hive/lib/hive-serdes-1.0-SNAPSHOT.jar; 在创建一个外部表来存储json文件中的数据时,出现以下错误

  1. CREATE EXTERNAL TABLE twitter(id BIGINT,text STRING) ROW FORMAT SERDE 'com.cloudera.hive.serde.JSONSerDe' LOCATION '/input/';

执行错误,从org.apache.hadoop.hive.ql.exec.ddltask返回代码1。org/apache/hadoop/hive/serde2/serde
日志文件-

  1. > 2018-01-24T19:57:40,386 INFO [e81a3c51-48a3-49e9-8121-e50b1ca97a90 main] ql.Driver: Executing command(queryId=infoobjects_20180124195740_04de95b6-9188-4b4e-9561-66c9db233cb9): create external table twitter(id BIGINT,text STRING) ROW FORMAT SERDE 'com.cloudera.hive.serde.JSONSerDe' LOCATION '/input/'
  2. 2018-01-24T19:57:40,387 INFO [e81a3c51-48a3-49e9-8121-e50b1ca97a90 main] ql.Driver: Starting task [Stage-0:DDL] in serial mode
  3. 2018-01-24T19:57:40,388 ERROR [e81a3c51-48a3-49e9-8121-e50b1ca97a90 main] exec.DDLTask: java.lang.NoClassDefFoundError: org/apache/hadoop/hive/serde2/SerDe
  4. at java.lang.ClassLoader.defineClass1(Native Method)
  5. at java.lang.ClassLoader.defineClass(ClassLoader.java:763)
  6. at java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
  7. at java.net.URLClassLoader.defineClass(URLClassLoader.java:467)
  8. at java.net.URLClassLoader.access$100(URLClassLoader.java:73)
  9. at java.net.URLClassLoader$1.run(URLClassLoader.java:368)
  10. at java.net.URLClassLoader$1.run(URLClassLoader.java:362)
  11. at java.security.AccessController.doPrivileged(Native Method)
  12. at java.net.URLClassLoader.findClass(URLClassLoader.java:361)
  13. at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
  14. at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:338)
  15. at java.lang.ClassLoader.loadClass(ClassLoader.java:411)
  16. at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
  17. at java.lang.Class.forName0(Native Method)
  18. at java.lang.Class.forName(Class.java:348)
  19. at org.apache.hadoop.conf.Configuration.getClassByNameOrNull(Configuration.java:2013)
  20. at org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:1978)
  21. at org.apache.hadoop.hive.ql.exec.DDLTask.validateSerDe(DDLTask.java:4213)
  22. at org.apache.hadoop.hive.ql.plan.CreateTableDesc.toTable(CreateTableDesc.java:723)
  23. at org.apache.hadoop.hive.ql.exec.DDLTask.createTable(DDLTask.java:4321)
  24. at org.apache.hadoop.hive.ql.exec.DDLTask.execute(DDLTask.java:354)
  25. at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:199)
  26. at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:100)
  27. at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:2183)
  28. at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1839)
  29. at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1526)
  30. at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1237)
  31. at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1227)
  32. at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:233)
  33. at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:184)
  34. at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:403)
  35. at org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:821)
  36. at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:759)
  37. at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:686)
  38. at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
  39. at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
  40. at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
  41. at java.lang.reflect.Method.invoke(Method.java:498)
  42. at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
  43. at org.apache.hadoop.util.RunJar.main(RunJar.java:136)
  44. Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.hive.serde2.SerDe
  45. at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
  46. at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
  47. at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:338)
  48. at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
  49. ... 40 more
  50. 2018-01-24T19:57:40,388 ERROR [e81a3c51-48a3-49e9-8121-e50b1ca97a90 main] ql.Driver: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask. org/apache/hadoop/hive/serde2/SerDe

我为任何错误道歉,这是我在这里的第一个问题(因为我在网上找不到解决办法)。提前谢谢。
更新:阿里的回答对我有效。此外,我还必须重新格式化json以包含单行json对象。

sycxhyv7

sycxhyv71#

我终于找到了。
从Hive0.12开始,它有一个内置的
jsonserde(hcatalog核心中的hive 0.12及更高版本)。
我们使用的所有serde都与我们使用的版本不兼容(在我的例子中是hive2.3.2)
您可以添加与您的版本相对应的jar add jar HIVE_HOME/lib/hive-hcatalog-core-2.3.2.jar 然后在您的查询中更改“com.cloudera…”

  1. ROW FORMAT SERDE 'org.apache.hive.hcatalog.data.JsonSerDe'

希望有帮助

mkshixfv

mkshixfv2#

我也有同样的错误,但是当我修改为“row format serde'org.apache.hive.hcatalog.data.jsonserde'”时,它成功了,但是当我从表中选择时;这只显示空表Hive>从tweets中选择count();好的0“

相关问题