在输入中,我有一个要在配置单元上导入的json文件:
[
{
"code": "ACPBC3P",
"libelle": "Bon de commande Prime de satisfaction ACP",
"libelleCourt": "Bon de commande Prime de satisfaction ACP",
"libelleLong": "Bon de commande Prime de satisfaction ACP",
"dureeStockage": 24,
"dureeArchivage": 96,
"dureeEpuration": 120,
"dureeStockageReelle": 24,
"dureeArchivageReelle": 96,
"dureeEpurationReelle": 120,
"typologie": {
"code": "ACP",
"libelle": "ACP - Activ'projet"
},
"sousTypologie": {
"code": "ACPBC3P",
"libelle": "BC3P - Bon de commande Prime de satisfaction"
}
},
{
"code": "ACPC1",
"libelle": "C1 - Demande d'avoir",
"libelleCourt": "C1 - Demande d'avoir",
"libelleLong": "C1 - Demande d'avoir",
"dureeStockage": 36,
"dureeArchivage": 84,
"dureeEpuration": 120,
"dureeStockageReelle": 36,
"dureeArchivageReelle": 84,
"dureeEpurationReelle": 120,
"typologie": {
"code": "ACP",
"libelle": "ACP - Activ'projet"
},
"sousTypologie": {
"code": "ACPC1",
"libelle": "C1 - Demande d'avoir"
}
},
{
"code": "ACPC2",
"libelle": "C2 - Relance fournisseur",
"libelleCourt": "C2 - Relance fournisseur",
"libelleLong": "C2 - Relance fournisseur",
"dureeStockage": 36,
"dureeArchivage": 84,
"dureeEpuration": 120,
"dureeStockageReelle": 36,
"dureeArchivageReelle": 84,
"dureeEpurationReelle": 120,
"typologie": {
"code": "ACP",
"libelle": "ACP - Activ'projet"
},
我试图用这种复杂的类型来捕捉这些信息:
ARRAY <STRUCT <`code`: STRING,` libelle`: STRING, `libelleCourt`: STRING,` libelleLong`: STRING, `storage duration`: INT, `Archive duration` INT, `dureeEpuration`: INT,` dureeStockageReelle`: INT, `dureeArchivageReelle`: INT,` dureeEpurationReelle`: INT, `typologie`: STRUCT <` code` STRING, `libelle` STRING>,` sousTypologie`: STRUCT <`code`: STRING,` libelle`: STRING>, `modeCapture`: STRUCT <` code`: STRING, `libelle`: STRING>,` master`: STRING, `codeActivite`: STRING >> but unfortunately it do not work !
ARRAY <STRUCT <`code`: STRING,` libelle`: STRING, `libelleCourt`: STRING,` libelleLong`: STRING, `storage duration`: INT, `Archive duration` INT, `dureeEpuration`: INT,` dureeStockageReelle`: INT, `dureeArchivageReelle`: INT,` dureeEpurationReelle`: INT, `typologie`: STRUCT <` code` STRING, `libelle` STRING>,` sousTypologie`: STRUCT <`code`: STRING,` libelle`: STRING>, `modeCapture`: STRUCT <` code`: STRING, `libelle`: STRING>,` master`: STRING, `codeActivite`: STRING >> but unfortunately it do not work !
1条答案
按热度按时间8cdiaqws1#
你没有提到任何关于错误的事情。一般来说,在使用json-serde时需要注意两件事。
org.apache.hadoop.hive.serde2.jsonserde不支持以方括号“[”开头的json数据
jsonserde基于文本serde,每一个换行符都被视为一个新记录
有效格式:
格式1无效:
格式2无效:
在将输入数据加载到配置单元表之前,应将其预处理为以下格式
ddl地址:
选择数据的查询: