使用apache cassandra和elasticsearch时，如何在apache atlas中保存沿袭信息

yjghlzjz 于 2021-06-13 发布在 Cassandra

关注(0)|答案(1)|浏览(734)

我计划部署apacheatlas，使用apachecassandra作为存储后端，elasticsearch作为索引后端。我想知道如何用这个保存血统信息？它提供getapi来获取沿袭信息，但似乎没有办法保存它。

来源：https://stackoverflow.com/questions/62413196/how-do-i-save-lineage-info-in-apache-atlas-when-using-apache-cassandra-and-elast

1条答案

按热度按时间

mfpqipee1#

在atlas中，当它们通过使用输入和输出的过程连接起来时，谱系就被创建了。
示例：如果要查看两个配置单元表类型之间的沿袭，则如下所示： T1(hive_table)--->P1(hive_process)--->T2(hive_table) 因此，基本上实体需要通过一个流程类型进行链接。
在atlas中，流程是实体，可以使用api创建 POST: /v2/entity 其中定义了输入和输出，如上文所述 hive_process :

POST: /api/atlas/v2/entity
    {
      "entity": {
        "typeName": "hive_process",
        "attributes": {
          "outputs": [
            {
              "guid": "2", 
              "typeName": "hive_table",
              "uniqueAttributes": {
                "qualifiedName": "t2@primary"
              }
            }
          ],
          "qualifiedName": "p1@primary",
          "inputs": [
            {
              "guid": "1",
              "typeName": "hive_table",
              "uniqueAttributes": {
                "qualifiedName": "t1@primary"
              }
            }
          ],
          "name": "P1-Process"
        }
      }
    }

在创建流程之前需要注意的一点是，引用的实体（输入、输出）应该预先存在，否则流程创建将失败。
如果您的需求不包含预先存在的类型，那么您当然可以继续为atlas实体和流程定义您自己的类型
关于apache站点上atlas类型系统的更多信息

赞(0）回复(0）举报 2021-06-13

我来回答

使用apache cassandra和elasticsearch时，如何在apache atlas中保存沿袭信息

1条答案

相关问题

热门标签

最新问答