在Python中从其他列表/字典的嵌套字典中获取键层次结构

w8rqjzmb  于 2023-01-12  发布在  Python
关注(0)|答案(3)|浏览(125)

我有这样一个输入dict:

input={'boo': 'its', 'soo': 'your', 'roo': 'choice', 'qoo': 'this', 'fizz': 'is', 'buzz': 'very', 'yoyo': 'rambling', 'wazzw': 'lorem', 'bnn': 'ipsum', 'cc': [{'boo': 'fill', 'soo': 'ing', 'roo': 'in', 'qoo': 'the', 'fizz': 'words', 'buzz': 'here', 'yoyo': 'we', 'wazzw': 'go', 'nummm': 2, 'bsdfff': 3, 'hgdjgkk': 4, 'opu': 1, 'mnb': True}, {'boo': 'again', 'soo': 'loop', 'roo': 'de', 'qoo': 'loop', 'fizz': 'wowzers', 'buzz': 'try', 'yoyo': 'again', 'wazzw': 'how', 'nummm': 1, 'bsdfff': 7, 'hgdjgkk': 0, 'opu': 1, 'mnb': True}], 'soos': ['ya'], 'tyu': 'doin', 'dddd3': 'today'}

使用python内置库如何获得每个键的层次结构(点分隔)。即:

expected_output=['boo','soo','roo','qoo','fizz','buzz','yoyo','wazzw','bnn','cc','cc.boo','cc.soo','cc.roo','cc.qoo','cc.fizz','cc.buzz','cc.yoyo','cc.wazzw','cc.nummm','cc.bsdfff','cc.hgdjgkk','cc.opu','cc.mnb','soos','tyu','dddd3']

第一次尝试未处理列表:

def getKeys(object, prev_key = None, keys = []):
if type(object) != type({}):
    keys.append(prev_key)
    return keys
new_keys = []
for k, v in object.items():
    if prev_key != None:
        new_key = "{}.{}".format(prev_key, k)
    else:
        new_key = k
    new_keys.extend(getKeys(v, new_key, []))
return new_keys
bvpmtnay

bvpmtnay1#

使用递归生成器:

def hierarchy(d, prefix=None):
    if isinstance(d, dict):
        for k, v in d.items():
            prefix2 = f'{prefix}.{k}' if prefix else k
            yield prefix2
            if isinstance(v, list):
                seen = set()
                for x in v:
                    if isinstance(x, dict):
                        yield from hierarchy({k: v for k, v in x.items()
                                              if k not in seen},
                                             prefix=prefix2)
                        seen.update(x.keys())
                    else:
                        yield from hierarchy(x, prefix=prefix2)
            elif isinstance(v, dict):
                yield from hierarchy(v, prefix=prefix2)
                
out = list(hierarchy(inpt))

# validation
assert out == expected_output

输出:

['boo', 'soo', 'roo', 'qoo', 'fizz', 'buzz', 'yoyo', 'wazzw', 'bnn',
 'cc', 'cc.boo', 'cc.soo', 'cc.roo', 'cc.qoo', 'cc.fizz', 'cc.buzz',
 'cc.yoyo', 'cc.wazzw', 'cc.nummm', 'cc.bsdfff', 'cc.hgdjgkk', 'cc.opu', 'cc.mnb',
 'soos', 'tyu', 'dddd3']

不同示例:

list(hierarchy({'l1': {'l2': {'l3': 'test', 'l4': [['abc'], {'l5': 'def'}]}}}))
# ['l1', 'l1.l2', 'l1.l2.l3', 'l1.l2.l4', 'l1.l2.l4.l5']
u3r8eeie

u3r8eeie2#

莫兹韦回答的修正https://www.mycompiler.io/view/6LB7k4TVOuj

# Includes $ for root node, and [] where access is through an array

def hierarchy(struct, path=None):
    if isinstance(struct, dict):
        path = path if path else '$'
        return set(
            child_path
                for key, obj   in struct.items()
                for child_path in hierarchy(obj, f'{path}.{key}')
        ).union(
            [path]
        )
    elif isinstance(struct, list):
        path = f'{path}[]' if path else '$[]'
        return set(
            child_path
                for obj        in struct
                for child_path in hierarchy(obj, path)
        ).union(
            [path]
        )
    else:
        return [path]

或者...

from itertools import chain

# Excludes those $ and [] markers

def hierarchy2(d):
    if isinstance(d, dict):
        return set(
            f'{k}.{x}' if x else k
                for k,v in d.items()
                for x in chain([''], hierarchy2(v))
        )
    elif isinstance(d, list):
        return set(
            v
                for l in d
                for v in hierarchy2(l)
                    if v
        )
    else:
        return set()
db2dz4w8

db2dz4w83#

要处理子列表,您可以迭代地检查每个子项是否是一个字典,如果是,则递归地将子字典的键路径追加到当前键:

def get_keys(d):
    keys = []
    for key, value in d.items():
        if isinstance(value, list):
            for obj in value:
                if isinstance(obj, dict):
                    for path in get_keys(obj):
                        keys.append(f'{key}.{path}')
                else:
                    keys.append(key)
        else:
            keys.append(key)
    return keys

因此,对于给定示例inputget_keys(input)将返回:

['boo', 'soo', 'roo', 'qoo', 'fizz', 'buzz', 'yoyo', 'wazzw', 'bnn', 'cc.boo', 'cc.soo', 'cc.roo', 'cc.qoo', 'cc.fizz', 'cc.buzz', 'cc.yoyo', 'cc.wazzw', 'cc.nummm', 'cc.bsdfff', 'cc.hgdjgkk', 'cc.opu', 'cc.mnb', 'cc.boo', 'cc.soo', 'cc.roo', 'cc.qoo', 'cc.fizz', 'cc.buzz', 'cc.yoyo', 'cc.wazzw', 'cc.nummm', 'cc.bsdfff', 'cc.hgdjgkk', 'cc.opu', 'cc.mnb', 'soos', 'tyu', 'dddd3']

演示:https://replit.com/@blhsing/OpenGoldenIntegrationtesting

相关问题