numpy PandasDF将重复元素的索引作为列表返回

e3bfsja2 于 2022-11-10 发布在其他

关注(0)|答案(3)|浏览(129)

我希望将重复的列元素的索引作为列表。到目前为止，我发现的方式是

test = ['a', 'a', 'b', 'c', 'b']
testdf = pd.DataFrame(test, columns=['test'])
np.asarray(np.where(list(testdf['test'].duplicated()))).tolist()[0]

# [1, 4]

这似乎错综复杂得可笑。
还有更好的办法吗？

numpy

来源：https://stackoverflow.com/questions/74144626/pandas-df-return-indices-of-duplicated-elements-as-a-list

3条答案

按热度按时间

kxe2p93d1#

可以将.duplicated()与.tolist()一起使用

testdf.index[testdf.test.duplicated()].tolist()

赞(0）回复(0）举报 2022-11-10

sczxawaw2#

只需对索引进行索引即可：

testdf.index[testdf['test'].duplicated()]

添加to_list：

testdf.index[testdf['test'].duplicated()].to_list()

产出：

[1, 4]

赞(0）回复(0）举报 2022-11-10

9wbgstp73#

%%time

test = ['a', 'a', 'b', 'c', 'b']
testdf = pd.DataFrame(test, columns=['test'])
testdf[testdf.test.duplicated()].index.to_list()

# Wall time: 2 ms

# [1, 4]

赞(0）回复(0）举报 2022-11-10

我来回答

numpy PandasDF将重复元素的索引作为列表返回

3条答案

相关问题

热门标签

最新问答