pandas 将给定行移动到DataFrame的末尾

9vw9lbht 于 2023-05-27 发布在其他

关注(0)|答案(6)|浏览(233)

我想从DataFrame中获取给定的行，并将其前置或追加到同一DataFrame中。
我下面的代码就是这样做的，但是我不确定我是否用正确的方法来做，或者是否有更简单，更好，更快的方法？

testdf = df.copy()
#get row 
target_row = testdf.ix[[2],:]
#del row from df
testdf.drop([testdf.index[2]], axis=0, inplace=True)
#concat original row to end or start of df
newdf = pd.concat([testdf, target_row], axis=0)

谢谢

pandas

来源：https://stackoverflow.com/questions/30940773/move-given-row-to-end-of-dataframe

6条答案

按热度按时间

7gyucuyw1#

而不是concat，我会在shifting之后直接赋值给df，然后使用iloc引用你想要赋值的行的位置，你必须调用squeeze，这样你就只赋值了，失去了原始的索引值，否则它会引发一个ValueError：

In [210]:
df = pd.DataFrame({'a':np.arange(5)})
df

Out[210]:
   a
0  0
1  1
2  2
3  3
4  4

In [206]:
target_row = df.ix[[2],:]
target_row

Out[206]:
   a
2  2

In [211]:
df = df.shift()
df.iloc[0] = target_row.squeeze()
df

Out[211]:
   a
0  2
1  0
2  1
3  2
4  3

编辑

在末尾插入：

In [255]:
df = pd.DataFrame({'a':np.arange(5)})
target_row = df.ix[[2],:]
df = df.shift(-1)
df.iloc[-1] = target_row.squeeze()
df

Out[255]:
   a
0  1
1  2
2  3
3  4
4  2

另一个更新

感谢@AsheKetchum指出我之前的答案是不正确的，现在3年后再看这个，我意识到你可以只reindex orig df：
如果我们将索引的副本作为list：

In[24]:
idx = df.index.tolist()
idx

Out[24]: [0, 1, 2, 3, 4]

然后我们可以从这个列表中pop感兴趣的索引：

In[25]:
idx.pop(2)
idx

Out[25]: [0, 1, 3, 4]

现在我们可以通过在这个列表前面添加reindex：

In[26]:
df.reindex([2] + idx)

Out[26]: 
   a
2  2
0  0
1  1
3  3
4  4

或追加：

In[27]:    
df.reindex(idx+[2])

Out[27]: 
   a
0  0
1  1
3  3
4  4
2  2

赞(0）回复(0）举报 2023-05-27

b1uwtaje2#

为了提高性能，您可能需要考虑保留一个包含所有要移动到DataFrame末尾的行的运行列表，然后在一个pd.concat操作中一次性移动这些行。

df = pd.DataFrame(np.random.rand(5, 3), columns=list('ABC'))
target_rows = [1, 3, 4]

a = df.iloc[[i for i in df.index if i not in target_rows], :]
b = df.iloc[target_rows, :]
>>> pd.concat([a, b])
          A         B         C
0  0.818722  0.174153  0.522383
2  0.581577  0.840306  0.985089
1  0.645752  0.238476  0.670922
3  0.198271  0.501911  0.954477
4  0.965488  0.735559  0.701077

赞(0）回复(0）举报 2023-05-27

smdncfj33#

我可以把它简化成一行代码：

pd.concat([df.ix[0:1], df.ix[3:], df.ix[[2]]])

我看不出你的代码和我的代码之间有任何性能差异。据推测，抄袭是最大的罪魁祸首。

赞(0）回复(0）举报 2023-05-27

t98cgbkg4#

类似于YH Wu写的，如果你知道指数（或指数），你可以在一行中完成。但是，ix已被弃用，因此请使用loc：

import pandas as pd
import numpy as np

df = pd.DataFrame({'a':np.arange(5)})

#    a
# 0  0
# 1  1
# 2  2
# 3  3
# 4  4

# move the line with index 2 to the end:
df2 = df.drop(2).append(df.loc[2])

#    a
# 0  0
# 1  1
# 3  3
# 4  4
# 2  2

# several indices, moves 3 and 2 to the end in that order:
to_move = [3, 2]
df2 = df.drop(to_move).append(df.loc[to_move])

#    a
# 0  0
# 1  1
# 4  4
# 3  3
# 2  2

.drop删除具有您作为参数给予的索引（或多个索引）的行。使用df.loc[x]选择索引为x的行。如果你写df = df.drop…，则直接将更改应用于原始DataFrame。如果你想重置索引，你可以执行“.reset_index（drop=True）”（如果你不想将原始索引作为新列保留，则drop=True）。

赞(0）回复(0）举报 2023-05-27

8mmmxcuj5#

如果需要按值移动行（例如您知道索引的名称，但不知道它的位置）。然后，您可以使用以下内容（假设索引的值为2）：

df.reindex([index for index in df.index if index != 2] + [2], axis=0)

其逻辑如下：

1. [index for index in df.index if index != 2] # create a list of all indexes except for the one you want to move to the end
2. + [2] # append the index you want to move to the end
3. reindex across the index (axis=0)

这种方法的优点：
1.可以用于任何数量的索引，你想移动
1.可以很容易地修改，将所需的索引移动到前面而不是后面
1.无硬编码：我们按值而不是按位置移动索引
1.列表理解提供了良好的性能

赞(0）回复(0）举报 2023-05-27

ykejflvf6#

我只是删除一行并在末尾追加。

df = pd.DataFrame({'a':np.arange(5)})
df.drop(2).append(df.ix[2]).reset_index(drop=True) # move 3rd row
df.drop(df.head(2).index).append(df.head(2)).reset_index() # move first 2 rows

赞(0）回复(0）举报 2023-05-27

我来回答

pandas 将给定行移动到DataFrame的末尾

6条答案

相关问题

热门标签

最新问答