pandas 无法在numpy中分割 Dataframe

8qgya5xd 于 2023-04-04 发布在其他

关注(0)|答案(1)|浏览(137)

无法使用numpy split函数将dataframe的子集分配给

cols =["fLength","fWidth","fSize","fConc","fConcl","fAsym","fM3Long","fAlpha","fDist","class"]
df = pd.read_csv("magic04.data",names = cols)
df['class'] = (df['class']=='g').astype(int)

train, valid, test = np.split(df.sample(frac=1), [int(0.6*len(df)) , int(0.8*len(df)), ])

KeyError                                  Traceback (most recent call last)
/usr/local/lib/python3.9/dist-packages/pandas/core/indexes/base.py in get_loc(self, key, method, tolerance)
   3628             try:
-> 3629                 return self._engine.get_loc(casted_key)
   3630             except KeyError as err:

17 frames
KeyError: 0

The above exception was the direct cause of the following exception:

KeyError                                  Traceback (most recent call last)
/usr/local/lib/python3.9/dist-packages/pandas/core/indexes/base.py in get_loc(self, key, method, tolerance)
   3629                 return self._engine.get_loc(casted_key)
   3630             except KeyError as err:
-> 3631                 raise KeyError(key) from err
   3632             except TypeError:
   3633                 # If we have a listlike key, _check_indexing_error will raise

尝试阅读文档，但没有发现任何有用的内容。

pandas

来源：https://stackoverflow.com/questions/75900884/not-able-to-split-data-frame-in-numpy

1条答案

按热度按时间

xytpbqjk1#

代码中的错误是，您试图将numpy例程与pandas Dataframe 一起使用。最好的方法是将df.sample转换为numpy数组，然后使用np.split()。
试试这个-它在我的VSCode上运行得很好：

npsample=np.array(df.sample(frac=1))
train, valid, test = np.split(npsample, [int(0.6*len(npdata)) , int(0.8*len(npdata)), ])

赞(0）回复(0）举报 2023-04-04

我来回答

pandas 无法在numpy中分割 Dataframe

无法使用numpy split函数将dataframe的子集分配给

1条答案

相关问题

热门标签

最新问答