pyspark 您访问的页面不存在!

i86rm4rw  于 2023-11-16  发布在  Spark
关注(0)|答案(1)|浏览(246)

我试图在一个大数据平台上使用kmeans构建一个聚类模型,我得到了这个错误,如何解决?

  1. File "C:\Users\knwafor\run_scripts\bigdata.py", line 473, in <module>
  2. kmeans_model = kmeans.fit(data_with_pca)
  3. File "C:\Users\knwafor\run_scripts\runscripts_env\lib\site-packages\pyspark\ml\base.py", line 205, in fit
  4. return self._fit(dataset)
  5. File "C:\Users\knwafor\run_scripts\runscripts_env\lib\site-packages\pyspark\ml\wrapper.py", line 381, in _fit
  6. java_model = self._fit_java(dataset)
  7. File "C:\Users\knwafor\run_scripts\runscripts_env\lib\site-packages\pyspark\ml\wrapper.py", line 377, in _fit_java
  8. self._transfer_params_to_java()
  9. File "C:\Users\knwafor\run_scripts\runscripts_env\lib\site-packages\pyspark\ml\wrapper.py", line 174, in _transfer_params_to_java
  10. pair = self._make_java_param_pair(param, self._defaultParamMap[param])
  11. File "C:\Users\knwafor\run_scripts\runscripts_env\lib\site-packages\pyspark\ml\wrapper.py", line 158, in _make_java_param_pair
  12. java_param = self._java_obj.getParam(param.name)
  13. File "C:\Users\knwafor\run_scripts\runscripts_env\lib\site-packages\py4j\java_gateway.py", line 1322, in __call__
  14. return_value = get_return_value(
  15. File "C:\Users\knwafor\run_scripts\runscripts_env\lib\site-packages\pyspark\errors\exceptions\captured.py", line 169, in deco
  16. return f(*a, **kw)
  17. File "C:\Users\knwafor\run_scripts\runscripts_env\lib\site-packages\py4j\protocol.py", line 326, in get_return_value
  18. raise Py4JJavaError(
  19. py4j.protocol.Py4JJavaError: An error occurred while calling o1468.getParam.
  20. : java.util.NoSuchElementException: Param maxBlockSizeInMB does not exist.
  21. at org.apache.spark.ml.param.Params.$anonfun$getParam$2(params.scala:705)
  22. at scala.Option.getOrElse(Option.scala:189)
  23. at org.apache.spark.ml.param.Params.getParam(params.scala:705)
  24. at org.apache.spark.ml.param.Params.getParam$(params.scala:703)
  25. at org.apache.spark.ml.PipelineStage.getParam(Pipeline.scala:41)
  26. at sun.reflect.GeneratedMethodAccessor41.invoke(Unknown Source)
  27. at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
  28. at java.lang.reflect.Method.invoke(Method.java:498)
  29. at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:244)
  30. at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)
  31. at py4j.Gateway.invoke(Gateway.java:282)
  32. at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:132)
  33. at py4j.commands.CallCommand.execute(CallCommand.java:79)
  34. at py4j.ClientServerConnection.waitForCommands(ClientServerConnection.java:182)
  35. at py4j.ClientServerConnection.run(ClientServerConnection.java:106)
  36. at java.lang.Thread.run(Thread.java:748)
  37. SUCCESS: The process with PID 13988 (child process of PID 17724) has been terminated.
  38. SUCCESS: The process with PID 17724 (child process of PID 16860) has been terminated.
  39. SUCCESS: The process with PID 16860 (child process of PID 7256) has been terminated.

字符串

mxg2im7a

mxg2im7a1#

我后来解决了这个问题,没有在pyspark中使用kmeans,而是使用了BisectingKMeans算法,它也给了我聚类。

相关问题