driver = webdriver.Chrome():: selenium 方法的问题-如何解决

64jmpszr  于 2023-06-06  发布在  其他
关注(0)|答案(1)|浏览(259)

嗯-我试图找出最简单的方法来收集数据从clutch.io

  • 我尝试了各种方法从网站(clutch.io)收集数据,但似乎都失败了:

看这里

  1. from bs4 import BeautifulSoup
  2. from selenium import webdriver
  3. driver = webdriver.Chrome()
  4. url = 'https://clutch.co/it-services/msp'
  5. driver.get(url=url)
  6. soup = BeautifulSoup(driver.page_source,"lxml")
  7. links = []
  8. for l in soup.find_all('li',class_='website-link website-link-a'):
  9. results = (l.a.get('href'))
  10. links.append(results)
  11. print(links, "\n", "Count links - ", len(links))

返回此错误:

  1. ---------------------------------------------------------------------------
  2. WebDriverException Traceback (most recent call last)
  3. <ipython-input-4-4f37092106f4> in <cell line: 4>()
  4. 2 from selenium import webdriver
  5. 3
  6. ----> 4 driver = webdriver.Chrome()
  7. 5
  8. 6 url = 'https://clutch.co/it-services/msp'
  9. 5 frames
  10. /usr/local/lib/python3.10/dist-packages/selenium/webdriver/remote/errorhandler.py in check_response(self, response)
  11. 243 alert_text = value["alert"].get("text")
  12. 244 raise exception_class(message, screen, stacktrace, alert_text) # type: ignore[call-arg] # mypy is not smart enough here
  13. --> 245 raise exception_class(message, screen, stacktrace)
  14. WebDriverException: Message: unknown error: cannot find Chrome binary
  15. Stacktrace:
  16. #0 0x55a6ebf424e3 <unknown>
  17. #1 0x55a6ebc71c76 <unknown>
  18. #2 0x55a6ebc98757 <unknown>
  19. #3 0x55a6ebc97029 <unknown>
  20. #4 0x55a6ebcd5ccc <unknown>
  21. #5 0x55a6ebcd547f <unknown>
  22. #6 0x55a6ebcccde3 <unknown>
  23. #7 0x55a6ebca22dd <unknown>
  24. #8 0x55a6ebca334e <unknown>
  25. #9 0x55a6ebf023e4 <unknown>
  26. #10 0x55a6ebf063d7 <unknown>
  27. #11 0x55a6ebf10b20 <unknown>
  28. #12 0x55a6ebf07023 <unknown>
  29. #13 0x55a6ebed51aa <unknown>
  30. #14 0x55a6ebf2b6b8 <unknown>
  31. #15 0x55a6ebf2b847 <unknown>
  32. #16 0x55a6ebf3b243 <unknown>
  33. #17 0x7ffb30c27609 start_thread

如何工作!?嗯-我正试图找出从www.example.com收集数据的最简单方法clutch.io

5tmbdcev

5tmbdcev1#

可以使用ChromeDriverManager

  1. from webdriver_manager.chrome import ChromeDriverManager
  2. from bs4 import BeautifulSoup
  3. from selenium import webdriver
  4. driver = webdriver.Chrome(ChromeDriverManager().install())
  5. url = 'https://clutch.co/it-services/msp'
  6. driver.get(url=url)
  7. soup = BeautifulSoup(driver.page_source,"lxml")

相关问题