在Playwright for Python中,如何获取与ElementHandle相关的元素(子元素、父元素、祖元素、兄弟元素)?

klh5stk1  于 2024-01-05  发布在  Python
关注(0)|答案(3)|浏览(211)

在playwright-python中,我知道我可以使用querySelector()获得elementHandle
示例(同步):

  1. from playwright import sync_playwright
  2. with sync_playwright() as p:
  3. for browser_type in [p.chromium, p.firefox, p.webkit]:
  4. browser = browser_type.launch()
  5. page = browser.newPage()
  6. page.goto('https://duckduckgo.com/')
  7. element = page.querySelector('input[id=\"search_form_input_homepage\"]')

字符串
如何根据elementHandle获取与此相关的an元素?即parent、grandparent、siblings、children句柄?

wfsdck30

wfsdck301#

原始答案:

使用querySelector()/querySelectorAllXPath (XML Path Language)可以检索elementHandle(分别是句柄的集合)。一般来说,**可以用于在XML文档中导航元素和属性。

  1. from playwright import sync_playwright
  2. with sync_playwright() as p:
  3. for browser_type in [p.chromium, p.firefox, p.webkit]:
  4. browser = browser_type.launch(headless=False)
  5. page = browser.newPage()
  6. page.goto('https://duckduckgo.com/')
  7. element = page.querySelector('input[id=\"search_form_input_homepage\"]')
  8. parent = element.querySelector('xpath=..')
  9. grandparent = element.querySelector('xpath=../..')
  10. siblings = element.querySelectorAll('xpath=following-sibling::*')
  11. children = element.querySelectorAll('xpath=child::*')
  12. browser.close()

字符串

更新(2022-07-22):

似乎browser.newPage()已被弃用,因此在较新版本的playwright中,该函数被称为browser.new_page()(注意不同的函数名称)。
可以选择先创建一个浏览器上下文(然后关闭它),并在该上下文上调用new_page()
访问孩子/父母/祖父母/兄弟姐妹的方式保持不变。

  1. from playwright import sync_playwright
  2. with sync_playwright() as p:
  3. for browser_type in [p.chromium, p.firefox, p.webkit]:
  4. browser = browser_type.launch(headless=False)
  5. context = browser.new_context()
  6. page = context.new_page()
  7. page.goto('https://duckduckgo.com/')
  8. element = page.querySelector('input[id=\"search_form_input_homepage\"]')
  9. parent = element.querySelector('xpath=..')
  10. grandparent = element.querySelector('xpath=../..')
  11. siblings = element.querySelectorAll('xpath=following-sibling::*')
  12. children = element.querySelectorAll('xpath=child::*')
  13. context.close()
  14. browser.close()

展开查看全部
svdrlsy4

svdrlsy42#

接受的答案是在旧版本的剧作家。使用以下格式为当前版本,它将工作。

  1. from playwright.sync_api import sync_playwright
  2. with sync_playwright() as p:
  3. for browser_type in [p.chromium, p.firefox, p.webkit]:
  4. browser = browser_type.launch(headless=False)
  5. context = browser.new_context()
  6. page =context.new_page()
  7. page.goto('https://duckduckgo.com/')
  8. element = page.query_selector('input[id=\"search_form_input_homepage\"]')
  9. parent = element.query_selector('xpath=..')
  10. grandparent = element.query_selector('xpath=../..')
  11. siblings = element.query_selector_all('xpath=following-sibling::*')
  12. children = element.query_selector_all('xpath=child::*')
  13. context.close()
  14. browser.close()

字符串

展开查看全部
l2osamch

l2osamch3#

现在不再使用query_selector,而是使用

  1. child_element = page.locator('input[id=\"search_form_input_homepage\"]')
  2. parent = page.locator('div').filter(has=child_element)
  3. grandparent = page.locator('xpath=../..')
  4. siblings = page.locator('xpath=following-sibling::*')
  5. children = page.locator('xpath=child::*')

字符串
参见链接

相关问题