scrapy Xpath不给予任何信息

but5z9lq 于 2022-11-09 发布在其他

关注(0)|答案(2)|浏览(111)

我正在尝试获得phone number，但没有给予任何从xpath如何解决这些问题，这些是页面链接https://aaos22.mapyourshow.com/8_0/exhibitor/exhibitor-details.cfm?exhid=999999999999

import scrapy
from scrapy.http import Request
from bs4 import BeautifulSoup
from selenium import webdriver
import time
from scrapy_selenium import SeleniumRequest
import requests
import json
import pandas  as pd

class TestSpider(scrapy.Spider):
    name = 'test'

    def start_requests(self):
        yield SeleniumRequest(
            url="https://aaos22.mapyourshow.com/8_0/explore/exhibitor-gallery.cfm?featured=false",
            wait_time=3,
            screenshot=True,
            callback=self.parse,
            dont_filter=True
        )

    def parse(self, response):
        books = response.xpath("//h3[@class='card-Title\nbreak-word\nf3\nmb1\nmt0']//a//@href").extract()

        for book in books:
            url = response.urljoin(book)
            yield Request(url, callback=self.parse_book)

    def parse_book(self, response):

        phone = response.xpath("//li[@class='dib  ml3  mr3'][2]").get()
        print(phone)

scrapy

来源：https://stackoverflow.com/questions/72872674/xpath-give-nothing

2条答案

按热度按时间

yruzcnhs1#

如果你想摆脱索引，这是你可以实现的方法：

response.xpath("normalize-space(//*[starts-with(@class,'showcase-web-phone')]/li[./*[.='Phone:']]/span/following::text())").get()

赞(0）回复(0）举报 2022-11-09

nxagd54h2#

假设你得到了具体的HTML，你可以调整你的xpath--通过它的class和最后一个<li>选择<ul>。由于这个数字不包括在<span>中，你必须调用它的sibling：

//ul[contains(@class,'showcase-web-phone')]/li[last()]/span/following-sibling::text()[1]

赞(0）回复(0）举报 2022-11-09

我来回答

scrapy Xpath不给予任何信息

2条答案

相关问题

热门标签

最新问答