How to get the XPATH or CSS selector from dynamically loaded website to follow links?

Question

How to get the XPATH or CSS selector from dynamically loaded website to follow links?

68 views Asked by Raisul Islam At 21 October 2022 at 04:24

This is a dynamically-loaded website https://www.gelbeseiten.de/suche/hotels/n%c3%bcrnberg. I'm trying to follow every link from the results. I found //article[@class='mod mod-Treffer']/a to follow the search result links. But the problem is this XPATH works only for a couple of links. For the rest of the others, I don't find any Selector. Because the other are using probably JS to make this action. I'm not familiar with this kind of dynamic website. So, I don't know how to get the selector from this kind of website. Any suggestions will be highly appreciated.

Original Q&A

There are 1 answers

**Barry the Platipus** · Accepted Answer · 2022-10-21T07:51:15+00:00

I will post this as an answer, without actually giving you the code, as it might help you more in the long term.

First, load that page in browser with javascript disabled (there are ways with disabling js in browser directly, or use an extension like ublock origin, etc - look it up).

You will notice that only the first 2 hotels are fully loading - the rest are being loaded dynamically by javascript (which in this case is disabled). There are 13 hits for //article[@class='mod mod-Treffer']/a selector, while there are more hotels on that page. However, each hotel is wrapped in an <article> tag, and that tag has data-realid="[...]" attribute. The url for each hotel would be https://www.gelbeseiten.de/gsbiz/{data-realid}.

This is how you can get all those hotels' profile links.

TechQA.

How to get the XPATH or CSS selector from dynamically loaded website to follow links?

There are 1 answers

Related Questions in WEB-SCRAPING

Related Questions in XPATH

Related Questions in SCRAPY

Related Questions in CSS-SELECTORS

Related Questions in SCRAPINGHUB

Popular Questions

Trending Questions