Don’t let pagination stop you from extracting the custom data you need to fuel your marketing and sales. Extract it with Clay instead.
Clay (Free) to scrape and clean paginated data
Manually scraping data from large, paginated databases takes forever. And many web scrapers simply can’t navigate pagination easily to do it automatically.
Good news: this Claybook uses Zenrow and Clay’s custom formulas to scrape paginated databases in seconds.
How it works:
First, copy this template in Clay.
If you’re already a user, you’ll be directed to your workspace. If not, you’ll be prompted to create a free Clay account.
The interactive demos on the right will walk you through each step of this Claybook.
How many pages exist in the paginated database you want to scrape? Add them to the first column, giving each page its own row. We’ll use these page numbers in the next step to create dynamic URLs.
Next, so you don’t have to manually paste URLs into Clay, we’ll use a formula to add the page number to the database URL string automatically. Simply paste the URL of the database in the formula and insert the page number cell in the proper spot.
Once finished, run the Zenrow data scrape column.
Next, to ensure we’re pulling usable data from often complex databases, use ChatGPT to clean the dataset.
Last, write your newly scraped database data to a new table. Why? We want to get all the datapoints into their own columns. The only way to do that is to write to a new table. In the new table, you can use this data for anything, like further enrichment or for outbound messaging.
Start scraping paginated databases in seconds.
Simply copy this template, follow the steps above, and let Clay work its magic.
If you have any issue, you can ask for help in Slack or schedule time with a Clay expert.
Check out Clay University, read our GTM blog, or try out our top templates to transform your growth ideas into outreach in minutes.