Unlock the Full Report

Enter your details to access premium pricing intelligence insights

Reference · Glossary

Glossary of Travel Data & Scraping Terms (A-Z)

By TravelScrape EngineeringUpdated June 2026Free · No paywall

Travel data has its own vocabulary, and mixing up the terms causes real confusion in projects. This A–Z reference from TravelScrape defines the words travel data and scraping teams use every day — clearly, in one place.

How to use this glossary

These are the core terms you'll meet across travel data scraping — from metrics like ADR and RevPAR to scraping concepts like proxies, rendering and rate limiting. Each definition is written to stand alone, so you can look up a single term or read the whole set.

Travel data & scraping terms (A–Z)

TermDefinition
ADRAverage Daily Rate — the average rental income per paid occupied room over a period. A core hotel revenue metric.
Anti-bot systemTechnology OTAs use to detect and block automated traffic — captchas, fingerprinting, behavioural analysis and rate limiting.
APIApplication Programming Interface — a sanctioned way for software to request data directly from a provider.
AvailabilityWhether a room, seat or product is bookable for a given date.
BackfillCollecting historical data after the fact to build a complete time series.
Batch scrapingCollecting data on a fixed schedule (e.g. nightly) rather than continuously in real time.
BotAn automated program that performs tasks online, such as visiting and reading web pages.
CaptchaA challenge designed to tell humans from bots, often triggered when a site suspects automation.
CrawlSystematically visiting many pages of a site, usually to discover URLs to scrape.
CSS selectorA rule that targets an element on a page so a scraper knows where a value sits.
Currency normalisationConverting prices to a common currency so they can be compared fairly.
Data pipelineThe end-to-end flow that collects, cleans, stores and delivers data.
De-duplicationRemoving repeated records so each observation appears once.
Dynamic pricingAdjusting prices automatically based on demand, competition and other signals.
FingerprintingIdentifying a visitor by browser and device characteristics, used to detect bots.
GDSGlobal Distribution System — networks like Amadeus and Sabre that distribute travel inventory to agents.
Geo-targetingCollecting from a specific country or city, since OTAs vary prices by location.
Headless browserA real browser run without a visible window, used to render JavaScript-heavy pages.
HTTP 403A 'Forbidden' response, often meaning an IP or request has been blocked.
HTTP 429A 'Too Many Requests' response, indicating rate limiting.
IngestionThe step where collected data enters your storage and validation layer.
IP banBlocking requests from a specific IP address that a site has flagged.
JSONA common structured data format used to deliver scraped data.
MetasearchSites like Kayak or Skyscanner that compare prices across multiple OTAs and suppliers.
Occupancy rateThe percentage of available rooms occupied over a period.
OTAOnline Travel Agency — a site that sells travel inventory online, e.g. Booking.com or Expedia.
ParsingReading raw page content and extracting the specific values you need.
ProxyAn intermediary server used to route requests through different IP addresses to avoid blocking.
Rate limitingA site's restriction on how many requests an IP can make in a time window.
Rate parityKeeping a hotel's price consistent across all the channels it is sold on.
Real-time scrapingCollecting data on demand, the moment it's needed, rather than on a schedule.
RenderingRunning a page's JavaScript so dynamic content (like prices) appears before extraction.
Residential proxyA proxy using a real consumer IP, harder for sites to detect than datacentre IPs.
RevPARRevenue Per Available Room — ADR multiplied by occupancy. A key hotel performance metric.
ScraperSoftware that automatically reads web pages and extracts structured data.
SelectorA rule (CSS or XPath) that tells a scraper where a piece of data sits on a page.
ThrottlingDeliberately slowing requests to stay within a site's limits.
User agentA string identifying the browser/device making a request; rotated to look natural.
ValidationChecking collected data for completeness and correctness before it's used.
XPathA path expression used to locate elements within a page's structure.

Want these terms in context?

Definitions are most useful alongside real workflows. See how the metrics and concepts above appear in practice in What Is OTA Scraping?, How to Structure Travel Data and Common Scraping Errors. Need a term we haven't covered? The TravelScrape team is happy to explain.

Want this data, not just the guide?

Get a free sample dataset from TravelScrape — hotel rates, flight prices or OTA listings for your market. Delivered by our engineering team, no commitment.