Build a Clinical Trial Pipeline Tracker with the ClinicalTrials.gov Scraper

Track any drug, sponsor or indication across ClinicalTrials.gov as structured JSON — phases, sponsors, enrollment and sites

ClinicalTrials.gov is the global registry of clinical studies — over 480,000 trials, every sponsor, drug, phase and site. For pharma and biotech teams it is the canonical source for pipeline intelligence, but its API v2 returns deeply nested “protocol section” trees that are painful to work with. This guide turns it into a flat, trackable feed.

TL;DR: Use the ClinicalTrials.gov Scraper to pull studies filtered by condition, drug, sponsor, status and location — flattened to one clean record per trial with phase, enrollment, sponsor and sites. Track a competitor’s pipeline, monitor an indication, or build a recruitment list. No API key, zero charge on empty runs, first 25 studies free.

Why a flat feed changes everything

The raw API nests every field inside modules: identification, status, sponsor/collaborators, design, arms/interventions, contacts/locations, eligibility. To build a simple “what trials is Pfizer running in oncology, and what phase are they in?” view, you’d walk that tree for every record and handle token pagination. The scraper does it once and gives you columns you can sort and filter immediately.

Track a competitor’s pipeline

Pull every trial a sponsor is running, by status:

{
  "sponsor": "Pfizer",
  "status": ["RECRUITING", "ACTIVE_NOT_RECRUITING"],
  "maxResults": 500
}

Each record returns the NCT ID, title, overall status, phase, conditions, interventions, enrollment, key dates and a study_url. Group by phases and you have a snapshot of where a competitor’s pipeline sits — how much is in Phase 1 versus Phase 3, and what’s about to read out.

Monitor an indication

Watch an entire disease area, not just one company:

{
  "condition": "non-small cell lung cancer",
  "intervention": "pembrolizumab",
  "status": ["RECRUITING"],
  "includeLocations": true
}

Turn on includeLocations and every study comes with its full list of sites — facility, city, country — which doubles as a map of where the trial activity (and the patients) are.

Build a recruitment or site-selection list

For a condition and geography, the recruiting trials and their sites are exactly the inputs a recruitment or site-selection team needs. Filter by location and status: ["RECRUITING"], and you have a sourced list of active sites in days, not weeks.

Pricing and reliability

The ClinicalTrials.gov Scraper is pay-per-result: first 25 studies free on every account, then $0.003 per study. Searches that return nothing are never charged. No API key — ClinicalTrials.gov’s API is fully open, and the actor flattens it into something you can actually use.

FAQ

Do I need an API key? No. The ClinicalTrials.gov API v2 is fully open.

How current is the data? Real-time — it reads the live registry, so newly posted and updated trials appear immediately.

Can I get every study site? Yes — turn on includeLocations for facility, city, state and country of every location.

Which statuses can I filter by? Recruiting, not yet recruiting, enrolling by invitation, active (not recruiting), completed, suspended, terminated, withdrawn and unknown.

Can I monitor a drug or sponsor over time? Yes — save an intervention or sponsor search as a task and schedule it; each run returns the latest matching trials.

Build a Clinical Trial Pipeline Tracker with the ClinicalTrials.gov Scraper

Why a flat feed changes everything

Track a competitor’s pipeline

Monitor an indication

Build a recruitment or site-selection list

Pricing and reliability

FAQ

Building a Legal & Regulatory Intelligence Pipeline with Court Records, Federal Rules, and Contract Data

The Economic Data Stack: GDP, Trade Flows, and Open Government Data as Clean JSON

Building an Academic Research Data Stack: Crossref, OpenAlex, and Citation-Aware RAG