Build a Clinical Trial Pipeline Tracker with the ClinicalTrials.gov Scraper
Track any drug, sponsor or indication across ClinicalTrials.gov as structured JSON — phases, sponsors, enrollment and sites
The actor referenced in this article is live on Apify. Pay only for results delivered.
ClinicalTrials.gov is the global registry of clinical studies — over 480,000 trials, every sponsor, drug, phase and site. For pharma and biotech teams it is the canonical source for pipeline intelligence, but its API v2 returns deeply nested “protocol section” trees that are painful to work with. This guide turns it into a flat, trackable feed.
TL;DR: Use the ClinicalTrials.gov Scraper to pull studies filtered by condition, drug, sponsor, status and location — flattened to one clean record per trial with phase, enrollment, sponsor and sites. Track a competitor’s pipeline, monitor an indication, or build a recruitment list. No API key, zero charge on empty runs, first 25 studies free.
Why a flat feed changes everything
The raw API nests every field inside modules: identification, status, sponsor/collaborators, design, arms/interventions, contacts/locations, eligibility. To build a simple “what trials is Pfizer running in oncology, and what phase are they in?” view, you’d walk that tree for every record and handle token pagination. The scraper does it once and gives you columns you can sort and filter immediately.
Track a competitor’s pipeline
Pull every trial a sponsor is running, by status:
{
"sponsor": "Pfizer",
"status": ["RECRUITING", "ACTIVE_NOT_RECRUITING"],
"maxResults": 500
}
Each record returns the NCT ID, title, overall status, phase, conditions, interventions, enrollment, key dates and a study_url. Group by phases and you have a snapshot of where a competitor’s pipeline sits — how much is in Phase 1 versus Phase 3, and what’s about to read out.
Monitor an indication
Watch an entire disease area, not just one company:
{
"condition": "non-small cell lung cancer",
"intervention": "pembrolizumab",
"status": ["RECRUITING"],
"includeLocations": true
}
Turn on includeLocations and every study comes with its full list of sites — facility, city, country — which doubles as a map of where the trial activity (and the patients) are.
Build a recruitment or site-selection list
For a condition and geography, the recruiting trials and their sites are exactly the inputs a recruitment or site-selection team needs. Filter by location and status: ["RECRUITING"], and you have a sourced list of active sites in days, not weeks.
Pricing and reliability
The ClinicalTrials.gov Scraper is pay-per-result: first 25 studies free on every account, then $0.003 per study. Searches that return nothing are never charged. No API key — ClinicalTrials.gov’s API is fully open, and the actor flattens it into something you can actually use.
FAQ
Do I need an API key? No. The ClinicalTrials.gov API v2 is fully open.
How current is the data? Real-time — it reads the live registry, so newly posted and updated trials appear immediately.
Can I get every study site? Yes — turn on includeLocations for facility, city, state and country of every location.
Which statuses can I filter by? Recruiting, not yet recruiting, enrolling by invitation, active (not recruiting), completed, suspended, terminated, withdrawn and unknown.
Can I monitor a drug or sponsor over time? Yes — save an intervention or sponsor search as a task and schedule it; each run returns the latest matching trials.
Try the scraper referenced in this article — live on Apify, pay only for results.
Open clinicaltrials-scraper on Apify →Building a Legal & Regulatory Intelligence Pipeline with Court Records, Federal Rules, and Contract Data
Track case law, new federal regulations, and government contract awards automatically. A step-by-step guide to wiring three public-data scrapers into a
The Economic Data Stack: GDP, Trade Flows, and Open Government Data as Clean JSON
Build a macroeconomic intelligence pipeline from authoritative open data. World Bank indicators, bilateral trade flows
Building an Academic Research Data Stack: Crossref, OpenAlex, and Citation-Aware RAG
How to assemble a literature-review and research-intelligence pipeline from open scholarly data. Search 150M+ works, map citation networks