The Mine Works
Browse on Apify
← Latest posts
Notes

Page 2 of 5

Reddit Official API vs Reddit Scraper in 2025: Costs, Limits, and What You Actually Get
comparison Jun 22, 2026 · 8 min read

Reddit Official API vs Reddit Scraper in 2025: Costs, Limits, and What You Actually Get

Reddit changed its API pricing in 2023 to $0.24 per 1,000 calls. Here is what that means for data collection workloads, and how scraping compares on cost and data coverage.

The Mine Works
tutorial Jun 22, 2026 · 2 min read

How to Search SEC EDGAR Filings by Keyword (Full-Text Search API)

SEC EDGAR has a free full-text search API called EFTS. Learn how to search 10-K, 10-Q, and 8-K filings by keyword, filter by form type and date, and extract matched text with Python.

Socrata API: How to Pull CDC, HHS, NYC, and 200+ Government Data Portals
tutorial Jun 22, 2026 · 6 min read

Socrata API: How to Pull CDC, HHS, NYC, and 200+ Government Data Portals

Socrata powers data portals for the CDC, HHS, Chicago, New York City, Texas, and 200+ other government entities. One API, same query syntax, all of them.

The Mine Works
tutorial Jun 22, 2026 · 2 min read

How to Scrape Trustpilot Reviews by Company Domain (Python Guide)

Trustpilot has no public API for review data. Learn how to pull business reviews, star ratings, trust scores, and business replies from any Trustpilot company page using Python.

Threads Has No Public API: Here Is How to Get Profile and Post Data Anyway
tutorial Jun 22, 2026 · 7 min read

Threads Has No Public API: Here Is How to Get Profile and Post Data Anyway

Meta has not released a public Threads API. Here is what the data looks like, what fields are available via scraping, and how to collect it without getting blocked.

USASpending.gov API: How to Pull Federal Contracts, Grants, and Awards Programmatically
tutorial Jun 22, 2026 · 7 min read

USASpending.gov API: How to Pull Federal Contracts, Grants, and Awards Programmatically

USASpending.gov tracks every federal dollar spent. The API is public and free but the endpoint structure is non-obvious. Here is how to actually use it in Python.

World Bank API in Python 2025: GDP, Inflation, and 1,400 Indicators Without the SOAP Hell
tutorial Jun 22, 2026 · 8 min read

World Bank API in Python 2025: GDP, Inflation, and 1,400 Indicators Without the SOAP Hell

The World Bank has a REST API but it returns XML by default, uses quirky pagination, and has undocumented quirks. Here is how to actually use it in Python.

World Bank Trade Data API: How to Pull Global Import and Export Statistics
tutorial Jun 22, 2026 · 7 min read

World Bank Trade Data API: How to Pull Global Import and Export Statistics

The World Bank WITS database covers bilateral trade flows between 200+ countries. Here is how to access it programmatically and what the data actually contains.

Building a Legal & Regulatory Intelligence Pipeline with Court Records, Federal Rules, and Contract Data
use-case Jun 15, 2026 · 9 min read

Building a Legal & Regulatory Intelligence Pipeline with Court Records, Federal Rules, and Contract Data

Track case law, new federal regulations, and government contract awards automatically. A step-by-step guide to wiring three public-data scrapers into a

The Economic Data Stack: GDP, Trade Flows, and Open Government Data as Clean JSON
use-case Jun 13, 2026 · 7 min read

The Economic Data Stack: GDP, Trade Flows, and Open Government Data as Clean JSON

Build a macroeconomic intelligence pipeline from authoritative open data. World Bank indicators, bilateral trade flows

Building an Academic Research Data Stack: Crossref, OpenAlex, and Citation-Aware RAG
use-case Jun 11, 2026 · 6 min read

Building an Academic Research Data Stack: Crossref, OpenAlex, and Citation-Aware RAG

How to assemble a literature-review and research-intelligence pipeline from open scholarly data. Search 150M+ works, map citation networks

The Healthcare Data Stack: Providers, Clinical Trials, and FDA Safety Signals
use-case Jun 9, 2026 · 7 min read

The Healthcare Data Stack: Providers, Clinical Trials, and FDA Safety Signals

Build a healthcare intelligence pipeline from authoritative public data. Look up providers via the NPI Registry, track trials on ClinicalTrials.gov

Literature Reviews and R&D Intelligence at Scale with the OpenAlex Scraper
use-case May 21, 2026 · 2 min read

Literature Reviews and R&D Intelligence at Scale with the OpenAlex Scraper

Search 250M+ research papers from OpenAlex as structured JSON — authors, citations, venues and abstracts

Monitor Federal Regulations: A Compliance Watch with the Federal Register API
use-case May 7, 2026 · 2 min read

Monitor Federal Regulations: A Compliance Watch with the Federal Register API

Build an automated regulatory watch with the Federal Register Scraper — rules, proposed rules, notices and executive orders as structured JSON

Automate FDA Recall Monitoring for Drugs, Devices and Food
use-case Apr 23, 2026 · 3 min read

Automate FDA Recall Monitoring for Drugs, Devices and Food

Build an automated FDA recall watch with the openFDA enforcement data — drug, device and food recalls as structured JSON, filtered by classification

Build a Clinical Trial Pipeline Tracker with the ClinicalTrials.gov Scraper
use-case Apr 9, 2026 · 3 min read

Build a Clinical Trial Pipeline Tracker with the ClinicalTrials.gov Scraper

Track any drug, sponsor or indication across ClinicalTrials.gov as structured JSON — phases, sponsors, enrollment and sites

Federal Contract Intelligence: Track Government Awards with the USAspending API
use-case Mar 26, 2026 · 3 min read

Federal Contract Intelligence: Track Government Awards with the USAspending API

How to mine USAspending.gov for competitor wins, re-compete timing and B2G leads — using the USAspending Federal Awards Scraper.

Pull SEC Filings into a RAG Pipeline with Claude and the SEC EDGAR Scraper
tutorial Mar 12, 2026 · 4 min read

Pull SEC Filings into a RAG Pipeline with Claude and the SEC EDGAR Scraper

How to turn 10-K, 10-Q and 8-K filings into a clean, chunked, citation-grounded knowledge base an LLM can answer questions over