Open Datasets

The data.

Twelve first-party research datasets across the Deep Synthesis constellation. Direct JSON download. Schema.org Dataset markup on every entry. CC-BY 4.0 — reuse freely with attribution.

12
Datasets
71
Variables tracked
7
Topical clusters
CC-BY 4.0
License
AI Tools & Productivity

AI Tools & Productivity

3 datasets
Nesyona·2026-05-09

AI API Token Price Decay 2022-2026

Time-series of input/output token prices for 12 frontier model families across 32 versions, with cost-per-reference-task normalized per entry.

Variables
input_price_per_1m_usdoutput_price_per_1m_usdcost_per_reference_task_usdeffective_datecontext_window_tokens
BagEngine·2026-05-09

True Cost of Amazon Selling Tools 2026

Per-tool pricing matrix across the Amazon seller SaaS ecosystem (Helium 10, Jungle Scout, Keepa, et al.) with feature-tier delta and effective monthly cost.

Variables
tool_idvendortier_namemonthly_usdannual_usdfeature_countcategory
DeskDeploy·2026-05-09

Standing Desk Claims vs Evidence 2026

Per-claim evidence audit of common standing-desk health and productivity marketing claims, sourced to peer-reviewed and meta-analytic literature.

Variables
claim_idclaim_textevidence_qualityeffect_sizestudy_countmeta_analysis_present
Health & Pharma

Health & Pharma

3 datasets
Health Britannica·2026-05-09

Supplement Adverse Events: CAERS 2014-2024

FDA CAERS-derived adverse event reports for dietary supplements over a decade: ingredient category, severity, outcome, year. Modeled estimates.

Variables
report_idyearingredient_categoryseverityoutcomeage_band
OmniRx·2026-05-09

PBM Spread Pricing 2026

Pharmacy benefit manager spread economics: NADAC reference price vs reimbursed price vs plan-paid price across high-volume generics.

Variables
drug_ndcnadac_usdpbm_paid_usdplan_paid_usdspread_usdperiod
RxGrab·2026-05-09

Brand-to-Generic Patent Cliff 2026

Brand-name drug patent expirations 2024-2028, generic entry timing, and per-drug savings on substitution.

Variables
brand_namegeneric_namepatent_expiry_datefirst_generic_datebrand_price_usdgeneric_price_usdsavings_pct
Finance & Tax

Finance & Tax

2 datasets
CeoCult·2026-05-09

1099-K Threshold State Map 2026

State-by-state 1099-K reporting threshold map with federal and state divergence for marketplace sellers and gig-economy operators.

Variables
statefederal_threshold_usdstate_threshold_usdstate_transaction_counteffective_year
GrantProbe·2026-05-09

Federal Grant Reality 2026

Grants.gov-derived award rate, applicant volume, and median award size across federal grant programs relevant to small business and nonprofits.

Variables
program_idagencyapplicantsawardsaward_rate_pctmedian_award_usd
Creator Economy

Creator Economy

1 dataset
LensPOV·2026-05-09

YouTube CPM by Niche 2026

Per-niche YouTube CPM and RPM estimates synthesized from creator-shared earnings data, partner studies, and ad-network rate cards.

Variables
nichecpm_low_usdcpm_high_usdrpm_estimate_usdsample_sizeperiod
Education

Education

1 dataset
EduBracket·2026-05-09

Online Course ROI: Hours-per-Dollar 2026

Per-course economics across MOOC and bootcamp providers: hours of instruction per dollar, completion rate, and outcome signal.

Variables
course_idproviderprice_usdhours_of_instructioncompletion_rate_pctoutcome_signal
Pet Health

Pet Health

1 dataset
PetMaxxing·2026-05-09

Pet Prescription Drug Pricing 2026

Vet-pharmacy vs online-pharmacy pricing for common pet prescription drugs across species and dosage forms.

Variables
drug_namespeciesdosage_formvet_pharmacy_usdonline_pharmacy_usdsavings_pct
Design & Typography

Design & Typography

1 dataset
FilmFont·2026-05-09

Movie Title Font Frequency 2026

Font and typeface frequency across recent theatrical-release movie posters and title cards, with genre and decade splits.

Variables
font_familycountgenredecadestudio_tier
How to cite

Attribution under CC-BY 4.0

All twelve datasets are released under the Creative Commons Attribution 4.0 International license. You may copy, redistribute, remix, transform, and build upon the material for any purpose, including commercially, provided you give appropriate credit, provide a link to the license, and indicate if changes were made.

Suggested citation format for a single dataset:

Couey, V. W. (2026). [Dataset Name]. Deep Synthesis. https://[site]/research/[slug]/

For research-grade citation, prefer the per-study page (each carries full Dataset + ScholarlyArticle JSON-LD with methodology notes).