Twelve first-party research datasets across the Deep Synthesis constellation. Direct JSON download. Schema.org Dataset markup on every entry. CC-BY 4.0 — reuse freely with attribution.
Time-series of input/output token prices for 12 frontier model families across 32 versions, with cost-per-reference-task normalized per entry.
input_price_per_1m_usdoutput_price_per_1m_usdcost_per_reference_task_usdeffective_datecontext_window_tokensPer-tool pricing matrix across the Amazon seller SaaS ecosystem (Helium 10, Jungle Scout, Keepa, et al.) with feature-tier delta and effective monthly cost.
tool_idvendortier_namemonthly_usdannual_usdfeature_countcategoryPer-claim evidence audit of common standing-desk health and productivity marketing claims, sourced to peer-reviewed and meta-analytic literature.
claim_idclaim_textevidence_qualityeffect_sizestudy_countmeta_analysis_presentFDA CAERS-derived adverse event reports for dietary supplements over a decade: ingredient category, severity, outcome, year. Modeled estimates.
report_idyearingredient_categoryseverityoutcomeage_bandPharmacy benefit manager spread economics: NADAC reference price vs reimbursed price vs plan-paid price across high-volume generics.
drug_ndcnadac_usdpbm_paid_usdplan_paid_usdspread_usdperiodBrand-name drug patent expirations 2024-2028, generic entry timing, and per-drug savings on substitution.
brand_namegeneric_namepatent_expiry_datefirst_generic_datebrand_price_usdgeneric_price_usdsavings_pctState-by-state 1099-K reporting threshold map with federal and state divergence for marketplace sellers and gig-economy operators.
statefederal_threshold_usdstate_threshold_usdstate_transaction_counteffective_yearGrants.gov-derived award rate, applicant volume, and median award size across federal grant programs relevant to small business and nonprofits.
program_idagencyapplicantsawardsaward_rate_pctmedian_award_usdPer-niche YouTube CPM and RPM estimates synthesized from creator-shared earnings data, partner studies, and ad-network rate cards.
nichecpm_low_usdcpm_high_usdrpm_estimate_usdsample_sizeperiodPer-course economics across MOOC and bootcamp providers: hours of instruction per dollar, completion rate, and outcome signal.
course_idproviderprice_usdhours_of_instructioncompletion_rate_pctoutcome_signalVet-pharmacy vs online-pharmacy pricing for common pet prescription drugs across species and dosage forms.
drug_namespeciesdosage_formvet_pharmacy_usdonline_pharmacy_usdsavings_pctFont and typeface frequency across recent theatrical-release movie posters and title cards, with genre and decade splits.
font_familycountgenredecadestudio_tierAll twelve datasets are released under the Creative Commons Attribution 4.0 International license. You may copy, redistribute, remix, transform, and build upon the material for any purpose, including commercially, provided you give appropriate credit, provide a link to the license, and indicate if changes were made.
Suggested citation format for a single dataset:
Couey, V. W. (2026). [Dataset Name]. Deep Synthesis. https://[site]/research/[slug]/
For research-grade citation, prefer the per-study page (each carries full Dataset + ScholarlyArticle JSON-LD with methodology notes).
Preprints, submitted work, in-progress papers across physics, chemistry, bionic vision.
View publicationsCo-bylined branded research at the same methodological standard as the open catalog above. Custom topic, your methodology requirements, distribution included.
Sponsorship inquiry