<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>AI Analytics — Technical writing</title>
    <link>https://ai-analytics.org/writing/</link>
    <atom:link href="https://ai-analytics.org/rss.xml" rel="self" type="application/rss+xml" />
    <description>Long-form technical notes from building intelligence infrastructure.</description>
    <language>en-us</language>
    <lastBuildDate>Thu, 18 Mar 2027 00:00:00 GMT</lastBuildDate>
    <item>
      <title>EPA Safe Drinking Water Act Site Visits: The Federal Record of Public Water System Inspections</title>
      <link>https://ai-analytics.org/writing/epa-sdwa-site-visits/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-sdwa-site-visits/</guid>
      <pubDate>Thu, 18 Mar 2027 00:00:00 GMT</pubDate>
      <description>EPA SDWA site visits cover ~433,150 public-water-system inspections via SDWIS/ECHO — sanitary surveys scored across eight evaluation areas (source water, treatment, distribution, storage, monitoring, management), the upstream half of the drinking-water compliance record.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EPA ICIS-Air: The Federal Database Behind Clean Air Act Stationary Source Compliance</title>
      <link>https://ai-analytics.org/writing/epa-icis-air/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-icis-air/</guid>
      <pubDate>Wed, 17 Mar 2027 00:00:00 GMT</pubDate>
      <description>EPA ICIS-Air covers ~279,262 Clean Air Act stationary sources via ECHO — Title V/synthetic-minor classification, permitted pollutants, compliance status, High Priority Violator flags, and full compliance evaluations, joinable to emissions and enforcement.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC Nutrition, Physical Activity, and Obesity: The Federal Surveillance Record of American Health Behavior</title>
      <link>https://ai-analytics.org/writing/cdc-npao-states/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-npao-states/</guid>
      <pubDate>Tue, 16 Mar 2027 00:00:00 GMT</pubDate>
      <description>The CDC Nutrition, Physical Activity, and Obesity dataset holds ~109,180 BRFSS-sourced rows — adult obesity, physical activity, and diet by state, year, and demographic (age/sex/race/income/education) — via the CDC Socrata API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Post-Acute Care Utilization: The Federal Database Behind Home Health, Hospice, and Skilled Nursing Spending</title>
      <link>https://ai-analytics.org/writing/cms-pac-utilization/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-pac-utilization/</guid>
      <pubDate>Mon, 15 Mar 2027 00:00:00 GMT</pubDate>
      <description>CMS Post-Acute Care utilization covers ~28,404 provider-by-measure rows across home health, hospice, and skilled nursing — episodes, days, beneficiaries, and Medicare payments under PDGM/PDPM/hospice per-diem, joinable to provider ownership and quality.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NVD CVE Database: The Federal Record of Every Known Software Vulnerability</title>
      <link>https://ai-analytics.org/writing/nvd-cves/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nvd-cves/</guid>
      <pubDate>Sun, 14 Mar 2027 00:00:00 GMT</pubDate>
      <description>The NIST NVD holds ~459,000 catalogued CVEs (current + archive) — CVE ID, CVSS v2/v3.1 severity, CWE weakness, affected CPE products, and references — joinable to CISA KEV for actively-exploited focus, via the NVD 2.0 REST API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Provider Ownership: The Federal Database Behind Private Equity in Nursing Homes, Home Health, and Hospice</title>
      <link>https://ai-analytics.org/writing/cms-provider-ownership/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-provider-ownership/</guid>
      <pubDate>Sat, 13 Mar 2027 00:00:00 GMT</pubDate>
      <description>CMS all-owners files (42 CFR 455.104) expose ~600,000 ownership records across Medicare nursing homes (280k), home health (101k), hospice (71k), and hospitals (147k) — owner role, percentage, PE/REIT flags, and a created-for-acquisition M&amp;A indicator.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SAM Exclusions and Debarments: The Federal List of Who Cannot Win Government Contracts</title>
      <link>https://ai-analytics.org/writing/sam-debarments/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sam-debarments/</guid>
      <pubDate>Fri, 12 Mar 2027 00:00:00 GMT</pubDate>
      <description>SAM.gov Exclusions lists ~64,400 active federal debarments and suspensions — governmentwide bars from contracts, grants, and loans under FAR 9.4 and 2 CFR 180, with excluding agency, cause, and dates, for vendor compliance screening.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA National Drug Code Directory: The Federal Index of Every US Drug Product</title>
      <link>https://ai-analytics.org/writing/fda-ndc/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-ndc/</guid>
      <pubDate>Thu, 11 Mar 2027 00:00:00 GMT</pubDate>
      <description>The FDA National Drug Code Directory indexes ~40,000 US drug products — NDC code, brand/generic name, labeler, dosage form, route, active ingredients, DEA schedule, and marketing dates — the join key to Medicare Part D, Medicaid, RxNorm, and openFDA drug data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FAA Airmen Certification Database: The Federal Record of Every US Pilot and Mechanic</title>
      <link>https://ai-analytics.org/writing/faa-airmen/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/faa-airmen/</guid>
      <pubDate>Wed, 10 Mar 2027 00:00:00 GMT</pubDate>
      <description>The FAA Airmen Certification Database (Releasable Airmen file) covers ~881,000 certificated pilots, mechanics, instructors, and dispatchers — certificate type/level, ratings, and medical class — joinable to NTSB accidents and FAA enforcement by airman ID.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FAA Aircraft Registry: The Federal Database Behind Every N-Numbered US Aircraft</title>
      <link>https://ai-analytics.org/writing/faa-aircraft/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/faa-aircraft/</guid>
      <pubDate>Tue, 09 Mar 2027 00:00:00 GMT</pubDate>
      <description>The FAA Aircraft Registry covers ~293,000 N-numbered civil aircraft — serial, manufacturer, model, year, registrant type (incl. Delaware trust/LLC ownership opacity), airworthiness class, and the Mode S hex code linking tail numbers to ADS-B flight tracking.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CFTC Commitments of Traders: The Federal Database Behind Futures Market Positioning</title>
      <link>https://ai-analytics.org/writing/cftc-cot/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cftc-cot/</guid>
      <pubDate>Mon, 08 Mar 2027 00:00:00 GMT</pubDate>
      <description>CFTC Commitments of Traders covers ~98,000 market-week rows — Tuesday futures positions released each Friday, split by trader category (Legacy, Disaggregated, TFF) across crude, gold, grains, Treasuries, and equity-index contracts, via the CFTC Socrata API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA Device Classification Database: The Federal System Behind Every Medical Device Type</title>
      <link>https://ai-analytics.org/writing/fda-device-classifications/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-device-classifications/</guid>
      <pubDate>Sun, 07 Mar 2027 00:00:00 GMT</pubDate>
      <description>The FDA Product Classification database covers ~7,058 device types — 3-letter product code, risk class I/II/III, 21 CFR regulation, specialty panel, and 510(k)/PMA/De Novo pathway — the join key to clearances, approvals, MAUDE adverse events, and recalls.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Doctors and Clinicians: The Federal Database Behind Every Medicare Physician</title>
      <link>https://ai-analytics.org/writing/cms-doctors/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-doctors/</guid>
      <pubDate>Sat, 06 Mar 2027 00:00:00 GMT</pubDate>
      <description>The CMS Doctors and Clinicians file holds ~163,000 Medicare physician records — NPI, specialty, medical school, group and hospital affiliation, and Medicare assignment — joinable to Open Payments by NPI via the Provider Data API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EPA Enforcement Defendants: The Federal Database Behind 200,000 Environmental Cases</title>
      <link>https://ai-analytics.org/writing/epa-enforcement-defendants/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-enforcement-defendants/</guid>
      <pubDate>Fri, 05 Mar 2027 00:00:00 GMT</pubDate>
      <description>EPA ICIS holds 199,682 enforcement-case defendant records via ECHO — every party named in Clean Air Act, Clean Water Act, RCRA, and CERCLA cases, flagged for complaint vs settlement, joinable to the case file for penalties.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC Form 144: The Federal Database Behind Insider Sales of Restricted and Control Stock</title>
      <link>https://ai-analytics.org/writing/sec-form-144/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-form-144/</guid>
      <pubDate>Thu, 04 Mar 2027 00:00:00 GMT</pubDate>
      <description>SEC Form 144 covers 1,681 insider notices of proposed sales of restricted and control stock under Rule 144 — seller, relationship, shares, aggregate value, and broker — machine-readable since the 2022 EDGAR e-filing mandate, pairing with Form 4 to measure follow-through.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC EDGAR Company Registry: The Federal Index That Resolves Every Public Company</title>
      <link>https://ai-analytics.org/writing/sec-companies/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-companies/</guid>
      <pubDate>Wed, 03 Mar 2027 00:00:00 GMT</pubDate>
      <description>The SEC EDGAR company registry maps 28,392 filers by Central Index Key — name, ticker, SIC, state of incorporation, exchange, and former names — the universal join key behind Form 4, 13F, N-PORT, Form D, and financial facts.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC N-PORT Mutual Fund Holdings: The Federal Database Behind Every Fund Portfolio Position</title>
      <link>https://ai-analytics.org/writing/sec-nport-holdings/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-nport-holdings/</guid>
      <pubDate>Tue, 02 Mar 2027 00:00:00 GMT</pubDate>
      <description>SEC Form N-PORT holds 354,405 monthly fund-holding rows — every registered mutual fund and ETF portfolio position with value, percent of net assets, asset category, country, and Level 1/2/3 fair-value flags via EDGAR.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC Schedule 13D Filings: The Federal Database Behind Activist Investor Stakes</title>
      <link>https://ai-analytics.org/writing/sec-schedule-13d/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-schedule-13d/</guid>
      <pubDate>Mon, 01 Mar 2027 00:00:00 GMT</pubDate>
      <description>SEC Schedule 13D/13G filings capture every 5 percent-plus beneficial-ownership stake — activist campaigns by Icahn, Elliott, Pershing Square, and Starboard, with target, percent of class, and Item 4 purpose, now on a 5-business-day deadline after the 2024 SEC amendments.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FRA Highway-Rail Grade Crossing Inventory: The Federal Database Behind 250,000 Railroad Crossings</title>
      <link>https://ai-analytics.org/writing/fra-grade-crossings/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fra-grade-crossings/</guid>
      <pubDate>Sun, 28 Feb 2027 00:00:00 GMT</pubDate>
      <description>The FRA Highway-Rail Crossing Inventory covers 250,636 US crossings — DOT crossing ID, warning devices, train and traffic counts — paired with the crossing accident file behind Operation Lifesaver and the FHWA Section 130 program.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FMCSA Crash Data: The Federal Database Behind Large Truck and Bus Crashes</title>
      <link>https://ai-analytics.org/writing/fmcsa-crashes/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fmcsa-crashes/</guid>
      <pubDate>Sat, 27 Feb 2027 00:00:00 GMT</pubDate>
      <description>The FMCSA MCMIS crash file holds 258,057 reportable commercial-vehicle crashes — large trucks and buses, keyed to carrier USDOT number, feeding the CSA Crash Indicator BASIC and the Crash Preventability Determination Program.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EPA Pollutant Emissions: The Federal Database Behind 10 Million Facility-Level Air and Toxic Release Records</title>
      <link>https://ai-analytics.org/writing/epa-pollutant-emissions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-pollutant-emissions/</guid>
      <pubDate>Fri, 26 Feb 2027 00:00:00 GMT</pubDate>
      <description>EPA merges the National Emissions Inventory and Toxics Release Inventory into 10.4M facility-level records — one per facility per pollutant per year, keyed to the FRS Registry ID, covering criteria air pollutants and 188 hazardous air pollutants via Envirofacts and ECHO.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FMCSA Motor Carrier Census: The Federal Database Behind 2 Million Registered Trucking Companies</title>
      <link>https://ai-analytics.org/writing/fmcsa-carriers/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fmcsa-carriers/</guid>
      <pubDate>Thu, 25 Feb 2027 00:00:00 GMT</pubDate>
      <description>The FMCSA census covers 2.18M USDOT-registered carriers — trucking, bus, hazmat, brokers — with operating status, fleet size, MCS-150 data, and CSA/SMS safety scores across the 7 BASICs via the SAFER and QCMobile systems.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>IRS Exempt Organizations Business Master File: The Federal Record of 1.3 Million Tax-Exempt Nonprofits</title>
      <link>https://ai-analytics.org/writing/irs-exempt-organizations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/irs-exempt-organizations/</guid>
      <pubDate>Wed, 24 Feb 2027 00:00:00 GMT</pubDate>
      <description>The IRS EO BMF lists 1.26M tax-exempt organizations by EIN — 501(c) subsection, NTEE sector code, foundation type, ruling date, and coded asset/income ranges — the canonical census of the US nonprofit sector, published as free monthly bulk extracts.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA Food Enforcement Reports: The Federal Database Behind Food and Cosmetic Recalls</title>
      <link>https://ai-analytics.org/writing/fda-food-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-food-enforcement/</guid>
      <pubDate>Tue, 23 Feb 2027 00:00:00 GMT</pubDate>
      <description>openFDA Food Enforcement holds ~25,000 classified food and cosmetic recalls — Class I/II/III hazard, recall reason (undeclared allergens lead, then Listeria/Salmonella/E. coli), distribution footprint, and initiation-to-termination dates via the openFDA API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS Occupational Employment and Wage Statistics: The Federal Database Behind Median Salary Data for Every US Occupation</title>
      <link>https://ai-analytics.org/writing/bls-oews/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-oews/</guid>
      <pubDate>Mon, 22 Feb 2027 00:00:00 GMT</pubDate>
      <description>The BLS OEWS survey covers 830 detailed occupations across every industry and metro area — employment counts and 10th through 90th percentile wage distributions for 1.1 million surveyed establishments, published annually each May.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CFPB Consumer Complaint Database: The Federal Record Behind 3 Million Financial Product Complaints</title>
      <link>https://ai-analytics.org/writing/cfpb-complaints/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cfpb-complaints/</guid>
      <pubDate>Sun, 21 Feb 2027 00:00:00 GMT</pubDate>
      <description>The CFPB complaint database contains 3 million+ consumer complaints since 2012 — mortgages, credit cards, student loans, debt collection, and credit reporting — with company response, resolution, and optional consumer narrative via the public REST API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FARA Foreign Agent Registrations: The Federal Database Behind Foreign Lobbying and Influence Disclosure</title>
      <link>https://ai-analytics.org/writing/fara-registrations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fara-registrations/</guid>
      <pubDate>Sat, 20 Feb 2027 00:00:00 GMT</pubDate>
      <description>The DOJ FARA database covers every US-based foreign agent registration — Saudi Arabia $450M+ since 2016, the Manafort/Podesta/Flynn Mueller-era enforcement surge, and 6-month supplemental statements with political contacts and disbursements via the eFARA bulk API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC Form 4 Insider Trading: The Federal Database Behind Corporate Insider Stock Transactions</title>
      <link>https://ai-analytics.org/writing/sec-form-4/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-form-4/</guid>
      <pubDate>Fri, 19 Feb 2027 00:00:00 GMT</pubDate>
      <description>SEC Form 4 filings cover every open-market purchase and sale by corporate insiders — officers, directors, and 10%+ shareholders — with 4 million+ filings in EDGAR, 2-business-day filing requirement post-SOX 2002, and Rule 10b5-1 plan disclosure reformed in 2023.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NLRB Elections and Labor Enforcement Data: The Federal Database Behind Union Organizing and Unfair Labor Practice Cases</title>
      <link>https://ai-analytics.org/writing/nlrb-elections-labor-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nlrb-elections-labor-enforcement/</guid>
      <pubDate>Thu, 18 Feb 2027 00:00:00 GMT</pubDate>
      <description>The NLRB maintains a union representation election database (every supervised election since the 1930s) and a ULP case database — together covering Amazon ALU (2022), 400+ Starbucks petitions, UAW Volkswagen Chattanooga, and WGA/SAG-AFTRA AI bargaining cases.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NTSB Aviation Accident Database: The Federal Record Behind Every US Aircraft Accident Investigation</title>
      <link>https://ai-analytics.org/writing/ntsb-aviation-accidents/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ntsb-aviation-accidents/</guid>
      <pubDate>Wed, 17 Feb 2027 00:00:00 GMT</pubDate>
      <description>The NTSB aviation accident database holds 90,000+ accidents since 1962 — coded by aircraft type, phase of flight, probable cause, and injury severity — driving safety recommendations behind Colgan Air 3407, Southwest 1380, Boeing 737 MAX, and the Alaska Airlines door plug case.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NOAA Storm Events Database: The Federal Record Behind 50 Years of US Weather Disasters</title>
      <link>https://ai-analytics.org/writing/noaa-storm-events/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/noaa-storm-events/</guid>
      <pubDate>Tue, 16 Feb 2027 00:00:00 GMT</pubDate>
      <description>The NOAA NCEI Storm Events Database covers 48 weather event types back to 1950 — tornadoes (Joplin 2011/April 2011 Super Outbreak), hurricanes (Katrina/Harvey/Ian), floods, and heat waves — with property damage, crop damage, injuries, deaths, and event narratives for every US county.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USAID Foreign Assistance Data: Tracing $50 Billion in Annual US Development Spending</title>
      <link>https://ai-analytics.org/writing/usaid-foreign-assistance/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usaid-foreign-assistance/</guid>
      <pubDate>Mon, 15 Feb 2027 00:00:00 GMT</pubDate>
      <description>ForeignAssistance.gov discloses $50B+ in annual US foreign assistance across 200 countries — covering economic development, global health (PEPFAR $7B+), security assistance, and MCC compacts — via a public API with country, year, and category filters.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NHTSA Vehicle Safety Complaints: The Federal Database Behind Auto Defect Investigations and Recalls</title>
      <link>https://ai-analytics.org/writing/nhtsa-vehicle-complaints/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nhtsa-vehicle-complaints/</guid>
      <pubDate>Sun, 14 Feb 2027 00:00:00 GMT</pubDate>
      <description>The NHTSA complaints database contains 3 million+ consumer complaints covering brake failures, airbag malfunctions, fire risks, and steering defects — the primary data source driving defect investigations behind the Takata airbag recall (67M vehicles), Toyota unintended acceleration, and GM ignition switch cases.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EPA RCRA Hazardous Waste Data: The Federal Database Behind 400,000 Regulated Facilities</title>
      <link>https://ai-analytics.org/writing/epa-rcra-hazardous-waste/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-rcra-hazardous-waste/</guid>
      <pubDate>Sat, 13 Feb 2027 00:00:00 GMT</pubDate>
      <description>The EPA RCRA database tracks 400,000+ regulated hazardous waste facilities through a cradle-to-grave manifest system — from small quantity generators through ~1,500 permitted TSDFs — with compliance data via RCRAInfo and the EPA ECHO API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EIA Form 860: The Federal Database Behind Every US Power Plant and Electricity Generator</title>
      <link>https://ai-analytics.org/writing/eia-form-860-power-plants/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/eia-form-860-power-plants/</guid>
      <pubDate>Fri, 12 Feb 2027 00:00:00 GMT</pubDate>
      <description>EIA Form 860 captures data on 25,000+ generating units at 8,000+ US power plants in a mandatory annual survey — covering coal retirements, solar and wind additions, nuclear ownership, and interconnection queue data via the EIA Open Data API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NCES IPEDS: The Federal Database Behind Higher Education Statistics for 6,000 US Colleges</title>
      <link>https://ai-analytics.org/writing/nces-ipeds-higher-education/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nces-ipeds-higher-education/</guid>
      <pubDate>Thu, 11 Feb 2027 00:00:00 GMT</pubDate>
      <description>IPEDS covers 12 annual survey components across 6,000 Title IV-eligible institutions — enrollment by race and gender, CIP-code completions, graduation rates, faculty salaries, and endowment values, accessible via the Urban Institute Education Data Portal API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OFAC Civil Penalties: The Federal Database Behind Sanctions Violations and Treasury Enforcement</title>
      <link>https://ai-analytics.org/writing/ofac-civil-penalties/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ofac-civil-penalties/</guid>
      <pubDate>Wed, 10 Feb 2027 00:00:00 GMT</pubDate>
      <description>OFAC publishes every civil penalty settlement for sanctions violations — banks and corporations transacting with sanctioned countries, with BNP Paribas $963M, Standard Chartered $639M, and UniCredit $611M among the largest actions.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>ORI Research Misconduct Database: The Federal Record Behind Scientific Fraud and Fabrication</title>
      <link>https://ai-analytics.org/writing/ori-research-misconduct/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ori-research-misconduct/</guid>
      <pubDate>Tue, 09 Feb 2027 00:00:00 GMT</pubDate>
      <description>The HHS Office of Research Integrity database covers every finding of fabrication, falsification, or plagiarism by PHS-funded researchers — hundreds of cases at major universities with debarment from federal funding.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USASpending Subawards: The Federal Database Behind Sub-Grant and Sub-Contract Flow Tracking</title>
      <link>https://ai-analytics.org/writing/usaspending-subawards/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usaspending-subawards/</guid>
      <pubDate>Mon, 08 Feb 2027 00:00:00 GMT</pubDate>
      <description>USASpending subaward data tracks the flow of federal money beyond the prime awardee — sub-grants from universities to community organizations and sub-contracts from prime defense contractors to thousands of small suppliers.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FEC Super PAC and Dark Money Data: The Federal Database Behind Outside Political Spending</title>
      <link>https://ai-analytics.org/writing/fec-dark-money-super-pacs/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fec-dark-money-super-pacs/</guid>
      <pubDate>Sun, 07 Feb 2027 00:00:00 GMT</pubDate>
      <description>The FEC independent expenditure database covers Super PAC and outside group spending — $4 billion+ in the 2020 cycle, plus dark money via 501(c)(4) organizations including Crossroads GPS, Arabella Advisors, and Americans for Prosperity.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Congressional Voting Records: The Federal Database Behind Every House and Senate Roll Call Vote</title>
      <link>https://ai-analytics.org/writing/congress-voting-records/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/congress-voting-records/</guid>
      <pubDate>Sat, 06 Feb 2027 00:00:00 GMT</pubDate>
      <description>Congressional roll call vote data covers every recorded House and Senate vote back to 1789 — enabling DW-NOMINATE ideology scoring, party loyalty tracking, and 250 years of American legislative analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Grants.gov: The Federal Database Behind $500 Billion in Annual Federal Grant Opportunities</title>
      <link>https://ai-analytics.org/writing/grants-gov-federal-grants/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/grants-gov-federal-grants/</guid>
      <pubDate>Fri, 05 Feb 2027 00:00:00 GMT</pubDate>
      <description>Grants.gov lists every competitive federal grant from 26 grant-making agencies — covering $500 billion+ in annual awards to universities, state governments, nonprofits, and businesses.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EPA Drinking Water Violations: The Federal Database Behind Safe Drinking Water Act Enforcement</title>
      <link>https://ai-analytics.org/writing/epa-drinking-water-violations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-drinking-water-violations/</guid>
      <pubDate>Thu, 04 Feb 2027 00:00:00 GMT</pubDate>
      <description>The EPA SDWIS tracks every Safe Drinking Water Act violation by 150,000 public water systems — MCL exceedances, monitoring failures, and treatment technique violations including Flint and PFAS contamination.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Regulations.gov: The Federal Database Behind 25 Million Public Comments on US Rulemaking</title>
      <link>https://ai-analytics.org/writing/regulations-gov-dockets/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulations-gov-dockets/</guid>
      <pubDate>Wed, 03 Feb 2027 00:00:00 GMT</pubDate>
      <description>Regulations.gov hosts dockets for every significant federal regulation from 170+ agencies — 25 million public comments and supporting documents covering the entire notice-and-comment rulemaking process.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FHWA HPMS: The Federal Database Behind US Road Condition and Highway Performance Monitoring</title>
      <link>https://ai-analytics.org/writing/fhwa-hpms-highway-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fhwa-hpms-highway-data/</guid>
      <pubDate>Tue, 02 Feb 2027 00:00:00 GMT</pubDate>
      <description>The FHWA Highway Performance Monitoring System collects pavement condition ratings, traffic volumes, and lane miles for 4.1 million miles of public roads — driving federal highway funding formulas and infrastructure condition tracking.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FAA Civil Aviation Registry: The Federal Database Behind 700,000 Pilots and 300,000 Aircraft</title>
      <link>https://ai-analytics.org/writing/faa-civil-aviation-registry/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/faa-civil-aviation-registry/</guid>
      <pubDate>Mon, 01 Feb 2027 00:00:00 GMT</pubDate>
      <description>The FAA Civil Aviation Registry maintains the Airmen Certification Database covering 700,000 active pilots and the Aircraft Registration Database covering 300,000 registered civil aircraft.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOE EV Charging Station Data: The Federal Database Behind 180,000 US Alternative Fuel Stations</title>
      <link>https://ai-analytics.org/writing/doe-ev-charging-stations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/doe-ev-charging-stations/</guid>
      <pubDate>Sun, 31 Jan 2027 00:00:00 GMT</pubDate>
      <description>The DOE Alternative Fuels Station Locator database tracks every publicly accessible EV charging station, hydrogen, propane, and CNG outlet in the United States — 180,000+ stations with real-time DCFC status.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USGS Wind and Solar Energy Data: The Federal Database Behind US Renewable Energy Infrastructure</title>
      <link>https://ai-analytics.org/writing/usgs-wind-solar-energy/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usgs-wind-solar-energy/</guid>
      <pubDate>Sat, 30 Jan 2027 00:00:00 GMT</pubDate>
      <description>The USGS US Wind Turbine Database and Solar PV Database cover 72,000+ wind turbines and thousands of utility-scale solar installations with GPS coordinates, capacity ratings, hub heights, and rotor diameters.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SBA Loan Programs: The Federal Database Behind $50 Billion in Annual Small Business Financing</title>
      <link>https://ai-analytics.org/writing/sba-loan-programs/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sba-loan-programs/</guid>
      <pubDate>Fri, 29 Jan 2027 00:00:00 GMT</pubDate>
      <description>The SBA 7(a) and 504 loan guarantee programs back over $50 billion in small business financing per year — every loan disclosed in a public dataset covering borrower name, location, loan amount, lender, industry, and jobs supported.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>US Attorney Prosecution Data: The Federal Database Behind 80,000 Annual Criminal Cases</title>
      <link>https://ai-analytics.org/writing/usao-prosecution-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usao-prosecution-data/</guid>
      <pubDate>Thu, 28 Jan 2027 00:00:00 GMT</pubDate>
      <description>The 94 United States Attorneys offices prosecute every federal crime — drug trafficking, financial fraud, public corruption, terrorism, and violent crime — documenting over 80,000 criminal defendants per year in federal court.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SAMHSA Treatment Data: The Federal Database Behind Substance Abuse and Mental Health Program Statistics</title>
      <link>https://ai-analytics.org/writing/samhsa-treatment-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/samhsa-treatment-data/</guid>
      <pubDate>Wed, 27 Jan 2027 00:00:00 GMT</pubDate>
      <description>SAMHSA publishes the most comprehensive federal data on addiction treatment and mental health services — NSDUH survey, Treatment Episode Data Set covering 2 million annual admissions, and the National Mental Health Services Survey.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>PHMSA Pipeline Safety Data: The Federal Database Behind Gas and Liquid Pipeline Incidents</title>
      <link>https://ai-analytics.org/writing/phmsa-pipeline-safety/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/phmsa-pipeline-safety/</guid>
      <pubDate>Tue, 26 Jan 2027 00:00:00 GMT</pubDate>
      <description>PHMSA maintains incident reports for every significant gas and liquid pipeline accident in the United States — spills, explosions, injuries, fatalities, and property damage across 2.7 million miles of US pipeline infrastructure.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC Foodborne Outbreak Database: The Federal Record Behind 25,000 Annual Illness Clusters</title>
      <link>https://ai-analytics.org/writing/cdc-foodborne-outbreaks/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-foodborne-outbreaks/</guid>
      <pubDate>Mon, 25 Jan 2027 00:00:00 GMT</pubDate>
      <description>The CDC Foodborne Disease Outbreak Surveillance System tracks every reported multi-person foodborne illness outbreak — pathogen, implicated food, setting, illness count, hospitalizations, and deaths.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OSHA 300A Injury Data: The Federal Database Behind Establishment-Level Workplace Injury Rates</title>
      <link>https://ai-analytics.org/writing/osha-300a-injury-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/osha-300a-injury-data/</guid>
      <pubDate>Sun, 24 Jan 2027 00:00:00 GMT</pubDate>
      <description>The OSHA 300A Summary data collects annual establishment-level injury and illness totals from 750,000 employers — TRC rates, DART rates, and industry benchmarks for every major US employer.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOJ Civil Rights Division: The Federal Database Behind Police Reform Consent Decrees and Civil Rights Enforcement</title>
      <link>https://ai-analytics.org/writing/doj-civil-rights/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/doj-civil-rights/</guid>
      <pubDate>Sat, 23 Jan 2027 00:00:00 GMT</pubDate>
      <description>The DOJ Civil Rights Division enforces federal civil rights laws through pattern-or-practice investigations, consent decrees, voting rights litigation, and fair housing enforcement.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USDA ERS Food Economics: The Federal Database Behind Farm Income, Food Prices, and Rural America</title>
      <link>https://ai-analytics.org/writing/usda-ers-food-economics/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usda-ers-food-economics/</guid>
      <pubDate>Fri, 22 Jan 2027 00:00:00 GMT</pubDate>
      <description>The USDA Economic Research Service publishes comprehensive federal data on food and agricultural economics — farm income, food price indices, food security measurements, and rural county classifications.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Medicare Part D Prescriber Data: The Federal Database Behind Drug Spending for 1 Million Providers</title>
      <link>https://ai-analytics.org/writing/cms-part-d-prescribers/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-part-d-prescribers/</guid>
      <pubDate>Thu, 21 Jan 2027 00:00:00 GMT</pubDate>
      <description>CMS publishes annual Medicare Part D prescriber-level drug spending data for every provider who prescribed drugs covered under Medicare — opioid outlier identification, specialty drug spending, and fraud detection.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DEA Registrant Enforcement: The Federal Database Behind Controlled Substance License Revocations</title>
      <link>https://ai-analytics.org/writing/dea-registrant-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dea-registrant-enforcement/</guid>
      <pubDate>Wed, 20 Jan 2027 00:00:00 GMT</pubDate>
      <description>The DEA publishes every order to show cause, immediate suspension order, and final order revoking a DEA registration — the controlled substance prescribing licenses held by physicians, pharmacies, and distributors.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NRC Reactor Oversight Process: The Federal Database Behind Nuclear Plant Safety Ratings</title>
      <link>https://ai-analytics.org/writing/nrc-reactor-oversight/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nrc-reactor-oversight/</guid>
      <pubDate>Tue, 19 Jan 2027 00:00:00 GMT</pubDate>
      <description>The NRC Reactor Oversight Process evaluates every US commercial nuclear power plant across seven safety cornerstones — performance indicators, inspection findings, and action matrix dispositions from monitoring to shutdown.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CFTC Enforcement Actions: The Federal Database Behind Commodity Market Fraud Penalties</title>
      <link>https://ai-analytics.org/writing/cftc-enforcement-actions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cftc-enforcement-actions/</guid>
      <pubDate>Mon, 18 Jan 2027 00:00:00 GMT</pubDate>
      <description>The CFTC enforcement database covers every civil action for violations of the Commodity Exchange Act — LIBOR manipulation, spoofing, FX rigging, and crypto asset fraud including FTX $12.7B and Binance $2.7B.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>HMDA Mortgage Lending Data: The Federal Database Behind 15 Million Annual Mortgage Applications</title>
      <link>https://ai-analytics.org/writing/hmda-mortgage-lending/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hmda-mortgage-lending/</guid>
      <pubDate>Sun, 17 Jan 2027 00:00:00 GMT</pubDate>
      <description>The Home Mortgage Disclosure Act requires every US mortgage lender to report every loan application — applicant race, income, property location, loan amount, interest rate, action taken, and denial reason.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Hospital Cost Reports: The Federal Database Behind Hospital Financial Data for 6,000 US Facilities</title>
      <link>https://ai-analytics.org/writing/cms-hospital-cost-reports/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-hospital-cost-reports/</guid>
      <pubDate>Sat, 16 Jan 2027 00:00:00 GMT</pubDate>
      <description>The CMS HCRIS database contains detailed financial and utilization data for every Medicare-participating hospital — revenues, costs, charges, staffing, beds, and patient days.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FEC Campaign Finance Enforcement: The Federal Database Behind Matters Under Review</title>
      <link>https://ai-analytics.org/writing/fec-enforcement-murs/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fec-enforcement-murs/</guid>
      <pubDate>Fri, 15 Jan 2027 00:00:00 GMT</pubDate>
      <description>The Federal Election Commission Matters Under Review database tracks every campaign finance complaint and enforcement action — contribution limit violations, disclosure failures, and foreign national contributions.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>IRS Criminal Investigation: The Federal Database Behind Tax Fraud and Financial Crime Prosecutions</title>
      <link>https://ai-analytics.org/writing/irs-criminal-investigation/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/irs-criminal-investigation/</guid>
      <pubDate>Thu, 14 Jan 2027 00:00:00 GMT</pubDate>
      <description>IRS Criminal Investigation is the only federal agency with jurisdiction over federal tax crimes — 2,500-3,000 criminal cases per year with a 90%+ conviction rate.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC NNDSS: The Federal Database Behind Reportable Disease Surveillance in the United States</title>
      <link>https://ai-analytics.org/writing/cdc-nndss-notifiable-diseases/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-nndss-notifiable-diseases/</guid>
      <pubDate>Wed, 13 Jan 2027 00:00:00 GMT</pubDate>
      <description>The National Notifiable Diseases Surveillance System aggregates case reports for 120+ nationally notifiable diseases — salmonellosis, Lyme disease, HIV, hepatitis, measles, and emerging threats.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OSHA Violations Database: The Federal Record of 200,000 Annual Workplace Safety Citations</title>
      <link>https://ai-analytics.org/writing/osha-violations-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/osha-violations-data/</guid>
      <pubDate>Tue, 12 Jan 2027 00:00:00 GMT</pubDate>
      <description>The OSHA enforcement database contains every citation issued after a workplace inspection — violation type, penalty amount, standard violated, and abatement status.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>GAO Reports Database: The Congressional Watchdog Behind 900 Annual Federal Audits</title>
      <link>https://ai-analytics.org/writing/gao-reports-database/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/gao-reports-database/</guid>
      <pubDate>Mon, 11 Jan 2027 00:00:00 GMT</pubDate>
      <description>The Government Accountability Office publishes 900+ reports per year — audits, investigations, and evaluations of federal programs covering every agency and department.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FCC Universal Licensing System: The Federal Database Behind Every US Radio License</title>
      <link>https://ai-analytics.org/writing/fcc-universal-licensing/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fcc-universal-licensing/</guid>
      <pubDate>Sun, 10 Jan 2027 00:00:00 GMT</pubDate>
      <description>The FCC Universal Licensing System contains every active radio license in the United States — over 10 million active licenses across AM/FM broadcast, cellular carriers, satellite operators, and amateur radio.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>UFLPA Entity List: The Federal Database Behind Uyghur Forced Labor Supply Chain Enforcement</title>
      <link>https://ai-analytics.org/writing/uflpa-entity-list/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/uflpa-entity-list/</guid>
      <pubDate>Sat, 09 Jan 2027 00:00:00 GMT</pubDate>
      <description>The Uyghur Forced Labor Prevention Act Entity List identifies companies whose goods are presumed to be produced with Uyghur forced labor in Xinjiang.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FinCEN BSA Enforcement: The Federal Database Behind Anti-Money Laundering Civil Penalties</title>
      <link>https://ai-analytics.org/writing/fincen-bsa-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fincen-bsa-enforcement/</guid>
      <pubDate>Fri, 08 Jan 2027 00:00:00 GMT</pubDate>
      <description>The Financial Crimes Enforcement Network publishes every Bank Secrecy Act civil enforcement action against banks, money services businesses, and cryptocurrency exchanges.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SAM.gov Exclusions: The Federal Database Behind Government Contractor Debarments</title>
      <link>https://ai-analytics.org/writing/sam-gov-exclusions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sam-gov-exclusions/</guid>
      <pubDate>Thu, 07 Jan 2027 00:00:00 GMT</pubDate>
      <description>The System for Award Management exclusions database lists every individual and entity barred from receiving federal contracts, grants, and other financial assistance.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FHWA National Bridge Inventory: The Federal Database Behind 620,000 US Bridge Inspections</title>
      <link>https://ai-analytics.org/writing/fhwa-national-bridge-inventory/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fhwa-national-bridge-inventory/</guid>
      <pubDate>Wed, 06 Jan 2027 00:00:00 GMT</pubDate>
      <description>The Federal Highway Administration National Bridge Inventory collects biennial condition ratings for every highway bridge in the United States — 620,000 bridges.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NIH Research Portfolio: The Federal Database Behind $50 Billion in Annual Biomedical Grants</title>
      <link>https://ai-analytics.org/writing/nih-research-grants/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nih-research-grants/</guid>
      <pubDate>Tue, 05 Jan 2027 00:00:00 GMT</pubDate>
      <description>The NIH Research Portfolio Online Reporting Tools database covers every NIH-funded research project since 1985 — 500,000+ active and historical grants totaling over $50 billion per year.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USDA SNAP Program Data: The Federal Database Behind $100 Billion in Food Assistance</title>
      <link>https://ai-analytics.org/writing/usda-snap-program/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usda-snap-program/</guid>
      <pubDate>Mon, 04 Jan 2027 00:00:00 GMT</pubDate>
      <description>The Supplemental Nutrition Assistance Program is the largest US food assistance program — 42 million participants, $100 billion in annual benefits, and one of the largest automatic stabilizers in the federal budget.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FEMA Disaster Declarations: The Federal Database Behind 70 Years of US Natural Disasters</title>
      <link>https://ai-analytics.org/writing/fema-disaster-declarations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fema-disaster-declarations/</guid>
      <pubDate>Sun, 03 Jan 2027 00:00:00 GMT</pubDate>
      <description>The FEMA disaster declaration database records every major disaster, emergency, and fire management assistance declaration since 1953 — over 4,600 major disaster declarations.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Hospital Compare: The Federal Database Behind Quality Ratings for 5,000 US Hospitals</title>
      <link>https://ai-analytics.org/writing/cms-hospital-compare/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-hospital-compare/</guid>
      <pubDate>Sat, 02 Jan 2027 00:00:00 GMT</pubDate>
      <description>The CMS Hospital Compare program publishes readmission rates, patient safety indicators, HCAHPS patient satisfaction scores, and payment data for every Medicare-certified hospital in the United States.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOL OFLC Visa Disclosures: The Federal Database Behind H-1B, H-2A, and H-2B Wage Records</title>
      <link>https://ai-analytics.org/writing/dol-oflc-h1b/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-oflc-h1b/</guid>
      <pubDate>Fri, 01 Jan 2027 00:00:00 GMT</pubDate>
      <description>The Department of Labor Office of Foreign Labor Certification publishes every H-1B Labor Condition Application, H-2A agricultural temporary worker certification, and H-2B non-agricultural temporary worker certification.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>PACER Federal Courts: The Database Behind 1 Billion Federal Court Documents</title>
      <link>https://ai-analytics.org/writing/pacer-federal-courts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/pacer-federal-courts/</guid>
      <pubDate>Thu, 31 Dec 2026 00:00:00 GMT</pubDate>
      <description>The Public Access to Court Electronic Records system holds dockets and documents for every federal district, bankruptcy, and appellate case filed since the 1980s — over 1 billion documents accessible via the CourtListener API and RECAP mirror.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BIS Export Enforcement: The Federal Database Behind US Export Control Violations</title>
      <link>https://ai-analytics.org/writing/bis-export-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bis-export-enforcement/</guid>
      <pubDate>Wed, 30 Dec 2026 00:00:00 GMT</pubDate>
      <description>The Bureau of Industry and Security export enforcement records cover every administrative settlement, denial order, and criminal referral for violations of US export control law — the Export Administration Regulations governing dual-use technology exports.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Treasury Daily Treasury Statement: The Federal Database Behind the US Government Daily Cash Position</title>
      <link>https://ai-analytics.org/writing/treasury-dts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/treasury-dts/</guid>
      <pubDate>Tue, 29 Dec 2026 00:00:00 GMT</pubDate>
      <description>The Daily Treasury Statement reports the federal government cash position every business day — receipts, outlays, and the operating cash balance — the most granular real-time fiscal data available from the US government.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census SAIPE: The Federal Database Behind County-Level Poverty and Income Estimates</title>
      <link>https://ai-analytics.org/writing/census-saipe/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-saipe/</guid>
      <pubDate>Mon, 28 Dec 2026 00:00:00 GMT</pubDate>
      <description>The Small Area Income and Poverty Estimates program produces annual county-level income and poverty statistics used to allocate $16 billion in Title I-A education funding.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EEOC Discrimination Charges: The Federal Database Behind 80,000 Annual Workplace Bias Claims</title>
      <link>https://ai-analytics.org/writing/eeoc-charges/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/eeoc-charges/</guid>
      <pubDate>Sun, 27 Dec 2026 00:00:00 GMT</pubDate>
      <description>The EEOC charge database tracks every workplace discrimination complaint filed with the federal government — race, sex, disability, age, religion — from first filing through litigation outcome.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA FAERS: The Federal Adverse Event Reporting Database Behind Drug Safety Surveillance</title>
      <link>https://ai-analytics.org/writing/fda-adverse-events/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-adverse-events/</guid>
      <pubDate>Sat, 26 Dec 2026 00:00:00 GMT</pubDate>
      <description>The FDA Adverse Event Reporting System contains every post-market drug safety report submitted since 1968 — manufacturer reports, voluntary consumer reports, and FDA-initiated reports — totaling over 26 million case submissions.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NHTSA FARS: The Federal Database Behind Every US Traffic Fatality Since 1975</title>
      <link>https://ai-analytics.org/writing/dot-fars/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dot-fars/</guid>
      <pubDate>Fri, 25 Dec 2026 00:00:00 GMT</pubDate>
      <description>The Fatality Analysis Reporting System is a census of every motor vehicle crash in the United States resulting in death — 50 years of data, 2 million fatalities, and the primary evidence base for federal highway safety policy.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CPSC Recalls: The Federal Database Behind 50 Years of Consumer Product Safety Recalls</title>
      <link>https://ai-analytics.org/writing/cpsc-recalls/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cpsc-recalls/</guid>
      <pubDate>Thu, 24 Dec 2026 00:00:00 GMT</pubDate>
      <description>CPSC (Consumer Product Safety Act 1972): ~9,800 recalls since 1973 covering ~15,000 product types (excludes food, drugs, autos, firearms). Section 15 voluntary (negotiated, most common) vs. Section 9 mandatory recalls; 24-hour reporting obligation for substantial product hazards. CPSIA 2008 (Chinese toy lead paint scandal 2007): third-party testing mandates, CPC/GCC certificates, 100 ppm lead limits, phthalate limits, tracking labels. SaferProducts.gov incident reporting database (NEISS-AIP, CPSC hospital sentinel network). Recall delays: average 12-18 months first incident to recall. Notable: Fisher-Price Rock n Play sleeper (4.7M units, 32 infant deaths, 2019), IKEA MALM dresser tip-over (29M units North America, 2016/2022), Peloton Tread+ (125k units, 2021), Samsung Galaxy Note 7 (2.5M units, 2016), Takata airbags (67M+ airbags, 19+ deaths, 2014-2019, NHTSA-led). recalls.gov/api and cpsc.gov/data: recallID/recallDate/title/description/hazard/remedy/units/productCategory/injuries/deaths fields; API parameters: product_type_id/date_from/date_to. Furniture stability mandatory rule (2023) targeting tip-overs. Safe Sleep for Babies Act (2022): banned inclined sleepers, crib bumpers. Python recalls.gov API analysis: hazard-type aggregation, product-category units recalled, 2015-2024 annual trend, fatal recall identification.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>ClinicalTrials.gov: The Federal Database Behind 500,000 Clinical Trials and Drug Approval Research</title>
      <link>https://ai-analytics.org/writing/clinical-trials/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/clinical-trials/</guid>
      <pubDate>Wed, 23 Dec 2026 00:00:00 GMT</pubDate>
      <description>ClinicalTrials.gov (NLM, launched February 2000 per FDAMA 1997): 500,000+ registered studies as of 2024. FDAAA 801 (2007): mandatory registration within 21 days of first enrollment for applicable clinical trials (ACTs -- Phase 2+ interventional trials of FDA-regulated drugs/biologics/devices); results reporting within 12 months of primary completion date; penalties up to $10,000/day; NIH grant withholding; but 2015 NEJM study found only 13% reporting on time. Study phases: Phase 0 (microdosing), Phase 1 (safety, 20-80 participants), Phase 2 (efficacy signal, 100-300), Phase 3 (pivotal RCTs, FDA approval basis), Phase 4 (post-marketing); observational (cohort/case-control/cross-sectional) studies phased differently. Key fields: NCT number, official title, brief summary, sponsor type (industry ~50%, NIH/federal ~20%, academic ~30%), study status, phase, allocation, intervention model, masking, primary completion date, enrollment, primary outcome measures, eligibility criteria (inclusion/exclusion), age range, gender, MeSH condition terms, intervention type (drug/device/behavioral/procedure). Disease area composition: oncology ~35%, diabetes/cardiology/psychiatry/ID follow. COVID-19 surge: ~11,000 COVID trials 2020-2021. Publication bias (file drawer problem): AllTrials campaign, Ben Goldacre, COMPARE project, RIAT. ClinicalTrials.gov API v2 at clinicaltrials.gov/api/v2/studies: no API key, pagination by pageSize/pageToken, protocolSection/resultsSection/statusModule/conditionsModule modules. Aggregate stats: ~40% completed, ~25% recruiting, ~15% terminated. Python API query: recruiting Phase 3 oncology trials by enrollment (top 10) + phase distribution for all cancer trials.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census Current Population Survey: The Federal Database Behind the Official US Poverty and Unemployment Rates</title>
      <link>https://ai-analytics.org/writing/census-cps/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-cps/</guid>
      <pubDate>Tue, 22 Dec 2026 00:00:00 GMT</pubDate>
      <description>CPS (Census/BLS joint survey since 1940): ~60,000 housing units/month (4-8-4 rotation group design); reference week containing the 12th. Labor force classifications: employed (1+ hour for pay/profit during reference week), unemployed (no work + active search past 4 weeks + currently available), not in labor force. U-1 through U-6 supplemental measures: U-3 = official rate, U-6 = total underemployment (unemployed + marginally attached + part-time for economic reasons); COVID-19 peak April 2020 U-3 14.7% / U-6 22.9%. Annual ASEC supplement (March, expanded ~100,000 households): official poverty rate (48 Orshansky thresholds by family size/composition; 2023 family-of-4 threshold ~$30,900; 2023 poverty rate ~11.1%, ~36M people); health insurance coverage; SPM (Supplemental Poverty Measure, 2011: counts SNAP/housing subsidies/EITC, subtracts taxes/work expenses/medical costs, geographic cost adjustment -- lower poverty for working-age, higher for elderly). CPS vs. CES/QCEW: residence-based (where people live) vs. establishment-based (where jobs are); CPS includes agricultural/domestic/self-employed not in QCEW. CPS microdata fields: PWSSWGT/PRTAGE/PESEX/PRDTRACE/PEHSPNON/PEEDUCA/PEMLR/PRUNTYPE/PRERNWA/OFFPOV/POVLL/PRCITSHP. IPUMS-CPS harmonized microdata back to 1962; raw files at census.gov/data/datasets; FRED: UNRATE/U6RATE/CIVPART/LNS11000000; BLS LAUS for state-level unemployment. Python FRED API + BLS LAUS API: state unemployment/poverty/LFPR table with YoY change.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DEA ARCOS: The Federal Opioid Distribution Database Behind 380 Million Pill Shipment Transactions</title>
      <link>https://ai-analytics.org/writing/arcos-opioid-distribution/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/arcos-opioid-distribution/</guid>
      <pubDate>Mon, 21 Dec 2026 00:00:00 GMT</pubDate>
      <description>DEA ARCOS (Automation of Reports and Consolidated Orders System): mandatory reporting under 21 USC 827 + 21 CFR 1304.33 for all manufacturers/distributors/importers of Schedule I/II controlled substances. 380M individual opioid transaction records 2006-2014 (oxycodone, hydrocodone, fentanyl, morphine, hydromorphone, methadone, oxymorphone, buprenorphine). Transaction fields: reporter DEA number, buyer DEA number, drug code, drug name, dosage unit, quantity, transaction date, transaction type (S=sale, P=purchase, T=theft/loss, R=return). MDL 2804 (In re: National Prescription Opiate Litigation, Judge Polster, NDOH): July 2019 court order released ARCOS data to Washington Post and HD Media -- first-ever public transaction-level release. Key findings: 76B oxycodone/hydrocodone pills shipped 2006-2014; WV ~780 pills/person/year; Mingo County WV: 3.3M hydrocodone pills over 2 years for 25,000 people; McKesson, Cardinal Health, AmerisourceBergen (Big Three) distributed 44% of all opioids. Suspicious order monitoring failure: 21 CFR 1301.74(b) requires reporting unusual orders; DEA settlements: McKesson $150M + registration surrenders 2017, AmerisourceBergen $150M 2017, Cardinal Health $44M. Purdue Pharma: OxyContin 1996, $634M 2007 plea, $8.34B 2020 settlement, Sacklers $6B; Mallinckrodt $1.6B. Big Three civil settlement $21B (2022); J&amp;J $5B; Walgreens $5.7B; CVS $5B; Walmart $3.1B; total settlements $55B+. Washington Post bulk download at WaPo arcos-database pages; arcos R package. Python WaPo bulk TSV download: pills-per-capita by county for oxycodone/hydrocodone, top distributors, annual trend.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOL UI Claims: The Federal Database Behind Weekly US Unemployment Statistics Since 1967</title>
      <link>https://ai-analytics.org/writing/dol-ui-claims/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-ui-claims/</guid>
      <pubDate>Sun, 20 Dec 2026 00:00:00 GMT</pubDate>
      <description>DOL ETA weekly UI claims (Thursday 8:30am): initial claims SA (ICSA) + continuing claims SA (CCSA/CC4WSA). 53 jurisdictions: 50 states + DC + PR + VI. COVID peak: 6.9M initial claims week of April 4 2020 (prior record 695k, Oct 1982); continuing claims peak 24.9M May 2020. CARES Act PUA extended to gig/self-employed. Regular state UI: typically 26 weeks; federal-state Extended Benefits at 6.5%/8% insured unemployment rate trigger. State benefit max: Mississippi $235/wk to Massachusetts $1,050/wk. Recipiency rate ~27% of unemployed in normal times. FRED series: ICSA, ICNSA, CCSA, CC4WSA at fred.stlouisfed.org; DOL ETA-539/5159 forms; DOL bulk at oui.doleta.gov/unemploy/claims.asp. BLS UI-vs-CPS distinction: UI = administrative benefit recipients vs. CPS = household survey unemployed. Python FRED API ICSA 2019-present + COVID peak detection + 52-week rolling average.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Nursing Home Compare: The Federal Database Behind Quality Ratings for 14,700 US Nursing Homes</title>
      <link>https://ai-analytics.org/writing/cms-nursing-homes/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-nursing-homes/</guid>
      <pubDate>Sat, 19 Dec 2026 00:00:00 GMT</pubDate>
      <description>CMS Five-Star Quality Rating: ~15,000 Medicare/Medicaid-certified nursing homes, ~1.35M residents, ~$90k-105k/yr private pay. Three domains: Health Inspections (standard annual + complaint surveys; F-tag deficiency system F600-F999; scope/severity matrix A-L; immediate jeopardy J-L), Staffing (Payroll-Based Journal PBJ quarterly since 2017: RN hours/resident day, total nurse hours/resident day, weekend staffing), Quality Measures (MDS 3.0 derived: long-stay high-risk pressure ulcers, falls with major injury, antipsychotic use in dementia, UTI; short-stay pressure ulcer + improved function). Special Focus Facilities (SFF): ~90 facilities with persistent serious quality problems; ~400 on SFF Candidate list; monthly CMS publication; decertification risk. Ownership transparency: Form CMS-855A; private equity association with lower staffing (Braun 2021, Harrington 2020); large chains: ManorCare/ProMedica (~250 facilities), Genesis Healthcare. data.cms.gov datasets: Provider Information (CMS_Certified_Nursing_Facilities.csv), Health Deficiencies, Quality Measures, Staffing, Penalties (CMPs). Socrata API, no key required. Python Provider Info CSV analysis: star distribution, SFF flags, average staffing by star rating, top-10 states by 1-star share.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS QCEW: The Federal Database Behind US Payroll Data for Every Industry and County</title>
      <link>https://ai-analytics.org/writing/bls-qcew/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-qcew/</guid>
      <pubDate>Fri, 18 Dec 2026 00:00:00 GMT</pubDate>
      <description>BLS QCEW (Quarterly Census of Employment and Wages): joint BLS-state partnership using UI administrative tax records. ~11M establishment records/quarter, ~95% of all US civilian employment. Excludes self-employed, military, elected officials, railroad (RRB), some agricultural. Key fields: area_fips (2-digit state, 5-digit county, MSA, US), industry_code (NAICS 2-6 digit), own_code (0=total, 1=federal, 2=state, 3=local, 5=private), disclosure_code (N=suppressed when &lt;3 establishments or 1 employer &gt;80% wages), avg_weekly_wage, month1/2/3_emplvl, total_qtrly_wages, taxable_qtrly_wages. Geographic coverage: national, 51 states+DC, 3,200+ counties, 380+ MSAs. QCEW vs. CES: QCEW is the administrative universe (5-month lag), CES is the sample survey (1-month lag); CES March benchmark revisions align to QCEW. 2024 benchmark revision: -818,000 downward revision to CES (QCEW showed slower job growth than CES estimated). Location Quotient: (county industry share) / (national share); LQ&gt;1 = local specialization. Three data access paths: BLS API series IDs, QCEW cross-sectional API at data.bls.gov/cew/api/, bulk flat files at blsdownload.bls.gov (~500MB/quarter compressed). Python QCEW API private sector 2-digit NAICS: employment/wage table by supersector + LQ demo + YoY wage growth.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS Current Employment Statistics: The Federal Database Behind the Monthly Jobs Report</title>
      <link>https://ai-analytics.org/writing/bls-ces/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-ces/</guid>
      <pubDate>Thu, 17 Dec 2026 00:00:00 GMT</pubDate>
      <description>BLS CES (Current Employment Statistics): monthly payroll survey of ~140,000 businesses and ~440,000 worksites covering ~34% of all nonfarm payroll. Two surveys: CES (establishment, payroll jobs) + CPS (household, unemployment rate). Released first Friday of each month at 8:30am ET. Headline: total nonfarm payroll employment; also private payrolls, manufacturing, AHE (average hourly earnings), AWH (average weekly hours). NAICS supersectors: Mining/Logging, Construction, Manufacturing (durable/nondurable), Trade/Transport/Utilities, Information, Financial Activities, Professional/Business Services, Education/Health, Leisure/Hospitality, Other Services, Government. Series ID format: CEU + supersector + industry + data type (01=employment, 03=hours, 11=AHE). Examples: CEU0000000001 (total nonfarm), CEU3000000001 (manufacturing), CEU7000000001 (leisure/hospitality), CEU0500000011 (private AHE). Reference week = week containing the 12th. Three estimates: preliminary (T+30 days), first revision (T+60), second revision (T+90). March annual benchmark revision aligns to QCEW administrative records. COVID: April 2020 -20.5M jobs (worst single month ever); Great Recession trough Feb 2010 -8.7M from Jan 2008 peak. BLS API: api.bls.gov/publicAPI/v2/timeseries/data/, 500 series/query with key, 10 years. ADP preview released 2 days before. AHE ~$35/hr all private (2024); real wage growth = AHE minus CPI. Python BLS API 20-series fetch + supersector employment table with YoY change + AHE/AWH block + COVID recovery tracker.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BOP Federal Prison Population: The Federal Database Behind 148,000 US Federal Inmates</title>
      <link>https://ai-analytics.org/writing/bop-federal-prison-population/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bop-federal-prison-population/</guid>
      <pubDate>Wed, 16 Dec 2026 00:00:00 GMT</pubDate>
      <description>BOP (Bureau of Prisons) under DOJ: 148,000+ federal inmates in 122 institutions (~36,000 staff). Federal offenses: drug trafficking ~44%, weapons ~20%, sex offenses ~8%, immigration ~6%, fraud/white collar ~5%. Federal mandatory minimums: 21 USC 841(b) (1kg+ heroin/5kg+ cocaine = 10-yr minimum). Crack/powder disparity: pre-FSA 2010 100:1 ratio, FSA 2010 reduced to 18:1. USSC Sentencing Guidelines advisory since Booker 2005; 13.4% longer sentences for Black defendants (USSC 2017). First Step Act 2018: FSA retroactivity (~2,600 released), safety valve expansion, earned-time credits (10-15 days/month), PATTERN risk tool. Demographics: 93% male, 37% Black, ~23% non-US citizens. Facility types: ADX (Florence supermax), USP, FCI, FPC, FMC. Private prisons: Biden EO 14006 non-renewal; Trump 2025 re-expansion. BOP Statistics at bop.gov/about/statistics/ (static tables). USSC datafiles at ussc.gov for sentencing data. Python BOP HTML table scraper + USSC drug sentence trends 2010-2023.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC Drug Overdose Mortality: The Federal Database Behind the US Opioid Crisis</title>
      <link>https://ai-analytics.org/writing/cdc-drug-overdose-mortality/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-drug-overdose-mortality/</guid>
      <pubDate>Tue, 15 Dec 2026 00:00:00 GMT</pubDate>
      <description>107,543 overdose deaths in 2023 (CDC NCHS provisional); first 100k+ year was 2021. Three-wave opioid crisis: Wave 1 prescription opioids (OxyContin 1996, Purdue 2007 $634M fine); Wave 2 heroin surge (2010-2013); Wave 3 synthetic opioids (IMF fentanyl 50-100x morphine, ~75k synthetic opioid deaths 2022). Three CDC sources: VSRR Provisional Drug Overdose Counts (monthly, Socrata API at data.cdc.gov, 12-month rolling), CDC WONDER (death certificate ICD-10 queries 1999-present, county-level), state drug category flat file. ICD-10 T-codes: T40.1 heroin, T40.2 natural/semisynthetic, T40.4 synthetic opioids (fentanyl -- the key field), T40.5 cocaine, T43.6 stimulants. Fentanyl supply: China scheduled 2019; Mexico (Sinaloa/CJNG) now primary; counterfeit M30 pills; xylazine (tranq) not reversed by naloxone. Geographic: WV ~80/100k; Appalachian epicenter. Purdue $8.34B 2022 DOJ settlement; Sackler $6B; total settlements &gt;$55B. MOUD: buprenorphine (waiver removed 2022 SUPPORT Act), methadone, naltrexone. Python VSRR Socrata API synthetic opioid rate by state.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOL Form 5500: The Federal Database Behind Every US Pension and Benefit Plan</title>
      <link>https://ai-analytics.org/writing/dol-form-5500-pension-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-form-5500-pension-data/</guid>
      <pubDate>Mon, 14 Dec 2026 00:00:00 GMT</pubDate>
      <description>Form 5500 Annual Return/Report: ~217,000 filings/year for all ERISA plans (DB, DC, health/welfare with 100+ participants). $30T+ plan assets. EFAST2 at efast.dol.gov -- public record. Plan types: DB (defined benefit, 27M-&gt;13M participants since 1985, employer bears investment risk); DC (401k: $23k employee deferral 2024, $69k total, 5% TSP match, target-date funds); 403(b); ESOPs. Schedule architecture: A (insurance contracts), C (service provider fees -- 408(b)(2) indirect compensation, basis for ERISA fee litigation: Boeing/Intel/MIT/Cornell all settled), G (prohibited transactions), H (large plan financials: balance sheet, income statement), R (retirement/actuarial), SB (Schedule SB: funding target, min required contribution, AFTAP triggers at 60%/80%). PBGC insurance: $80k/yr guarantee; $96/participant flat premium + $52/$1k variable (2024); ARP 2021: $86B SPAP for troubled multiemployer plans (Central States $73B). Large plan audit: 100-participant threshold; SAS 136; 2015 OIG found 39% deficient. Bulk data: dol.gov/agencies/ebsa research files. Python EFAST2 Schedule H + C analysis: top-50 401k plans by assets + fee rates in basis points by asset tier.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS OEWS: The Federal Database Behind Wage Statistics for 830 Occupations Across the US Economy</title>
      <link>https://ai-analytics.org/writing/bls-oews-wage-statistics/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-oews-wage-statistics/</guid>
      <pubDate>Sun, 13 Dec 2026 00:00:00 GMT</pubDate>
      <description>BLS OEWS (Occupational Employment and Wage Statistics): semi-annual survey of 1.1M non-farm establishments, ~57M workers. Annual release: mean/median wages, 10th-25th-75th-90th percentiles, employment for 830 occupations across 590+ areas (all states, 564+ MSAs, nonmetro areas). SOC 2018: 23 major groups, 6-digit codes. Highest-paying: anesthesiologists ~$331k, oral surgeons ~$317k, OB/GYN ~$296k, CEOs ~$246k. Data fields: area_type (1=national, 2=state, 3=MSA), occ_code, o_group (major/minor/broad/detailed), emp, h_mean/a_mean, h_median/a_median, h_pct10/h_pct90/a_pct10/a_pct90, emp_prse, mean_prse. Special symbols: * = above $208k cap, # = employment suppressed. Access: bls.gov/oes/tables.htm bulk zip files (national/state/MSA); BLS API v2 with complex OEWS series IDs. Industry-occupation matrix: SWE in finance vs. tech vs. manufacturing. Projections link: NEM 2022-2032 (wind turbine techs +60%, nurse practitioners +46%, data scientists +35%). Python: downloads national zip, ranks Computer &amp; Math occupations by wage, computes percentile spread healthcare vs. tech.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FRA Railroad Accident Data: The Federal Database Behind Every US Rail Incident Since 1975</title>
      <link>https://ai-analytics.org/writing/fra-railroad-accidents/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fra-railroad-accidents/</guid>
      <pubDate>Sat, 12 Dec 2026 00:00:00 GMT</pubDate>
      <description>FRA (Federal Railroad Administration) accident reporting system (49 CFR Part 225): ~224k records since 1975. Train accidents (Form 54), grade crossing (Form 57), employee injuries (Form 55). Cause codes: Track/Equipment/Human Factors/Misc. Reportable threshold: $11,200+ damage, or death/injury/evacuation/hazmat. East Palestine OH Feb 2023: Norfolk Southern 32N derailment; vinyl chloride; controlled burn; NTSB 37 recommendations. Grade crossing: ~2,000-2,200 collisions/yr, ~270-290 deaths, 128,000 public crossings. PTC: mandated 2008 Rail Safety Improvement Act after Chatsworth 2008 (25 dead); fully implemented 2020. FRA: 140,000 inspections/yr, 28,000 violations, $27,904 max penalty. CRISI grants $1B+ (IIJA 2021). FRA Safety Data API at safetydata.fra.dot.gov; bulk CSV Forms 54/57/55. Python derailments by state + hazmat releases by commodity.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OPM FedScope: The Federal Database Behind 2.1 Million US Government Workers</title>
      <link>https://ai-analytics.org/writing/opm-federal-workforce/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/opm-federal-workforce/</guid>
      <pubDate>Fri, 11 Dec 2026 00:00:00 GMT</pubDate>
      <description>OPM CPDF/FedScope: ~2.1-2.3M federal civilian employees quarterly. Largest: DOD ~750k, VA ~400k, DHS ~250k. GS pay: GS-1 Step 1 $22,270/yr to GS-15 Step 10 $163,964 base + 34 locality areas (DC +33.26%); SES ~9,000 positions $155k-$235k (2024). FedScope dimensions: agency, occupation series, location, pay plan, grade, education, age, race, gender, veterans status (27% federal vs 6% private). FERS: 1.1%/yr x high-3 x years + TSP 5% match + Social Security; CSRS pre-1984. DOGE 2025: fork-in-the-road email ~75k acceptances; USAID ~10k terminated; HHS ~20k; DOE ~1,500; EPA ~1,500; union lawsuits. Data: fedscope.opm.gov cube; opm.gov bulk CSV; no public REST API. Python FedScope CSV analysis by agency grade distribution and SES density.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NIFC Wildfire Data: The Federal Database Behind a Century of US Fire Statistics</title>
      <link>https://ai-analytics.org/writing/nifc-wildfire-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nifc-wildfire-data/</guid>
      <pubDate>Thu, 10 Dec 2026 00:00:00 GMT</pubDate>
      <description>NIFC (Boise ID) with USFS/NPS/BLM/BIA/FWS: wildfire stats since 1926. 2023: ~56,580 fires, ~2.7M acres (10-yr avg ~7M/yr). Record years: 2015 (10.1M), 2020 (10.1M). Fire suppression paradox: Smokey Bear 1944+ = fuel accumulation = larger fires. USFS FOD: ~2.3M fires 1992-present, SQLite, size classes A-G, cause (human/lightning/unknown), lat-lon, county. MTBS: USGS-USFS Landsat dNBR burn severity for fires &gt;=1,000 acres at mtbs.gov. ICS-209 extended attack reports at famweb.nwcg.gov. WUI: 43M homes (Radeloff 2018 PNAS); Camp Fire 2018 Paradise CA (153k acres, 85 dead); Lahaina 2023 (2,200 structures). Active fire: NIFC ArcGIS GeoJSON; NASA FIRMS MODIS/VIIRS. Climate: Westerling 2006 Science + Williams 2019 PNAS VPD correlation. Python decade-by-decade analysis + active fire query.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CFPB Consumer Complaint Database: The Federal Record of 7 Million Financial Complaints</title>
      <link>https://ai-analytics.org/writing/cfpb-consumer-complaints/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cfpb-consumer-complaints/</guid>
      <pubDate>Wed, 09 Dec 2026 00:00:00 GMT</pubDate>
      <description>CFPB Consumer Complaint Database (March 2012): 7M+ complaints since 2011. Products: credit reporting ~60%, debt collection ~10%, credit card ~8%, mortgage ~7%. Equifax/Experian/TransUnion receive 50%+ of all complaints. Fields: complaint_id, date_received, product/sub_product/issue, consumer_complaint_narrative (~20% with text, PII-scrubbed), company_response (monetary/non-monetary/explanation relief), timely (Y/N), consumer_disputed (Y/N), state, zip (3-digit partial). COVID: mortgage forbearance surge. Biden loan forgiveness 2022-2024: 2-3x student loan complaints. Navient $1.85B 2022. Wells Fargo $3.7B 2022 (largest-ever CFPB). API: api.consumerfinance.gov/data-research/consumer-complaints/search (no key, max 10k/query). Bulk download ~1.5GB+. Python mortgage complaint analysis by company and response type.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NOAA Storm Events: The Federal Database Behind 50 Years of US Weather Disaster Data</title>
      <link>https://ai-analytics.org/writing/noaa-storm-events-database/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/noaa-storm-events-database/</guid>
      <pubDate>Tue, 08 Dec 2026 00:00:00 GMT</pubDate>
      <description>The NOAA National Centers for Environmental Information (NCEI) Storm Events Database records every significant weather event in the US from 1950 to present — ~2.1M event records, 48 standardized event types (tornado, hurricane, flash flood, hail, winter storm, wildfire, and more), with property damage, crop damage, injuries, fatalities, and county-level geography. Bulk download at ncei.noaa.gov/pub/data/swdi/stormevents/csvfiles/ with annual gzipped CSVs; CDO API at www.ncei.noaa.gov/cdo-web/api/v2/. DAMAGE_PROPERTY field uses K/M/B suffix encoding requiring parsing. NOAA Billion-Dollar Disasters tracker covers 376 events since 1980 totaling $2.6T in CPI-adjusted damages; 2023: 28 events exceeding $1B each — a record. Tornado climatology: ~1,200-1,500 annually, EF0-EF5 scale, 2011 Super Outbreak 362 tornadoes/3 days, Dixie Alley shift. Hurricane damage: Harvey $125B, Ian $112B county-by-county in Storm Events. Flood events: deadliest weather type most years, ~88 fatalities/year average, AHPS stream gauge network. Climate change signal in increasing damage frequency and extreme precipitation. Here is event type taxonomy, data quality caveats, NCEI CDO API, Billion-Dollar Disasters methodology, tornado EF scale, hurricane storm surge vs. wind damage distinction, and a Python DAMAGE_PROPERTY parsing analysis by event type and state.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FBI NIBRS: The National Crime Database Behind Incident-Level Crime Statistics</title>
      <link>https://ai-analytics.org/writing/fbi-nibrs-crime-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fbi-nibrs-crime-data/</guid>
      <pubDate>Mon, 07 Dec 2026 00:00:00 GMT</pubDate>
      <description>The FBI National Incident-Based Reporting System (NIBRS) replaced summary-level UCR in 2021 with incident-level records from 15,000+ law enforcement agencies — 52 offense categories, victim/offender/arrestee demographics, location, weapon, property, and arrest outcomes. 2022: ~15,724 agencies reporting, covering ~79% of US population; NYPD (8M people) only began NIBRS submission 2023. Offense segments: Group A (52 categories) vs Group B (11 citation-only). Victim segment: age, sex, race, ethnicity, victim-offender relationship (intimate partner, acquaintance, stranger, unknown). Hate crime codes: 88 bias motivation codes. Crime Data Explorer API at cde.ucr.cjis.gov: /api/nibrs/{offense}/offense/agencies, /api/nibrs/{offense}/victim/count — free API key, 1,000 req/day. Annual bulk downloads: incident, offense, victim, offender, arrestee, property files. Supplemental Homicide Reports (SHR) since 1976: victim-offender-weapon-circumstance at case level. NIBRS vs NCVS: only ~43% of violent crimes reported to police. TRAC-NIBRS for coverage gap analysis. Here is the UCR-to-NIBRS transition, reporting gaps, API structure, hate crime methodology, SHR limitations, NCVS complement, and a Python CDE API violent crime rate analysis by state.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SSA Social Security: The Federal Database Behind $1.4 Trillion in Annual OASDI Benefits</title>
      <link>https://ai-analytics.org/writing/ssa-social-security-oasdi/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ssa-social-security-oasdi/</guid>
      <pubDate>Sun, 06 Dec 2026 00:00:00 GMT</pubDate>
      <description>Social Security OASDI (Old-Age, Survivors, and Disability Insurance) pays ~$1.4T annually to ~70M beneficiaries. Three components: OASI (~58M, ~$1.2T), DI (~8.8M, ~$160B), SSI (~7.5M, ~$65B, means-tested). Trust funds: OASI ~$2.75T invested in special-issue Treasuries; 2034 projected OASI depletion (77% payable). FICA tax: 6.2%+6.2% on wages up to $168,600 (2024). Benefit formula: AIME computed from 35 highest-earning years indexed to AWI; PIA = 90% of AIME to first bend point + 32% between bend points + 15% above second. FRA: 67 (born 1960+); early at 62 with ~30% reduction; DRCs +8%/year to 70. 2024 bend points: $1,174/$7,078. SSA data: data.ssa.gov — Monthly Statistical Snapshot, Annual Statistical Supplement (Table 5.A state-level beneficiaries, Table 4.B DI allowance rates, Table 6.C SSI by state), state/county OASDI CSV. FRED series: SSASSHDI, SSARECEIPTSDISABILITY. Disability sequential evaluation: SGA → severe impairment → Blue Book listings → RFC → past relevant work → vocational grids. ALJ hearing backlog ~1M. WEP/GPO eliminated January 2025 (Social Security Fairness Act) for 3.2M workers. Here is trust fund mechanics, AIME/PIA formula, DI determination process, state data API, and a Python analysis of retired-worker benefit penetration rates by state.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>IRS Exempt Organizations: The Federal Database Behind 1.26 Million US Nonprofits</title>
      <link>https://ai-analytics.org/writing/irs-990-nonprofit-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/irs-990-nonprofit-data/</guid>
      <pubDate>Sat, 05 Dec 2026 00:00:00 GMT</pubDate>
      <description>The IRS Exempt Organizations Business Master File (BMF) registers 1.26M active tax-exempt organizations — 501(c)(3) public charities and private foundations (~1M), 501(c)(4) social welfare orgs (~80k), 501(c)(6) trade associations, 527 political orgs, and 25 other IRC subsection categories. $2.8T in annual sector revenues (~5.5% of US GDP), ~12M nonprofit employees. BMF published monthly at IRS.gov: tab-delimited with EIN, name, subsection code, NTEE code (26 major categories A-Z: Education, Health, Human Services, etc.), ruling date, deductibility code, asset/income/revenue amounts. Form 990 e-file JSON on AWS S3 at s3://irs-form-990/ since 2013 — index files plus per-filing XML/JSON. Key schedules: Part VII compensation (5 highest-paid officers), Schedule A (public support test), Schedule B (donor list, confidential), Schedule C (political activity), Form 990-PF (private foundations: 1.39% NII excise tax, 5% minimum distribution, self-dealing IRC 4941). Citizens United 2010 + 501(c)(4) anonymous spending: Form 8976 required since 2016. ProPublica Nonprofit Explorer API at api.propublica.org/nonprofits/v2/organizations/{ein}.json. Private foundations: Gates ($70B), Ford ($16B), Robert Wood Johnson ($13B). Church filing exemption: no Form 990 required, largest data gap. Here is BMF field schema, NTEE taxonomy, 990 e-file S3 access, political activity rules, private foundation excise regime, and a Python NTEE subsector analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USAID Foreign Aid Data: The Federal Database Behind $40 Billion in Annual US Development Assistance</title>
      <link>https://ai-analytics.org/writing/usaid-foreign-aid-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usaid-foreign-aid-data/</guid>
      <pubDate>Fri, 04 Dec 2026 00:00:00 GMT</pubDate>
      <description>USAID (United States Agency for International Development) manages ~$40B in annual foreign assistance across 100+ countries. ForeignAssistance.gov (IATI) publishes whole-of-government aid data by agency, country, sector, and implementing partner. Award types: contracts (for-profit implementers like Chemonics ~$1-2B/yr, DAI, AECOM), grants (INGOs: Save the Children, CARE, IRC, World Vision), cooperative agreements, interagency agreements. PEPFAR: $110B+ since 2003, 20M+ people on ARVs, Country Operational Plans at pepfar.gov. DATA Act: USAID contracts on USASpending.gov; IATI XML at iatiregistry.org. Top recipients FY2022: Ukraine (surged post-invasion), Ethiopia, DR Congo, Nigeria, Jordan, South Africa (PEPFAR). Sub-Saharan Africa ~35%, Near East ~20% of obligations. ForeignAssistance.gov API at /api/v1/resources.json. OpenAid, AidData, D-Portal for secondary access. Here is award mechanisms, PEPFAR data structure, implementing partner concentration, geographic patterns, IATI standard, and a Python ForeignAssistance.gov analysis by country and sector.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>PCAOB Auditor Inspections: The Federal Database Behind Public Company Audit Oversight and Accounting Firm Deficiencies</title>
      <link>https://ai-analytics.org/writing/pcaob-auditor-inspections/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/pcaob-auditor-inspections/</guid>
      <pubDate>Thu, 03 Dec 2026 00:00:00 GMT</pubDate>
      <description>PCAOB (Public Company Accounting Oversight Board) was created by Sarbanes-Oxley Act 2002 after Enron/WorldCom/Arthur Andersen scandals; ~1,700 registered audit firms globally; annual inspections for firms auditing &gt;100 SEC-registered issuers, triennial for ≤100. Two-part inspection report: Part I (public deficiencies — insufficient audit evidence, ICFR failures, revenue recognition) immediately; Part II (quality control criticisms) public after 12 months. Big Four 2022 deficiency rates: 31-44% of inspected engagements. HFCAA: Chinese audit firms required to allow PCAOB inspection; August 2022 agreement enabled first-ever inspection of KPMG Huazhen and PwC Zhong Tian. Enforcement: Section 105, $15M/$750k monetary penalties; KPMG 2019 $50M fine for stealing inspection plans. Critical Audit Matters (CAMs) required since 2019. All inspection reports at pcaobus.org/inspections. Here is inspection methodology, deficiency trends by audit area, HFCAA China access resolution, enforcement actions, CAM disclosure, and a Python deficiency rate trend analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Medicare Part D Drug Spending Data: The Federal Database Behind $225 Billion in Annual Prescription Drug Costs</title>
      <link>https://ai-analytics.org/writing/medicare-part-d-drug-spending/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/medicare-part-d-drug-spending/</guid>
      <pubDate>Wed, 02 Dec 2026 00:00:00 GMT</pubDate>
      <description>Medicare Part D (MMA 2003, implemented January 2006) covers outpatient prescription drugs for ~50M beneficiaries through private PDPs and MA-PD plans; ~$225B annual spending. CMS publishes Part D Prescriber Data (NPI, specialty, drug, total claims, total cost) and Drug Spending Dashboard. Top drugs by spending: Eliquis (apixaban) ~$14B, Humira ~$6B pre-biosimilar, Keytruda ~$5B, GLP-1 agonists (Ozempic/Victoza) rapidly rising. PBM rebate mechanics: Tier 1-5 formulary; manufacturer pays 70% brand discount in coverage gap. IRA 2022: Medicare drug price negotiation (first 10 drugs 2026); $2,000 OOP cap 2025; inflation rebates. Humira 2023: 7 biosimilars launched simultaneously. ProPublica Prescriber Checkup identifies high-volume opioid prescribers. LIS/Extra Help: ~13M beneficiaries, full subsidy. CMS data at data.cms.gov. Here is benefit phases, formulary mechanics, IRA negotiation program, prescriber-level data structure, opioid prescribing patterns, and a Python analysis of specialty drug spending.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NFIP Flood Insurance Data: The Federal Program Behind $20 Billion in Flood Claims and the National Flood Hazard Layer</title>
      <link>https://ai-analytics.org/writing/nfip-flood-insurance/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nfip-flood-insurance/</guid>
      <pubDate>Tue, 01 Dec 2026 00:00:00 GMT</pubDate>
      <description>NFIP (National Flood Insurance Act 1968) provides flood insurance for ~5M policyholders across 22,000+ communities; ~$1.3T total coverage in force. Flood zones: SFHAs (Zone A/AE/V, 1% annual chance) require federally-backed mortgages to carry NFIP. Coverage limits: $250,000 building/$100,000 contents residential. Katrina 2005: $16B, 267k claims; Harvey 2017: $8.9B, 89k claims; Ian 2022: $3.6B. NFIP was $20B+ in debt to Treasury. Risk Rating 2.0 (Oct 2021): property-specific pricing by flood frequency, distance to water, foundation type; 18% annual cap; 1.2M policies canceled/non-renewed. National Flood Hazard Layer (NFHL) at msc.fema.gov; WFS API at hazards.fema.gov. OpenFEMA API: FimaNfipClaims and FimaNfipPolicies datasets. Repetitive loss: ~25,000 severe repetitive loss structures = 25-30% of total claims. First Street Foundation alternative risk model. Here is flood zone mechanics, Risk Rating 2.0 reform, NFHL GIS data, OpenFEMA API structure, repetitive loss dynamics, and a Python Harvey claims analysis by county.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FARA: The Foreign Agents Registration Act Database Behind Lobbying Disclosure for Foreign Governments</title>
      <link>https://ai-analytics.org/writing/fara-foreign-agents/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fara-foreign-agents/</guid>
      <pubDate>Mon, 30 Nov 2026 00:00:00 GMT</pubDate>
      <description>FARA (Foreign Agents Registration Act, 22 U.S.C. §§ 611-621, 1938) requires agents of foreign governments and political parties to register with DOJ&apos;s National Security Division and file semi-annual disclosure statements. ~500-600 active registrations at any time. Registration: Form RA-1 (within 10 days) → Form NSD-3 (semi-annual supplement) disclosing principal identity, activities, compensation, disbursements, political contacts. LDA exemption (22 U.S.C. § 613(h)): agents registering under the Lobbying Disclosure Act whose principal is not a foreign government or political party may use LDA instead -- DOJ IG 2016 report criticized this gap. Mueller-era surge 2018-2022: Manafort convicted, Flynn retroactively registered (Turkey/Gülen), Barrack acquitted (UAE), Podesta Group and Mercury LLC retroactively registered. Saudi Arabia post-Khashoggi: $14M+ annually, $450M+ since 2016; firms retained: Squire Patton Boggs, Akin Gump, BGR Group. Chinese state media: CGTN and Xinhua registered as foreign agents 2019. Criminal penalty: 22 U.S.C. § 618 felony, up to 5 years + fines. Electronic Reading Room: justice.gov/nsd-fara; eFARA bulk CSV at efile.fara.gov/bulk/. OpenSecrets and POGO maintain secondary databases. Here is registration mechanics, LDA exemption gap, Mueller-era cases, Saudi Arabia and China enforcement, eFARA bulk data structure, and a Python analysis of FARA disbursements by country.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Open Payments: The Federal Database Behind $12 Billion in Annual Pharma and Device Payments to Physicians</title>
      <link>https://ai-analytics.org/writing/cms-open-payments/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-open-payments/</guid>
      <pubDate>Sun, 29 Nov 2026 00:00:00 GMT</pubDate>
      <description>The Physician Payments Sunshine Act (ACA Section 6002, 2010) requires applicable manufacturers to report all payments ≥$10 to covered recipients. 2022 dataset: $12.7B total; research payments ~$4B; general payments ~$2.5B; ownership/investment interests ~$6.2B. ~2,700 applicable manufacturers; ~900,000 covered recipients. Three datasets at openpaymentsdata.cms.gov: General Payments (GP), Research Payments (RP), Ownership/Investment Interests (OI). Key fields: NPI, total_amount_of_payment_usdollars, nature_of_payment (consulting/speaking/food/royalty/research), drug/device name, manufacturer. NPI linkage to NPPES enables physician specialty/location cross-reference. ProPublica &quot;Dollars for Docs&quot; since 2010. Research: Carey et al. (2021) meals associated with brand prescribing; DeJong et al. (2016) payment receipt and prescribing patterns. Dispute process: 45-day window before publication. Socrata API at data.cms.gov/open-payments. Here is Sunshine Act mechanics, payment category taxonomy, scale data, prescribing-impact research, drug-specific linkage (Ozempic/Humira/insulin), and a Python Socrata API analysis by specialty and manufacturer.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NLRB Union Elections and Unfair Labor Practice Data: The Federal Database Behind US Labor Organizing</title>
      <link>https://ai-analytics.org/writing/nlrb-union-elections/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nlrb-union-elections/</guid>
      <pubDate>Sat, 28 Nov 2026 00:00:00 GMT</pubDate>
      <description>NLRB processes ~2,500-3,000 election cases and ~15,000-20,000 ULP charges annually. RC (union-initiated), RM (employer), RD (decertification) petition types; 25-30% showing of interest required; secret ballot; majority of valid votes cast to win. 2014 &quot;Ambush Election&quot; rule reduced pre-election period to ~23 days; 2023 Biden rule restoration. Union win rate ~65-70% in recent years. Amazon LDJ5 Staten Island April 2022: 2,654-2,131 first US Amazon union win; Starbucks Workers United 400+ stores. ULP charges: Section 8(a)(1) interference, 8(a)(3) anti-union discrimination, 8(a)(5) refusal to bargain; ALJ hearing → NLRB Board → circuit court. Gissel bargaining orders; McLaren Macomb (2023) confidentiality clauses unlawful. BLS 2023: 10.0% union density, 6.0% private, 33.1% public. NLRB election results CSV and case search at nlrb.gov. Here is petition types, 2014 rule history, Amazon/Starbucks campaigns, ULP mechanics, Gissel orders, and a Python election win rate analysis by industry.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>ATF Firearm Trace Data: The Federal Database Behind 350,000 Annual Crime Gun Traces</title>
      <link>https://ai-analytics.org/writing/atf-firearm-trace-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/atf-firearm-trace-data/</guid>
      <pubDate>Fri, 27 Nov 2026 00:00:00 GMT</pubDate>
      <description>ATF eTrace processes ~350,000-400,000 crime gun trace requests annually from law enforcement. Trace chain: law enforcement submits recovered gun → ATF contacts manufacturer/importer → FFL of first sale → subsequent FFLs until first retail purchaser identified. Time-to-crime (TTC): average 7-8 years nationally; TTC under 3 years flags potential trafficking; 21% of handguns traced within 3 years. Tiahrt Amendments (2003): prohibit ATF from releasing trace data to the public, using in civil litigation against gun dealers/manufacturers. Iron Pipeline: southeastern states (GA, SC, VA, FL) supply northeastern cities (NY, NJ, MD) via regulatory arbitrage. NIBIN: 300+ sites, 7,300 ballistic leads/week, links cartridge cases across crime scenes. ~130,000 active FFLs; 5-7% annually inspected; 920M+ records at Out-of-Business Records Center. ATF publishes state-level aggregate trace data at atf.gov. Here is eTrace chain mechanics, Tiahrt Amendments history, Iron Pipeline geographic patterns, NIBIN infrastructure, and a Python TTC distribution analysis by state and firearm type.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDIC Institution Database: The Federal Profile of Every FDIC-Insured Bank and Thrift</title>
      <link>https://ai-analytics.org/writing/fdic-institution-database/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fdic-institution-database/</guid>
      <pubDate>Thu, 26 Nov 2026 00:00:00 GMT</pubDate>
      <description>FDIC BankFind Suite at banks.data.fdic.gov provides institution profiles for all ~4,600 active FDIC-insured banks and thrifts plus 10,000+ historical institutions back to 1934. Charter types: N = national bank (OCC-chartered), SM = state member bank (Federal Reserve), NM = state nonmember bank (FDIC-supervised), SA = state savings association (OCC), SB = state savings bank (FDIC). Dual banking system: institutions choose state or federal charter creating regulatory competition. Banking consolidation: 14,000+ FDIC-insured institutions in 1984 to ~4,600 today -- 67% reduction driven by S&amp;L crisis failures, interstate banking deregulation (Riegle-Neal 1994), Gramm-Leach-Bliley 1999, post-GFC 2008-2012 failures, and ongoing M&amp;A. Summary of Deposits: annual branch-level deposit data enabling banking desert analysis (census tracts with no bank or credit union within 10 miles). CRA (Community Reinvestment Act) exam ratings published: Outstanding, Satisfactory, Needs to Improve, Substantial Noncompliance. BankFind API: /api/institutions endpoint with CERT (unique 5-digit certificate number), ACTIVE, ASSET (thousands), CLASSP, STALP, ESTYMD, SPECGRP, HCTMULT fields; no API key required. Here is charter type mechanics, dual banking regulatory competition, consolidation drivers, CRA compliance, banking desert geography, and a Python active institution analysis by state and asset tier.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FMCSA Crash Data: The Federal Database Behind 5,000 Annual Large Truck Fatalities</title>
      <link>https://ai-analytics.org/writing/fmcsa-crash-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fmcsa-crash-data/</guid>
      <pubDate>Wed, 25 Nov 2026 00:00:00 GMT</pubDate>
      <description>FMCSA&apos;s MCMIS tracks ~500,000 reportable CMV crashes per year. Large truck fatalities reached 5,837 in 2022 -- the highest since 2005. 80%+ of truck crash fatalities are passenger vehicle occupants. The Large Truck Crash Causation Study (963 crashes) found driver error in 55% of crashes (87% decision/recognition/performance errors). HOS regulations: 11-hour drive limit, 14-hour window, ELD mandate December 2017. CSA SMS: 7 BASICs updated monthly. Roadside inspections: 3.5M/year, 20% vehicle OOS rate, 5% driver OOS rate. ATA v. FMCSA 2019 removed BASIC percentile scores from public display. SAFER, A&amp;I portal, FMCSA public API, NHTSA FARS complement. Industry: 3.5M drivers, 750,000 carriers, 350,000 owner-operators. Here is the state fatality rate normalized by FHWA VMT, time-of-day and road-type breakdowns, critical reason attribution, and a Python carrier-level crash lookup.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CRS Reports: The Congressional Research Service Database Behind US Policy Analysis</title>
      <link>https://ai-analytics.org/writing/crs-congressional-reports/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/crs-congressional-reports/</guid>
      <pubDate>Tue, 24 Nov 2026 00:00:00 GMT</pubDate>
      <description>The Congressional Research Service (CRS) is the nonpartisan policy and legal research arm of Congress within the Library of Congress, established 1914. 700 analysts across 7 divisions produce six product types: Reports (comprehensive analyses), Insights (2-4 page current issue), In Focus (2-page overviews), Legal Sidebars, Report Updates, and Testimonies. 25+ policy areas including agriculture, appropriations, budget, energy, environment, foreign affairs, health, homeland security, immigration, technology, labor, law, national defense, and transportation. The 2018 Consolidated Appropriations Act first mandated public release; crsreports.congress.gov is the official portal with 9,000+ available reports. Historically products were available only to members of Congress -- the 2012 Coburn-blocked report on top marginal tax rates and economic growth (finding no correlation) galvanized the public access movement. EveryCRSReport.com (Federation of American Scientists + Demand Progress) provides bulk access including pre-2018 reports via API at everycrsreport.com/reports.json; each report has id, title, topics array, date, and versions list. CRS differs from GAO (auditing/program evaluation) and CBO (budget scoring only). Here is CRS product type mechanics, the public access mandate, EveryCRSReport.com API structure, and a Python analysis of publication frequency and update patterns by policy area.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NIST NVD: The National Vulnerability Database Behind CVE Scoring and Cybersecurity Compliance</title>
      <link>https://ai-analytics.org/writing/nist-nvd-vulnerabilities/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nist-nvd-vulnerabilities/</guid>
      <pubDate>Mon, 23 Nov 2026 00:00:00 GMT</pubDate>
      <description>NIST NVD enriches 250,000+ CVE records with CVSS scores, CWE classifications, and CPE product data. CVE Numbering Authorities (400+ CNAs): Microsoft, Google, Apple, Red Hat, MITRE root CNA. CVSS v3.1: Attack Vector/Complexity, Privileges Required, User Interaction, Scope, CIA impact. Score ranges: Critical 9.0-10.0, High 7.0-8.9, Medium 4.0-6.9. CISA KEV catalog: 1,000+ confirmed-exploited CVEs, BOD 22-01 mandates federal patching within 14 days. Log4Shell CVE-2021-44228 CVSS 10.0; EternalBlue CVE-2017-0144 9.3; Heartbleed CVE-2014-0160 7.5. CWE-787 Out-of-Bounds Write dominates Critical CVEs. NVD REST API /rest/json/cves/2.0 with cvssV3Severity/cweId/cpeMatchString/hasKev parameters. FedRAMP, PCI DSS, FISMA compliance applications. Here is CVE assignment mechanics, CVSS base/temporal/environmental scores, KEV operational mechanics, and a Python Critical CVE analysis for 2024.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EPA Greenhouse Gas Reporting Program: The Facility-Level Emissions Database Behind US Climate Accountability</title>
      <link>https://ai-analytics.org/writing/epa-ghg-reporting/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-ghg-reporting/</guid>
      <pubDate>Sun, 22 Nov 2026 00:00:00 GMT</pubDate>
      <description>EPA GHGRP requires ~8,000 facilities emitting ≥25,000 tCO2e/year to report annually, covering ~85-90% of US stationary source emissions. 41 source categories: power plants (Subpart D), petroleum/natural gas systems (Subpart W, largest by count), refineries (Subpart Y), landfills, cement, iron/steel, chemical manufacturing. Six GHGs: CO2, CH4 (28-34x GWP), N2O (265-298x), HFCs (up to 14,800x), PFCs, SF6 (23,500x). FLIGHT tool at ghgdata.epa.gov for facility search. ECHO bulk download. Satellite methane validation controversy: TROPOMI/Sentinel-5P, GHGSat, MethaneSAT finding higher Permian Basin emissions than Subpart W reports. EPA 2024 Subpart W methodology revision. Here is sector composition, data access via ENVIRO API, and a Python top-emitter analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOJ Antitrust Division: The Federal Merger Review and Cartel Enforcement Database</title>
      <link>https://ai-analytics.org/writing/doj-antitrust-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/doj-antitrust-data/</guid>
      <pubDate>Sat, 21 Nov 2026 00:00:00 GMT</pubDate>
      <description>DOJ Antitrust Division enforces Sherman Act (criminal: price-fixing, bid-rigging, market allocation) and Clayton Act (civil merger review). HSR Act pre-merger notification: 2024 threshold $119.5M, ~1,500-2,000 annual filings, ~3% receive Second Requests, $51,744/day penalty for failure to file. Merger review: Phase 1 (30-day) → Phase 2 → consent decree or litigation. 2023 Merger Guidelines: HHI thresholds (2,500+ highly concentrated). Leniency Program: first cartel self-reporter gets automatic amnesty. Auto parts cartel $2.9B fines. AT&amp;T-Time Warner (DOJ lost), UnitedHealth-Change Healthcare (blocked), JetBlue-Spirit (blocked). DOJ press releases RSS, PACER for complaints. Here is FTC coordination, criminal enforcement mechanics, and a Python press release classification analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC WISQARS: The Federal Injury and Violence Mortality Database Behind Public Health Research</title>
      <link>https://ai-analytics.org/writing/cdc-wisqars-injury-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-wisqars-injury-data/</guid>
      <pubDate>Fri, 20 Nov 2026 00:00:00 GMT</pubDate>
      <description>CDC WISQARS (Web-based Injury Statistics Query and Reporting System) covers all US injury deaths (ICD-10 external cause V-Y) back to 1981 and nonfatal ED visits via NEISS-AIP. 2022: unintentional injury ~230k deaths (#1 cause ages 1-44); drug overdose ~109,680 (fentanyl/synthetics ~73,800); motor vehicle ~46,000; suicide ~49,000 (firearms 55%, hanging 27%); homicide ~24,000 (firearms 79%). Total firearm deaths 48,204 (14.6/100k). Three opioid waves: prescription, heroin, fentanyl. WISQARS API, WONDER, NVDRS (case-level violent deaths with circumstance data). Geographic patterns: firearm suicide highest in rural Mountain West; firearm homicide concentrated in urban areas. Here is ICD-10 coding, NEISS methodology, and a Python state firearm rate analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USASpending.gov: The Federal Spending Database Behind $6 Trillion in Annual Contracts, Grants, and Loans</title>
      <link>https://ai-analytics.org/writing/usaspending-federal-contracts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usaspending-federal-contracts/</guid>
      <pubDate>Thu, 19 Nov 2026 00:00:00 GMT</pubDate>
      <description>USASpending.gov tracks ~$6T in annual federal spending via FFATA 2006 and DATA Act 2014. Contracts (~$700B from FPDS-NG): DoD ~$412B, Lockheed Martin ~$73B, RTX ~$42B, Boeing, General Dynamics, Northrop. Grants (~$800B): NIH $40B, NSF $9B. API at api.usaspending.gov: /search/spending_by_award, /bulk_download. FPDS fields: UEI, CAGE, PSC, NAICS, contract type, competition type, set-aside (8(a)/HUBZone/SDVOSB/WOSB). FSRS subaward reporting &gt;$30k. Data Act financial linkage: appropriation → obligation → outlay. Here is defense contract patterns, small business set-asides, DATA Act mechanics, and a Python DoD top-contractor analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Federal Register: The Official Rulemaking Journal Behind 90,000 Pages of Annual US Regulatory Activity</title>
      <link>https://ai-analytics.org/writing/federal-register-rulemaking/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/federal-register-rulemaking/</guid>
      <pubDate>Wed, 18 Nov 2026 00:00:00 GMT</pubDate>
      <description>The Federal Register is the official daily journal of the US federal government, published since 1936, containing proposed rules (NPRMs), final rules, presidential documents, and notices — ~85,000-95,000 pages/year. APA requires notice-and-comment: NPRM → 30-90 day comment period → final rule with 30-day delay. OIRA reviews significant/major rules (&gt;$100M impact) under EO 12866. Unified Regulatory Agenda tracks all agency rules in the pipeline. Congressional Review Act allows Congress to overturn recent major rules. Here is the CFR 50-title structure, Regulations.gov docket API, Federal Register API at federalregister.gov/api/v1/, Loper Bright 2024 overruling of Chevron, and a Python EPA NPRM analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FEC Committee Filings: The Campaign Finance Database Behind $14 Billion in Election Spending</title>
      <link>https://ai-analytics.org/writing/fec-committee-filings/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fec-committee-filings/</guid>
      <pubDate>Tue, 17 Nov 2026 00:00:00 GMT</pubDate>
      <description>The FEC administers FECA (1971/1974) for federal elections only. Committee types: PCC, party committees, PAC ($5k/election limit), Super PAC (post-Citizens United, unlimited), SSF, Leadership PAC. 2024 federal spending ~$14B. Individual to candidate limit $3,300/election. FEC bulk data: cm.zip (committees), indiv.zip (individual contributions &gt;$200 with employer/occupation), pas2.zip (PAC-to-candidate), oppexp.zip (disbursements). OpenFEC API at api.open.fec.gov/v1/. 501(c)(4) dark money: no donor disclosure required. Here is all eight bulk files, Super PAC mechanics, MURs, and a Python occupation partisan lean analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC NNDSS: The National Notifiable Disease Surveillance System Behind Weekly US Epidemic Tracking</title>
      <link>https://ai-analytics.org/writing/cdc-nndss-notifiable-diseases/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-nndss-notifiable-diseases/</guid>
      <pubDate>Mon, 16 Nov 2026 00:00:00 GMT</pubDate>
      <description>CDC NNDSS collects weekly case reports for 120+ nationally notifiable conditions from all states and territories. Published via MMWR. CSTE annual list: vaccine-preventable (measles, pertussis ~15k/yr, polio), STIs (gonorrhea ~700k/yr, chlamydia ~1.6M/yr, syphilis 176,713 in 2022 — highest since 1950, congenital syphilis +755%), vector-borne (Lyme disease ~476k estimated), foodborne (Salmonella ~1.35M estimated). FluView four-component influenza surveillance. COVID wastewater surveillance via NWSS. WONDER API, Socrata NNDSS table, AtlasPlus. Here is CSTE legal structure, STI AMR trends, and a Python Lyme disease state trend analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC Form D: The Private Placement Database Behind $2 Trillion in Annual Exempt Offerings</title>
      <link>https://ai-analytics.org/writing/sec-form-d-private-placements/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-form-d-private-placements/</guid>
      <pubDate>Sun, 15 Nov 2026 00:00:00 GMT</pubDate>
      <description>SEC Form D is filed within 15 days of first sale in a Reg D exempt offering. Rule 506(b): unlimited amount, no general solicitation, up to 35 non-accredited investors (~90% of filings). Rule 506(c): unlimited, general solicitation permitted (JOBS Act 2012), accredited investors only. Rule 506 offerings raised ~$2.5T in 2022. Fields: entity name, exemption type, offering amount, investor count, investment fund type (VC/PE/hedge/real estate), industry group. EDGAR full-text search at efts.sec.gov. Historical back to 2009. Reg CF ($5M crowdfunding), Reg A+ ($75M mini-IPO). Here is all Reg D exemptions, JOBS Act impact, dark money limitations, and a Python VC state/sector analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Medicare Inpatient Provider Data: The Hospital-Level Payment Records Behind $170 Billion in Annual DRG Reimbursements</title>
      <link>https://ai-analytics.org/writing/cms-medicare-inpatient-drg/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-medicare-inpatient-drg/</guid>
      <pubDate>Sat, 14 Nov 2026 00:00:00 GMT</pubDate>
      <description>CMS publishes annual Medicare Inpatient Provider Charge Data for ~3,000 hospitals across ~760 DRGs. The IPPS pays a fixed amount per DRG via relative weights (RW) — DRG 001 Heart Transplant RW ~25.0, DRG 470 Major Joint Replacement RW ~2.1, 2023 base rate ~$6,000. Medicare pays ~$170B/year via IPPS. Adjustments include Wage Index, IME for teaching hospitals, DSH for safety-net hospitals, and outlier payments. Chargemasters produce 5x-10x sticker prices vs. actual payments. Geographic variation: DRG 470 ranges $12,000–$35,000+ across hospitals. Here is the full dataset schema, Socrata API access, value-based care adjustments (HVBP, HRRP, HACRP), and a Python charge-to-payment ratio analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA Orange Book: The Drug Patent and Exclusivity Database Behind Generic Drug Competition and Hatch-Waxman Challenges</title>
      <link>https://ai-analytics.org/writing/fda-orange-book/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-orange-book/</guid>
      <pubDate>Fri, 13 Nov 2026 00:00:00 GMT</pubDate>
      <description>The FDA Orange Book lists approved drugs and their TE ratings (AB = substitutable bioequivalent). Hatch-Waxman Act (1984) created the ANDA pathway — generics skip clinical trials, show bioequivalence. Paragraph IV certification challenges listed patents → 30-month stay + 180-day first-filer exclusivity. Exclusivity types: NCE 5yr, new clinical investigation 3yr, Orphan Drug 7yr, Pediatric 6mo. Patent thickets: average 71+ listed patents per brand drug. Lipitor $10B/year cliff Nov 2011; Humira 2023 multi-biosimilar launch. Three flat files: Products.txt, Patent.txt, Exclusivity.txt. Here is TE code breakdown, pay-for-delay/FTC v. Actavis, Purple Book for biologics, and a Python upcoming patent cliff analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC PLACES: The Small Area Health Estimates Behind County and Census Tract Disease Prevalence Data</title>
      <link>https://ai-analytics.org/writing/cdc-places-health-estimates/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-places-health-estimates/</guid>
      <pubDate>Thu, 12 Nov 2026 00:00:00 GMT</pubDate>
      <description>CDC PLACES produces model-based small area health estimates for all 3,100+ counties, 29,000+ census tracts, and 28,000+ ZCTAs. 36+ measures across 5 domains: health outcomes (diabetes, obesity, CHD, stroke), prevention (screenings, insurance), unhealthy behaviors (smoking, binge drinking), disabilities, and social determinants. Uses multilevel regression and poststratification (MRP) applied to BRFSS survey data + Census ACS. Obesity &gt;40% in Appalachian counties vs. &lt;20% in Mountain West. Diabetes 15%+ in Mississippi Delta vs. &lt;7% in Colorado. Socrata API at data.cdc.gov, GeoJSON endpoint, sodapy library. Here is full methodology, PLACES vs. County Health Rankings, and a Python Mississippi county health burden analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BSEE Offshore Safety Data: The Post-Deepwater Horizon Incident Database Behind 4,000 Annual Offshore Inspections</title>
      <link>https://ai-analytics.org/writing/bsee-offshore-safety/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bsee-offshore-safety/</guid>
      <pubDate>Wed, 11 Nov 2026 00:00:00 GMT</pubDate>
      <description>BSEE was created in 2011 from MMS breakup after the Deepwater Horizon/Macondo blowout (87 days, 4.9M barrels, 11 deaths). Regulates ~2,000 OCS facilities, MODUs, 15,000+ wells. 4,000+ annual inspections, ~2,000+ INCs (Incidents of Noncompliance) issued. Incident categories: blowouts, fires/explosions, collisions, fatalities (~10-15/yr), injuries (100+/yr). SEMS rule (2010/2013) required operator safety management systems. Well Control Rule (2016) set BOP testing/monitoring requirements. OCS produces 15-17% of US oil. Data at bsee.gov: incident, INC, inspection, production, well CSVs. ArcGIS REST services for OCS infrastructure GIS.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Treasury Daily Treasury Statement: The Federal Cash Flow Data Published Every Business Day</title>
      <link>https://ai-analytics.org/writing/treasury-daily-statement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/treasury-daily-statement/</guid>
      <pubDate>Tue, 10 Nov 2026 00:00:00 GMT</pubDate>
      <description>The DTS is published each federal business day at 4 PM ET by the Bureau of the Fiscal Service, reporting the prior day&apos;s cash receipts, outlays, and borrowing. Tables cover TGA balance at Federal Reserve Banks, public debt outstanding, deposits and withdrawals by source category, operating cash balances, and federal agency deposits. The fiscal year deficit is the running sum of daily net outflows. Here is DTS Table I-VII structure, TGA balance mechanics, debt ceiling X-date tracking, Fiscal Data API access, and a Python script to chart daily outflows by category.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Federal Reserve H.15: The Selected Interest Rates Release Behind Treasury Yields, Fed Funds, and Every Rate Benchmark</title>
      <link>https://ai-analytics.org/writing/fed-h15-interest-rates/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fed-h15-interest-rates/</guid>
      <pubDate>Mon, 09 Nov 2026 00:00:00 GMT</pubDate>
      <description>The Federal Reserve H.15 release publishes daily interest rate data for the federal funds effective rate, Treasury constant maturities (1-month through 30-year), prime rate, discount rate, and SOFR since the LIBOR transition. The 2-10 yield curve inverted to -108 bps in 2023, the deepest inversion since 1981. Here is CMT construction methodology, EFFR vs. SOFR vs. LIBOR mechanics, FRED series IDs (DFF, DGS10, SOFR), real rate calculation via TIPS breakevens, and a Python FRED API dual-chart of the yield curve spread with recession shading.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census Population Estimates Program: The Annual County and State Population Data Behind Apportionment, Funding, and Growth Tracking</title>
      <link>https://ai-analytics.org/writing/census-population-estimates/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-population-estimates/</guid>
      <pubDate>Sun, 08 Nov 2026 00:00:00 GMT</pubDate>
      <description>The Census PEP produces annual population estimates for all 3,100+ counties and 50 states using a cohort-component model: base census + births - deaths + net migration. Florida gained 2.1M residents 2020-2023; Texas gained 2.4M; NYC lost ~500K from 2020 peak. Here is the components-of-change methodology, TIGER geography linkage, vintage year vs. decennial census reconciliation, Census API pep/population endpoint, and a Python script ranking counties by population growth rate with net migration decomposition.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USDA FSIS Food Safety Data: The Federal Recall Database and Inspection Records Behind Meat, Poultry, and Egg Safety</title>
      <link>https://ai-analytics.org/writing/usda-fsis-food-safety/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usda-fsis-food-safety/</guid>
      <pubDate>Sat, 07 Nov 2026 00:00:00 GMT</pubDate>
      <description>FSIS regulates 6,500+ meat, poultry, and egg processing establishments covering 80+ billion pounds of product annually. The three-class recall system escalates from Class III (mislabeling) to Class I (health hazard). The 2008 Hallmark/Westland recall (143M lbs, largest ever) involved downer cattle. E. coli O157:H7 is a zero-tolerance adulterant. Here is the Establishments.csv schema, FSIS recall database API, GenomeTrakr WGS pathogen tracing, HACCP plan requirements, PHIS inspection reports, and a Python recall trend analysis by Class and commodity.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census SAIPE: The Small Area Income and Poverty Estimates Behind Federal Education Funding and County-Level Poverty Maps</title>
      <link>https://ai-analytics.org/writing/census-saipe-poverty/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-saipe-poverty/</guid>
      <pubDate>Fri, 06 Nov 2026 00:00:00 GMT</pubDate>
      <description>SAIPE produces annual model-based poverty estimates for all 3,100+ counties and 13,000+ school districts, using ACS, IRS EITC filers, SNAP counts, and CPS via small area estimation. It drives ~$17B in Title I-A education funding and the $3.5B CDBG formula. Here is the methodology, why CPS and ACS cannot substitute, funding allocation mechanics, and a Python Census API analysis ranking counties by child poverty rate with year-over-year change.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOT National Transit Database: The Federal Ridership and Finance Data Behind Every US Bus and Rail System</title>
      <link>https://ai-analytics.org/writing/dot-national-transit-database/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dot-national-transit-database/</guid>
      <pubDate>Thu, 05 Nov 2026 00:00:00 GMT</pubDate>
      <description>NTD collects UPT, VRM, fares, and expenses from ~800 transit agencies as a condition of FTA grants. 2023 total: 10.4B UPT (below 15.7B pre-COVID peak). COVID collapsed NYC subway from 1.8B to 600M trips; $69B in emergency relief. Section 5307 formula (~$5B/year) uses NTD data. Here is mode-level recovery ratios, the 10-agency profile table, monthly ridership reports, and a Python COVID recovery analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USPTO Trademark Data: The Federal Brand Registry Behind 3 Million Active Marks and the TESS Search System</title>
      <link>https://ai-analytics.org/writing/uspto-trademark-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/uspto-trademark-data/</guid>
      <pubDate>Wed, 04 Nov 2026 00:00:00 GMT</pubDate>
      <description>~3M active registered trademarks, ~650,000 annual applications at peak. 45 Nice Classification classes, TEAS filing, use-in-commerce vs. ITU basis, Sections 8/15 post-registration requirements. Here is TESS search with DuPont factors, TTAB opposition/cancellation proceedings, bulk XML data at bulkdata.uspto.gov, USPTO Trademark JSON API, China 25% of foreign filings, and a Python Class 42 technology filing trend analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Federal Reserve Senior Loan Officer Survey: The Quarterly Credit Conditions Data the Fed Uses to Track Lending Tightening</title>
      <link>https://ai-analytics.org/writing/fed-sloos-survey/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fed-sloos-survey/</guid>
      <pubDate>Tue, 03 Nov 2026 00:00:00 GMT</pubDate>
      <description>SLOOS surveys ~80 large US banks quarterly on lending standards changes. Net percentage (tightening minus easing): +80% C&amp;I in Q4 2008, +68% in Q2 2020. Net &gt;+50% historically predicts recession within 4 quarters. Here is C&amp;I, CRE, residential, and consumer loan question categories, special topical questions, SVB stress spike, and a Python FRED API dual-line chart with recession shading.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FCC Spectrum Data: The Universal Licensing System Behind 25 Million Wireless Licenses and US Radio Frequency Allocation</title>
      <link>https://ai-analytics.org/writing/fcc-spectrum-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fcc-spectrum-data/</guid>
      <pubDate>Mon, 02 Nov 2026 00:00:00 GMT</pubDate>
      <description>The FCC ULS holds 25M+ active wireless licenses. Spectrum auctions raised $160B+ total; Auction 110 (C-band 2021) netted $81B. Here is the National Table of Frequency Allocations (47 CFR Part 2), low/mid/mmWave 5G bands, auction mechanics (SMRA/CCA), AT&amp;T/Verizon/T-Mobile C-band spend, ULS bulk data schema (EN/HD/LO/FR/AN tables), broadcast licensing (CDBS/LMS), and a Python amateur license density analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>HUD Housing Choice Vouchers: The Section 8 Data Behind 2.3 Million Households and $30 Billion in Annual Rental Assistance</title>
      <link>https://ai-analytics.org/writing/hud-housing-vouchers/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hud-housing-vouchers/</guid>
      <pubDate>Sun, 01 Nov 2026 00:00:00 GMT</pubDate>
      <description>HCV serves ~2.3M households at ~$30B/year through ~2,200 PHAs. HUD publishes FMRs for ~2,600 areas at the 40th percentile of gross rent (NYC 2BR $2,765, rural MS $725). Only ~25% of eligible households receive assistance. Here is payment standard mechanics, Small Area FMRs, the PASH/PIC data (tract-level income/demographics), Project-Based PBRA (1.2M units), AFFH mapping, CHAS housing affordability data, and a Python FMR-to-renter-income affordability ratio analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census American Housing Survey: The Biennial Housing Quality Database Behind US Structural Conditions and Neighborhood Characteristics</title>
      <link>https://ai-analytics.org/writing/census-american-housing-survey/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-american-housing-survey/</guid>
      <pubDate>Sat, 31 Oct 2026 00:00:00 GMT</pubDate>
      <description>The AHS biennial panel (60,000 housing units, tracked since 1973) covers structural quality, condition deficiencies, heating fuel, plumbing, and neighborhood characteristics. Key trends: plumbing inadequacy 4.5% to &lt;0.5%; owner-occupancy 69% (2004-05) to 63% (2016) recovery; new single-family size 1,500 to 2,300+ sq ft. HUD uses AHS for Worst Case Housing Needs report (8.5M households 2023). Here is the PUF variables guide (TENURE, BUILT, ADEQUACY, ZINC2), metro oversamples, and a Python renter cost burden by building-age cohort analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USDA Economic Research Service: The Agricultural Economics Data Behind Farm Income, Food Prices, and Rural America</title>
      <link>https://ai-analytics.org/writing/usda-ers-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usda-ers-data/</guid>
      <pubDate>Fri, 30 Oct 2026 00:00:00 GMT</pubDate>
      <description>ERS publishes farm income ($116B net in 2023), food prices (Food Price Outlook, 2022 +11.4% grocery surge), food security (13.5% food insecure in 2023, 47M people, 18-item scale), commodity market outlooks (ARC/PLC reference prices), and rural America metrics (Beale Codes 1-9, 180+ rural hospital closures). The Food Access Research Atlas maps food deserts. Here is the FADS farm income system, government payment share analysis, and a Python farm income components visualization.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS Employment Cost Index: The Quarterly Wage and Benefits Tracker the Federal Reserve Watches Most Closely</title>
      <link>https://ai-analytics.org/writing/bls-employment-cost-index/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-employment-cost-index/</guid>
      <pubDate>Thu, 29 Oct 2026 00:00:00 GMT</pubDate>
      <description>The ECI measures quarterly employer compensation costs (wages + benefits) with fixed employment weights, eliminating industry-mix distortion. Private-industry wages peaked at ~5.7% YoY mid-2022, decelerated to ~4.2% by end-2023, with Fed comfort level ~3.5% for 2% PCE inflation. Here is the National Compensation Survey design (~18,000 establishments), the ECEC benefits breakdown (health insurance ~$3.50-4.00/hour, total benefits ~31% of compensation), Fed usage and unit labor cost arithmetic, BLS API series IDs, and a Python script comparing ECI to Average Hourly Earnings.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOL Unemployment Insurance Weekly Claims: The Thursday Morning Data Release That Moves Financial Markets</title>
      <link>https://ai-analytics.org/writing/dol-ui-weekly-claims/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-ui-weekly-claims/</guid>
      <pubDate>Wed, 28 Oct 2026 00:00:00 GMT</pubDate>
      <description>DOL publishes initial and continuing UI claims every Thursday at 8:30 AM ET. COVID peak: 6.87M initial claims week ending March 28, 2020 (previous record 695,000 in 1982). Pre-COVID lows ~200,000 (2018-2019, lowest since 1969). Here is program eligibility mechanics, CARES Act $600/week FPUC supplement, 4-week moving average smoothing rationale, auto retooling and hurricane spikes, FRED series ICSA/ICNSA/CC4WSA, and a Python shock-week detection script.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census Foreign Trade Statistics: The HS-Code Import and Export Database Behind Every US Trade Policy Decision</title>
      <link>https://ai-analytics.org/writing/census-foreign-trade/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-foreign-trade/</guid>
      <pubDate>Tue, 27 Oct 2026 00:00:00 GMT</pubDate>
      <description>2023 US goods exports $2.02T, imports $3.08T, deficit $1.06T. Census FTD compiles from CBP ACE entries and AES export filings. Data drills to 10-digit HS/Schedule B codes by country and port. Section 301 China tariffs reduced US-China deficit from $419B (2018) to $279B (2023) while shifting sourcing to Vietnam/Mexico/Taiwan. Here is USA Trade Online, the Census API endpoint for trade time series, and a Python semiconductor (HS 8542) sourcing-shift analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NIFC Wildfire Data: The Federal Statistics Behind 4 Million Acres Burned Annually and the Expanding Fire Season</title>
      <link>https://ai-analytics.org/writing/nifc-wildfire-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nifc-wildfire-data/</guid>
      <pubDate>Mon, 26 Oct 2026 00:00:00 GMT</pubDate>
      <description>NIFC tracks US wildfire statistics back to 1926. The 10-year rolling average acreage roughly doubled from 1980s to 2020s; 2015, 2017, and 2020 each exceeded 10 million acres. Camp Fire 2018: 85 deaths, 18,804 structures, ~$16.5B insured losses. Here is the USFS FOD database (2.3M records, 9 cause codes, A-G size classes), MTBS Landsat burn-severity mapping, KBDI drought index, VPD fire weather, NIFC ArcGIS REST perimeter services, and a Python trend analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Social Security OASDI: The Federal Data Behind $1.4 Trillion in Annual Benefits and 70 Million Recipients</title>
      <link>https://ai-analytics.org/writing/ssa-oasdi-benefits/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ssa-oasdi-benefits/</guid>
      <pubDate>Sun, 25 Oct 2026 00:00:00 GMT</pubDate>
      <description>Social Security OASDI (Old Age, Survivors, and Disability Insurance) paid $1.4T in benefits to ~70 million recipients in 2024, funded by 6.2% FICA payroll tax on wages up to $168,600. Here is the benefit formula (AIME from 35 highest indexed earning years, PIA bend points 90/32/15%), Full Retirement Age 67 for born 1960+, early claiming at 62 (25-30% reduction), delayed claiming to 70 (8%/year increase), average retirement benefit $1,907/month, COLA 3.2% in 2024 tied to CPI-W, WEP and GPO government pension offsets, the 2033 OASI trust fund depletion projection, and SSA Annual Statistical Supplement with 700+ tables.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census Current Population Survey: The Monthly Survey Behind Official US Poverty Rates and Income Inequality Measures</title>
      <link>https://ai-analytics.org/writing/census-cps-poverty/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-cps-poverty/</guid>
      <pubDate>Sat, 24 Oct 2026 00:00:00 GMT</pubDate>
      <description>The CPS interviews ~60,000 households monthly for the official unemployment rate and, via the March ASEC supplement (~95,000 households), the official US poverty rate. Here is the 4-8-4 rotating panel design, the official poverty measure (1963 Orshansky thresholds, $30,900 family-of-4 in 2023, 11.1% rate), the Supplemental Poverty Measure adding SNAP/housing/EITC (12.9% in 2023), median household income $80,610, the Gini coefficient, IPUMS CPS harmonized microdata back to 1962, and a Python Census API state poverty ranking script.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BEA International Transactions: The Balance of Payments Data Behind Every US Trade Deficit Headline</title>
      <link>https://ai-analytics.org/writing/bea-international-transactions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bea-international-transactions/</guid>
      <pubDate>Fri, 23 Oct 2026 00:00:00 GMT</pubDate>
      <description>The BEA International Transactions Accounts (ITAs) record all US-to-rest-of-world economic flows under IMF BPM6 standards. In 2023: goods deficit ~$1.06T, services surplus ~$293B, net primary income +$196B, total current account deficit ~$905B (3.3% of GDP). Here is the financial account (FDI inflows/outflows ~$350B/$500B), Treasury TIC data ($7.8T foreign Treasury holdings), the US net IIP of -$20.6T, the &quot;exorbitant privilege&quot; return differential, the savings-investment identity (CA = S-I), and the BEA ITA API with quarterly data back to 1960.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NOAA Climate Data: The National Centers for Environmental Information Behind 130 Years of Temperature Records and Climate Normals</title>
      <link>https://ai-analytics.org/writing/noaa-climate-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/noaa-climate-data/</guid>
      <pubDate>Thu, 22 Oct 2026 00:00:00 GMT</pubDate>
      <description>NOAA NCEI archives 150+ petabytes serving 25+ billion annual data requests. GHCN-Daily covers ~120,000 stations with daily Tmax/Tmin/PRCP back to the late 1800s. Here is the NOAAGlobalTemp dataset (2023 warmest year on record, +1.45 deg C above pre-industrial), US Climate Normals (1991-2020, 15,000+ stations), the Billion-Dollar Disasters database (28 events/$94B in 2023), HURDAT2 hurricane tracks, NOAA Tides and Currents sea level rise (3.6mm/year average), and the CDO REST API for programmatic station data access.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>VA Disability Benefits: The Federal Data Behind 5.5 Million Compensation Recipients and $130 Billion in Annual Spending</title>
      <link>https://ai-analytics.org/writing/va-disability-benefits/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/va-disability-benefits/</guid>
      <pubDate>Wed, 21 Oct 2026 00:00:00 GMT</pubDate>
      <description>The VA disability compensation program pays monthly benefits to ~5.5 million veterans based on a 0-100% whole-person rating. Here is the 2024 rate table ($171/month at 10% to $3,737 at 100%), the PACT Act 2022 (burn pit presumptives, 3.5M newly eligible veterans, $280B 10-year cost), the GI Bill Post-9/11 Ch. 33 (tuition cap, BAH, books stipend), the VA Home Loan Guaranty (no down payment, 4M+ loans), the 884K claims backlog peak, VSOs and TDIU (~370K recipients), and VA Open Data portal state-level benefits utilization data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USGS Water Resources: The National Water Information System Behind Flood Prediction, Drought Monitoring, and Aquifer Depletion</title>
      <link>https://ai-analytics.org/writing/usgs-water-resources/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usgs-water-resources/</guid>
      <pubDate>Tue, 20 Oct 2026 00:00:00 GMT</pubDate>
      <description>The USGS NWIS runs 8,000+ streamflow gauges feeding NWS River Forecast Centers and the National Water Model (2.7M reaches, 15-minute forecasts). Here is ADCP gauging, annual peak discharge feeding FEMA flood maps, the Ogallala Aquifer (174,000 sq miles, declining 1-3 ft/year in TX/KS), Central Valley subsidence, the NAWQA water quality program, water use surveys (thermoelectric 41%), the 7Q10 low-flow NPDES permit standard, the NWIS REST API, and a Python discharge hydrograph with drought shading.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OPM Federal Workforce Data: The Personnel Records Behind 2.1 Million Civilian Federal Jobs</title>
      <link>https://ai-analytics.org/writing/opm-federal-workforce/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/opm-federal-workforce/</guid>
      <pubDate>Mon, 19 Oct 2026 00:00:00 GMT</pubDate>
      <description>OPM manages HR for 2.1M+ federal civilians via the Central Personnel Data File and FedScope tool. Here is the GS pay system (GS-1 through GS-15, 48 locality areas, SF +44.15%), SES (~8,000 career SES, $148k-$222k), FERS retirement (DB + Social Security + TSP $800B+ AUM, 5% match), the ~100-day federal time-to-hire, Pathways Programs, the FRB 17-19% total compensation premium finding, and the 2025 DOGE workforce reduction context.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NSF Research Grants: Mapping $9 Billion in Annual Basic Science Funding</title>
      <link>https://ai-analytics.org/writing/nsf-research-grants/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nsf-research-grants/</guid>
      <pubDate>Sun, 18 Oct 2026 00:00:00 GMT</pubDate>
      <description>NSF funds ~25% of all federally funded basic research (excluding life sciences) with $9B+ annually across 8 directorates. Here is the dual merit review (Intellectual Merit + Broader Impacts), funding rates by directorate (17-25%), CAREER award ($500k/5yr), GRFP ($37k/yr, ~2,000 of 12,000+ applicants), NSF Awards API (600,000+ awards), National AI Research Institutes ($200M+), the 2023 immediate open-access mandate, EPSCoR geographic equity, and a Python CAREER grant API analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BTS Airline On-Time Performance: The Federal Dataset Behind Every Flight Delay, Cancellation, and Tarmac Crisis</title>
      <link>https://ai-analytics.org/writing/bts-airline-performance/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bts-airline-performance/</guid>
      <pubDate>Sat, 17 Oct 2026 00:00:00 GMT</pubDate>
      <description>The BTS ATOP/ASQP database covers ~6 million flight records per year across five delay categories: Carrier, NAS, Late Aircraft, Weather, and Security. Here is the T-100 domestic/international traffic series (ASM, RPM, load factor), Form 41 carrier financials (CASM, RASM, fuel 20-30% of costs), the COVID collapse (96% RPM decline April 2020, $54B CARES Act), the Southwest December 2022 meltdown (17,000 flights cancelled, $140M DOT settlement), the tarmac delay rule (3-hr domestic/4-hr international), and a Python Transtats on-time rate analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Federal Reserve Z.1 Financial Accounts: The Flow of Funds Behind US Household Wealth and Sectoral Balances</title>
      <link>https://ai-analytics.org/writing/fed-z1-financial-accounts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fed-z1-financial-accounts/</guid>
      <pubDate>Fri, 16 Oct 2026 00:00:00 GMT</pubDate>
      <description>The Federal Reserve Z.1 (formerly Flow of Funds) publishes quarterly financial assets and liabilities for all US sectors. Here is the household net worth data ($156T 2021 peak, ~$8T 2022 decline), the Distributional Financial Accounts (top 1% hold ~31%, bottom 50% hold ~3%), two-sided sectoral balance mechanics, corporate leverage, Table B.101 real estate at market value ($25T to $43T 2019-2024), the $26T+ Treasury liability position, Rest of World holdings, FRED mnemonic guide, and a Python FRED API net worth analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census LEHD: The Longitudinal Employer-Household Dynamics Database Behind Workforce Flows, Commuting, and Wage Growth</title>
      <link>https://ai-analytics.org/writing/census-lehd-workforce/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-lehd-workforce/</guid>
      <pubDate>Thu, 15 Oct 2026 00:00:00 GMT</pubDate>
      <description>Census LEHD links UI wage records for 95%+ of private workers to employer and household data, producing the Quarterly Workforce Indicators (employment/payroll/hires/separations by county x industry x age x sex x education), LODES block-to-block commuting OD matrices, job-to-job flow statistics (7-10% switching earnings premium), and business dynamics data. Here is the COVID OD commute reshaping, the great resignation spike, OnTheMap tool, LEHD vs. QCEW/CES/ACS distinctions, and a Python Census QWI API script.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BEA Regional Accounts: GDP by State, Personal Income by County, and the Sub-National Data Behind Every State Policy Debate</title>
      <link>https://ai-analytics.org/writing/bea-regional-accounts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bea-regional-accounts/</guid>
      <pubDate>Wed, 14 Oct 2026 00:00:00 GMT</pubDate>
      <description>BEA Regional Accounts allocate national totals to states, counties, and MSAs: GDP by State (annual/quarterly, NAICS detail), Personal Income by State (quarterly, five-component decomposition), Personal Income by County (~3,100 counties, CAINC1), and GDP by MSA (~380 MSAs, NYC $2T+). Here is the energy boom-bust signal, high-income tax migration effect, COVID transfer payment surge and unwinding, BEA Regional API parameters, and a Python state per-capita income growth ranking.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USDA NASS Crop Surveys: The Federal Agricultural Data Behind Every Corn, Soybean, and Wheat Market</title>
      <link>https://ai-analytics.org/writing/usda-nass-crop-surveys/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usda-nass-crop-surveys/</guid>
      <pubDate>Tue, 13 Oct 2026 00:00:00 GMT</pubDate>
      <description>The USDA National Agricultural Statistics Service conducts 400+ surveys annually, reaching 3 million respondents to produce the authoritative federal record of US crop production, livestock inventories, and commodity prices since 1867. Here is the Crop Production report, WASDE supply-demand balance sheets, QuickStats API, weekly Crop Progress Good/Excellent condition ratings, the five major crops (corn 35% of cropland, Brazil soybean competition, wheat, cotton, rice), the 2012 drought sending corn to $8.49/bushel and soybeans above $17, Cattle on Feed, Hogs and Pigs quarterly, Prices Received/Paid, and a Python QuickStats API corn yield analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EIA Energy Data: The Federal Database Behind Oil Prices, Natural Gas Storage, and Electricity Generation</title>
      <link>https://ai-analytics.org/writing/eia-energy-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/eia-energy-data/</guid>
      <pubDate>Mon, 12 Oct 2026 00:00:00 GMT</pubDate>
      <description>The EIA is the primary federal authority for US energy data. Here is the Short-Term Energy Outlook, Weekly Petroleum Status Report (Cushing OK crude stocks moving WTI $1-2/barrel), Natural Gas Storage Report (five-region EIA-914, Thursday 10:30 AM release), EIA-860 and EIA-923 power plant databases (15,000+ generators), Electric Power Monthly, Petroleum Supply Monthly, the EIA Open Data API (500,000+ series), the 2019 US net petroleum export milestone, the 2022 European crisis Henry Hub spike to $9/MMBtu, and a Python EIA v2 API script pulling WTI crude and Henry Hub weekly prices.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census Building Permits and Housing Starts: The Federal Leading Indicator Behind the US Housing Market</title>
      <link>https://ai-analytics.org/writing/census-building-permits/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-building-permits/</guid>
      <pubDate>Sun, 11 Oct 2026 00:00:00 GMT</pubDate>
      <description>The Census Bureau Building Permits Survey and New Residential Construction release track ~20,000 permit-issuing jurisdictions monthly. Here is the 96% construction coverage, SAAR methodology, permits-to-starts lag, 2006 peak (2.07M SAAR) to 2009 trough (554K) to 2020-2021 surge to 2022-2023 pullback (3% to 7% mortgage rates), the SFH/multifamily bifurcation, Sun Belt concentration, New Residential Sales contract-signed timing, lumber futures (2021 spike to $1,700/MBF), Census BPS API, and FRED series PERMIT/HOUST.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS Occupational Employment Data: Wages, Job Counts, and 10-Year Projections for Every US Occupation</title>
      <link>https://ai-analytics.org/writing/bls-occupational-outlook/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-occupational-outlook/</guid>
      <pubDate>Sat, 10 Oct 2026 00:00:00 GMT</pubDate>
      <description>The BLS OEWS program publishes wages and employment for 830 occupations across 590+ geographies from a 1.1M establishment survey pooled to ~3.3M observations. Here is the data structure (wage percentiles 10th-90th, location quotient, entry/experienced wages), the SOC taxonomy (867 detailed occupations), top-paying occupations (surgeons $250k+), Employment Projections 2022-2032 (home health aides +924k fastest-growing), the Occupational Outlook Handbook, O*NET crosswalk, wage inequality applications (90/10 ratio), H-1B prevailing wage Level I-IV, and a Python healthcare occupation wage analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FHWA Highway Data: The Federal Dataset Behind Bridge Conditions, Pavement Quality, and Traffic Counts</title>
      <link>https://ai-analytics.org/writing/dot-fhwa-highway/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dot-fhwa-highway/</guid>
      <pubDate>Fri, 09 Oct 2026 00:00:00 GMT</pubDate>
      <description>The FHWA publishes the National Bridge Inventory (620,000+ bridges, biennial inspection, structurally deficient vs. functionally obsolete classifications, 0-100 sufficiency rating), the Highway Performance Monitoring System (pavement IRI, Good/Fair/Poor condition), Annual Average Daily Traffic counts, and Highway Statistics. Here is the IIJA 2021 $40B bridge repair program, the Highway Trust Fund solvency crisis ($0.184/gallon gas tax frozen since 1993), the Freight Analysis Framework commodity-flow OD matrices, and a Python NBI bridge script to map structurally deficient bridges.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS Current Employment Statistics: The Monthly Jobs Report Behind Every Payroll Number</title>
      <link>https://ai-analytics.org/writing/bls-current-employment/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-current-employment/</guid>
      <pubDate>Thu, 08 Oct 2026 00:00:00 GMT</pubDate>
      <description>Two surveys released on Jobs Friday: the Establishment Survey (580,000 worksites, payroll employment headline) and the Household Survey (60,000 households, unemployment rate). Here is the net birth/death model, the three-tier revision cycle, the January 2024 benchmark that removed 818,000 jobs from the prior year, X-13ARIMA-SEATS seasonal adjustment, industry dynamics, the COVID -20.5M single-month collapse, the 8:30 AM market impact, and a Python BLS API nonfarm payroll analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC EDGAR XBRL: The Machine-Readable Financial Statement Database Behind Every Public Company</title>
      <link>https://ai-analytics.org/writing/sec-edgar-xbrl/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-edgar-xbrl/</guid>
      <pubDate>Wed, 07 Oct 2026 00:00:00 GMT</pubDate>
      <description>The SEC requires XBRL-tagged financials from ~7,000 public companies since 2009-2011. Here is the US-GAAP taxonomy (17,000+ concepts), the Company Facts API, Company Concept API, and Frames API (cross-sectional data for all filers in one call), data quality pitfalls (30% custom extension elements, ASC 606 taxonomy changes), the Beneish M-score fraud detection application, and a Python SEC EDGAR API script to extract Apple revenue and net income history.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Skilled Nursing Facility Data: Star Ratings, Staffing, and the Quality Metrics Behind 15,000 Nursing Homes</title>
      <link>https://ai-analytics.org/writing/cms-skilled-nursing-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-skilled-nursing-data/</guid>
      <pubDate>Mon, 05 Oct 2026 00:00:00 GMT</pubDate>
      <description>CMS Care Compare publishes quality data for every Medicare- and Medicaid-certified skilled nursing facility. Here is the five-star composite rating system, the A-L scope/severity deficiency grid (IJ at J-L), the Payroll-Based Journal staffing system (replacing self-reported data in 2016), MDS resident assessments driving PDPM reimbursement and quality measures, COVID-19 nursing home crisis (170,000+ deaths, 38% of early US COVID deaths), private equity ownership transparency gaps, and a Python script to compute state-level star rating distributions from CMS Care Compare CSV files.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS Occupational Injuries: The SOII Dataset Behind 2.8 Million Annual Workplace Injuries</title>
      <link>https://ai-analytics.org/writing/bls-occupational-injuries/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-occupational-injuries/</guid>
      <pubDate>Sun, 04 Oct 2026 00:00:00 GMT</pubDate>
      <description>The BLS Survey of Occupational Injuries and Illnesses surveys ~230,000 establishments to produce the national workplace injury count. Here is the TRIR formula, OSHA recordkeeping requirements (Form 300/300A/301), the case-and-demographic microdata, CFOI as the companion fatal census (~5,500/year, the fatal four in construction), the musculoskeletal disorder supplement, the underreporting problem (40-69% capture rate), and a Python BLS API script to compare incidence rates across construction, manufacturing, and healthcare.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EPA Air Quality System: The Federal Monitor Network Behind NAAQS Compliance and Pollution Mapping</title>
      <link>https://ai-analytics.org/writing/epa-air-quality-system/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-air-quality-system/</guid>
      <pubDate>Sat, 03 Oct 2026 00:00:00 GMT</pubDate>
      <description>The EPA AQS aggregates hourly and daily pollutant readings from 4,000+ sites. Here is the six criteria pollutant NAAQS framework, the 2024 PM2.5 standard tightened to 9 ug/m3, the AQI 0-500 scale, nonattainment designation and SIP mechanics, the Harvard Six Cities study and BenMAP health burden model (100,000+ annual PM2.5-attributable deaths), environmental justice monitoring gaps, wildfire smoke exceptional events provisions, and a Python EPA AQS API script to download daily PM2.5 readings and identify exceedance days.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>HUD Point-in-Time Count: The Federal Homeless Census Behind 650,000 Americans Without Shelter</title>
      <link>https://ai-analytics.org/writing/hud-homeless-count/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hud-homeless-count/</guid>
      <pubDate>Fri, 02 Oct 2026 00:00:00 GMT</pubDate>
      <description>HUD&apos;s annual Point-in-Time count, conducted by ~400 Continuum of Care regions in January, is the only national homeless census. Here is the sheltered vs. unsheltered methodology, the 2023 count of 653,100 (record high), California&apos;s 28% share, HMIS as the longitudinal individual tracking database, veteran homelessness (37,000+ and HUD-VASH), the chronic homeless definition (12+ months or 4+ episodes), methodological limitations, Housing First evidence, system performance measures, and a Python script to compute per-capita homeless rates by state.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FAA Aviation Safety Data: The Federal Databases Behind Every Plane Crash Investigation</title>
      <link>https://ai-analytics.org/writing/faa-aviation-safety/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/faa-aviation-safety/</guid>
      <pubDate>Thu, 01 Oct 2026 00:00:00 GMT</pubDate>
      <description>The federal aviation safety ecosystem spans four databases: NTSB accident database, FAA AIDS, NASA ASRS (voluntary near-miss reports), and the FAA Wildlife Strike Database. Here is the NTSB probable cause taxonomy, Boeing 737 MAX MCAS investigation, the ASRS reporting immunity mechanism, runway incursion categories, the Miracle on Hudson Canada Goose context, FAA Civil Aviation Registry, pilot workforce demographics, and a Python NTSB phase-of-flight fatal accident rate analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NRC Nuclear Safety Data: The Federal Database Behind Every Reactor Inspection and Incident Report</title>
      <link>https://ai-analytics.org/writing/nrc-nuclear-safety/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nrc-nuclear-safety/</guid>
      <pubDate>Wed, 30 Sep 2026 00:00:00 GMT</pubDate>
      <description>The NRC publishes quarterly Performance Indicators, inspection findings, and daily Event Notification Reports for all 99 operating US nuclear reactors. Here is the Reactor Oversight Process, Significance Determination Process (Green/White/Yellow/Red), Licensee Event Reports, the TMI and Fukushima reform trail, probabilistic risk assessment (core damage frequency ~1E-5/reactor-year), ADAMS with 7M+ public documents, the 92-93% nuclear capacity factor, and a Python NRC PI XML parser.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Bureau of Prisons Data: The Federal Inmate Population Behind 150,000 Federal Prisoners</title>
      <link>https://ai-analytics.org/writing/bop-prison-population/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bop-prison-population/</guid>
      <pubDate>Tue, 29 Sep 2026 00:00:00 GMT</pubDate>
      <description>The BOP manages 121 federal prisons holding ~148,000 inmates -- down from a 219,000 peak in 2013. Here is the offense category breakdown (drug offenses 43%+), the crack cocaine sentencing disparity, FIRST STEP Act reforms, BJS National Prisoner Statistics, USSC case-level sentencing data, PACER federal court records, supervised release mechanics, private prison contracting ($700M+/year), ICE immigration detention, and federal recidivism data (68% rearrest within 3 years).</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USCIS Immigration Data: The Federal Database Behind Visas, Green Cards, and Naturalizations</title>
      <link>https://ai-analytics.org/writing/uscis-immigration-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/uscis-immigration-data/</guid>
      <pubDate>Mon, 28 Sep 2026 00:00:00 GMT</pubDate>
      <description>USCIS adjudicates ~8 million petitions annually. Here is naturalization data by country of birth, the employment-based 7% per-country cap creating 40+ year backlogs for Indian nationals (EB-2 priority date ~2012), H-1B lottery (470K registrations for 85K slots), the 1.7M+ affirmative asylum backlog, DACA quarterly data, EOIR immigration court backlogs with judge-level grant rate variation, DHS Yearbook of Immigration Statistics, and a Python USCIS naturalization workbook analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FBI UCR: The Federal Crime Statistics Behind Every Public Safety Analysis</title>
      <link>https://ai-analytics.org/writing/fbi-ucr-crime-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fbi-ucr-crime-data/</guid>
      <pubDate>Sun, 27 Sep 2026 00:00:00 GMT</pubDate>
      <description>The FBI UCR program collects crime data from ~18,000 law enforcement agencies -- transitioning from legacy SRS to incident-level NIBRS, creating massive 2021 coverage gaps when major cities failed to report. Here is the 8 Part I Index Crimes, NIBRS segment structure, the 2020-2021 murder surge (+30% single-year), hate crime data, LEOKA, the dark figure of crime and NCVS complement, clearance rates, and the CDE API with a Python state-level murder rate trend analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Medicare Cost Reports: The Annual Financial Disclosure Behind Every US Hospital</title>
      <link>https://ai-analytics.org/writing/cms-hospital-cost-reports/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-hospital-cost-reports/</guid>
      <pubDate>Sat, 26 Sep 2026 00:00:00 GMT</pubDate>
      <description>Every Medicare-certified hospital files an annual Medicare Cost Report -- the only source of audited hospital-level financial data spanning all hospitals regardless of ownership type. Here is the Worksheet structure, cost-to-charge ratio methodology, DSH disproportionate share payments, IME/GME teaching hospital adjustments ($12-15B combined), Worksheet S-10 uncompensated care, the HCRIS database at NBER, charge markup analysis, and a Python nonprofit vs. for-profit operating margin analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SBA 7(a) and 504 Loan Data: The Federal Small Business Lending Database Behind $40 Billion in Annual Guarantees</title>
      <link>https://ai-analytics.org/writing/sba-7a-504-loans/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sba-7a-504-loans/</guid>
      <pubDate>Fri, 25 Sep 2026 00:00:00 GMT</pubDate>
      <description>The SBA publishes loan-level data for all approved 7(a) and 504 loans. Here is the 7(a) guarantee structure (85%/75% on loans above/below $150K), the 504 three-party 50/40/10 split, loan-level dataset fields (NAICS, lender, status, charge-off amount, ownership flags), lender concentration, industry default rates, SBIC venture financing, and a Python Socrata API sector default rate analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS American Time Use Survey: The Federal Dataset Behind How Americans Actually Spend Their Time</title>
      <link>https://ai-analytics.org/writing/bls-american-time-use/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-american-time-use/</guid>
      <pubDate>Thu, 24 Sep 2026 00:00:00 GMT</pubDate>
      <description>The ATUS has tracked 24-hour time diaries for ~10,000 Americans annually since 2003 -- the only federal dataset measuring time allocation across all life activities. Here is the 17 major activity categories, the gender gap (women 2+ hours/day more household/caregiving vs. men&apos;s more leisure), parental childcare trends, the 2020 COVID remote work shift, leisure inequality by education, the Well-Being and Eating &amp; Health modules, IPUMS-ATUS access, and a Python weighted gender gap analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDIC Call Report Data: The Quarterly Financial Filing Behind Every US Bank&apos;s Balance Sheet</title>
      <link>https://ai-analytics.org/writing/fdic-call-report-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fdic-call-report-data/</guid>
      <pubDate>Wed, 23 Sep 2026 00:00:00 GMT</pubDate>
      <description>Every FDIC-insured institution files quarterly Call Reports (FFIEC 031/041/051) -- the primary supervisory dataset covering ~4,700 banks with balance sheet, income, asset quality, capital, and liquidity detail. Here is the RC schedule structure (HTM/AFS securities, loan categories, deposits), Schedule RI income statement, RC-N nonperforming loans, RC-R capital ratios and PCA thresholds, SVB 2022 warning signs (HTM unrealized losses, uninsured deposit concentration), the Texas Ratio methodology, FDIC BankFind Suite API, and a Python community-bank screening script.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS Multifactor Productivity: The Federal Dataset Behind Long-Run Economic Growth Accounting</title>
      <link>https://ai-analytics.org/writing/bls-multifactor-productivity/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-multifactor-productivity/</guid>
      <pubDate>Tue, 22 Sep 2026 00:00:00 GMT</pubDate>
      <description>The BLS Multifactor Productivity (TFP) program measures output growth unexplained by labor and capital inputs -- the Solow residual capturing technological progress. Here is the growth accounting decomposition, historical MFP episodes (1.5%/yr golden age to 1970s slowdown to IT revival to AI hypothesis), Hall-Jorgenson capital services methodology, labor vs. MFP distinction for real wages, unit labor costs as core services inflation driver (peaked 2022, recovered 2023), FRED series IDs (OPHNFB, ULCNFB), and a Python BLS API dual-axis chart.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Medicaid Enrollment Data: The Federal Dataset Behind 90 Million Beneficiaries and $900 Billion in Annual Spending</title>
      <link>https://ai-analytics.org/writing/hhs-medicaid-enrollment/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hhs-medicaid-enrollment/</guid>
      <pubDate>Mon, 21 Sep 2026 00:00:00 GMT</pubDate>
      <description>Medicaid covers ~90M people (~$900B/year) as the largest health coverage program by beneficiary count. Here is the monthly enrollment data by eligibility group, T-MSIS claims system, MBES expenditure data, the ACA expansion 37-state divide, the COVID continuous enrollment surge (70M to 95M) and 2023-2024 unwinding, FMAP mechanics (50-77% federal match), managed care 70% enrollment share, dual eligibles, long-term care (Medicaid covers 42%), and a Python Medicaid.gov Socrata API unwinding analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NLRB Union Election Data: The Federal Record of Every Organizing Drive and Vote Count</title>
      <link>https://ai-analytics.org/writing/nlrb-union-elections/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nlrb-union-elections/</guid>
      <pubDate>Sun, 20 Sep 2026 00:00:00 GMT</pubDate>
      <description>The NLRB conducts ~2,000-2,500 union representation elections annually and publishes detailed results -- eligible voters, votes for/against, unit size, industry, union affiliation. Here is the RC/RD/RM petition taxonomy, long-run win rate trend (60-65% 1950s to 45% post-PATCO to 70%+ in the 2022-2024 Starbucks/Amazon surge), bargaining unit determination, blocking charge mechanics, the 2023 rapid-response 21-day election rule, card check vs. secret ballot, and a Python FY2019-2024 election analysis by union affiliation.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOL Wage and Hour Division: The Federal Enforcement Database Behind $300 Million in Annual Back-Wage Recoveries</title>
      <link>https://ai-analytics.org/writing/dol-wage-hour-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-wage-hour-enforcement/</guid>
      <pubDate>Sat, 19 Sep 2026 00:00:00 GMT</pubDate>
      <description>The DOL Wage and Hour Division enforces the FLSA, Davis-Bacon Act, Service Contract Act, FMLA, and child labor laws -- recovering $200-300M in back wages for 200,000-300,000 workers annually. Here is the WHISARD public enforcement database schema, the FLSA exempt vs. non-exempt classification battle, worker misclassification under the 2024 economic reality rule, H-2A agricultural wage violations, Davis-Bacon prevailing wage enforcement, the Asplundh $95M settlement, FLSA criminal prosecution under 216(a), and a Python sector-level penalty analysis by NAICS code.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS PPI: The Producer Price Index and the Federal Inflation Dataset That Leads CPI</title>
      <link>https://ai-analytics.org/writing/bls-ppi-producer-prices/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-ppi-producer-prices/</guid>
      <pubDate>Fri, 18 Sep 2026 00:00:00 GMT</pubDate>
      <description>The BLS Producer Price Index measures average change in selling prices received by domestic producers -- the upstream complement to the consumer-facing CPI, with a 2-3 month leading relationship to goods inflation. Here is the three indexing systems (Final Demand PPI launched 2014, Intermediate Demand stage-of-processing, traditional commodity-based), trade services margin methodology, the 2021-2022 supply chain surge (+22.9% FD goods peak), FRED series IDs (PPIFIS, PPIFAF, PPIFAE, PPICOR, PPIACO), BLS API access, and a Python 4-line chart of the inflation episode by component.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census PL 94-171: The Redistricting Data Behind Every Congressional Map</title>
      <link>https://ai-analytics.org/writing/census-decennial-redistricting/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-decennial-redistricting/</guid>
      <pubDate>Thu, 17 Sep 2026 00:00:00 GMT</pubDate>
      <description>Public Law 94-171 mandates block-level population data to states for legislative redistricting -- the foundational dataset for every congressional and state legislative district. Here is the five data tables (P1-P5, H1), geographic hierarchy to census block, Reynolds v. Sims and Wesberry v. Sanders case law, the 2020 apportionment results (Texas +2, New York missed by 89 people), differential privacy and the TopDown Algorithm controversy, the 63-combination race/ethnicity schema, Census API variable naming (P2_006N syntax), VRA Section 2 and the Gingles three-part test, and a Python Census API racial composition analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Treasury TIC: The Federal Dataset Behind Foreign Ownership of US Securities</title>
      <link>https://ai-analytics.org/writing/treasury-tic-capital-flows/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/treasury-tic-capital-flows/</guid>
      <pubDate>Tue, 15 Sep 2026 00:00:00 GMT</pubDate>
      <description>The Treasury International Capital system tracks foreign purchases and sales of US securities -- the primary federal source on who holds US Treasuries and how capital flows across borders. Here is the four main TIC reports (monthly major holders, TIC-S/TIC-B flow surveys, SHCA annual position survey, SHLA mirror), the top foreign holders (Japan $1.1T, China $800B peak, UK $700B, Belgium/Euroclear anomaly), the custodian country problem, China&apos;s &quot;financial nuclear option&quot; analysis, sudden stop risk, 2008 flight-to-safety dynamics, and a Python script to download the monthly major foreign holders Excel.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC WONDER: The Federal Mortality Database Behind Every Death Statistics Analysis</title>
      <link>https://ai-analytics.org/writing/cdc-wonder-mortality/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-wonder-mortality/</guid>
      <pubDate>Mon, 14 Sep 2026 00:00:00 GMT</pubDate>
      <description>CDC WONDER is the query interface for US death certificate data -- every death in America since 1999 coded by ICD-10 underlying cause, linked to place, age, race, and demographic characteristics. Here is the death certificate pipeline, ICD-10 code taxonomy (C codes for cancers, I codes for circulatory, F codes for mental, V-Y codes for external causes), the &lt;10 death suppression rule, age-adjusted rates using the 2000 Standard Population, the three-wave opioid crisis (prescription T40.2-T40.3 to heroin T40.1 to synthetic fentanyl T40.4, ~110K deaths in 2022), Case-Deaton deaths of despair research, and COVID-19 U07.1 excess mortality analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS JOLTS: The Federal Job Openings and Labor Turnover Survey Behind Every Tight-Labor-Market Claim</title>
      <link>https://ai-analytics.org/writing/bls-jolts-job-openings/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-jolts-job-openings/</guid>
      <pubDate>Sun, 13 Sep 2026 00:00:00 GMT</pubDate>
      <description>The BLS Job Openings and Labor Turnover Survey measures the monthly flow of workers into and out of US employment -- job openings, hires, quits, and layoffs across 21,000 establishments. Here is the four core metrics, how the quit rate peaked at 3.0% in April 2022 signaling the hottest labor market in decades, the Beveridge Curve rightward shift revealing labor market frictions, labor hoarding dynamics in 2023, JOLTS vs. Indeed/LinkedIn alternative measures, FRED series IDs (JTSJOL, JTSHIL, JTSQUL, JTSLAL, JTSQUR), and a Python Beveridge Curve plot.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NHTSA FARS: The Federal Traffic Fatality Census Behind Every Road Safety Analysis</title>
      <link>https://ai-analytics.org/writing/nhtsa-fars-fatality-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nhtsa-fars-fatality-data/</guid>
      <pubDate>Sat, 12 Sep 2026 00:00:00 GMT</pubDate>
      <description>The NHTSA Fatality Analysis Reporting System is a complete census of every US traffic fatality since 1975 -- not a sample, but a record of all 38,000-43,000 annual deaths with linked accident, vehicle, and person detail. Here is the three-table structure (accident/vehicle/person), key variable codes (HARM_EV, MAN_COLL, LGT_COND, DRUNK_DR), the COVID anomaly (miles driven -13% but fatality rate spiked 24%), the alcohol-impaired decline from 20K/year in the 1980s to 10.5K/year, the pedestrian fatality rise from 4,300 to 7,500 since 2010, the CRSS companion for non-fatal crashes, and a Python state-level pedestrian fatality rate analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Medicare Advantage: Plan Bids, Star Ratings, and the Federal Dataset Behind Private Medicare</title>
      <link>https://ai-analytics.org/writing/cms-medicare-advantage/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-medicare-advantage/</guid>
      <pubDate>Thu, 10 Sep 2026 00:00:00 GMT</pubDate>
      <description>Medicare Advantage now covers 51% of Medicare beneficiaries (~33M people) through private insurance plans. Here is the CMS benchmark-bid-rebate payment system, the 40-measure Star Ratings framework, HCC risk adjustment upcoding controversy ($10-30B excess payments), prior authorization denial rates (OIG 2022: 13% denials met coverage criteria), enrollment concentration, and a Python market-share analysis by state.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>IRS Statistics of Income: The Federal Dataset Behind the US Tax and Income Distribution</title>
      <link>https://ai-analytics.org/writing/irs-statistics-of-income/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/irs-statistics-of-income/</guid>
      <pubDate>Wed, 09 Sep 2026 00:00:00 GMT</pubDate>
      <description>The IRS SOI program has published aggregated tax return statistics since 1916 -- the definitive federal source on income distribution, effective tax rates, deductions, and credits. Here is the individual 1040 AGI class tables, top 1% income share data (Piketty-Saez source), EITC distribution, estate tax stepped-up basis issue, corporate SOI and TCJA effective rate dynamics, and the Public Use File for microsimulation.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OSHA Inspections: The Federal Database Behind Every Workplace Safety Violation and Citation</title>
      <link>https://ai-analytics.org/writing/dol-osha-inspections/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-osha-inspections/</guid>
      <pubDate>Tue, 08 Sep 2026 00:00:00 GMT</pubDate>
      <description>OSHA publishes every workplace inspection, citation, and penalty going back to 1972 -- covering ~130M US workers in 10M workplaces. Here is the inspection types, citation taxonomy (Willful $156K max through De Minimis), top-cited 29 CFR standards (fall protection #1 in the Fatal Four), the Imperial Sugar explosion, Amazon injury rate controversy, State Plan boundary, and a Python sector-level penalty analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>HMDA: The Home Mortgage Disclosure Act Dataset Behind Every Redlining Investigation</title>
      <link>https://ai-analytics.org/writing/hmda-mortgage-disclosure/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hmda-mortgage-disclosure/</guid>
      <pubDate>Mon, 07 Sep 2026 00:00:00 GMT</pubDate>
      <description>HMDA requires most mortgage lenders to publicly disclose every application, origination, and denial -- with loan amount, property location, applicant race/ethnicity, income, pricing, DTI, LTV, and AUS results. Here is the full post-2018 field schema, how CFPB and DOJ use denial-rate mapping to build redlining cases (Trustmark, Cadence, City National), denial reason codes, the HMDA Platform API, CRA examination connections, and a Python disparity-ratio analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census ACS: The American Community Survey and the Federal Demographic Dataset Behind Every Policy Decision</title>
      <link>https://ai-analytics.org/writing/census-acs-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-acs-data/</guid>
      <pubDate>Sun, 06 Sep 2026 00:00:00 GMT</pubDate>
      <description>The American Community Survey sends questionnaires to 3.5 million addresses per year, replacing the decennial long form with continuous annual estimates. Here is the 1-year vs. 5-year distinction, the full social/economic/housing/demographic variable taxonomy, margin of error thresholds, Census API variable naming (B19013_001E syntax), key tables for income/poverty/rent/race/commute, and a Python census-tract rent burden analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS CPI: The Consumer Price Index and the Federal Inflation Measurement Behind Every Policy Decision</title>
      <link>https://ai-analytics.org/writing/bls-cpi-inflation/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-cpi-inflation/</guid>
      <pubDate>Fri, 04 Sep 2026 00:00:00 GMT</pubDate>
      <description>The BLS Consumer Price Index has tracked urban consumer prices since 1913 -- the primary US inflation gauge driving Social Security COLAs, wage negotiations, and Fed policy. Here is CPI-U vs. CPI-W vs. Chained CPI, the basket weights (shelter 35%, OER methodology), CPI vs. PCE deflator gap, the 2021-2023 9.1% peak episode, FRED series IDs, BLS API access, and a Python chart of the inflation episode by component.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC BRFSS: The World&apos;s Largest Telephone Survey and the Federal Health Behavior Database</title>
      <link>https://ai-analytics.org/writing/cdc-brfss-behavioral-risk/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-brfss-behavioral-risk/</guid>
      <pubDate>Thu, 03 Sep 2026 00:00:00 GMT</pubDate>
      <description>The CDC Behavioral Risk Factor Surveillance System interviews ~450,000 adults per year across all 50 states -- the world&apos;s largest health survey. Here is the core module variables (obesity, smoking, diabetes, exercise, mental health), the raking weighting methodology, the PLACES MRP small-area estimation project, the 2011 cell-phone expansion discontinuity, and a Python approach to computing weighted state-level obesity prevalence from the LLCP XPT file.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FHFA House Price Index: The Federal Repeat-Sales Benchmark for US Home Prices</title>
      <link>https://ai-analytics.org/writing/fhfa-house-price-index/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fhfa-house-price-index/</guid>
      <pubDate>Wed, 02 Sep 2026 00:00:00 GMT</pubDate>
      <description>The FHFA HPI tracks single-family home price changes using repeat-sales methodology on conforming mortgages purchased by Fannie Mae and Freddie Mac -- back to 1975, with national, state, MSA, and ZIP code coverage. Here is the weighted repeat-sales methodology, conforming loan limit boundary, expanded-data HPI, the 40%+ pandemic surge, FHFA vs. Case-Shiller vs. Zillow distinctions, and a Python script for state-level YoY appreciation rankings.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>IRS Form 990: The Public Financial Disclosure Behind Every Major US Nonprofit</title>
      <link>https://ai-analytics.org/writing/irs-990-nonprofit-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/irs-990-nonprofit-data/</guid>
      <pubDate>Tue, 01 Sep 2026 00:00:00 GMT</pubDate>
      <description>The IRS requires most 501(c) organizations to file Form 990 publicly -- disclosing revenue, expenses, executive compensation, governance, and grant activity for the $3T+ US nonprofit sector. Here is the 990 vs. 990-EZ vs. 990-PF form variants, Part VII executive pay disclosure, the AWS S3 bulk XML dataset (4M+ filings), ProPublica Nonprofit Explorer API, dark money 501(c)(4) tracking, and a Python script for financial ratio extraction.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Federal Reserve H.8: The Weekly Snapshot of Every US Commercial Bank&apos;s Balance Sheet</title>
      <link>https://ai-analytics.org/writing/fed-h8-bank-balance-sheets/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fed-h8-bank-balance-sheets/</guid>
      <pubDate>Mon, 31 Aug 2026 00:00:00 GMT</pubDate>
      <description>The Federal Reserve publishes the H.8 every Friday -- a weekly aggregate balance sheet for all US commercial banks covering $23T+ in assets: C&amp;I loans, real estate loans, securities (HTM vs. AFS), reserve balances, and deposit flows. Here is the large vs. small bank breakdown, the SVB $98B single-week deposit outflow, H.8 vs. Call Report distinctions, FRED series IDs, and a Python snippet tracking credit cycle signals.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Census County Business Patterns: Annual Establishment Counts, Employment, and Payroll for Every US County</title>
      <link>https://ai-analytics.org/writing/census-county-business-patterns/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-county-business-patterns/</guid>
      <pubDate>Sun, 30 Aug 2026 00:00:00 GMT</pubDate>
      <description>County Business Patterns is the Census Bureau annual series on US business activity at the county-NAICS level, published since 1964 -- establishment counts by size class, mid-March employment, and first-quarter payroll for every county. Here is the Business Register source, noise infusion disclosure methodology, Nonemployer Statistics companion, CBP vs. QCEW vs. Economic Census distinctions, Business Dynamics Statistics, Census API access, and how to compute manufacturing location quotients by county.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NAEP: The Nation&apos;s Report Card and the Federal Dataset Behind US Education Achievement</title>
      <link>https://ai-analytics.org/writing/naep-education-assessment/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/naep-education-assessment/</guid>
      <pubDate>Sat, 29 Aug 2026 00:00:00 GMT</pubDate>
      <description>The National Assessment of Educational Progress is the only nationally representative, continuing assessment of US student achievement -- reading, math, science, and more for 4th, 8th, and 12th graders. Here is the 0-500 scale and NAGB achievement levels, the COVID learning loss evidence (largest reading decline in 30 years), state comparison methodology, plausible values estimation, NAEP Data Explorer API, and the White-Black achievement gap trend since 1992.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS OEWS: The Occupational Employment and Wage Statistics Behind Every Salary Benchmark</title>
      <link>https://ai-analytics.org/writing/bls-oews-occupational-wages/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-oews-occupational-wages/</guid>
      <pubDate>Fri, 28 Aug 2026 00:00:00 GMT</pubDate>
      <description>The BLS Occupational Employment and Wage Statistics program covers 800+ occupations across every industry and geography -- the most comprehensive source for occupation-level wage percentiles in the US. Here is the survey methodology, full SOC hierarchy, wage percentile fields (10th through 90th), the H-1B prevailing wage Level I-IV connection, OEWS vs. CPS vs. QCEW distinctions, and a Python script for ranking the highest-paid tech occupations.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USPTO Patent Data: The Federal Database Behind Every US Patent Grant and Application</title>
      <link>https://ai-analytics.org/writing/uspto-patent-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/uspto-patent-data/</guid>
      <pubDate>Thu, 27 Aug 2026 00:00:00 GMT</pubDate>
      <description>The USPTO publishes bulk patent grant data (4M+ grants since 1976) and applications (since 2001), with PatentsView as the canonical research dataset -- disambiguated inventor and assignee records, CPC codes, citation networks, and prosecution history via PEDS. Here is the three patent types, continuation and evergreening strategy, Alice Corp and IPR quality controversies, PatentsView API, BigQuery public data, and a Python snippet for ranking top AI patent holders by CPC subclass.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BEA GDP and National Accounts: The Federal Dataset That Measures the US Economy</title>
      <link>https://ai-analytics.org/writing/bea-gdp-accounts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bea-gdp-accounts/</guid>
      <pubDate>Wed, 26 Aug 2026 00:00:00 GMT</pubDate>
      <description>The BEA National Income and Product Accounts are the official measure of US economic output, income, and spending — updated three times per year with advance, second, and third estimates. Here is the C+I+G+(X-M) expenditure identity, every GDP component in depth, real vs. nominal GDP, GDP by State and GDP by Industry breakdowns, the BEA API query structure, and FRED series IDs as the easiest access path.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA Drug Approvals: The NDA, BLA, and ANDA Database Behind Every Drug on the Market</title>
      <link>https://ai-analytics.org/writing/fda-drug-approvals/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-drug-approvals/</guid>
      <pubDate>Tue, 25 Aug 2026 00:00:00 GMT</pubDate>
      <description>The FDA CDER Drugs@FDA dataset tracks every drug approval action since 1939 — NDAs for brand drugs, BLAs for biologics, ANDAs for generics. Here is the Orange Book TE codes and patent/exclusivity listings, NCE/3-year/pediatric/orphan/biologic exclusivity mechanics, Breakthrough and Accelerated Approval designations, the Aduhelm controversy, and how to query OpenFDA drugs API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Medicare Part B Data: Every Procedure Billed to Medicare and What It Paid</title>
      <link>https://ai-analytics.org/writing/medicare-part-b-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/medicare-part-b-data/</guid>
      <pubDate>Mon, 24 Aug 2026 00:00:00 GMT</pubDate>
      <description>The CMS Medicare Part B Physician and Supplier Public Use File covers 1M+ providers, 12,000+ HCPCS procedure codes, and $400B+ in annual submitted charges. Here is the submitted vs. allowed vs. payment markup ratio, standardized payments removing geographic wage index, the Lucentis/Avastin ASP+6% controversy, the Salomon Melgen $21M ophthalmology fraud, and how to filter anti-VEGF injections to expose the billion-dollar pricing disparity.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS QCEW: The County-Level Employment and Wages Dataset Behind Every Local Economic Analysis</title>
      <link>https://ai-analytics.org/writing/bls-qcew-employment/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-qcew-employment/</guid>
      <pubDate>Sun, 23 Aug 2026 00:00:00 GMT</pubDate>
      <description>The BLS Quarterly Census of Employment and Wages covers 97%+ of US jobs at the county-NAICS industry level -- the most granular federal employment dataset available. Here is the QCEW vs. CES vs. LAUS distinctions, suppression rules for counties with fewer than three establishments, average weekly wage by sector, BLS bulk CSV download structure, and a Python snippet for highest-wage industries by county.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FBI NICS Background Checks: The Federal Dataset Behind 400 Million Firearm Transfer Attempts</title>
      <link>https://ai-analytics.org/writing/fbi-nics-background-checks/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fbi-nics-background-checks/</guid>
      <pubDate>Sat, 22 Aug 2026 00:00:00 GMT</pubDate>
      <description>The Brady Act NICS system has processed 400M+ background checks since 1998 — publishing monthly state-level counts of handgun, long gun, and permit check types. Here is the full check type taxonomy, why NICS counts do not equal gun sales, the default proceed loophole, the COVID and Biden inauguration demand spikes, and the BuzzFeed News parsed CSV.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>HUD LIHTC Database: Mapping 35 Years of Low-Income Housing Tax Credit Projects</title>
      <link>https://ai-analytics.org/writing/hud-lihtc-database/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hud-lihtc-database/</guid>
      <pubDate>Fri, 21 Aug 2026 00:00:00 GMT</pubDate>
      <description>The LIHTC program has financed 50,000+ projects and 3.5M+ affordable units since 1986. Here is the HUD database schema, the 9% vs. 4% credit mechanics, how State HFA Qualified Allocation Plans shape development geography, the National Housing Preservation Database complement, and how to compute units per capita by state.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CFTC Commitments of Traders: The Weekly Federal Report Behind Futures Market Positioning</title>
      <link>https://ai-analytics.org/writing/cftc-commitments-of-traders/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cftc-commitments-of-traders/</guid>
      <pubDate>Thu, 20 Aug 2026 00:00:00 GMT</pubDate>
      <description>The CFTC publishes weekly open interest broken down by trader category for every regulated futures market since 1986. Here is the four COT formats, how net non-commercial positioning signals crowded trades, the disaggregated vs. legacy distinction, all covered markets, and how to build a 52-week COT z-score.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OFAC Sanctions Lists: The Treasury Database Every Financial Institution Must Screen Against</title>
      <link>https://ai-analytics.org/writing/treasury-ofac-sanctions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/treasury-ofac-sanctions/</guid>
      <pubDate>Wed, 19 Aug 2026 00:00:00 GMT</pubDate>
      <description>The OFAC SDN list (~8,000 entries) covers every individual, entity, and vessel US persons are prohibited from transacting with — civil penalties up to $1.3M per violation. Here is the full SDN record schema, all major sanctions programs, the 50% ownership rule, the Binance $4.3B landmark penalty, and how to parse the XML list.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EPA Toxic Release Inventory: 35 Years of Industrial Chemical Releases and Environmental Justice Patterns</title>
      <link>https://ai-analytics.org/writing/epa-toxic-release-inventory/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-toxic-release-inventory/</guid>
      <pubDate>Tue, 18 Aug 2026 00:00:00 GMT</pubDate>
      <description>EPCRA Section 313 requires 20,000+ industrial facilities to report annual releases of 800+ toxic chemicals. Here is the full TRI field schema, the 75% release decline since 1988, the 2024 PFAS additions, how to use the RSEI model for toxicity-weighted population exposure, and how to join TRI to Census ACS for environmental justice analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Hospital Quality Data: Outcomes, Readmissions, and Star Ratings for 6,000 US Hospitals</title>
      <link>https://ai-analytics.org/writing/cms-hospital-quality/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-hospital-quality/</guid>
      <pubDate>Mon, 17 Aug 2026 00:00:00 GMT</pubDate>
      <description>CMS Care Compare publishes quality measures for every Medicare-certified hospital — 30-day mortality and readmission rates, HCAHPS patient experience, process compliance, and Medicare spending per beneficiary. Here is the full measure taxonomy, risk adjustment methodology, HAC Reduction Program penalties, and how to download and analyze the data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC EDGAR XBRL Financials: Machine-Readable Fundamentals for Every Public Company</title>
      <link>https://ai-analytics.org/writing/sec-edgar-financials/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-edgar-financials/</guid>
      <pubDate>Sun, 16 Aug 2026 00:00:00 GMT</pubDate>
      <description>Every public company files XBRL-tagged financial statements with the SEC — extractable through the EDGAR Company Facts API, the Frames cross-sectional endpoint, and bulk quarterly FSN downloads. Here is the US-GAAP taxonomy structure, three data quality pitfalls, rate limits, and how to build a revenue growth screener.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Corporate Prosecution Registry: DPAs, NPAs, and the Too-Big-to-Jail Database</title>
      <link>https://ai-analytics.org/writing/corporate-prosecution-registry/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/corporate-prosecution-registry/</guid>
      <pubDate>Sat, 15 Aug 2026 00:00:00 GMT</pubDate>
      <description>The Corporate Prosecution Registry (Duke Law) tracks every federal corporate criminal resolution since 1990 — DPAs, NPAs, and guilty pleas — covering 400+ resolutions and $30B+ in fines. Here is the resolution taxonomy, the Yates Memo and Monaco Doctrine, the HSBC and Boeing landmark cases, the compliance monitor system, and FCPA enforcement patterns.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USAID Foreign Assistance Data: Tracing $50 Billion in Annual US Development Spending</title>
      <link>https://ai-analytics.org/writing/usaid-foreign-assistance/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usaid-foreign-assistance/</guid>
      <pubDate>Fri, 14 Aug 2026 00:00:00 GMT</pubDate>
      <description>ForeignAssistance.gov publishes every US foreign assistance obligation and disbursement across all agencies — covering $50B+ per year since 2001. Here is the full dataset structure, PEPFAR ($110B+ cumulative, 20M+ on antiretrovirals), top recipient countries, implementing partner ecosystem, and 2025 USAID restructuring implications.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>PCAOB: The Federal Audit Watchdog Created After Enron and the KPMG Inspection-Data Scandal</title>
      <link>https://ai-analytics.org/writing/pcaob-audit-oversight/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/pcaob-audit-oversight/</guid>
      <pubDate>Thu, 13 Aug 2026 00:00:00 GMT</pubDate>
      <description>The PCAOB publishes inspection reports on every registered audit firm&apos;s deficiency rate. Here is the Big Four inspection pattern, the KPMG $50M stolen-inspection-list scandal, the HFCAA Chinese auditor crisis and 2022 CSRC breakthrough, and how researchers use deficiency rates as an auditor quality proxy.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Medicare Part D Prescribing Data: Every Drug Prescribed by Every Medicare Provider</title>
      <link>https://ai-analytics.org/writing/medicare-part-d-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/medicare-part-d-data/</guid>
      <pubDate>Wed, 12 Aug 2026 00:00:00 GMT</pubDate>
      <description>CMS publishes provider-level Part D prescribing data — 1M+ providers, 5,700+ drugs, $100B+ in visible spending per year. Here is the full schema, how the data exposed the opioid crisis (ProPublica Prescriber Checkup), the GLP-1 agonist cost surge, and how to join with CMS Open Payments for prescribing-payment correlation analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CMS Open Payments: Mapping $12 Billion in Drug and Device Industry Payments to Physicians</title>
      <link>https://ai-analytics.org/writing/cms-open-payments/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-open-payments/</guid>
      <pubDate>Tue, 11 Aug 2026 00:00:00 GMT</pubDate>
      <description>The Physician Payments Sunshine Act requires manufacturers to report every payment to physicians — consulting fees, speaker fees, meals, royalties, and 22 other categories. Here is the full schema, $3.5B/year scale, the GSK and Novartis enforcement cases, peer-reviewed payment-prescribing correlations, and how to join with Medicare Part D data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>ATF Crime Gun Trace Data: The Federal Dataset the Tiahrt Amendment Tried to Hide</title>
      <link>https://ai-analytics.org/writing/atf-crime-gun-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/atf-crime-gun-data/</guid>
      <pubDate>Mon, 10 Aug 2026 00:00:00 GMT</pubDate>
      <description>The ATF NTC processes 500,000+ firearm traces per year. Here is what the Tiahrt Amendment restricts, what aggregated state-level data reveals about the iron pipeline, how time-to-crime exposes straw purchasing, the FFL directory, AFMER manufacturing data, and the ghost gun tracing gap.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CPSC Product Recalls: The Federal Safety Database Behind 400 Consumer Product Recalls Per Year</title>
      <link>https://ai-analytics.org/writing/cpsc-product-recalls/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cpsc-product-recalls/</guid>
      <pubDate>Sun, 09 Aug 2026 00:00:00 GMT</pubDate>
      <description>The CPSC publishes every recall of consumer products — 400-500 per year across 15,000+ product categories. Here is the full recall database schema, the SaferProducts.gov incident system, the IKEA Malm and Fisher-Price Rock n Play landmark cases, CPSIA 2008, and how the voluntary recall process works.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NFIP Flood Insurance Data: Mapping 40 Years of Flood Claims Across 5 Million Policies</title>
      <link>https://ai-analytics.org/writing/nfip-flood-insurance/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nfip-flood-insurance/</guid>
      <pubDate>Sat, 08 Aug 2026 00:00:00 GMT</pubDate>
      <description>The FEMA NFIP publishes every paid flood claim since 1978 and active policy data — 5M+ policies, $1.3T in coverage, loss history including Katrina ($16B), Sandy ($8B), and Harvey ($9B). Here is the claims/policy dataset structure, flood zone taxonomy, repetitive loss problem, Risk Rating 2.0, and the OpenFEMA API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FARA: The Foreign Agent Registration Database Behind US Influence Operations</title>
      <link>https://ai-analytics.org/writing/fara-foreign-agents/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fara-foreign-agents/</guid>
      <pubDate>Fri, 07 Aug 2026 00:00:00 GMT</pubDate>
      <description>FARA requires US-based agents of foreign principals to register with DOJ and disclose activities and payments. Here is the full registration database schema, the Section 613 LDA exemption, the Manafort conviction and Mueller-era enforcement surge, $700M+ in annual disclosed payments, and how to query the eFile API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA Medical Device Recalls: The Database Behind Every Implant Failure and CPAP Warning</title>
      <link>https://ai-analytics.org/writing/fda-device-recalls/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-device-recalls/</guid>
      <pubDate>Thu, 06 Aug 2026 00:00:00 GMT</pubDate>
      <description>The FDA CDRH publishes every medical device recall action — Class I, II, and III — covering 1,000–1,500 recalls per year. Here is the full field schema, the DePuy ASR ($4B settlement) and Philips Respironics CPAP (5.5M+ units) landmark recalls, how MAUDE adverse event reports feed recall decisions, and how to query the OpenFDA device recall API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NCUA Credit Union Data: The 5300 Call Report and Enforcement Database for 4,700 Federally Insured Credit Unions</title>
      <link>https://ai-analytics.org/writing/ncua-credit-union-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ncua-credit-union-data/</guid>
      <pubDate>Wed, 05 Aug 2026 00:00:00 GMT</pubDate>
      <description>The NCUA publishes quarterly 5300 Call Report data for every federally insured credit union — assets, shares, loans, delinquency, net worth ratios — plus a public enforcement action database. Here is the data structure, the net worth PCA thresholds, the 2009 corporate credit union crisis ($28.5B bailout), and how to download and screen the quarterly data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CFPB Enforcement Actions: The Public Record of $20 Billion in Consumer Finance Penalties</title>
      <link>https://ai-analytics.org/writing/cfpb-enforcement-actions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cfpb-enforcement-actions/</guid>
      <pubDate>Tue, 04 Aug 2026 00:00:00 GMT</pubDate>
      <description>The CFPB has brought 200+ enforcement actions since 2011 — covering UDAAP violations, redlining, student loan servicer abuses, and predatory auto lending — with $20B+ in consumer relief and penalties. Here is the enforcement action taxonomy, the UDAAP abusiveness standard, the Wells Fargo $3.7B action, and how to scrape and analyze the enforcement database.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BTS Border Crossing Entry Data: Monthly Counts of Every Vehicle, Truck, and Pedestrian at US Land Ports</title>
      <link>https://ai-analytics.org/writing/bts-border-crossings/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bts-border-crossings/</guid>
      <pubDate>Mon, 03 Aug 2026 00:00:00 GMT</pubDate>
      <description>The BTS publishes monthly counts of every border crossing type at ~290 US land ports back to 1996 — personal vehicles, pedestrians, trucks, buses, trains, and containers by port. Here is the full taxonomy, the COVID-19 collapse (pedestrians -93%, trucks -28%), San Ysidro and Laredo dominance, and how to use the Socrata API for supply chain and trade flow analysis.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>ClinicalTrials.gov Data: The Federal Registry Behind Every Drug and Device Trial</title>
      <link>https://ai-analytics.org/writing/clinicaltrials-gov-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/clinicaltrials-gov-data/</guid>
      <pubDate>Sun, 02 Aug 2026 00:00:00 GMT</pubDate>
      <description>FDAAA 801 requires registration of all applicable clinical trials before enrollment and results submission within 12 months of completion — but 50%+ of trials still fail to report results. Here is the full NCT schema, how to access the AACT PostgreSQL mirror from Duke/CTTI, how to detect publication bias using the results reporting gap, and how the GLP-1 agonist trial explosion looks in the data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA 510(k) Device Clearances: The Substantial Equivalence Pathway That Cleared 100,000+ Medical Devices</title>
      <link>https://ai-analytics.org/writing/fda-510k-device-clearances/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-510k-device-clearances/</guid>
      <pubDate>Sat, 01 Aug 2026 00:00:00 GMT</pubDate>
      <description>The FDA 510(k) pathway clears medical devices by showing substantial equivalence to a predicate device — no clinical trials required. Here is the three-class device system, the K-number database fields, the predicate daisy-chain problem, the De Novo pathway, and the metal-on-metal hip and vaginal mesh controversies. Plus how to query the OpenFDA device API.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOL H-2 Visa Disclosures: Mapping the Guest Worker Programs Feeding US Agriculture and Hospitality</title>
      <link>https://ai-analytics.org/writing/dol-h2-visa-disclosures/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-h2-visa-disclosures/</guid>
      <pubDate>Fri, 31 Jul 2026 00:00:00 GMT</pubDate>
      <description>The H-2A program (cap-free agricultural) and H-2B program (66,000-cap non-agricultural) bring hundreds of thousands of temporary workers to the US annually. DOL OFLC publishes quarterly disclosure files with employer, job title, wages, worksites, and worker counts. Here is the data structure, H-2A growth from 60,000 to 370,000+ certifications (2012-2023), and how to compare wages against adverse effect wage rates.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OCC Bank Enforcement Actions: Reading the Federal Regulator’s Public Disciplinary Record</title>
      <link>https://ai-analytics.org/writing/occ-bank-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/occ-bank-enforcement/</guid>
      <pubDate>Thu, 30 Jul 2026 00:00:00 GMT</pubDate>
      <description>The Office of the Comptroller of the Currency publishes every formal enforcement action against national banks and federal thrifts — from Commitment Letters through Formal Agreements, Consent Orders, and Cease-and-Desist Orders. Here is the enforcement action taxonomy, the BSA/AML enforcement pattern, the Wells Fargo consent order cascade, and how to scrape and analyze the OCC enforcement database.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FERC Enforcement: The Federal Watchdog Over Energy Market Manipulation</title>
      <link>https://ai-analytics.org/writing/ferc-energy-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ferc-energy-enforcement/</guid>
      <pubDate>Wed, 29 Jul 2026 00:00:00 GMT</pubDate>
      <description>FERC investigates electricity and gas market manipulation with penalties up to $1.4M per day per violation. Here is the enforcement database, the JP Morgan ($410M) and Barclays ($488M) cases, how Electric Quarterly Reports expose every bilateral power transaction, and how to search FERC eLibrary enforcement dockets.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SAMHSA Treatment Data: Mapping Substance Use Disorder Services Across 17,000 Facilities</title>
      <link>https://ai-analytics.org/writing/samhsa-treatment-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/samhsa-treatment-data/</guid>
      <pubDate>Tue, 28 Jul 2026 00:00:00 GMT</pubDate>
      <description>SAMHSA publishes the N-SSATS facility survey (17,000+ treatment locations with services, ownership, and MAT availability) and the TEDS admissions dataset. Here is the data structure, how to map OTP density against overdose death rates, and what the data reveals about rural treatment gaps.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC Enforcement Actions: The Public Record of Every Securities Law Violation</title>
      <link>https://ai-analytics.org/writing/sec-enforcement-actions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-enforcement-actions/</guid>
      <pubDate>Mon, 27 Jul 2026 00:00:00 GMT</pubDate>
      <description>The SEC publishes Administrative Proceedings, Litigation Releases, and final orders covering 700-800 enforcement actions per year — with $4-5B in annual disgorgement and penalties. Here is the record structure, whistleblower program mechanics, and how to scrape and parse the enforcement databases.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>HHS OIG Exclusions: The Federal Healthcare Fraud Blacklist That Every Provider Must Screen Against</title>
      <link>https://ai-analytics.org/writing/hhs-oig-exclusions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hhs-oig-exclusions/</guid>
      <pubDate>Sun, 26 Jul 2026 00:00:00 GMT</pubDate>
      <description>The HHS OIG LEIE bars providers from billing Medicare and Medicaid — with $10,000 per-service penalties for employers that fail to screen. Here is the exclusion type taxonomy, how to download the monthly CSV, how it differs from SAM.gov EPLS, and how to implement fuzzy-match screening.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC Form 8-K: The Real-Time Disclosure Feed for Every Material Corporate Event</title>
      <link>https://ai-analytics.org/writing/sec-8k-material-events/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-8k-material-events/</guid>
      <pubDate>Sat, 25 Jul 2026 00:00:00 GMT</pubDate>
      <description>Public companies must file Form 8-K within 4 business days of any material event — covering 33 item types from earnings releases and executive departures to bankruptcy filings and the 2023 cybersecurity incident disclosure requirement. Here is the item taxonomy, how to filter EDGAR for specific event types, and how non-reliance filings signal fraud.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NHTSA Vehicle Recall Data: 70 Years of Safety Defects Across 900 Million Vehicles</title>
      <link>https://ai-analytics.org/writing/nhtsa-vehicle-recalls/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nhtsa-vehicle-recalls/</guid>
      <pubDate>Fri, 24 Jul 2026 00:00:00 GMT</pubDate>
      <description>NHTSA maintains the recall database covering every safety-related defect since 1966. Here is the data structure, the NHTSA complaint-to-recall investigation pipeline, how to query by VIN, and what the Takata airbag recall and EV battery fire trend reveal about the dataset.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOL Form 5500: The Annual Filing That Exposes Every Private Pension and 401(k) Plan</title>
      <link>https://ai-analytics.org/writing/dol-form-5500-pensions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-form-5500-pensions/</guid>
      <pubDate>Thu, 23 Jul 2026 00:00:00 GMT</pubDate>
      <description>Every large ERISA plan files Form 5500 annually — covering 750,000+ plans with $10T+ in assets. Schedule C reveals service provider fees; Schedule SB tracks pension funding ratios. Here is the schema, EFAST2 access, and how to compute average expense ratios by plan size.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Senate LDA Lobbying Disclosures: Mapping $4 Billion in Annual Influence Spending</title>
      <link>https://ai-analytics.org/writing/lobbying-disclosure-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/lobbying-disclosure-data/</guid>
      <pubDate>Wed, 22 Jul 2026 00:00:00 GMT</pubDate>
      <description>The Lobbying Disclosure Act requires quarterly filings with the Senate SOPR — covering lobbyist identities, issue codes, specific bills lobbied, and dollar amounts. Here is the LDA API, the relationship to FARA and LD-203 contribution reports, and how to connect lobbying spending to legislative outcomes.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USASpending Federal Contracts: Tracing $700 Billion in Annual Government Procurement</title>
      <link>https://ai-analytics.org/writing/usaspending-federal-contracts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usaspending-federal-contracts/</guid>
      <pubDate>Tue, 21 Jul 2026 00:00:00 GMT</pubDate>
      <description>USASpending.gov pulls from FPDS-NG to publish every federal contract action — award type, NAICS/PSC codes, competition type, small business set-asides, and full recipient data. Here is the field structure, how to use the USASpending API, and how journalists trace no-bid contracts and contractor concentration.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>EIA Electricity Data: The Federal Dataset Behind Every Kilowatt-Hour Generated, Sold, and Priced</title>
      <link>https://ai-analytics.org/writing/eia-electricity-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/eia-electricity-data/</guid>
      <pubDate>Mon, 20 Jul 2026 00:00:00 GMT</pubDate>
      <description>The EIA publishes Form 923 (monthly plant-level generation and fuel use), Form 861 (annual utility retail sales and pricing), Form 860 (every generator nameplate capacity and status), and EIA-930 (hourly real-time grid data by Balancing Authority). Here is the fuel mix transformation 2000-2023 (coal 52% to 16%, gas rise, wind/solar growth), ERCOT Texas isolation and Winter Storm Uri, the EIA API v2 structure, and a Python stacked-area chart of the energy transition.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CFPB Consumer Complaint Database: 5 Million Complaints Against Banks and Lenders</title>
      <link>https://ai-analytics.org/writing/cfpb-consumer-complaints/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cfpb-consumer-complaints/</guid>
      <pubDate>Sun, 19 Jul 2026 00:00:00 GMT</pubDate>
      <description>The CFPB Consumer Complaint Database tracks 5M+ complaints since 2011 about mortgages, credit cards, debt collection, and credit reporting — with company response, relief status, and consumer narratives. Here is the schema, how credit reporting complaints surged post-COVID, and how the data connects to enforcement.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FINRA BrokerCheck: The Public Database of Every Registered Broker and Investment Adviser</title>
      <link>https://ai-analytics.org/writing/finra-brokercheck-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/finra-brokercheck-data/</guid>
      <pubDate>Sat, 18 Jul 2026 00:00:00 GMT</pubDate>
      <description>FINRA BrokerCheck publishes registration history, licenses, employment records, and disclosure events for every registered broker and firm. Here is the data structure, the recidivist broker problem, how to access the BrokerCheck API, and how attorneys use it to vet advisers.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC Form 4: The Insider Trading Disclosure Behind Every Officer and Director Stock Transaction</title>
      <link>https://ai-analytics.org/writing/sec-form-4-insider-trading/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-form-4-insider-trading/</guid>
      <pubDate>Fri, 17 Jul 2026 00:00:00 GMT</pubDate>
      <description>Section 16(a) requires officers, directors, and 10%+ shareholders to file Form 4 within 2 business days of any stock transaction -- creating a near-real-time public record on EDGAR. Here is the full transaction code taxonomy (code P open-market purchases as the only high-signal code), the 10b5-1 plan gaming problem and 2022 SEC amendments, cluster-buying methodology, academic evidence on 6%+ abnormal returns, and a Python EDGAR bulk index screen.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA FAERS: The Adverse Drug Event Database Behind Post-Market Drug Safety</title>
      <link>https://ai-analytics.org/writing/fda-faers-adverse-events/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-faers-adverse-events/</guid>
      <pubDate>Thu, 16 Jul 2026 00:00:00 GMT</pubDate>
      <description>The FDA Adverse Event Reporting System contains 7 linked quarterly files tracking drug adverse events. Here is the MedDRA schema, how disproportionality analysis (PRR/ROR) detects safety signals, and the Avandia/Vioxx/SSRI signal cases.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>College Scorecard: The Federal Dataset That Exposes Graduation Rates, Debt, and Earnings for Every US College</title>
      <link>https://ai-analytics.org/writing/college-scorecard-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/college-scorecard-data/</guid>
      <pubDate>Wed, 15 Jul 2026 00:00:00 GMT</pubDate>
      <description>The College Scorecard links IPEDS enrollment data to federal loan records and IRS earnings data. Here is the data structure, how to use the API, and what the earnings-debt gap reveals about for-profit colleges and high-debt programs.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CISA KEV Catalog: The Federal Government&apos;s Definitive List of Actively Exploited Vulnerabilities</title>
      <link>https://ai-analytics.org/writing/cisa-known-exploited-vulnerabilities/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cisa-known-exploited-vulnerabilities/</guid>
      <pubDate>Tue, 14 Jul 2026 00:00:00 GMT</pubDate>
      <description>The CISA Known Exploited Vulnerabilities catalog lists CVEs confirmed as actively exploited in the wild — with mandatory federal patching deadlines under BOD 22-01. Here is the catalog structure, how it differs from CVSS scoring, and how security teams use it for patch prioritization.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USCIS H-1B Visa Data: Mapping the 600,000-Worker Skilled Immigration Pipeline</title>
      <link>https://ai-analytics.org/writing/uscis-h1b-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/uscis-h1b-data/</guid>
      <pubDate>Mon, 13 Jul 2026 00:00:00 GMT</pubDate>
      <description>The DOL Labor Condition Application dataset and USCIS H-1B Employer Data Hub reveal the true shape of the skilled-worker visa program: IT staffing companies dominate approvals, India-born workers hold 70%+ of visas, and prevailing wage Level I filings expose systematic wage suppression.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>DOJ False Claims Act Settlements: The $70 Billion Fraud Recovery Database</title>
      <link>https://ai-analytics.org/writing/doj-false-claims-act/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/doj-false-claims-act/</guid>
      <pubDate>Sun, 12 Jul 2026 00:00:00 GMT</pubDate>
      <description>The False Claims Act is the government&apos;s primary anti-fraud tool, with qui tam whistleblowers driving 80%+ of the $2B+ in annual recoveries. Healthcare fraud dominates. Here is how to access the DOJ settlement database, scrape press releases, and identify repeat violators.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NIH Research Grant Data: Mapping $40 Billion in Annual Biomedical Funding</title>
      <link>https://ai-analytics.org/writing/nih-research-grants/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nih-research-grants/</guid>
      <pubDate>Sat, 11 Jul 2026 00:00:00 GMT</pubDate>
      <description>The NIH Reporter system publishes every grant award — PI, institution, project title, abstract, award amount, IC, and activity code. Here is the activity code taxonomy, how funding flows by Institute/Center, and how to use the NIH Reporter API to track COVID research spending and HBCU funding gaps.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CDC Drug Overdose Mortality Data: The Federal Dataset Behind the Opioid Crisis</title>
      <link>https://ai-analytics.org/writing/cdc-drug-overdose-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cdc-drug-overdose-data/</guid>
      <pubDate>Fri, 10 Jul 2026 00:00:00 GMT</pubDate>
      <description>The CDC publishes overdose mortality through NVSS, CDC WONDER, and monthly VSRR provisional counts — tracking 107,000+ annual drug deaths at the county, demographic, and drug-category level. Here is the ICD-10 code structure, the three waves of the opioid epidemic, and how to access the data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDIC Bank Failure Data: Every US Bank That Has Failed Since 1934</title>
      <link>https://ai-analytics.org/writing/fdic-bank-failures/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fdic-bank-failures/</guid>
      <pubDate>Thu, 09 Jul 2026 00:00:00 GMT</pubDate>
      <description>The FDIC publishes a complete failure list covering 4,000+ bank closures since 1934 — S&amp;L crisis wave, the 2008–2012 GFC wave, and the 2023 SVB/Signature/First Republic episode. Here is the dataset schema, how to use call report data and the Texas Ratio to identify at-risk institutions, and how to access FDIC BankFind.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FDA Warning Letters: The Public Enforcement Record for 100,000+ Regulatory Actions</title>
      <link>https://ai-analytics.org/writing/fda-warning-letters/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fda-warning-letters/</guid>
      <pubDate>Wed, 08 Jul 2026 00:00:00 GMT</pubDate>
      <description>The FDA publishes every warning letter on its website — pharmaceutical cGMP violations, food safety failures, device adulteration, and clinical investigator fraud. Here is the enforcement hierarchy from Form 483 to criminal referral, how to access and scrape the letter database, and what the record reveals about repeat violators.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>MSHA Mine Safety Data: Violations, Accidents, and Fatalities Across 10,000 Active Mines</title>
      <link>https://ai-analytics.org/writing/msha-mine-safety/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/msha-mine-safety/</guid>
      <pubDate>Tue, 07 Jul 2026 00:00:00 GMT</pubDate>
      <description>The Mine Safety and Health Administration publishes three linked datasets — mine listings, accident/injury records, and violation citations going back to 1983. Here is the significant-and-substantial designation, the Pattern of Violations mechanism, the Upper Big Branch disaster context, and how to join violations to accidents by Mine ID.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>USCG Marine Casualty Data: Every US Vessel Accident Since 1982</title>
      <link>https://ai-analytics.org/writing/uscg-marine-casualties/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/uscg-marine-casualties/</guid>
      <pubDate>Mon, 06 Jul 2026 00:00:00 GMT</pubDate>
      <description>The US Coast Guard maintains the Boating Accident Report Database (BARD) for recreational vessels and the Marine Casualty and Pollution Database (MCPD) for commercial casualties. Here is what each database contains, how alcohol and life-jacket non-use drive fatality statistics, and how journalists use the data to track manufacturer defects.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FMCSA Carrier Safety Ratings: The Federal Database Behind 550,000 Trucking Companies</title>
      <link>https://ai-analytics.org/writing/fmcsa-safety-ratings/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fmcsa-safety-ratings/</guid>
      <pubDate>Sun, 05 Jul 2026 00:00:00 GMT</pubDate>
      <description>The FMCSA maintains SAFER and MCMIS covering every commercial motor carrier in interstate commerce — three official safety ratings, seven SMS BASICs scoring each carrier as a percentile, inspection counts, OOS rates, and crash data. Here is the data structure, how to access it, and what it reveals about high-risk carriers.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>CBP US Trade Statistics: The Federal Dataset Behind Every Import and Export</title>
      <link>https://ai-analytics.org/writing/cbp-trade-statistics/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cbp-trade-statistics/</guid>
      <pubDate>Sat, 04 Jul 2026 00:00:00 GMT</pubDate>
      <description>US Customs and Border Protection and the Census Bureau publish comprehensive import and export statistics by commodity (HTS code), trading partner, port of entry, and month. Here is the data structure, how to access USA Trade Online and the Census Foreign Trade API, and what the data reveals about trade diversion after Section 301 tariffs.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>ICE Enforcement and Removal Operations: Reading the Federal Dataset Behind Immigration Enforcement</title>
      <link>https://ai-analytics.org/writing/ice-enforcement-removals/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ice-enforcement-removals/</guid>
      <pubDate>Fri, 03 Jul 2026 00:00:00 GMT</pubDate>
      <description>ICE publishes annual ERO reports covering arrests, detentions, removals, and returns by country of origin, criminal vs. non-criminal designation, and field office. Here is the data structure, TRAC-ICE access, and what the dataset reveals about enforcement priority shifts and the interior vs. border enforcement split.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS CPI-U: The Inflation Dataset That Moves Markets and Sets Policy</title>
      <link>https://ai-analytics.org/writing/bls-cpi-u/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-cpi-u/</guid>
      <pubDate>Thu, 02 Jul 2026 00:00:00 GMT</pubDate>
      <description>The BLS Consumer Price Index for All Urban Consumers tracks monthly inflation going back to January 1913. Here is the expenditure weight breakdown, how CPI-U differs from core CPI and the PCE deflator, how to access it via the BLS API, and what the 2021-2023 surge revealed about shelter inflation measurement.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SSA Disability Award Statistics: The Federal Dataset Behind 8 Million Benefit Decisions</title>
      <link>https://ai-analytics.org/writing/ssa-disability-awards/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ssa-disability-awards/</guid>
      <pubDate>Wed, 01 Jul 2026 00:00:00 GMT</pubDate>
      <description>The Social Security Administration publishes annual disability award statistics covering both SSDI and SSI — awards by state, diagnosis code, age group, gender, and decision level. Here is what the dataset contains, how to access it, and what it reveals about geographic variation in award rates, the ALJ hearing backlog, and the Trust Fund solvency timeline.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NLRB Unfair Labor Practice Data: 300,000 Cases of Worker-Management Conflict</title>
      <link>https://ai-analytics.org/writing/nlrb-ulp-filings/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nlrb-ulp-filings/</guid>
      <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
      <description>The National Labor Relations Board maintains a public case management system tracking every unfair labor practice charge filed under the NLRA — 20,000–25,000 annually. Here is the case lifecycle, data structure, how to query the NLRB API, and what the data reveals about the 2022–2024 Starbucks and Amazon organizing surge.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BLS JOLTS: The Federal Dataset That Measures Why Workers Quit</title>
      <link>https://ai-analytics.org/writing/bls-jolts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bls-jolts/</guid>
      <pubDate>Mon, 29 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Job Openings and Labor Turnover Survey tracks monthly job openings, hires, quits, layoffs, and other separations by industry and region. Here is the data structure, BLS API access, and what JOLTS reveals about the Great Resignation, the Fed&apos;s rate-hike calculus, and the labor market signals that precede recessions.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>FTC Consumer Sentinel Network: 16 Million Fraud Reports Hiding in Plain Sight</title>
      <link>https://ai-analytics.org/writing/ftc-consumer-complaints/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ftc-consumer-complaints/</guid>
      <pubDate>Sun, 28 Jun 2026 00:00:00 GMT</pubDate>
      <description>The FTC Consumer Sentinel Network aggregates 8M+ fraud, identity theft, and consumer complaint reports annually from the FTC and dozens of partner organizations. Here is what the dataset contains, how to access it, and what it reveals about imposter scams, cryptocurrency fraud, and the counterintuitive age dynamics of financial loss.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Derailments and grade crossings: using FRA railroad accident data to analyze rail safety trends</title>
      <link>https://ai-analytics.org/writing/fra-railroad-accident-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fra-railroad-accident-data/</guid>
      <pubDate>Sat, 27 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Federal Railroad Administration publishes two linked databases covering US railroad safety since 1975: Form 54 (all rail accidents) and Form 57 (highway-rail grade crossing accidents). Together they cover 250,000+ incidents with train information, track type, speed at accident, casualties, and equipment damage.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The graveyard of pensions: using PBGC data to track terminated defined-benefit plans</title>
      <link>https://ai-analytics.org/writing/pbgc-pension-terminations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/pbgc-pension-terminations/</guid>
      <pubDate>Fri, 26 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Pension Benefit Guaranty Corporation publishes data on every terminated private-sector defined-benefit pension plan it has trusteed since 1975 — over 5,000 plans covering millions of workers. The data reveals which industries have abandoned their pension obligations and which plan sponsors walked away from the largest underfunded obligations.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Seismic record: using the USGS earthquake catalog to analyze fault risk and induced seismicity</title>
      <link>https://ai-analytics.org/writing/usgs-earthquake-database/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usgs-earthquake-database/</guid>
      <pubDate>Thu, 25 Jun 2026 00:00:00 GMT</pubDate>
      <description>The USGS National Earthquake Information Center maintains a catalog of every recorded earthquake globally — magnitude 2.5+ events back to 1900, with 100,000+ events per year above M4 globally. Here is the data structure, how to access the API and bulk downloads, and what the catalog reveals about fault hazard zones and the Oklahoma induced seismicity surge.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Following EPA enforcement: using ECHO data to track environmental violations and penalties</title>
      <link>https://ai-analytics.org/writing/epa-enforcement-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/epa-enforcement-data/</guid>
      <pubDate>Wed, 24 Jun 2026 00:00:00 GMT</pubDate>
      <description>EPA&apos;s Enforcement and Compliance History Online (ECHO) publishes every CAA, CWA, RCRA, and TSCA enforcement case — facility violations, formal actions, penalties assessed, and compliance status for 800,000+ regulated facilities. Here is the data structure, how to query it, and what the database reveals about enforcement gaps and environmental justice.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>60 years of extreme weather: using NOAA Storm Events data to map tornado paths, flood losses, and climate trends</title>
      <link>https://ai-analytics.org/writing/noaa-storm-events-database/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/noaa-storm-events-database/</guid>
      <pubDate>Tue, 23 Jun 2026 00:00:00 GMT</pubDate>
      <description>NOAA&apos;s Storm Events Database records every significant weather event in the US since 1950 — tornadoes, floods, hurricanes, winter storms, heat waves, wildfires, and 50+ other event types with location, deaths, injuries, and property damage estimates. Here is the data structure, how to access it, and what the database reveals about extreme weather trends, geographic risk concentration, and the growing cost of natural disasters.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Dark money disclosed: using IRS Form 990 data to map political organization spending</title>
      <link>https://ai-analytics.org/writing/irs-990-political-organizations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/irs-990-political-organizations/</guid>
      <pubDate>Mon, 22 Jun 2026 00:00:00 GMT</pubDate>
      <description>The IRS publishes Form 990 filings for political organizations — 527 committees and 501(c)(4) social welfare organizations (the dark money vehicle). The data covers revenue, expenditures, officer compensation, and political activities for 65,000+ organizations. Here is what the data contains and what it reveals about the shadow infrastructure of US political spending.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Every US plane crash since 1962: using the NTSB aviation accident database</title>
      <link>https://ai-analytics.org/writing/ntsb-aviation-accidents/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ntsb-aviation-accidents/</guid>
      <pubDate>Sun, 21 Jun 2026 00:00:00 GMT</pubDate>
      <description>The National Transportation Safety Board publishes a database of every US civil aviation accident since 1962 — over 90,000 accidents and incidents with aircraft type, probable cause, phase of flight, weather, pilot certificates, and injury counts. Here is the data structure, how to query it, and what 60 years of accident data reveals.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Pipeline spills and explosions: using PHMSA incident data to map 50 years of pipeline failures</title>
      <link>https://ai-analytics.org/writing/phmsa-pipeline-safety/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/phmsa-pipeline-safety/</guid>
      <pubDate>Sat, 20 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Pipeline and Hazardous Materials Safety Administration publishes incident reports for every significant pipeline accident since 1970 — gas distribution, gas transmission, hazardous liquids, and LNG facilities. The database covers 25,000+ incidents with fatalities, injuries, property damage, and commodity spilled.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Food stamps by the numbers: using USDA SNAP participation data to track hunger and benefit policy</title>
      <link>https://ai-analytics.org/writing/usda-snap-program-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usda-snap-program-data/</guid>
      <pubDate>Fri, 19 Jun 2026 00:00:00 GMT</pubDate>
      <description>The USDA Food and Nutrition Service publishes monthly SNAP participation and benefit data by state — total participants, households, benefits issued, average benefit per person, and issuance history going back to 1969. The data shows how food assistance responds to recessions, pandemic aid expansions, and state-level work requirement policies.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The demographic backbone: using Census ACS data to contextualize every other federal dataset</title>
      <link>https://ai-analytics.org/writing/census-acs-demographic-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/census-acs-demographic-data/</guid>
      <pubDate>Thu, 18 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Census Bureau&apos;s American Community Survey publishes 5-year estimates for every census tract in the US — income, poverty, race, housing tenure, education, employment, and 350+ other variables at the tract level. ACS is the denominator that makes every other federal dataset meaningful: HMDA denial rates per capita, OSHA injury rates per worker, SNAP participation per household.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Mapping housing discrimination: using HUD FHEO complaint data to find fair housing violations</title>
      <link>https://ai-analytics.org/writing/hud-fair-housing-complaints/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hud-fair-housing-complaints/</guid>
      <pubDate>Wed, 17 Jun 2026 00:00:00 GMT</pubDate>
      <description>HUD&apos;s Fair Housing and Equal Opportunity office publishes a complaint database covering every fair housing complaint filed with HUD and participating state agencies — basis of discrimination (race, national origin, disability, familial status, sex, religion), property type, complaint disposition, and whether the complainant received relief.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Inside the count: using BJS National Prisoner Statistics to analyze incarceration trends</title>
      <link>https://ai-analytics.org/writing/bjs-national-prisoner-statistics/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bjs-national-prisoner-statistics/</guid>
      <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Bureau of Justice Statistics publishes the National Prisoner Statistics program — state and federal prison populations back to 1925, with demographics (race, sex, age), offense categories, sentence lengths, and admissions/releases flows. Here is the data structure, how to access it, and what 100 years of incarceration data reveals about mandatory minimums, the drug war, and mass incarceration&apos;s racial dimensions.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Workplace safety violations: using OSHA inspection and citation data to find dangerous employers</title>
      <link>https://ai-analytics.org/writing/osha-inspection-enforcement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/osha-inspection-enforcement/</guid>
      <pubDate>Mon, 15 Jun 2026 00:00:00 GMT</pubDate>
      <description>OSHA publishes its full inspection and citation database — every workplace inspection since 1972, every violation found, every penalty assessed, and whether the employer contested the citation. The database covers 2.5M+ inspections across all industries. Here is what it contains, how to query it, and what patterns emerge from 50 years of enforcement data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Wage theft by employer: using DOL Wage and Hour Division enforcement data to find labor violations</title>
      <link>https://ai-analytics.org/writing/dol-wage-hour-violations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dol-wage-hour-violations/</guid>
      <pubDate>Sun, 14 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Department of Labor&apos;s Wage and Hour Division publishes a public enforcement database covering every concluded investigation — employer name, violation type, back wages owed, employees affected, and civil money penalties.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Every US traffic death since 1975: using NHTSA FARS to analyze road safety, vehicle defects, and enforcement gaps</title>
      <link>https://ai-analytics.org/writing/nhtsa-fars-traffic-fatalities/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nhtsa-fars-traffic-fatalities/</guid>
      <pubDate>Sat, 13 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Fatality Analysis Reporting System (FARS) contains a record for every motor vehicle crash death on US public roads since 1975 — 1.1M+ fatalities with vehicle type, crash circumstances, driver behavior, and roadway conditions. Here is the data structure, how to download it, and what it reveals about drunk driving trends, pedestrian deaths, and the safety gap between vehicle classes.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Incident-level crime: using FBI NIBRS data to analyze offense patterns, victim demographics, and clearance rates</title>
      <link>https://ai-analytics.org/writing/fbi-nibrs-crime-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fbi-nibrs-crime-data/</guid>
      <pubDate>Fri, 12 Jun 2026 00:00:00 GMT</pubDate>
      <description>The FBI&apos;s National Incident-Based Reporting System (NIBRS) publishes incident-level crime data — every offense, victim, offender, arrest, and property loss reported by participating agencies. Here is what the database contains, how it differs from the legacy UCR Summary data, and how to use it for research on offense patterns, racial disparities in enforcement, and geographic hot-spots.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Follow the money: mapping dark money and super PAC flows with FEC bulk data</title>
      <link>https://ai-analytics.org/writing/fec-campaign-finance-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fec-campaign-finance-data/</guid>
      <pubDate>Thu, 11 Jun 2026 00:00:00 GMT</pubDate>
      <description>The FEC publishes bulk data on every contribution and expenditure in federal elections — candidates, PACs, super PACs, and party committees. Here is how to download the full dataset, trace money from donor to expenditure, and identify the shell-company layer that obscures dark money flows.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Wall of Shame: what the HHS-OCR HIPAA breach database reveals about healthcare data security</title>
      <link>https://ai-analytics.org/writing/hhs-ocr-hipaa-breach-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hhs-ocr-hipaa-breach-data/</guid>
      <pubDate>Wed, 10 Jun 2026 00:00:00 GMT</pubDate>
      <description>HHS-OCR publishes every reported healthcare data breach affecting 500+ patients — the &quot;Wall of Shame.&quot; Over 5,000 entries covering ransomware attacks, stolen laptops, unauthorized employee access, and business associate failures. Here is what the database contains and what it reveals about healthcare security failures.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>By the numbers: using EEOC charge statistics to find discrimination patterns by industry and employer</title>
      <link>https://ai-analytics.org/writing/eeoc-discrimination-charge-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/eeoc-discrimination-charge-data/</guid>
      <pubDate>Tue, 09 Jun 2026 00:00:00 GMT</pubDate>
      <description>The EEOC publishes annual charge statistics and, since 2017, charge-level data under FOIA. The aggregate data shows which industries generate the most race, sex, disability, and age discrimination charges — and which large employers appear repeatedly in the conciliation record.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The $800 billion bailout: using SBA PPP data to trace who got pandemic relief</title>
      <link>https://ai-analytics.org/writing/sba-ppp-loan-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sba-ppp-loan-data/</guid>
      <pubDate>Mon, 08 Jun 2026 00:00:00 GMT</pubDate>
      <description>After a FOIA fight, the SBA released PPP loan data covering 11.8 million loans and $793 billion in forgiven funds. Here is what the public data contains, the fraud patterns it revealed, and how to cross-reference it with SAM.gov debarments, IRS nonprofit data, and the DOJ prosecution record.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Trading on the inside: using STOCK Act filings to track congressional stock transactions</title>
      <link>https://ai-analytics.org/writing/stock-act-congressional-trading/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/stock-act-congressional-trading/</guid>
      <pubDate>Sun, 07 Jun 2026 00:00:00 GMT</pubDate>
      <description>The STOCK Act requires members of Congress to report stock trades within 45 days. The House Clerk publishes scanned PDFs — not structured data. Here is how Quiver Quantitative, Capitol Trades, and journalists have structured this data, and what the disclosures reveal about trading patterns around legislation and committee assignments.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The asylum lottery: what EOIR data reveals about judge-by-judge grant rate disparities</title>
      <link>https://ai-analytics.org/writing/eoir-asylum-grant-rates/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/eoir-asylum-grant-rates/</guid>
      <pubDate>Sat, 06 Jun 2026 00:00:00 GMT</pubDate>
      <description>EOIR publishes quarterly data on every immigration judge&apos;s case outcomes, including asylum grant rates. The spread is enormous — some judges grant asylum in fewer than 5% of cases; others grant it in more than 90%. Here is how to access and analyze the data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The mortgage map: using HMDA loan-level data to find lending disparities</title>
      <link>https://ai-analytics.org/writing/hmda-mortgage-lending-disparities/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/hmda-mortgage-lending-disparities/</guid>
      <pubDate>Fri, 05 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Home Mortgage Disclosure Act requires 7,000+ lenders to report every mortgage application — approvals, denials, withdrawn, race, income, loan amount, census tract. Here is how to use the CFPB bulk download to find redlining, reverse redlining, and lender-level denial rate disparities.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The recall record: what the CPSC product safety database shows and what manufacturers hide</title>
      <link>https://ai-analytics.org/writing/cpsc-consumer-product-recalls/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cpsc-consumer-product-recalls/</guid>
      <pubDate>Thu, 04 Jun 2026 00:00:00 GMT</pubDate>
      <description>The CPSC Recall database covers 9,800+ recalls since 1973. Behind the press releases: how many units are actually returned, which hazard categories dominate, and why the voluntary recall system lets manufacturers negotiate the language of their own enforcement actions.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>SEC Form 13F: The Institutional Holdings Disclosure Behind Every Hedge Fund Tracker</title>
      <link>https://ai-analytics.org/writing/sec-13f-institutional-holdings/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/sec-13f-institutional-holdings/</guid>
      <pubDate>Wed, 03 Jun 2026 00:00:00 GMT</pubDate>
      <description>Section 13(f) requires institutional investment managers with &gt;$100M in 13(f) securities to file quarterly holdings disclosures with the SEC -- ~5,000 filers, 45-day lag, long-equity-only view. Here is the full holdings table schema (CUSIP, VALUE, SH/PRN, PUT/CALL, INVESTMENT DISCRETION, VOTING AUTHORITY), what 13F covers and critically excludes (no short positions, no bonds), major filers (Berkshire, BlackRock, Renaissance), confidential treatment requests, the 45-day stale-data limitation, academic research (Griffin/Xu 2009), comparison to 13D/13G/Form 4, and a Python EDGAR parser for any manager by CIK.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>380 million transactions: indexing the DEA&apos;s ARCOS opioid distribution data</title>
      <link>https://ai-analytics.org/writing/dea-arcos-opioid-distribution/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/dea-arcos-opioid-distribution/</guid>
      <pubDate>Tue, 02 Jun 2026 00:00:00 GMT</pubDate>
      <description>How we indexed 380 million DEA ARCOS controlled-substance transaction records from the opioid MDL discovery release, what the data reveals about pill distribution, and how to cross-reference it against DEA enforcement actions and CDC overdose mortality.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The DPA database: every federal deferred prosecution agreement since 1992</title>
      <link>https://ai-analytics.org/writing/corporate-prosecution-dpa-registry/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/corporate-prosecution-dpa-registry/</guid>
      <pubDate>Mon, 01 Jun 2026 00:00:00 GMT</pubDate>
      <description>The Corporate Prosecution Registry at Duke and UVA covers 3,000+ federal organizational prosecutions and every DPA/NPA since 1990 — including agreements DOJ refused to disclose under FOIA.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The gun dealer map: what ATF&apos;s Federal Firearms Licensee data shows and what it hides</title>
      <link>https://ai-analytics.org/writing/atf-federal-firearms-licensees/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/atf-federal-firearms-licensees/</guid>
      <pubDate>Sun, 31 May 2026 00:00:00 GMT</pubDate>
      <description>ATF publishes the complete list of ~75,000 active Federal Firearms Licensees monthly as a free CSV. Here&apos;s what the data contains, what the Tiahrt Amendment keeps hidden, and how to cross-reference it.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Before it disappeared: archiving $1.5 trillion in USAID foreign assistance data</title>
      <link>https://ai-analytics.org/writing/usaid-foreign-assistance-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/usaid-foreign-assistance-data/</guid>
      <pubDate>Sat, 30 May 2026 00:00:00 GMT</pubDate>
      <description>foreignassistance.gov went dark on January 31, 2025. What the dataset contained, how it was archived, what the DOGE cuts actually targeted, and where to access it now.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>One in four audits flagged: indexing PCAOB deficiency data across the Big 4</title>
      <link>https://ai-analytics.org/writing/pcaob-audit-deficiency-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/pcaob-audit-deficiency-data/</guid>
      <pubDate>Fri, 29 May 2026 00:00:00 GMT</pubDate>
      <description>PCAOB inspection reports contain structured deficiency data for every registered audit firm. In 2023, 26% of Big 4 audits reviewed had Part I.A deficiencies — meaning auditors signed off without sufficient evidence. Here is what the data covers and how to use it.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Who won, who lost: five years of union elections in NLRB data</title>
      <link>https://ai-analytics.org/writing/nlrb-union-election-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nlrb-union-election-data/</guid>
      <pubDate>Thu, 28 May 2026 00:00:00 GMT</pubDate>
      <description>How to pull, clean, and analyze NLRB union election records — RC and RD cases, the 2021–2024 organizing surge, the 100k export cap workaround, industry breakdowns, and cross-referencing with OSHA and CFPB data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The pharma payment map: joining CMS Open Payments and Medicare Part D prescribing data</title>
      <link>https://ai-analytics.org/writing/cms-open-payments-part-d/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cms-open-payments-part-d/</guid>
      <pubDate>Tue, 26 May 2026 00:00:00 GMT</pubDate>
      <description>How joining CMS Open Payments (100M+ pharma payments to physicians) with Medicare Part D prescribing data surfaces the correlation between manufacturer payments and prescribing patterns — and how to cross-reference with HHS OIG exclusions.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Foreign agents in plain sight: mapping DC&apos;s hidden influence network with FARA data</title>
      <link>https://ai-analytics.org/writing/fara-foreign-agent-registrations/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fara-foreign-agent-registrations/</guid>
      <pubDate>Sun, 24 May 2026 00:00:00 GMT</pubDate>
      <description>The DOJ buries the FARA bulk download inside an Oracle APEX URL that looks broken. Behind it: daily CSV exports of every DC firm registered to lobby for a foreign government. Here is how to use it.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Repetitive loss: what FEMA&apos;s flood insurance claims data reveals about 2.7 million paid claims</title>
      <link>https://ai-analytics.org/writing/fema-nfip-flood-claims/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/fema-nfip-flood-claims/</guid>
      <pubDate>Sat, 23 May 2026 00:00:00 GMT</pubDate>
      <description>FEMA&apos;s NFIP claims dataset covers 2.7 million paid flood insurance claims. The multiple-loss properties subset shows properties paid out more than their assessed value — some 10–15 times.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Compliance screening across 30+ federal enforcement lists: how the risk score works</title>
      <link>https://ai-analytics.org/writing/compliance-screening-risk-score/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/compliance-screening-risk-score/</guid>
      <pubDate>Fri, 22 May 2026 00:00:00 GMT</pubDate>
      <description>How we built a 0–100 compliance risk score across OFAC, SAM, OIG, CFPB, SEC, DOJ, FDIC, FINRA, CFTC, EPA, MSHA, FDA warning letters, PCAOB, UFLPA, and 15+ more lists in a single API call.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Entity resolution for multi-list compliance screening: reducing false positives without sacrificing recall</title>
      <link>https://ai-analytics.org/writing/compliance-entity-resolution/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/compliance-entity-resolution/</guid>
      <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub resolves entity identity across 30+ compliance lists: three-stage pipeline (identifier join 34%, FTS5 canonical name 41%, Jaro-Winkler fuzzy 18%), false positive taxonomy, EntityResolutionResult confidence-to-action mapping (MATCH &gt;= 0.90, PROBABLE_MATCH 0.72-0.90), 99.1% recall, 98.7% precision, and weekly analyst-feedback calibration.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Name matching in federal regulatory data: aliases, subsidiaries, and sanctions evasion across 197 datasets</title>
      <link>https://ai-analytics.org/writing/federal-entity-matching/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/federal-entity-matching/</guid>
      <pubDate>Sun, 10 May 2026 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub resolves entity names across 197 federal datasets when identifiers disagree — OFAC alias explosion, SEC EDGAR subsidiary mapping, three-pass fuzzy matching (exact → Jaro-Winkler → TF-IDF cosine), 1.4% combined false positive rate, and how entity_confidence weights the compliance risk score.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Canonical entity IDs in the Federal Regulatory Data Hub: stable identifiers across 197 federal datasets</title>
      <link>https://ai-analytics.org/writing/regulatory-entity-canonical-ids/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-entity-canonical-ids/</guid>
      <pubDate>Tue, 05 May 2026 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub generates and maintains stable canonical IDs for entities across 197 federal datasets — deterministic SHA-256 ID generation, EntityVersion history for merge and split events, EntityAlias tracking for historical name variants, and subscriber continuity guarantees when source identifiers change.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Building the cross-agency regulatory entity graph: 35M records, one join</title>
      <link>https://ai-analytics.org/writing/cross-agency-regulatory-entity-graph/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/cross-agency-regulatory-entity-graph/</guid>
      <pubDate>Fri, 01 May 2026 00:00:00 GMT</pubDate>
      <description>How we built an entity bridge across 197 federal datasets so a single query returns every SEC filing, FDA warning letter, EPA enforcement case, and OFAC sanction for any company.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Entity subscriptions in the Federal Regulatory Data Hub: per-entity change monitoring across 30+ enforcement lists</title>
      <link>https://ai-analytics.org/writing/federal-regulatory-entity-subscriptions/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/federal-regulatory-entity-subscriptions/</guid>
      <pubDate>Sun, 26 Apr 2026 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub lets compliance teams subscribe to regulatory events for specific entities — using the cross-agency entity bridge to watch OFAC, SAM, SEC, EPA, DOJ, and 25+ other lists simultaneously.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Federal Regulatory Data Hub change alerts: near-real-time OFAC sanctions, SAM debarments, and enforcement action webhooks</title>
      <link>https://ai-analytics.org/writing/regulatory-change-alerts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-change-alerts/</guid>
      <pubDate>Tue, 21 Apr 2026 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub detects regulatory record changes and delivers them to subscribers: 10-minute OFAC sanctions window, 30-minute SAM debarment window, EDGAR 8-K filing webhooks, HMAC-signed Cloudflare Queue delivery with at-least-once semantics, per-entity and per-list subscription filters, and idempotency_key deduplication.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK v0.4: situational awareness, electronic warfare coordination, and adversarial resilience</title>
      <link>https://ai-analytics.org/writing/swarm-sdk-v04/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-sdk-v04/</guid>
      <pubDate>Tue, 14 Apr 2026 00:00:00 GMT</pubDate>
      <description>Situational Awareness API for shared position and sensor fusion, EW Coordination protocol, Adversarial Resilience with traffic morphing and store-and-forward, and RF Fingerprinting for passive emitter tracking. 465 total tests.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm situational awareness: signed position broadcasts, sensor fusion, and dead-reckoning in embedded Rust</title>
      <link>https://ai-analytics.org/writing/swarm-situational-awareness/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-situational-awareness/</guid>
      <pubDate>Fri, 10 Apr 2026 00:00:00 GMT</pubDate>
      <description>How the swarm coordination layer maintains a shared operational picture across 128 nodes: Ed25519-signed 124-byte position broadcast frames, an Extended Kalman Filter fusing GPS/IMU/barometric altitude into a 6-DOF state estimate, dead-reckoning fallback for up to 90 seconds without GPS, and a probabilistic gossip protocol achieving 94.2% frame delivery across a 2km x 2km field deployment.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK on bare metal: porting the cryptographic core to no_std Rust on STM32H7</title>
      <link>https://ai-analytics.org/writing/swarm-embedded-rust/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-embedded-rust/</guid>
      <pubDate>Mon, 06 Apr 2026 00:00:00 GMT</pubDate>
      <description>How we ported the Swarm SDK cryptographic core to no_std Rust targeting the STM32H7 Cortex-M7: feature-gated std/embedded builds, 96KB static heap with cortex-m-alloc, pre-allocated VecDeque deduplication ring, in-place AES-GCM to avoid heap allocation, hardware AES accelerator integration (0.14ms vs. 0.61ms software), and binary size optimization from 1.2MB to 284KB with opt-level=&quot;z&quot; and LTO.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK key rotation: automated cryptographic material refresh in field-deployed drone meshes</title>
      <link>https://ai-analytics.org/writing/swarm-key-rotation/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-key-rotation/</guid>
      <pubDate>Wed, 01 Apr 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK rotates cryptographic material without grounding the fleet — scheduled signed pre-key rotation on a 7-day timer, OTP replenishment when bundle drops below 20 keys, emergency revocation via gossip-flooded KeyRevocationAnnouncement, BKPSRAM zeroization with 0xFF pattern verification, and staggered rotation coordination across the mesh.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK key management: device provisioning, certificate rotation, and revocation for autonomous drone systems</title>
      <link>https://ai-analytics.org/writing/swarm-key-management/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-key-management/</guid>
      <pubDate>Sat, 28 Mar 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK manages cryptographic identity for drone fleets: on-device ML-KEM-768 + X25519 keypair generation, three-tier fleet CA hierarchy, pre-provisioned mission cert bundles for offline authentication, 7-day signed prekey rotation over the gossip mesh, in-flight device revocation via RevocationMessage, and emergency wipe on tamper detection.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK device enrollment: how a new drone joins an authenticated fleet mesh</title>
      <link>https://ai-analytics.org/writing/swarm-device-enrollment/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-device-enrollment/</guid>
      <pubDate>Sat, 21 Mar 2026 00:00:00 GMT</pubDate>
      <description>How a Swarm SDK drone goes from factory state to trusted mesh participant: factory-provisioned ML-KEM-768 + X25519 keypairs, CSR generation and Fleet CA signing, USB and RF enrollment paths, gossip mesh announcement with SignedPreKeyBundle, pioneer bootstrap for the first device, and re-enrollment at certificate expiry.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Post-quantum mesh cryptography for drone swarms: the Swarm SDK design</title>
      <link>https://ai-analytics.org/writing/post-quantum-drone-mesh/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/post-quantum-drone-mesh/</guid>
      <pubDate>Sun, 15 Mar 2026 00:00:00 GMT</pubDate>
      <description>How we designed the Swarm SDK: ML-KEM-768 + X25519 hybrid post-quantum key exchange, Double Ratchet forward secrecy, gossip mesh routing with bounded fanout, and the path to CNSA 2.0 compliance.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK operational security: traffic analysis resistance, message size normalization, and timing jitter</title>
      <link>https://ai-analytics.org/writing/swarm-sdk-opsec/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-sdk-opsec/</guid>
      <pubDate>Tue, 10 Mar 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK protects drone mesh communications against traffic analysis — six fixed message size bins, ±15% transmission timing jitter, store-and-forward ring buffer for burst smoothing, degraded-channel operational mode, and RF fingerprint resistance on STM32H7.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK MAVLink v2 integration: encrypting mesh messages inside 253-byte drone protocol frames</title>
      <link>https://ai-analytics.org/writing/swarm-mavlink-integration/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-mavlink-integration/</guid>
      <pubDate>Thu, 05 Mar 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK wraps post-quantum encrypted mesh traffic in MAVLink v2 SWARM_MESH_FRAME messages — 18-byte fragment header design, per-message reassembly buffer with 5-second TTL, PX4 and ArduPilot integration, MAVSDK passthrough, and why ML-KEM-768 Sealed Sender envelopes always require 6 frames.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK message framing: binary wire format, fragmentation, and MAVLink packing</title>
      <link>https://ai-analytics.org/writing/swarm-message-framing/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-message-framing/</guid>
      <pubDate>Fri, 27 Feb 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK serializes, fragments, and packs Double Ratchet encrypted messages into MAVLink v2 TUNNEL frames: the SwarmFrame binary header, 237-byte payload limit, fragmentation algorithm, reassembly state machine, CONTROL frame authentication, and STM32H7 performance.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Swarm SDK double ratchet: forward secrecy and post-compromise security in drone mesh networks</title>
      <link>https://ai-analytics.org/writing/swarm-double-ratchet/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-double-ratchet/</guid>
      <pubDate>Sun, 22 Feb 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK implements the Double Ratchet algorithm for drone-to-drone messaging: adapting Signal Protocol&apos;s KDF chains for ML-KEM-768 post-quantum initial key exchange, header encryption, out-of-order message handling with a sliding key cache, MAVLink v2 framing, and performance benchmarks on embedded ARM.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK Sealed Sender: hiding the sender identity without breaking end-to-end encryption</title>
      <link>https://ai-analytics.org/writing/swarm-sealed-sender/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-sealed-sender/</guid>
      <pubDate>Mon, 16 Feb 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK implements Sealed Sender to hide drone identity from relay infrastructure: recipient-issued SenderCertificate, ephemeral X25519 + HKDF-SHA256 per-message encryption, AES-256-GCM with zero relay-visible sender field, 48-hour certificate TTL, four decryption failure modes, and Sender Keys group integration.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Federal Regulatory Data Hub MCP server: 38+ tools for AI agent workflows</title>
      <link>https://ai-analytics.org/writing/regulatory-api-mcp-server/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-api-mcp-server/</guid>
      <pubDate>Thu, 05 Feb 2026 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub exposes its data through an MCP server with 38+ tools for Claude, GPT, and other AI agents — screen_entity, get_entity, compliance reporting tools, rate-limit tiers by plan, and Claude Desktop integration via stdio transport.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK v0.3: Sender Keys, Sealed Sender, and Deniable Authentication for Drone Mesh Networks</title>
      <link>https://ai-analytics.org/writing/swarm-sdk-v03/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-sdk-v03/</guid>
      <pubDate>Tue, 10 Feb 2026 00:00:00 GMT</pubDate>
      <description>What shipped in Swarm SDK v0.3: O(1) group encryption with Sender Keys (0.7ms on STM32H7), Sealed Sender hiding drone identity via ML-KEM-768 encapsulation, deniable HMAC authentication, and PKCS7 padding normalization across all AES-GCM operations. 127 new tests (302 total).</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Federal Regulatory API: REST, MCP, and JSON-LD for 197 federal datasets</title>
      <link>https://ai-analytics.org/writing/federal-regulatory-api/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/federal-regulatory-api/</guid>
      <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub API is designed: CC0 REST endpoints, cross-agency entity resolution in one GET, MCP server with 38+ tools for agent workflows, and JSON-LD structured data.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK session establishment: X3DH prekey bundles and the initial drone-to-drone handshake</title>
      <link>https://ai-analytics.org/writing/swarm-x3dh-session/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-x3dh-session/</guid>
      <pubDate>Sun, 25 Jan 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK uses Extended Triple Diffie-Hellman (X3DH) with ML-KEM-768 adaptation for async drone-to-drone session establishment — prekey bundle construction, one-time prekey consumption, Fleet CA bundle verification, and the transition from shared secret to Double Ratchet forward secrecy.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK prekey bundle management: generating, distributing, and consuming OneTimePreKeys across a drone fleet</title>
      <link>https://ai-analytics.org/writing/swarm-prekey-management/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-prekey-management/</guid>
      <pubDate>Tue, 20 Jan 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK generates, distributes, and tracks OneTimePreKeys for X3DH session establishment — including OTP exhaustion handling, SignedPreKey rotation, and the gossip-mesh key bundle protocol.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Federal dataset ingest: keeping 197 federal datasets fresh at the edge</title>
      <link>https://ai-analytics.org/writing/federal-dataset-ingest/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/federal-dataset-ingest/</guid>
      <pubDate>Thu, 15 Jan 2026 00:00:00 GMT</pubDate>
      <description>How we ingest and refresh 197 federal regulatory datasets across 45 agencies using Cloudflare Workers cron, delta detection, schema drift handling, and per-source retry budgets — the ETL behind the Federal Regulatory Data Hub.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK mesh transport: reliable delivery over contested RF links</title>
      <link>https://ai-analytics.org/writing/swarm-mesh-transport/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-mesh-transport/</guid>
      <pubDate>Thu, 08 Jan 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK MeshTransport layer achieves reliable frame delivery over lossy drone radio links: sliding window ARQ with selective ACK, EWMA RTT estimation, transparent fragmentation and reassembly for Sealed Sender envelopes, multi-channel bonding across 2.4GHz and 5.8GHz radios, and performance benchmarks on STM32H7 and Jetson Nano.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK gossip mesh: bounded fanout routing, message deduplication, and network partition handling</title>
      <link>https://ai-analytics.org/writing/swarm-gossip-mesh/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-gossip-mesh/</guid>
      <pubDate>Fri, 02 Jan 2026 00:00:00 GMT</pubDate>
      <description>How the Swarm SDK implements a gossip mesh for drone swarms: epidemic broadcast with k=3 fanout, UUIDv4 sliding-window deduplication, Lamport clock causal ordering for key management messages, TTL hop limiting, and anti-entropy reconciliation for post-partition recovery — with STM32H7 and Jetson Nano benchmarks.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Swarm SDK architecture: gossip mesh, post-quantum cryptography, and embedded-first design</title>
      <link>https://ai-analytics.org/writing/swarm-sdk-architecture/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/swarm-sdk-architecture/</guid>
      <pubDate>Sat, 27 Dec 2025 00:00:00 GMT</pubDate>
      <description>An architectural overview of the Swarm SDK: the three-layer design covering gossip mesh epidemic broadcast, ML-KEM-768 + X25519 hybrid post-quantum cryptography with Double Ratchet and Sender Keys, MAVLink v2 framing, and no_std embedded operation on STM32H7.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Incident clustering and deduplication: how Voidly avoids counting the same censorship event twice</title>
      <link>https://ai-analytics.org/writing/voidly-incident-clustering/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-incident-clustering/</guid>
      <pubDate>Mon, 22 Dec 2025 00:00:00 GMT</pubDate>
      <description>How Voidly deduplicates thousands of probe measurements into discrete censorship incidents: the four-tuple clustering key, the 6-hour gap rule, incident lifecycle from ANOMALY to RESOLVED, incident_id assignment, retroactive CensoredPlanet alignment, and edge cases including flapping blocks and BGP outages.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly incident timeline reconstruction: building the canonical event sequence from distributed probe measurements</title>
      <link>https://ai-analytics.org/writing/voidly-incident-timeline/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-incident-timeline/</guid>
      <pubDate>Wed, 17 Dec 2025 00:00:00 GMT</pubDate>
      <description>How Voidly reconstructs the authoritative timeline of a censorship incident from asynchronous distributed probe measurements — IncidentEvent sourcing model, temporal alignment across time zones, confidence weighting requiring 3+ independent probes, retroactive revision from CensoredPlanet batch data, and the timeline REST API endpoint.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly incident resolution: how we know when a censorship event ends</title>
      <link>https://ai-analytics.org/writing/voidly-incident-resolution/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-incident-resolution/</guid>
      <pubDate>Sat, 13 Dec 2025 00:00:00 GMT</pubDate>
      <description>How Voidly determines that a censorship incident has ended: per-type resolution thresholds (consecutive passing measurements with p_blocked &lt; 0.3), the 12-hour RESOLVED_PENDING re-open window, FLAPPING state detection for rapidly alternating blocks, BGP-type auto-resolution, and cross-source confirmation requirements for VERIFIED incidents — with observed resolution time distributions (BGP 4.2h median, HTTP 12.1 days).</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly real-time anomaly scorer: ML inference in the streaming pipeline at 50,000 events per second</title>
      <link>https://ai-analytics.org/writing/voidly-realtime-anomaly-scorer/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-realtime-anomaly-scorer/</guid>
      <pubDate>Tue, 09 Dec 2025 00:00:00 GMT</pubDate>
      <description>How Voidly embeds ONNX Runtime inside an Apache Flink streaming job to score probe results for censorship anomalies at 50,000 events/sec with sub-100ms end-to-end latency: thread-local ONNX session management, Kafka partition alignment with (country_code, asn) keyBy, mini-batch coalescing for 50ms p99 inference, and backpressure handling.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s real-time event pipeline: from measurement anomaly to journalist alert in under 8 minutes</title>
      <link>https://ai-analytics.org/writing/voidly-realtime-pipeline/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-realtime-pipeline/</guid>
      <pubDate>Fri, 05 Dec 2025 00:00:00 GMT</pubDate>
      <description>How Voidly gets from a probe anomaly to a published verified incident in under 8 minutes: event queue, real-time OONI and IODA API polling, confidence thresholds, two-window alert-fatigue guard, and nightly CensoredPlanet retroactive pass.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe run lifecycle: from scheduled task to classifier input</title>
      <link>https://ai-analytics.org/writing/voidly-probe-run-lifecycle/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-run-lifecycle/</guid>
      <pubDate>Sat, 29 Nov 2025 00:00:00 GMT</pubDate>
      <description>What happens inside a single Voidly probe run: the measurement execution loop, DNS and TCP and TLS and HTTP data capture, result serialization and signing, and the upload path that delivers a signed ProbeResult to the ingest pipeline.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe networking: staying connected through NAT, firewalls, and censored infrastructure</title>
      <link>https://ai-analytics.org/writing/voidly-probe-networking/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-networking/</guid>
      <pubDate>Mon, 24 Nov 2025 00:00:00 GMT</pubDate>
      <description>How Voidly probes maintain connectivity and upload measurements from networks that actively block VPN protocols — QUIC/443 transport, domain fronting via CDN SNI fronting, TLS certificate pinning against MITM, local SQLite buffering (500 MB cap, 48h window), and metered-connection backoff.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe local measurement buffer: SQLite ring buffer, batch compression, and resilient upload</title>
      <link>https://ai-analytics.org/writing/voidly-probe-local-buffer/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-local-buffer/</guid>
      <pubDate>Wed, 19 Nov 2025 00:00:00 GMT</pubDate>
      <description>How Voidly probes preserve measurement data during upload failures — a 72-hour SQLite ring buffer with anomaly-safe eviction, LZ4 batch compression reducing median batch size from 47KB to 9KB, exponential backoff retry up to 4 hours, priority queue for anomalous measurements, and 0.003% measurement loss rate across 37 probes over 6 months.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly Probe: Tauri + boringtun network measurement at the operator&apos;s edge</title>
      <link>https://ai-analytics.org/writing/voidly-probe-architecture/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-architecture/</guid>
      <pubDate>Sat, 15 Nov 2025 00:00:00 GMT</pubDate>
      <description>How the Voidly desktop probe works: Tauri 2 cross-platform app, Cloudflare boringtun WireGuard, tun-rs TUN device, X25519-Dalek on-device key generation, and operator anonymity as a design constraint.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly probe test runner: concurrency, timeout handling, and the measurement state machine</title>
      <link>https://ai-analytics.org/writing/voidly-probe-test-runner/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-test-runner/</guid>
      <pubDate>Sat, 08 Nov 2025 00:00:00 GMT</pubDate>
      <description>How the Voidly probe test runner orchestrates concurrent measurements: tokio Semaphore with 3 permits, MeasurementState machine (Pending → Running → Success/Error/Timeout), per-layer timeout budgets (DNS 3s, TCP 5s, TLS 8s, HTTP 15s, total 30s), Ed25519 measurement signing, mpsc upload queue with capacity 200, and why per-layer timeouts are themselves evidence of DNS-layer interference.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>How Voidly measures HTTP and HTTPS censorship: the full protocol lifecycle from DNS through TLS to body comparison</title>
      <link>https://ai-analytics.org/writing/voidly-http-measurement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-http-measurement/</guid>
      <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
      <description>A step-by-step breakdown of how each Voidly probe test works: DNS resolution, TCP handshake, TLS negotiation with certificate chain validation, HTTP request execution, response body fingerprinting, control comparison, and how every layer maps to interference types in the anomaly classifier.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s TCP measurement layer: RST injection detection, null-routing, and connection timing analysis</title>
      <link>https://ai-analytics.org/writing/voidly-tcp-measurement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-tcp-measurement/</guid>
      <pubDate>Mon, 27 Oct 2025 00:00:00 GMT</pubDate>
      <description>A deep dive into the TCP layer of Voidly&apos;s censorship detection: SYN-ACK timing, RST injection detection with a 15ms threshold, null-routing vs. RST as two distinct censorship mechanisms, the TcpResult struct, dual-IP probing to identify RST source, and how TCP evidence maps to the anomaly classifier&apos;s interference classes.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly control server: how we tell censorship from a bad network</title>
      <link>https://ai-analytics.org/writing/voidly-control-server/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-control-server/</guid>
      <pubDate>Wed, 22 Oct 2025 00:00:00 GMT</pubDate>
      <description>How Voidly uses a distributed control server network to distinguish genuine censorship from network errors, CDN split-horizon DNS, and misconfigured sites — DNS, TCP, TLS, and HTTP comparison methodology.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>How Voidly measures bandwidth throttling: timing signals, body truncation, and the calibration problem</title>
      <link>https://ai-analytics.org/writing/voidly-throttling-measurement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-throttling-measurement/</guid>
      <pubDate>Wed, 15 Oct 2025 00:00:00 GMT</pubDate>
      <description>A technical deep-dive on how Voidly detects bandwidth throttling — the hardest interference class to classify. Covers the TimingFeatures Rust struct, TTFB z-score computation against control measurements, body truncation and mid-transfer RST signals, the congestion vs. deliberate-throttling calibration problem, cross-probe corroboration scoring, and country patterns from Russia TSPU, Iran ARRS, India, and China.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe health monitoring: how we detect and replace failing probe nodes</title>
      <link>https://ai-analytics.org/writing/voidly-probe-health/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-health/</guid>
      <pubDate>Wed, 08 Oct 2025 00:00:00 GMT</pubDate>
      <description>How Voidly monitors 37+ probe nodes: heartbeat system (60s cadence, separate transport), DEGRADED/OFFLINE state machine, measurement quality scoring, ASN coverage SLOs for 200 countries, flapping detection, automated replacement, and the classify_offline_cause() algorithm distinguishing probe failure from ISP-level censorship.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>How Voidly detects DNS injection: forged responses, injection rates by country, and pipeline integration</title>
      <link>https://ai-analytics.org/writing/voidly-dns-injection-detection/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-dns-injection-detection/</guid>
      <pubDate>Fri, 03 Oct 2025 00:00:00 GMT</pubDate>
      <description>How Voidly probes identify DNS injection and manipulation in censored networks — comparison against three control resolvers, four weighted detection signals (IP divergence, TTL anomaly, source IP divergence, response timing), per-country injection rates (China 94%, Iran 61%, Russia 12%), CAP_NET_RAW privilege handling, and anycast false-positive calibration.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s interference taxonomy: classifying censorship from DNS injection to BGP withdrawal</title>
      <link>https://ai-analytics.org/writing/voidly-interference-taxonomy/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-interference-taxonomy/</guid>
      <pubDate>Wed, 24 Sep 2025 00:00:00 GMT</pubDate>
      <description>How Voidly classifies every censorship measurement into one of 7 interference types using a hierarchical decision tree — DnsInjection, DnsNxdomain, TcpRstInjection, TcpNullRouting, TlsMitm, HttpBlockPage, and Throttling — with confidence scoring, protocol layer priority, and an Indeterminate category.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Geoblocking vs. censorship: how Voidly distinguishes licensing restrictions, CDN geofencing, and GDPR blocks from government-ordered blocking</title>
      <link>https://ai-analytics.org/writing/voidly-geoblocking/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-geoblocking/</guid>
      <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
      <description>How Voidly avoids false positives from commercial geoblocking: HTTP 451 detection, streaming service block page fingerprints, multi-country probe comparison, CDN split-horizon detection via ASN group mapping, and the p_geoblock score.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Cross-source censorship verification: reconciling OONI, CensoredPlanet, and IODA</title>
      <link>https://ai-analytics.org/writing/censorship-cross-source-verification/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/censorship-cross-source-verification/</guid>
      <pubDate>Sat, 20 Sep 2025 00:00:00 GMT</pubDate>
      <description>How Voidly correlates three independent measurement projects at scale — data format normalization, 4-hour sliding window alignment, independence-weighted confidence scoring, and handling source disagreements.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly middlebox detection: transparent proxies, TCP injection points, and TSPU vendor signatures</title>
      <link>https://ai-analytics.org/writing/voidly-middlebox-detection/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-middlebox-detection/</guid>
      <pubDate>Tue, 16 Sep 2025 00:00:00 GMT</pubDate>
      <description>How Voidly probes detect network middleboxes: an HTTP echo test for transparent proxies via custom X-Voidly-Echo headers, TCP RST injection timing analysis using four heuristics (arrival time, TTL mismatch, zero window, absent TCP options), a vendor signature library with 47 confirmed fingerprints (TSPU/Sandvine/Huawei Hi-SEC/GFW/Cisco), and 18-hour median lead time between middlebox detection and censorship anomaly onset across 31 countries.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>How Voidly measures TLS censorship: certificate forgery, SNI blocking, and handshake interference</title>
      <link>https://ai-analytics.org/writing/voidly-tls-measurement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-tls-measurement/</guid>
      <pubDate>Fri, 12 Sep 2025 00:00:00 GMT</pubDate>
      <description>A deep dive into the TLS layer of Voidly&apos;s censorship detection: full certificate chain extraction with rustls, government CA list (China MoI, Iran MICT, Kazakhstan NCA), MITM detection via fingerprint mismatch, TLS alert timing analysis (RST &lt; 15ms = injected), SNI-based blocking detection via dual-SNI probing, ECH/ESNI measurement, and how TLS failure maps to interference_type classifier outputs.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s block page fingerprint library: detecting censorship signatures across 2,300+ known pages</title>
      <link>https://ai-analytics.org/writing/voidly-blockpage-fingerprints/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-blockpage-fingerprints/</guid>
      <pubDate>Fri, 05 Sep 2025 00:00:00 GMT</pubDate>
      <description>How Voidly built and maintains the 2,300-entry block page fingerprint library: four matching strategies (exact SHA-256 hash, structural normalization, SimHash, TLS certificate fingerprinting), block page collection from OONI confirmed events and probe captures, per-country composition (Turkey 47, Iran 312, Russia 189, China 8), false positive mitigation, and integration with the lf_http_blockpage_hash label function.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly measurement protocol stack: composing DNS, TCP, TLS, and HTTP layers into a ProbeResult</title>
      <link>https://ai-analytics.org/writing/voidly-measurement-protocol-stack/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-measurement-protocol-stack/</guid>
      <pubDate>Mon, 01 Sep 2025 00:00:00 GMT</pubDate>
      <description>How the four Voidly measurement layers compose into a single ProbeResult struct: sequential DNS → TCP → TLS → HTTP execution with the control measurement running in parallel, the None-vs-Some failure propagation convention distinguishing &quot;not attempted&quot; from &quot;attempted and failed&quot;, a failure mode table mapping six layer-outcome combinations to censorship types, and deterministic control vantage selection by domain hash.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>How Voidly measures DNS censorship: NXDOMAIN injection, IP spoofing, and resolver-level filtering</title>
      <link>https://ai-analytics.org/writing/voidly-dns-measurement/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-dns-measurement/</guid>
      <pubDate>Thu, 28 Aug 2025 00:00:00 GMT</pubDate>
      <description>A deep dive into the DNS layer of Voidly&apos;s censorship detection: dual-resolver design (ISP resolver vs. neutral control), four interference types (NXDOMAIN injection, IP spoofing, empty answer, timeout), the compare_dns_results() algorithm, known injection IP database (China 18 IPs, Iran 3, Turkey 2), CDN geofencing false positive mitigation via ASN group matching, DNSSEC validation limitations, and DoH/DoT diagnostic queries.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly measurement API export: NDJSON streaming, Parquet generation, and HuggingFace dataset sync</title>
      <link>https://ai-analytics.org/writing/voidly-measurement-api-export/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-measurement-api-export/</guid>
      <pubDate>Sun, 24 Aug 2025 00:00:00 GMT</pubDate>
      <description>How Voidly publishes its measurement corpus to external researchers: a keyset-paginated NDJSON streaming API with SSE mode, nightly PyArrow Parquet generation sorted by (domain, ts) for 60% I/O reduction on single-domain queries with zstd level-3 compression, atomic HuggingFace Dataset Hub push with dataset card regeneration, and classifier_version tagging.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly measurement dataset: field-by-field schema reference</title>
      <link>https://ai-analytics.org/writing/voidly-dataset-schema/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-dataset-schema/</guid>
      <pubDate>Wed, 20 Aug 2025 00:00:00 GMT</pubDate>
      <description>A complete field-by-field guide to the Voidly CC BY 4.0 measurement dataset — probe identity, DNS/TCP/TLS/HTTP layers, control comparison, ML classification output, BGP signals, corroboration fields, and filtering recipes for journalists and ML researchers.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s TimescaleDB continuous aggregates: pre-aggregating 2.2B probe measurements for fast queries</title>
      <link>https://ai-analytics.org/writing/voidly-continuous-aggregates/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-continuous-aggregates/</guid>
      <pubDate>Thu, 14 Aug 2025 00:00:00 GMT</pubDate>
      <description>The three-level TimescaleDB continuous aggregate hierarchy behind Voidly&apos;s sub-10ms query latency: measurement_hourly (15-minute refresh), country_daily_summary (1-hour refresh), and country_monthly_stats (daily). Covers refresh policies, late-arriving probe data handling, compression interplay, and backfill procedures.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s probe-to-dataset ingest pipeline: normalization, quality filtering, and TimescaleDB indexing</title>
      <link>https://ai-analytics.org/writing/voidly-probe-ingest/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-ingest/</guid>
      <pubDate>Fri, 08 Aug 2025 00:00:00 GMT</pubDate>
      <description>The full path from raw probe bytes to a queryable TimescaleDB record: protobuf over QUIC, Cloudflare Worker validation, Kafka fan-out, Rust normalization, probe-version schema drift handling, quality filtering (3.2% drop rate), and nightly Parquet export to HuggingFace.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly BGP data ingestion: parsing MRT dumps, detecting prefix withdrawals, and computing country outage scores</title>
      <link>https://ai-analytics.org/writing/voidly-bgp-data-ingestion/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-bgp-data-ingestion/</guid>
      <pubDate>Sat, 02 Aug 2025 00:00:00 GMT</pubDate>
      <description>How Voidly ingests BGP data from RIPE NCC RIS, RouteViews, and bgp.tools: MRT format parsing, per-country baseline computation, withdrawal detection thresholds, BgpEvent records in TimescaleDB, and how bgp_outage_score is attached to probe measurements.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>BGP routing signals and internet shutdown detection: how Voidly uses IODA data</title>
      <link>https://ai-analytics.org/writing/bgp-shutdown-detection/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/bgp-shutdown-detection/</guid>
      <pubDate>Mon, 28 Jul 2025 00:00:00 GMT</pubDate>
      <description>How Voidly uses BGP prefix withdrawal patterns and IODA data to detect internet shutdowns before any probe can send a packet — baseline per-country reachability, BGP silence vs. withdrawal, and how BGP fits into the composite confidence score.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly AS path analysis: using BGP topology to locate censorship enforcement points</title>
      <link>https://ai-analytics.org/writing/voidly-as-path-analysis/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-as-path-analysis/</guid>
      <pubDate>Tue, 22 Jul 2025 00:00:00 GMT</pubDate>
      <description>How Voidly uses CAIDA AS-Rank, RIPE NCC RIS route collector data, and PeeringDB to build an AS-level topology, classify censorship choke points (IXP, transit AS, edge ISP), compute per-country probe diversity scores, and feed AS path features into the anomaly classifier.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Per-domain censorship history in Voidly: tracking blocking events across countries and time</title>
      <link>https://ai-analytics.org/writing/voidly-domain-censorship-history/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-domain-censorship-history/</guid>
      <pubDate>Sat, 12 Jul 2025 00:00:00 GMT</pubDate>
      <description>How Voidly tracks the full history of blocking events for individual domains across all probe countries — domain_measurement_summary continuous aggregate, first/last-seen tracking, the /v1/domains/{domain}/history API, temporal patterns, cross-country correlation, and domain freshness scoring.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s ASN-level blocking analysis: how censorship propagates across autonomous systems</title>
      <link>https://ai-analytics.org/writing/voidly-asn-blocking-analysis/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-asn-blocking-analysis/</guid>
      <pubDate>Thu, 17 Jul 2025 00:00:00 GMT</pubDate>
      <description>How Voidly uses per-ASN probe vantages to distinguish nationwide censorship orders from selective ISP-level blocking — BGP peer classification from CAIDA AS-Rank, ISP blocking fingerprints by interference type, differential blocking detection, and propagation speed analysis that reveals enforcement mechanisms.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s country-level censorship score: aggregating 2.2B probe measurements into the global index</title>
      <link>https://ai-analytics.org/writing/voidly-country-censorship-index/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-country-censorship-index/</guid>
      <pubDate>Tue, 08 Jul 2025 00:00:00 GMT</pubDate>
      <description>How Voidly aggregates per-measurement interference probabilities into per-country censorship scores: recency decay with a 30-day half-life, ASN diversity weighting, domain category weighting, cross-source corroboration multipliers, 90-day rolling windows, Gaussian temporal smoothing, and bootstrap confidence bands.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Sanctions timelines and internet shutdowns: how Voidly correlates OFAC designation bursts with censorship events</title>
      <link>https://ai-analytics.org/writing/voidly-sanctions-shutdown-correlation/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-sanctions-shutdown-correlation/</guid>
      <pubDate>Thu, 03 Jul 2025 00:00:00 GMT</pubDate>
      <description>How Voidly aligns OFAC sanctions packages, EU/UN designation timelines, and bilateral diplomatic signals with measured internet shutdown events — building the diplomatic-isolation feature for the shutdown forecasting model.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OFAC SDN integration in the Federal Regulatory Data Hub: conditional GET, entity normalization, and sub-second screening</title>
      <link>https://ai-analytics.org/writing/ofac-sdn-integration/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ofac-sdn-integration/</guid>
      <pubDate>Sat, 28 Jun 2025 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub ingests the OFAC Specially Designated Nationals list — conditional GET with ETag, XML parsing across 12K SDN entries with alias explosion, name normalization, FTS5 + Jaro-Winkler three-pass screening, and p50 8ms / p99 28ms latency against the SDN list.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The features behind Voidly&apos;s 7-day shutdown forecast: political calendar, sanctions timelines, and network telemetry</title>
      <link>https://ai-analytics.org/writing/voidly-shutdown-features/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-shutdown-features/</guid>
      <pubDate>Sat, 21 Jun 2025 00:00:00 GMT</pubDate>
      <description>A deep dive into the feature engineering behind Voidly&apos;s 7-day internet shutdown forecasting model: political calendar integration (election dates, protest intensity via GDELT CAMEO 14x events), OFAC sanctions timeline features, BGP withdrawal rate, probe measurement rate drops as early-warning signals, historical shutdown patterns with sin/cos cyclical encoding, and XGBoost SHAP feature importance across 200 countries.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Seven-day internet shutdown forecasting: how Voidly predicts connectivity outages</title>
      <link>https://ai-analytics.org/writing/shutdown-forecasting/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/shutdown-forecasting/</guid>
      <pubDate>Sun, 15 Jun 2025 00:00:00 GMT</pubDate>
      <description>How we build a 7-day predictive model for internet shutdowns across 200 countries: political calendar features, network telemetry, ARIMA + XGBoost ensemble, and per-country reliability scoring.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Bridging classifier outputs to shutdown forecasting: from per-measurement censorship probability to country-level shutdown risk scores</title>
      <link>https://ai-analytics.org/writing/voidly-classifier-to-forecast/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-classifier-to-forecast/</guid>
      <pubDate>Wed, 11 Jun 2025 00:00:00 GMT</pubDate>
      <description>How Voidly aggregates calibrated per-measurement censorship probabilities into country-level shutdown risk signals: a three-stage aggregation hierarchy, exponential decay weighting with 48-hour half-life over a 14-day window, a 28-feature forecast vector, and the Kafka voidly.forecast.features topic handoff to the Bayesian shutdown forecasting service.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s per-country classifier calibration: Platt scaling, threshold tuning, and why the same probability means different things in Iran vs. China</title>
      <link>https://ai-analytics.org/writing/voidly-classifier-calibration/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-classifier-calibration/</guid>
      <pubDate>Sat, 07 Jun 2025 00:00:00 GMT</pubDate>
      <description>How Voidly calibrates its anomaly classifier separately for each country — Platt scaling on per-country holdout predictions, F2-weighted threshold tuning per class, 30-day rolling calibration windows, and case studies: Iran DNS fires at 0.62; China DNS requires 0.74.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s anomaly classifier retraining pipeline: temporal splits, champion/challenger promotion, and drift detection</title>
      <link>https://ai-analytics.org/writing/voidly-classifier-retraining/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-classifier-retraining/</guid>
      <pubDate>Mon, 02 Jun 2025 00:00:00 GMT</pubDate>
      <description>How Voidly retrains its five-class censorship anomaly classifier on a weekly cadence: time-based train/val/test splits to prevent temporal leakage, SMOTE resampling for class imbalance, PSI drift detection, champion/challenger shadow deployment, and the canary rollout process.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s real-time inference API: classifying censorship measurements at 50ms</title>
      <link>https://ai-analytics.org/writing/voidly-inference-api/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-inference-api/</guid>
      <pubDate>Wed, 28 May 2025 00:00:00 GMT</pubDate>
      <description>How Voidly serves the anomaly classifier as a live inference API — feature extraction in under 5ms, ONNX Runtime model serving, five-class output with per-class probabilities, Cloudflare Worker routing to regional inference nodes, model versioning with champion/challenger shadow mode, and the latency budget that keeps end-to-end probe-to-verdict under 50ms.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly ONNX inference: exporting XGBoost to ONNX and serving censorship predictions at 50ms p99</title>
      <link>https://ai-analytics.org/writing/voidly-onnx-inference/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-onnx-inference/</guid>
      <pubDate>Sat, 24 May 2025 00:00:00 GMT</pubDate>
      <description>How Voidly converts a trained XGBoost censorship classifier to ONNX for serving inside a Rust ingestion service: the sklearn-to-ONNX export pipeline with zipmap=False for zero-copy float32 probability tensors, ONNX Runtime session configuration with per-thread isolation and L3 graph optimization, opset 17 pinning with metadata validation, and batch inference benchmarks achieving p99 under 50ms at batch size 200 on 4 vCPUs.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The 47 features that classify internet censorship: how Voidly extracts signal from raw network measurements</title>
      <link>https://ai-analytics.org/writing/voidly-feature-extraction/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-feature-extraction/</guid>
      <pubDate>Tue, 20 May 2025 00:00:00 GMT</pubDate>
      <description>How Voidly transforms raw probe measurements into the 47-feature vector that feeds the anomaly classifier: the ControlDelta struct, DNS features (NXDOMAIN injection, bogon IPs, known injection IPs), TCP features (RST timing, SYN-ACK count), TLS features (MITM cert detection, alert codes), HTTP features (blockpage SimHash score, body length ratio), and the LRU control cache design that prevents doubling probe cost.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe scheduling constraints: battery budgets, cellular data limits, and adaptive domain selection</title>
      <link>https://ai-analytics.org/writing/voidly-probe-scheduling-constraints/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-scheduling-constraints/</guid>
      <pubDate>Fri, 16 May 2025 00:00:00 GMT</pubDate>
      <description>How Voidly probes adapt their measurement schedule to device resource constraints: four constraint checks (battery floor, thermal throttle, cellular daily cap, unknown network), sliding-window cellular data accounting with per-minute SQLite buckets, adaptive cycle length scaling to remaining budget, and a priority queue scoring domains on staleness (0.50), config priority flag (0.35), and anomaly recency (0.15).</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe operator safety: anonymity design, data minimization, and operational security for censorship measurement</title>
      <link>https://ai-analytics.org/writing/voidly-probe-operator-safety/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-operator-safety/</guid>
      <pubDate>Wed, 07 May 2025 00:00:00 GMT</pubDate>
      <description>How Voidly protects probe operators in high-risk jurisdictions: strict data minimization, WireGuard peer-key authentication, daily probe ID pseudonymization, optional Tor upload path, measurement scrubbing, country-tier legal risk assessments, and one-tap emergency stop with full data erasure.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s URL test list: how we curate the domains that reveal internet censorship</title>
      <link>https://ai-analytics.org/writing/voidly-test-list/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-test-list/</guid>
      <pubDate>Mon, 12 May 2025 00:00:00 GMT</pubDate>
      <description>How Voidly selects and maintains the domains it probes for censorship: Citizen Lab&apos;s global test list, 12 OONI category codes, per-country supplemental lists, the measurement budget problem, and why the test list is a political document.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe commissioning: how a new operator joins the censorship measurement network</title>
      <link>https://ai-analytics.org/writing/voidly-probe-commissioning/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-commissioning/</guid>
      <pubDate>Sat, 03 May 2025 00:00:00 GMT</pubDate>
      <description>How a new Voidly probe operator goes from application to publishing measurements: on-device X25519 key generation in the Tauri app, probe registration and ASN verification, 48-hour warmup period with calibration measurements, quality scoring at promotion, and what happens when warmup calibration fails.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Regulatory API rate limiting: per-tier quotas, burst tokens, and Cloudflare KV sliding-window counters</title>
      <link>https://ai-analytics.org/writing/regulatory-rate-limiting/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-rate-limiting/</guid>
      <pubDate>Tue, 29 Apr 2025 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub enforces per-client and per-tier rate limits at 8,000 req/s without a centralized counter store: a five-tier quota table (free/researcher/compliance/vendor/internal), token-bucket burst enforcement in Cloudflare KV with ETag-based conditional writes, and sliding 24-hour window daily quota counting using per-minute KV buckets.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Federal Regulatory Data Hub query layer: routing 35M records at the Cloudflare edge</title>
      <link>https://ai-analytics.org/writing/regulatory-query-layer/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-query-layer/</guid>
      <pubDate>Fri, 25 Apr 2025 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub serves 35M records via Cloudflare Workers: 8 vertical D1 shards by agency group, Promise.all fan-out for cross-agency queries, entity bridge join across CIK/UEI/LEI/DUNS/NPI, FTS5 full-text search for narrative datasets, response caching with TTL table by endpoint type, and p50/p99 latency budget including partial-response fallback.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Regulatory data versioning: point-in-time queries, audit trails, and as-of compliance screening in Cloudflare D1</title>
      <link>https://ai-analytics.org/writing/regulatory-data-versioning/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-data-versioning/</guid>
      <pubDate>Mon, 21 Apr 2025 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub implements bitemporal versioning across 35M regulatory records in Cloudflare D1: the valid_from/valid_until row-version pattern with half-open intervals, an append-only record_versions audit table, AS-OF query rewriting in the Workers router, three screening modes (current/as-of/historical), and keyset-paginated NDJSON snapshot export for retroactive batch compliance screening.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Monitoring dataset freshness in the Federal Regulatory Data Hub: staleness detection, multi-channel alerting, and the OFAC publish-time problem</title>
      <link>https://ai-analytics.org/writing/regulatory-staleness-monitoring/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-staleness-monitoring/</guid>
      <pubDate>Thu, 17 Apr 2025 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub monitors the freshness of 197 federal datasets and alerts on staleness: per-source FRESHNESS_CONFIG with expected_cadence and max_staleness_hours, D1 dataset_ingests staleness query, Cloudflare Cron */5 * * * * staleness check, multi-channel alerting (Slack webhook, email, PagerDuty) with KV deduplication, OFAC ETag monitoring with 90-minute publish-delay alert, five ingest error classes, and public /status endpoint.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe vantage selection: ASN diversity, operator safety, and reaching hard-to-measure countries</title>
      <link>https://ai-analytics.org/writing/voidly-vantage-selection/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-vantage-selection/</guid>
      <pubDate>Thu, 10 Apr 2025 00:00:00 GMT</pubDate>
      <description>How Voidly selects and distributes its probe vantage network: why ASN diversity matters more than geographic spread, the operator safety constraints for high-risk countries, and how we reach places where most people connect on mobile-only networks.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly probe config delivery: signed bundles, auto-update protocol, and country-specific measurement parameters</title>
      <link>https://ai-analytics.org/writing/voidly-probe-config-delivery/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-probe-config-delivery/</guid>
      <pubDate>Sun, 06 Apr 2025 00:00:00 GMT</pubDate>
      <description>How Voidly delivers measurement configuration to probes without a persistent control channel: gzip+CBOR bundles signed with Ed25519, a pull-based auto-update scheduler with 6-hour intervals and exponential backoff, version pinning and two-snapshot rollback, and anonymous country tokens derived via BLAKE3 so the CDN cannot correlate which overlay a probe applies.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly operator privacy: how we publish measurements without exposing the people who collect them</title>
      <link>https://ai-analytics.org/writing/voidly-operator-privacy/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-operator-privacy/</guid>
      <pubDate>Wed, 02 Apr 2025 00:00:00 GMT</pubDate>
      <description>How Voidly protects probe operator identity while publishing full measurement data: probe_id as SHA-256(public_key_bytes) with zero IP logging, codename system (450K+ combinations, no joint table), measurement anonymization (probe_cc + probe_asn published; IP never stored), per-probe Ed25519 signing with isolated key store, and 12-country extra protections (4–48 hour publication delay, 90-day probe_id rotation).</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Entity alias tables for sanctions evasion detection: AKA, FKA, NFE, and PHONETIC normalization across OFAC, SEC, and FinCEN</title>
      <link>https://ai-analytics.org/writing/regulatory-entity-alias-table/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-entity-alias-table/</guid>
      <pubDate>Sat, 29 Mar 2025 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub manages alias proliferation across OFAC SDN, SEC EDGAR, and FinCEN BSA: a five-type alias taxonomy (AKA/FKA/NFE/PHONETIC/VESSEL), entity_aliases DDL with FTS5 virtual table and covering indexes, a normalization pipeline with iterative legal-suffix stripping, double-Metaphone phonetic bucket generation, and a four-pass resolution pipeline achieving 98.7% alias recall on 2.4M aliases.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Entity ID normalization in the Federal Regulatory Data Hub: resolving CIK, UEI, LEI, DUNS, and NPI across 197 datasets</title>
      <link>https://ai-analytics.org/writing/federal-entity-id-normalization/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/federal-entity-id-normalization/</guid>
      <pubDate>Tue, 25 Mar 2025 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub resolves company identity across five incompatible federal identifier schemes: three-pass resolution (exact ID join, alias table, TF-IDF fuzzy name matching), entity_master bridge table schema, company name normalization, false positive rates by method, and p50 38ms cross-agency query latency.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Federal Regulatory Data Hub schema design: per-vertical table layouts, entity_master bridge, and D1 indexing strategy for 35M records across 8 shards</title>
      <link>https://ai-analytics.org/writing/regulatory-schema-design/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-schema-design/</guid>
      <pubDate>Fri, 21 Mar 2025 00:00:00 GMT</pubDate>
      <description>The full schema design behind the Federal Regulatory Data Hub: eight vertical D1 databases, OFAC SDN and EPA enforcement table DDL with FTS5 virtual tables, entity_master bridge with shard_presence bitmask, covering indexes vs. FTS5 trade-offs, and the Workers queryEntityAllShards() Promise.all fan-out achieving p50 38ms cross-shard entity queries.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Full-text search across 35M federal records: SQLite FTS5, BM25 ranking, and cross-shard fan-out in Cloudflare D1</title>
      <link>https://ai-analytics.org/writing/federal-fts5-search/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/federal-fts5-search/</guid>
      <pubDate>Mon, 17 Mar 2025 00:00:00 GMT</pubDate>
      <description>How the Federal Regulatory Data Hub implements full-text search across 35M records using SQLite FTS5 in Cloudflare D1: virtual table creation with the unicode61 tokenizer and content= shadow-table pattern, BM25 scoring with weighted columns (10× entity_name, 5× description, 1× narrative), highlight() and snippet() functions for context extraction, buildFts5Query() TypeScript alias expansion, Promise.all cross-dataset fan-out across 5 D1 shards, trigger-based index maintenance, and weekly optimize via Cloudflare Cron.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Building the Federal Regulatory Data Hub on Cloudflare D1: 35M records at the edge</title>
      <link>https://ai-analytics.org/writing/regulatory-data-hub-cloudflare-d1/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/regulatory-data-hub-cloudflare-d1/</guid>
      <pubDate>Mon, 10 Mar 2025 00:00:00 GMT</pubDate>
      <description>How we built a 35M-record federal regulatory database on Cloudflare D1 — per-vertical SQLite tables across 197 datasets, daily cron ingest, FTS5 for free-text datasets, and vertical sharding past the 10GB limit.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s measurement retention policy: hot, warm, and cold tiers for 2.2B probe results</title>
      <link>https://ai-analytics.org/writing/voidly-measurement-retention/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-measurement-retention/</guid>
      <pubDate>Wed, 05 Mar 2025 00:00:00 GMT</pubDate>
      <description>How Voidly manages storage for 2.2B probe measurements using a three-tier TimescaleDB retention policy — full-resolution hot tier (0-30 days), native-compressed warm tier (31-365 days, 6.2x ratio), and downsampled cold tier (&gt;365 days, aggregates only), with continuous aggregate cascade, pg_cron compliance verification, and R2 tiered storage planned for Q3 2026.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s measurement database: 2.2B probe results in TimescaleDB</title>
      <link>https://ai-analytics.org/writing/voidly-timescaledb/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-timescaledb/</guid>
      <pubDate>Sat, 01 Mar 2025 00:00:00 GMT</pubDate>
      <description>How Voidly stores and queries 2.2 billion censorship probe results in TimescaleDB: hypertable design with 1-day chunk intervals and secondary country partitioning, 6.2× compression, continuous aggregates for country-level daily summaries, three-tier retention (hot/warm/cold), and query benchmarks for anomaly detection.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s real-time corroboration engine: fetching, aligning, and merging OONI, CensoredPlanet, and IODA data</title>
      <link>https://ai-analytics.org/writing/voidly-corroboration-engine/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-corroboration-engine/</guid>
      <pubDate>Sat, 22 Feb 2025 00:00:00 GMT</pubDate>
      <description>How Voidly&apos;s corroboration engine fetches and aligns data from three independent sources in near-real-time: tokio::join! parallel fetches with per-source timeouts, adaptive OONI polling (15m/60m/3h/6h), in-memory CensoredPlanet daily dump index, independence-weighted source agreement scoring, and retroactive nightly reprocessing against the CP daily dump.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly MCP server: 83 censorship query tools for Claude and GPT</title>
      <link>https://ai-analytics.org/writing/voidly-mcp-server/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-mcp-server/</guid>
      <pubDate>Sat, 15 Feb 2025 00:00:00 GMT</pubDate>
      <description>How the Voidly MCP server exposes 83 tools for querying the global censorship dataset from Claude, GPT, and agent frameworks — incident lookup, measurement queries, country summaries, BGP events, shutdown forecasts, and wiring it into Claude Code.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly Parquet export pipeline: nightly snapshots from TimescaleDB to HuggingFace</title>
      <link>https://ai-analytics.org/writing/voidly-parquet-export/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-parquet-export/</guid>
      <pubDate>Sat, 08 Feb 2025 00:00:00 GMT</pubDate>
      <description>How the nightly Voidly export job extracts measurements from TimescaleDB and pushes Parquet snapshots to HuggingFace Hub: PyArrow schema with dictionary-encoded columns, server-side cursor streaming at 50K rows per round-trip, Zstandard level 3 compression, country + year_month partitioning, atomic HuggingFace commit with CommitOperationAdd, post-push SHA-256 verification, and the incremental vs. monthly full-snapshot strategy.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly open datasets on HuggingFace: structure, daily snapshots, and filter recipes</title>
      <link>https://ai-analytics.org/writing/voidly-huggingface-datasets/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-huggingface-datasets/</guid>
      <pubDate>Sat, 01 Feb 2025 00:00:00 GMT</pubDate>
      <description>How the Voidly CC BY 4.0 measurement dataset and the OONI historical corpus are hosted on HuggingFace — Parquet snapshot structure, daily incremental updates, git-lfs versioning, and Python/R filter recipes for journalists, ML researchers, and infrastructure teams.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Censorship incident lifecycle in Voidly: from anomaly detection to verified incident to resolution</title>
      <link>https://ai-analytics.org/writing/voidly-incident-lifecycle/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-incident-lifecycle/</guid>
      <pubDate>Sun, 26 Jan 2025 00:00:00 GMT</pubDate>
      <description>How a Voidly censorship incident progresses through six states (Anomaly, MultiSourceAnomaly, Corroborated, VerifiedIncident, Resolved, FalsePositive) with transition thresholds, timing data from 847 incidents in 2024, publication timing by tier, and how lifecycle state encodes into HuggingFace dataset fields.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>From anomaly to verified incident: the Voidly confidence tier system</title>
      <link>https://ai-analytics.org/writing/voidly-confidence-tiers/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-confidence-tiers/</guid>
      <pubDate>Mon, 20 Jan 2025 00:00:00 GMT</pubDate>
      <description>How a Voidly measurement moves through three confidence tiers and what each tier means for journalists, ML researchers, and infrastructure monitoring teams using the dataset.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s Server-Sent Events streaming API: real-time censorship incident subscriptions</title>
      <link>https://ai-analytics.org/writing/voidly-streaming-api/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-streaming-api/</guid>
      <pubDate>Mon, 13 Jan 2025 00:00:00 GMT</pubDate>
      <description>How the Voidly SSE streaming endpoint delivers censorship events in real time: GET /v1/stream with country/tier/type filtering, four event types (incident_created, incident_updated, incident_resolved, country_status_change), Last-Event-ID reconnection with 24-hour event ring buffer, Python httpx.Client and JavaScript EventSource examples, and how SSE differs from the webhook delivery system.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly API authentication: API keys, request signing, and rate limit tiers</title>
      <link>https://ai-analytics.org/writing/voidly-api-authentication/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-api-authentication/</guid>
      <pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate>
      <description>How the Voidly API handles authentication: two tiers (public/keyed), voidly_{env}_{base58} key format with PBKDF2-HMAC-SHA256 storage, D1 + KV request authentication flow, four plan tiers, HMAC-SHA256 webhook verification, key rotation without downtime, test keys, and OAuth2 for third-party integrations.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly REST API: querying the global censorship index in real time</title>
      <link>https://ai-analytics.org/writing/voidly-rest-api/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-rest-api/</guid>
      <pubDate>Mon, 06 Jan 2025 00:00:00 GMT</pubDate>
      <description>How the Voidly REST API is designed: key endpoints for incident lookup, measurement queries, country summaries, domain history, BGP events, and 7-day shutdown forecasts; cursor-based pagination, filtering, rate limits, and code samples in curl, Python, and JavaScript.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s alert delivery system: PGP-encrypted email, webhooks, and RSS for censorship incidents</title>
      <link>https://ai-analytics.org/writing/voidly-alert-delivery/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-alert-delivery/</guid>
      <pubDate>Sat, 28 Dec 2024 00:00:00 GMT</pubDate>
      <description>How Voidly gets verified censorship incidents to journalists, researchers, and monitoring systems: HMAC-signed webhook delivery with exponential-backoff retry, PGP-encrypted email for verified alerts, per-country and per-confidence-tier RSS feeds, alert deduplication by incident_id, and rate-limiting to prevent fatigue.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly incident publication: state machines, idempotent upserts, and Kafka fan-out for verified censorship events</title>
      <link>https://ai-analytics.org/writing/voidly-incident-publication/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-incident-publication/</guid>
      <pubDate>Tue, 24 Dec 2024 00:00:00 GMT</pubDate>
      <description>How Voidly transitions a censorship incident through five states (Anomaly/MultiSourceAnomaly/Corroborated/Verified/Resolved) with threshold-gated transitions, stores every state change in a TimescaleDB hypertable with SHA-256 idempotency_key, and fans out verified incidents to alert delivery and cache invalidation via three Kafka topics — with deterministic compute_incident_id() SHA-256 Rust function.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly measurement scheduler: how we decide which domains to probe and when</title>
      <link>https://ai-analytics.org/writing/voidly-measurement-scheduler/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-measurement-scheduler/</guid>
      <pubDate>Fri, 20 Dec 2024 00:00:00 GMT</pubDate>
      <description>How Voidly schedules 80-domain probe runs across 37+ nodes: domain priority tiers by OONI category code, anomaly-driven priority boosts, protocol selection per domain, ±15% jitter for anti-detection, ASN distribution for cross-ASN coverage, adaptive scheduling on anomaly detection, and per-country task budgets (CN 68, IR 74, RU 72).</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Building Voidly&apos;s classifier training dataset from OONI: ingestion, alignment, and label generation</title>
      <link>https://ai-analytics.org/writing/voidly-ooni-training-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-ooni-training-data/</guid>
      <pubDate>Sun, 15 Dec 2024 00:00:00 GMT</pubDate>
      <description>How Voidly ingests 200M+ OONI Explorer measurements, aligns them with Voidly probe data on a country-domain-date key, generates probabilistic training labels using five Snorkel-style label functions, handles OONI coverage gaps with label distillation, and constructs the labeled dataset that trains the five-class anomaly classifier.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly anomaly classifier: five interference classes, gradient boosted trees, and why we optimize for recall</title>
      <link>https://ai-analytics.org/writing/voidly-anomaly-classifier/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-anomaly-classifier/</guid>
      <pubDate>Tue, 10 Dec 2024 00:00:00 GMT</pubDate>
      <description>How the Voidly ML classifier distinguishes DNS tampering, TLS interference, HTTP blocking, BGP withdrawal, and throttling — five per-class binary models, country-specific calibration, and why 95% recall beats 95% precision when cross-source corroboration filters the noise.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Evaluating the Voidly anomaly classifier: per-country confusion matrices, precision-recall curves, and the offline test harness</title>
      <link>https://ai-analytics.org/writing/voidly-classifier-offline-test/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-classifier-offline-test/</guid>
      <pubDate>Tue, 03 Dec 2024 00:00:00 GMT</pubDate>
      <description>How Voidly evaluates the five-class censorship anomaly classifier offline before deployment: the ClassifierEvaluator test harness, per-country AUC-PR vs. AUC-ROC tradeoffs for imbalanced censorship data, F2 scoring rationale, per-country confusion matrix case studies (Iran, China, Russia), ECE calibration before and after Platt scaling, and model promotion criteria including 48-hour champion/challenger shadow mode.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s active learning loop: growing the anomaly training set with human-in-the-loop annotation</title>
      <link>https://ai-analytics.org/writing/voidly-active-learning/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-active-learning/</guid>
      <pubDate>Wed, 27 Nov 2024 00:00:00 GMT</pubDate>
      <description>How Voidly uses uncertainty sampling, Cohen&apos;s kappa inter-annotator agreement, and weekly model retrains to grow its censorship anomaly training set from 127K bootstrap labels to 275K — 500 examples/week annotated by 3 researchers each, with DVC data versioning and PSI drift detection.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Voidly&apos;s ML training pipeline: building a labeled censorship dataset from OONI measurements</title>
      <link>https://ai-analytics.org/writing/voidly-ml-training-data/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-ml-training-data/</guid>
      <pubDate>Wed, 20 Nov 2024 00:00:00 GMT</pubDate>
      <description>How Voidly constructs a labeled training dataset for the anomaly classifier from 200M+ OONI measurements: weak supervision with Snorkel-style label functions across DNS/TCP/TLS/HTTP layers, class imbalance handling, time-based train/val/test splits to prevent leakage, per-country Platt scaling calibration, and the continuous retraining pipeline.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The Voidly measurement quality filter: how we clean 200M OONI records before ML training</title>
      <link>https://ai-analytics.org/writing/voidly-measurement-quality-filter/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/voidly-measurement-quality-filter/</guid>
      <pubDate>Wed, 13 Nov 2024 00:00:00 GMT</pubDate>
      <description>How the quality filter pipeline decides which raw measurements are fit for ML training: boolean checks for control_failure (1.9% drop rate), missing_fields (0.8%), old probe version pre-2.5.0 (0.3%), and duplicates (0.2%), totalling 3.2% dropped. Includes the quality_filter() Python function, the to_feature_input() schema transformation, and why rejected measurements go to quarantine not discard.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>OONI data normalization: bridging five schema versions across 1.66M raw measurement files</title>
      <link>https://ai-analytics.org/writing/ooni-data-normalization/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ooni-data-normalization/</guid>
      <pubDate>Sat, 09 Nov 2024 00:00:00 GMT</pubDate>
      <description>How Voidly normalizes 200M+ OONI measurements across five web_connectivity schema versions (v0.2 to v0.6) into a single ML-ready format: a detect_web_connectivity_version() function using field-presence inference, AnomalyType and ConfidenceTier enums, FLAG_* bitmask constants for anomaly encoding, side-by-side normalize_v05() vs. normalize_v06() implementations, and a 95.3% pass-through rate from the drop-reason table.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Building the OONI historical corpus: 1.66M downloads, schema normalization, and the decisions behind the dataset</title>
      <link>https://ai-analytics.org/writing/ooni-historical-corpus/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/ooni-historical-corpus/</guid>
      <pubDate>Tue, 05 Nov 2024 00:00:00 GMT</pubDate>
      <description>How we processed the OONI raw measurement archive into a flat ML-ready CSV: handling probe version schema drift, normalizing test_keys across 20 measurement types, and streaming 200M+ records.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Censorship attribution via OSINT: identifying DPI vendors from network signatures, procurement records, and BGP TTL analysis</title>
      <link>https://ai-analytics.org/writing/censorship-attribution-osint/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/censorship-attribution-osint/</guid>
      <pubDate>Fri, 01 Nov 2024 00:00:00 GMT</pubDate>
      <description>How Voidly attributes censorship infrastructure to specific DPI vendors using network signatures and open-source intelligence: a six-vendor signature table (TSPU/Sandvine/NetClean/Iran ARRS/Cisco IronPort/GFW), DpiVendorSignature dataclass with a score_signature_match() function weighting RST timing (0.35), block page (0.30), injection IP (0.25), and CA SPKI (0.10), procurement scraping across five government tender portals, and case studies for Russia, Iran, and Ethiopia.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Building a digital-footprint reconnaissance pipeline for OSINT investigations</title>
      <link>https://ai-analytics.org/writing/osint-digital-footprint/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/osint-digital-footprint/</guid>
      <pubDate>Mon, 28 Oct 2024 00:00:00 GMT</pubDate>
      <description>How we build persistent cross-platform entity profiles for OSINT: passive collection from 40+ sources, graph-based identity disambiguation with calibrated edge weights, Certificate Transparency log monitoring, BGP/ASN change tracking, stylometric fingerprinting, and operational security for researchers in hostile environments.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Mapping censorship infrastructure: identifying filtering gateways, DPI vendor signatures, and blocking architecture from network signals</title>
      <link>https://ai-analytics.org/writing/censorship-infrastructure-mapping/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/censorship-infrastructure-mapping/</guid>
      <pubDate>Mon, 21 Oct 2024 00:00:00 GMT</pubDate>
      <description>How Voidly identifies the hardware and software responsible for internet censorship: blocking architecture taxonomy (L3/L4/L7-DNS/L7-HTTP), DPI vendor signatures from timing patterns (Russia&apos;s TSPU RST injection &lt; 3ms, Iran&apos;s ARRS DNS IPs, China&apos;s GFW TTL fingerprinting), ISP-level blocking fingerprints (Rostelecom vs. MTS vs. Turkcell), TTL middlebox distance analysis, OSINT cross-referencing with procurement records, and the censorship_infrastructure dataset field.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Building a distributed VPN with intelligent routing</title>
      <link>https://ai-analytics.org/writing/distributed-vpn-routing/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/distributed-vpn-routing/</guid>
      <pubDate>Tue, 15 Oct 2024 00:00:00 GMT</pubDate>
      <description>How we route around censorship with ML-driven path selection, traffic morphing, and 142 entry-node IPs.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Named entity extraction and disambiguation in the OSINT pipeline: 58M posts per day, 15,000 entity mentions per hour</title>
      <link>https://ai-analytics.org/writing/osint-entity-extraction/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/osint-entity-extraction/</guid>
      <pubDate>Thu, 10 Oct 2024 00:00:00 GMT</pubDate>
      <description>How the AI Analytics OSINT pipeline extracts, disambiguates, and stores named entity mentions from 58M social media posts per day — GPU-accelerated NER, Wikidata QID linking, cross-language transliteration, and person co-reference resolution.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Social media ingestion at scale: collecting 58M posts per day from 47 platform schemas</title>
      <link>https://ai-analytics.org/writing/social-media-ingestion/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/social-media-ingestion/</guid>
      <pubDate>Sat, 05 Oct 2024 00:00:00 GMT</pubDate>
      <description>How we collect and normalize social media data from 47 platforms into a canonical post format: three-tier collection strategy (official APIs, ActivityPub, RSS/scrape), token-bucket rate limiting with circuit breakers, FastText language detection at ingest, content-hash deduplication, and Kafka topic partitioning by platform.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>NLP pipeline for real-time sentiment analysis at scale</title>
      <link>https://ai-analytics.org/writing/nlp-pipeline-scale/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/nlp-pipeline-scale/</guid>
      <pubDate>Sat, 28 Sep 2024 00:00:00 GMT</pubDate>
      <description>Architecture of a real-time NLP pipeline: TensorFlow models, sub-2-second latency, multi-language sentiment + entity recognition.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Multilingual bot detection: an 8-feature XGBoost classifier across 14 languages with per-language Platt scaling</title>
      <link>https://ai-analytics.org/writing/multilingual-bot-detection/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/multilingual-bot-detection/</guid>
      <pubDate>Tue, 24 Sep 2024 00:00:00 GMT</pubDate>
      <description>How the OSINT platform detects bot accounts across 14 languages without retraining per language: an 8-feature BotFeatureVector (posting_interval_entropy, reply_outdegree_ratio, content_cluster_density, age_velocity_zscore, quote_to_original_ratio, url_recycling_rate, cross_platform_correlation, bio_change_count_90d), Redis-bucketed perceptual hash matching with Hamming distance threshold 8, XGBClassifier with StratifiedGroupKFold on language groups, and per-language Platt scaling achieving F1 0.883-0.908 across 14 languages.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Detecting coordinated inauthentic behavior in social media at scale</title>
      <link>https://ai-analytics.org/writing/coordinated-campaign-detection/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/coordinated-campaign-detection/</guid>
      <pubDate>Fri, 20 Sep 2024 00:00:00 GMT</pubDate>
      <description>How we detect coordinated amplification campaigns across 58M daily posts: MinHash LSH (128 hash functions, 16 bands, Jaccard threshold 0.80) for content similarity, Redis sorted-set burst detection (≥5 accounts within 15 minutes), seven account-feature logistic regression, network amplification ring detection, cross-platform timing joins, and a 0–100 coordination score with 70/90 thresholds for human review and auto-flagging.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Entity resolution for FEC campaign finance data: committee type taxonomy, JFC allocation, and four-pass name matching</title>
      <link>https://ai-analytics.org/writing/election-finance-entity-resolution/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/election-finance-entity-resolution/</guid>
      <pubDate>Mon, 16 Sep 2024 00:00:00 GMT</pubDate>
      <description>How the election intelligence pipeline resolves FEC committee identity across 1.3M records: the 10-code committee type taxonomy (H/S/P/X/Y/N/Q/O/I/U), JointFundraisingCommittee dataclass with JFCAllocation and resolve_jfc_participants() from Form 99, normalize_entity_name() with iterative legal-suffix stripping, and a four-pass resolution table achieving 95.5% cumulative recall.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Detecting election anomalies using statistical methods</title>
      <link>https://ai-analytics.org/writing/election-anomaly-detection/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/election-anomaly-detection/</guid>
      <pubDate>Thu, 12 Sep 2024 00:00:00 GMT</pubDate>
      <description>Benford’s Law, turnout modeling, ARIMA time-series — surfacing anomalies worth a second look.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>Statistical anomaly detection for election integrity: Benford&apos;s Law, digit uniformity, and turnout modeling</title>
      <link>https://ai-analytics.org/writing/election-statistical-methods/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/election-statistical-methods/</guid>
      <pubDate>Sat, 07 Sep 2024 00:00:00 GMT</pubDate>
      <description>The statistical methods behind AI Analytics&apos; election anomaly detection — first-digit analysis, last-digit uniformity testing, turnout z-scores, and why these signals require cross-validation with social and media data before generating an alert.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>The election intelligence pipeline: aggregating ballot data, social signals, and media coverage for real-time anomaly detection</title>
      <link>https://ai-analytics.org/writing/election-data-pipeline/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/election-data-pipeline/</guid>
      <pubDate>Mon, 02 Sep 2024 00:00:00 GMT</pubDate>
      <description>How the election intelligence pipeline ingests AP Election API feeds, state authority data, social media signals, and media coverage in real time: Kafka election.precinct_results topic partitioned by state FIPS, PrecinctResult protobuf schema, state scraper layer, ElectionSentimentConsumer and narrative divergence scoring, FIPS normalization edge cases, and p50/p99 latency targets for all four data streams.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
    <item>
      <title>How we process 2.4M social-media posts per hour</title>
      <link>https://ai-analytics.org/writing/processing-millions-posts/</link>
      <guid isPermaLink="true">https://ai-analytics.org/writing/processing-millions-posts/</guid>
      <pubDate>Fri, 30 Aug 2024 00:00:00 GMT</pubDate>
      <description>Kafka partition key design, binary COPY writes to TimescaleDB, character 4-gram MinHash LSH distributed across Redis, autoscaling on consumer lag, and a canonical normalization layer across 47 platform schemas — the full pipeline behind 58M posts/day.</description>
      <author>info@ai-analytics.org (AI Analytics)</author>
    </item>
  </channel>
</rss>
