I build data systems that ship.
The kind that pay for themselves. Models that keep the accounts worth keeping, aim spend at the customers who convert, and stop losing tests before they cost a quarter. Forty-five live labs below, filterable. Click anything.
§ I The research index
Forty-five labs.
Hover any tile.
Color tells you what kind of work it is. Hover for the details. Click to open. Filter by category, or search by topic.
Churn Model
Finds the customers about to leave. Saves revenue already paid to acquire.
Customer Segmentation
Sorts customers by value. Aims spend at the 21% who generate 64% of revenue.
Recommendation Engine
Guesses what they'll watch next. Every basis point of watch-time is a subscription renewed.
A/B Test Simulator
Shows when a winning version is real and when it's noise. Stops losing tests early.
Funnel Simulation
Finds the one customer segment quietly killing conversion without breaking the rest.
Sketch Recognition
Draw anything. A small neural network running on your laptop guesses what it is, in real time.
Model Picker
Ask in plain language: "cheapest model good at code." Get a side-by-side comparison.
Influencer Mix Modeling
A working Bayesian Marketing Mix Model for a creator-driven D2C brand. Eight channels, 104 weeks, full posterior intervals.
Token Lab
A live Byte Pair Encoding visualizer. Type text, watch merges happen step by step, see exactly how GPT turns words into numbers.
Bandit Lab
A live multi-armed bandit simulator. Four strategies, hidden reward probabilities, and real-time regret curves that prove why exploration matters.
Diffusion Lab
A toy 2D diffusion model in your browser. Draw a shape, watch it dissolve into noise, then walk it back step by step. The engine behind DALL-E.
Viral Nation Case Study
Interview deliverable for Viral Nation. Hosted externally on Vercel. Opens in a new tab.
AI & Jobs
Type your job. See its AI exposure, robotics exposure, and BLS projection through 2030.
Time-Use Atlas
How Americans actually spend 24 hours, drawn as a clock. Pick a cohort, watch the day redraw.
Algorithm Watchdog
A daily-refreshed read on what YouTube's algorithm is rewarding right now.
AGI Horizon
Two hundred fifty AGI predictions plotted on year said vs year targeted. Twenty years out, since 1965.
The Productivity Mystery
U.S. labor productivity broke in 1973 and again in 2004. Seven datasets, seven explanations, one survives.
How LLMs Learn
A trillion words, a handful of equations. Eight charts on what actually happens inside an LLM.
Escape Velocity
Are we approaching the singularity? Seven charts. The inputs are bending up. Is intelligence following?
Semiconductor Cartography
Thirty facilities, seven supply-chain layers, the chokepoints that could stall the AI chip industry.
Tech Predictions That Never Happened
Eighty-two confident, dated technology predictions. Each named a year. The year came. Most did not.
The Productivity Paradox
Ninety-one predictions about when IT and AI would show up in productivity statistics, against the BLS series.
The Skills Surge
Six months of Claude Skills, mapped. Power-law star distribution, surge timeline, architecture quadrant, and the verbs every skill author opens with.
Inside Agent Memory
The 2026 map of agent memory. MEMORY.md vs vector DBs vs Mem0 vs Zep vs the hybrids in between, with benchmarks, costs, and a decision matrix.
Cosmic Engines
An interactive 3D field guide to the eight extreme objects that light the universe. Quasars, black holes, pulsars, and the violent machinery between.
Reservoir Teacup Map
Twenty Western reservoirs drawn as USBR teacup diagrams. Current storage versus historical average, plus a Mead/Powell elevation cliff and a snow-to-storage scatter.
Neurotransmitter Atlas
A chapter-style 3D neuron, synapse close-up, and brain map for six major neurotransmitter systems.
Better?
A scrubbable ledger of one hundred indicators of human progress and regress, from primary sources.
Bubble Watch
Six charts on whether the U.S. market is in an AI bubble. CAPE, concentration, hyperscaler capex vs frontier-lab revenue, margin debt, capability scaling, and a five-peak comparison.
The Price of Intelligence
Eleven commodities, seven centuries. A stress test of the claim that AI is the first commodity with infinite demand. 1,000x in 3 years for intelligence vs 12,000x over 700 years for light.
The Rework Tax
Eight datasets stress-test the viral claim that 82% of AI tokens go to rework. The direction is right. The number is marketing. The baseline is from 1978.
Inside DeepSeek-V4
Five things most people assume modern LLMs do, and what DeepSeek-V4 does instead.
Inside Subquadratic
A real Miami AI startup, $29M seed, $19.6M GPU contract, benchmark numbers no one has reproduced.
Attention Is All You Need
An interactive infographic of the 2017 Transformer paper. 6×6 attention heatmap, 8 heads, complexity slider.
Inside LeWorldModel
The 15M JEPA world model that replaced seven loss terms with one Gaussian-matching regularizer.
The Model Atlas
An expert system over 13 ML models. Five questions, one match, with reasons.
Linear Regression
Fitting a line. Drag points, watch coefficients update in real time.
Logistic Regression
The simplest classifier worth deploying. Plant points, watch gradient descent slide a probability boundary into place.
Ridge & Lasso
Two regularizers, side by side. Watch coefficients shrink, and Lasso zero them out.
Decision Tree
Recursive splits made visible. Depth slider shows over- and underfit.
Random Forest
A committee of decision trees. Many imperfect trees beat one perfect one.
Gradient Boosting
Each tree fixes the last tree's mistakes. The technique behind XGBoost.
Support Vector Machine
The widest margin wins. Soft-margin linear SVM, live SGD on hinge loss.
Multi-Layer Perceptron
Layers of nonlinearity. Live backprop, one batch at a time.
Naive Bayes
Bayes' theorem applied bluntly. Wrong assumption, surprisingly useful answer.
K-Nearest Neighbors
No training. Just memory. Predict by voting among k neighbors.
K-Means
Lloyd's algorithm, drawn live. Pick k, watch points snap to centroids.
DBSCAN
Density-based clustering, no k needed. Eps and minPts control everything.
Isolation Forest
Outliers are easy to isolate. Random splits, short paths to anomalies.
§ II Shipped, end-to-end
Dirty data,
clean production.
End-to-end builds, not slideware. Each one shipped with the plumbing, the UI, and the deploy. Usually that's three teams and two quarters of work. Below is what happens when one person owns it.
RiskScore
A vulnerability scoring service that ingests CVE feeds, weights them against deployment context, and emits a single explainable risk score per asset.
Glass Cipher
An intelligence globe that plots open-source signals in real time. Sources are re-weighted by a small language model so clusters resolve into narratives, not noise.
Minecraft Clone
A voxel sandbox built from scratch. Chunked world, procedural terrain, block place/break, day-night lighting. Claude Code wrote the engine loop; Python and noise functions shape every biome.
Claude Second Brain
A note graph where Claude backfills links, summaries and open questions. Runs locally against a vault.
Open Claw Mission Control
A control surface for a DIY claw machine. Live camera, queueing, telemetry and a leaderboard.
iOS · three small apps
Assign work to your future self. Re-asks you at the right times instead of at the wrong ones.
Extracts a 5-color palette from any photo with a deliberately slow, gestural picker.
A journal that flags second-order consequences before you commit to them. Prompts pull from your own past entries.
§ III Education
Georgia Institute of Technology
Temple University
Plus 25+ certifications: machine learning (U. Washington), data science (CU Boulder, Duke), people management (Google). full list on linkedin ↗
Let’s build
something.
Best first move: send the job and the numbers it needs to hit. I reply within two business days with relevant examples and code.