Portfolio

Pipelines shipped, products built, and tools deployed — real work with real outputs.

Internal InfrastructureProduction System

Built a config-driven B2B data pipeline from scratch

Multi-stage pipeline covering scraping, enrichment, deduplication, email extraction, and clean CSV output. Processes hundreds of thousands of records across healthcare and professional verticals. Deployed on self-hosted VPS with orchestration via Raspberry Pi, Discord alerting, and automated delivery. Powers the entire CRK Dev dataset catalog.

12
datasets built
40,000+
records processed
Self-hosted
infrastructure
PythonPlaywrightBeautifulSoupFastAPILinuxHetzner
Data ProductLive & Selling

12 production B2B datasets across healthcare and professional verticals

Built, enriched, and packaged 12 targeted B2B datasets from public sources. Each dataset goes through a full pipeline — scrape, enrich, dedupe, clean, package. Integrated with Stripe checkout and automated file delivery via Cloudflare R2 and Resend.

12
products live
40,000+
total records
Automated
delivery on purchase
Data EngineeringStripeCloudflare R2Resend
Web ApplicationLive

Designed and built a full-stack B2B data products storefront

Next.js storefront with 35 pages, SEO-optimized product pages, Stripe checkout, automated CSV delivery, self-hosted on Hetzner with Nginx, GitHub Actions auto-deploy, and Cloudflare DNS. Built and deployed in a single session.

35
pages
Zero
build errors
Automated
deployment pipeline
Next.jsTypeScriptStripeNginxGitHub Actions
Internal ToolInternal

Built an internal pipeline command center at ops.crkdev.com

Real-time operations dashboard for monitoring Spectral pipeline status, dataset registry, and run history. Built with FastAPI, Jinja2, and HTMX. Deployed on a GMKtec K6 mini PC with Nginx and accessible over Tailscale.

FastAPIHTMXNginxPythonTailscale
Desktop ApplicationShipped

Drag-and-drop CSV/XLSX merger — packaged as a Windows desktop app

Purpose-built Windows utility for merging multiple CSV and XLSX files. Drag-and-drop interface, clean output, no Excel required. Packaged with PyInstaller as a standalone .exe — no install, no dependencies.

PythonPyInstallerWindowsCSV
CLI ToolShipped

Command-line CSV cleaning and normalization tool

Python CLI that normalizes messy CSV files — trims whitespace, standardizes phone and email formats, fixes encoding issues, removes blank rows, and outputs a clean file. Available as CLI or packaged as a Windows app.

PythonCLICSVWindows
Client ProjectClient Work

Built a targeted scraper for a client's public records research workflow

Scraped publicly available auction listing data for a client conducting property research. Handled pagination, dynamic content, and inconsistent formatting. Delivered clean, structured CSV output ready for analysis. Client details kept confidential.

PythonPlaywrightCSVData Extraction

Additional project details, code samples, and references available on request during discovery.