Skip to content

Projects

Reproducible scientific software, and a couple of shipped full-stack apps — the engineering behind the research.

Research engineering

Hippocampal linguistic encoding pipeline

End-to-end, reproducible pipeline mapping language onto single-neuron activity: 57-feature / 9-layer linguistic extraction → adversarially purified GCN + SBERT embeddings → cross-validated Poisson / ridge encoding with confound controls. Scales to 435 neurons across 7,346 word-level timepoints.

Python PyTorch scikit-learn spaCy NLTK HuggingFace

QA-Emb — LLM interrogation & brain alignment toolkit

A question-answer embedding framework that extracts interpretable structure from LLM hidden states and aligns it to neural population geometry via RDM/RSA and Procrustes, with length-controlled brain-score and wavelet null models.

Python transformers sentence-transformers NumPy/SciPy

Music GLM & spike-sorting tooling

Poisson GLM suite with nested likelihood-ratio tests, circular pitch encoding, MFCC spectral PCs, and multi-system clock-drift compensation — plus a UMAP-based spike-sorting quality-triage tool for waveform clustering diagnostics.

MATLAB Python librosa UMAP

Product / full-stack

Houston Eats

An interactive web map for discovering Houston restaurants scraped from Instagram, with location filtering, marker clustering, and a CSV→geocode→JSON data pipeline. A shipped, full-stack app — concrete evidence of production engineering.

React 19 Vite Leaflet Supabase Tailwind CSS

Graduation Name Pronouncer

A B2B concept for accurate pronunciation of names (especially non-English) at graduation ceremonies. Dual-mode: model-generated pronunciation via multilingual grapheme-to-phoneme + a curated lexicon (the competitive wedge), plus self-recording capture. IPA as source of truth; all voice data self-hosted (FERPA-compliant).

Python Flask espeak-ng Piper / Kokoro TTS scikit-learn