Projects
Reproducible scientific software, and a couple of shipped full-stack apps — the engineering behind the research.
Research engineering
Hippocampal linguistic encoding pipeline
End-to-end, reproducible pipeline mapping language onto single-neuron activity: 57-feature / 9-layer linguistic extraction → adversarially purified GCN + SBERT embeddings → cross-validated Poisson / ridge encoding with confound controls. Scales to 435 neurons across 7,346 word-level timepoints.
Python PyTorch scikit-learn spaCy NLTK HuggingFace
QA-Emb — LLM interrogation & brain alignment toolkit
A question-answer embedding framework that extracts interpretable structure from LLM hidden states and aligns it to neural population geometry via RDM/RSA and Procrustes, with length-controlled brain-score and wavelet null models.
Python transformers sentence-transformers NumPy/SciPy
Music GLM & spike-sorting tooling
Poisson GLM suite with nested likelihood-ratio tests, circular pitch encoding, MFCC spectral PCs, and multi-system clock-drift compensation — plus a UMAP-based spike-sorting quality-triage tool for waveform clustering diagnostics.
MATLAB Python librosa UMAP
Product / full-stack
Houston Eats
An interactive web map for discovering Houston restaurants scraped from Instagram, with location filtering, marker clustering, and a CSV→geocode→JSON data pipeline. A shipped, full-stack app — concrete evidence of production engineering.
React 19 Vite Leaflet Supabase Tailwind CSS
Graduation Name Pronouncer
A B2B concept for accurate pronunciation of names (especially non-English) at graduation ceremonies. Dual-mode: model-generated pronunciation via multilingual grapheme-to-phoneme + a curated lexicon (the competitive wedge), plus self-recording capture. IPA as source of truth; all voice data self-hosted (FERPA-compliant).
Python Flask espeak-ng Piper / Kokoro TTS scikit-learn