01 / LLM inference research
duo-laguna-adapter
Poolside Laguna Research Hackathon submission with Cameron Wheeler. Adapted DuoAttention to Laguna's gated attention architecture and used learned head importance for a mixed-precision KV-cache policy, reducing cache size by up to 53%.
- PyTorch
- LLM inference
- KV cache
- Quantization
02 / Life science tooling
bio-agents-mcp
MCP servers that make protein and small-molecule data available to AI agents, with local Ollama-based testing for a practical research workflow.
03 / Transformer fundamentals
jax-sentiment-analysis
A from-scratch Transformer implementation in JAX, Flax and Optax, applied to sentiment analysis to explore attention, training loops and model internals directly.
group-unet
An experiment in rotation equivariance: Group Convolution U-Nets evaluated on butterfly segmentation, with tracked model comparisons and a technical write-up.
05 / Model implementation
flax-u2net
A from-scratch JAX and Flax implementation of U²-Net for salient-object detection, translating a non-trivial vision architecture into a functional ML stack.
06 / Reinforcement learning
ball-balancer
Distributed PPO training for Unity ML-Agents' 3D Ball environment. Uses MPI to run parallel environments and accelerate policy learning.