Home

Byte-level Diffusion LM for Palindrome

Training a byte-level discrete diffusion language model to generate palindromes, comparing it to autoregressive approaches.

Per-Sample Gradients

Understanding per-sample gradients and their applications in data attribution and influence functions.