[Thesis Defense] Morris Yau: On Structure, Parallelism, and Approximation in Modern Neural Sequence Modeling

Speaker

Morris Yau

Host

Jacob Andreas

MIT CSAIL

Abstract: Is there an algorithm that learns the best fit parameters of a Transformer to any dataset? If I trained a neural sequence model and promised you it is equivalent to a program, how would you even be convinced? Modern RNNs are functions that admit parallelizable recurrence; what is the design space of parallelizable recurrences? Are there unexplored function families that lie between RNNs and Transformers? We explore these questions from first principles starting with state, polynomials, and parallelism.

Add to Calendar 2026-02-06 9:30:00 2026-02-06 10:30:00 America/New_York [Thesis Defense] Morris Yau: On Structure, Parallelism, and Approximation in Modern Neural Sequence Modeling Abstract:  Is there an algorithm that learns the best fit parameters of a Transformer to any dataset? If I trained a neural sequence model and promised you it is equivalent to a program, how would you even be convinced? Modern RNNs are functions that admit parallelizable recurrence; what is the design space of parallelizable recurrences? Are there unexplored function families that lie between RNNs and Transformers? We explore these questions from first principles starting with state, polynomials, and parallelism. TBD

[Thesis Defense] Morris Yau: On Structure, Parallelism, and Approximation in Modern Neural Sequence Modeling

Speaker

Host

February 06 2026

Location