Skip to content
Loading Events

« All Events

  • This event has passed.

TILOS Seminar: Single location regression and attention-based models

March 27 @ 2:00 pm - 3:00 pm

Title: Single location regression and attention-based models

Speaker: Claire Boyer, Université Paris-Saclay

Abstract: Attention-based models, such as Transformer, excel across various tasks but lack a comprehensive theoretical understanding, especially regarding token-wise sparsity and internal linear representations. To address this gap, we introduce the single-location regression task, where only one token in a sequence determines the output, and its position is a latent random variable, retrievable via a linear projection of the input. To solve this task, we propose a dedicated predictor, which turns out to be a simplified version of a non-linear self-attention layer. We study its theoretical properties, by showing its asymptotic Bayes optimality and analyzing its training dynamics. In particular, despite the non-convex nature of the problem, the predictor effectively learns the underlying structure. This work highlights the capacity of attention mechanisms to handle sparse token information and internal linear structures.

Zoom: https://bit.ly/TILOS-Seminars

Details

Date:
March 27
Time:
2:00 pm - 3:00 pm
Event Category:

Organizer

TILOS
View Organizer Website

Venue

Halicioglu Data Science Institute Room 123
3234 Matthews Ln
La Jolla, CA 92093 United States
View Venue Website