Loading Events

« All Events

  • This event has passed.

TILOS Seminar: Single location regression and attention-based models

March 27 @ 2:00 pm - 3:00 pm

Title: Single location regression and attention-based models

Speaker: Claire Boyer, Université Paris-Saclay

Abstract: Attention-based models, such as Transformer, excel across various tasks but lack a comprehensive theoretical understanding, especially regarding token-wise sparsity and internal linear representations. To address this gap, we introduce the single-location regression task, where only one token in a sequence determines the output, and its position is a latent random variable, retrievable via a linear projection of the input. To solve this task, we propose a dedicated predictor, which turns out to be a simplified version of a non-linear self-attention layer. We study its theoretical properties, by showing its asymptotic Bayes optimality and analyzing its training dynamics. In particular, despite the non-convex nature of the problem, the predictor effectively learns the underlying structure. This work highlights the capacity of attention mechanisms to handle sparse token information and internal linear structures.

Zoom: https://bit.ly/TILOS-Seminars

Details

  • Date: March 27
  • Time:
    2:00 pm - 3:00 pm
  • Event Category:

Organizer

Venue

  • Halicioglu Data Science Institute Room 123
  • 3234 Matthews Ln
    La Jolla, CA 92093 United States
  • View Venue Website