Attention

5 Articles

A Survey of Document Understanding Models

The past three years have seen significant interest in applying language models to the task of visual document understanding – integrating spatial, textual, and visual signals to make sense of PDFs and scanned documents.

by Madison May

A Survey of Document Understanding Models

Machine Learning Transformers Finetuning

Representation Learning and Retrieval

A look at extending pre-trained representations with document retrieval to better solve downstream tasks.

by Madison May

JAX Transformers Attention

Large Memory Layers with Product Keys

Augmenting transformer language models with sparse access of large memory matrices

by Madison May

JAX Machine Learning Programming

Finetuning Transformers with JAX + Haiku

A practical, code-first look at DeepMind's new haiku library.

by Madison May

Finetuning Transformers with JAX + Haiku

Machine Learning Attention Transformers

A Survey of Long-Term Context in Transformers

Exploring 6 noteworthy approaches for incorporating longer-term context in transformer models.

by Madison May

A Survey of Long-Term Context in Transformers

machine learning musings

Success! Your account is fully activated, you now have access to all content.

Please enter at least 3 characters 0 Results for your search

May we suggest a tag?

Machine Learning Transformers Attention Programming JAX Finetuning Data Science Visualization Tensorflow Model Compression Retrieval Multimodal

Attention

A Survey of Document Understanding Models

Representation Learning and Retrieval

Large Memory Layers with Product Keys

Finetuning Transformers with JAX + Haiku

A Survey of Long-Term Context in Transformers

Subscribe to see what we're thinking

Great!

May we suggest a tag?

May we suggest an author?

Madison May