Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

8 March 2024

Gemini Team, Google DeepMind (incl. J. Adler) · arXiv preprint (2024)

Gemini 1.5 extends multimodal understanding to context windows of millions of tokens, with near-perfect retrieval across long documents, video and audio.

Preprint