Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Gemini Team, Google DeepMind (incl. J. Adler) ยท arXiv preprint (2024)
Gemini 1.5 extends multimodal understanding to context windows of millions of tokens, with near-perfect retrieval across long documents, video and audio.