|
|
|
@ -241,4 +241,6 @@ The output is the same as before:
|
|
|
|
|
- [Gemini: A Family of Highly Capable Multimodal Models - Technical Report](https://storage.googleapis.com/deepmind-media/gemini/gemini_1_report.pdf)
|
|
|
|
|
- [Fast Transformer Decoding: One Write-Head is All You Need](https://arxiv.org/abs/1911.02150)
|
|
|
|
|
- [Google AI Studio quickstart](https://ai.google.dev/tutorials/ai-studio_quickstart)
|
|
|
|
|
- [Multimodal Prompts](https://ai.google.dev/docs/multimodal_concepts)
|
|
|
|
|
- [Multimodal Prompts](https://ai.google.dev/docs/multimodal_concepts)
|
|
|
|
|
- [Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases](https://arxiv.org/abs/2312.15011v1)
|
|
|
|
|
- [A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise](https://arxiv.org/abs/2312.12436v2)
|