Send an Image as a Prompt to Gemini using the API

varunkadapatti · August 9, 2024, 3:24pm

I have watched the very well crafted tutorial on “How To Use Gemini API” and it was worked extremely well for text inputs.

However, now I want to upload images to the Gemini API. How do i achieve this?
I am not sure how to send an image to the API and then get the required output because the prompt should be IMAGE + TEXT together.

Can someone please advise me on this?

Thank you.
Regards,
Varun Kadapatti

varunkadapatti · August 12, 2024, 2:05pm

Can someone please guide me as to how to achieve this if possible?
Sending a link for the picture does not work…

matt_conroy · August 12, 2024, 2:21pm

Could you link the documentation from Google for uploading images on Gemini?

varunkadapatti · August 15, 2024, 10:41am

Yes! Here are the links to the same.
File prompting strategies | Gemini API | Google AI for Developers

Explore vision capabilities with the Gemini API | Google AI for Developers

Thank you!

varunkadapatti · August 21, 2024, 1:19pm

Hello, I’ve made few advancements in trying to prompt Gemini with images using the Gemini API.
This is the JSON code I have used in the Web API’s Body.
However, it always returns a “null” value for some reason as the output.
After investigating in Google Cloud Console, it seems that the Gemini API does not receive the request at all.
I have ensured that the API key is up-to-date and it is functioning in my other use cases.
Please can someone guide me on this.
Thank you.

WEB API BODY:
{
“contents”: [
{
“role”: “user”,
“parts”: [
{
“fileData”: {
“fileUri”: “Imgur: The magic of the Internet”,
“mimeType”: “image/jpeg”
}
},
{
“text”: “Please solve this mathematics question”
}
]
}
],
“generationConfig”: {
“temperature”: 1,
“topK”: 64,
“topP”: 0.95,
“maxOutputTokens”: 8192,
“responseMimeType”: “text/plain”
}
}

For context I am making an AI Math question solver

system · November 19, 2024, 1:19pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Getting Started with Google Gemini API Free Templates, Designs, and Block Combinations ai , webapi	17	2478	November 8, 2024
GPT-4o vision capability Web API's beginner , help	4	121	July 15, 2024
How to access image taken by thunkable Camera by ChatGPT Web API's	8	102	October 29, 2024
Chat AI & Image Generator (aka Synthi) Questions about Thunkable	43	4914	November 27, 2023
Call open ai get object help Web API's	4	27	June 24, 2025

Send an Image as a Prompt to Gemini using the API

Related topics