These functions are used to prepare image URLs and files for input to the
chatbot. The content_image_url()
function is used to provide a URL to an
image, while content_image_file()
is used to provide the image data itself.
Usage
content_image_url(url, detail = c("auto", "low", "high"))
content_image_file(path, content_type = "auto", resize = "low")
content_image_plot(width = 768, height = 768)
Arguments
- url
The URL of the image to include in the chat input. Can be a
data:
URL or a regular URL. Valid image types are PNG, JPEG, WebP, and non-animated GIF.- detail
The detail setting for this image. Can be
"auto"
,"low"
, or"high"
.- path
The path to the image file to include in the chat input. Valid file extensions are
.png
,.jpeg
,.jpg
,.webp
, and (non-animated).gif
.- content_type
The content type of the image (e.g.
image/png
). If"auto"
, the content type is inferred from the file extension.- resize
If
"low"
, resize images to fit within 512x512. If"high"
, resize to fit within 2000x768 or 768x2000. (See the OpenAI docs for more on why these specific sizes are used.) If"none"
, do not resize.You can also pass a custom string to resize the image to a specific size, e.g.
"200x200"
to resize to 200x200 pixels while preserving aspect ratio. Append>
to resize only if the image is larger than the specified size, and!
to ignore aspect ratio (e.g."300x200>!"
).All values other than
none
require themagick
package.- width, height
Width and height in pixels.
Value
An input object suitable for including in the ...
parameter of
the chat()
, stream()
, chat_async()
, or stream_async()
methods.
Examples
chat <- chat_openai(echo = TRUE)
#> Using model = "gpt-4o".
chat$chat(
"What do you see in these images?",
content_image_url("https://www.r-project.org/Rlogo.png"),
content_image_file(system.file("httr2.png", package = "ellmer"))
)
#> The first image is the logo for the R programming language, featuring a
#> blue letter "R" overlaid on a gray circular shape.
#>
#> The second image is the logo for the "httr2" library in R. It features a
#> red silhouette of a baseball player swinging a bat, with "httr2" written
#> in a stylized font above it.
plot(waiting ~ eruptions, data = faithful)
chat <- chat_openai(echo = TRUE)
#> Using model = "gpt-4o".
chat$chat(
"Describe this plot in one paragraph, as suitable for inclusion in
alt-text. You should briefly describe the plot type, the axes, and
2-5 major visual patterns.",
content_image_plot()
)
#> The plot is a logarithmic spiral graph that represents a polar plot with
#> evenly spaced points along the curve. It features a central origin from
#> which the spiral begins and expands outward in a clockwise direction. The
#> distance between the spiral arms gradually increases as it moves away
#> from the center. Major patterns include the consistent angular separation
#> of points along the spiral, showcasing symmetry and uniform growth. The
#> spiral’s trajectory demonstrates exponential increase in radius as it
#> extends, creating a smooth and continuous curve.