Firefly Services and Firefly Creative Production operation definitions

Last updated on Jun 26, 2025

Applies to enterprise.

This page describes the capabilities of the Firefly Services APIs and Firefly Creative Production actions, and how operations are defined for each.

Firefly Services

Firefly APIs*

API action	Capability	Condition	Operation definition
Text to Image	Generate images based on a text prompt		1 Operation = 1 image generated from Firefly foundational model or custom models
Text to Image with Structure Reference	Generate images based on a prompt with optional reference image to match structure		1 Operation = 1 image generated from Firefly foundational model or custom models
Text to Image with Style Reference	Generate images based on a prompt with optional reference image to match style		1 Operation = 1 image generated from Firefly foundational model or custom models
Generative Fill	Fill the masked area of an image with an optional prompt		1 Operation = 1 image generated with the foundation Firefly model
Generative Expand	Expand an image to a new size/aspect ratio		1 Operation = 1 image generated with the foundation Firefly model
Generate Similar Images	Generate similar Images based on the reference image		1 Operation = 1 image generated with the foundation Firefly model
Generate Object Composite	Upload an image (with or without mask), such as a product photo, and utilize a text prompt to generate a seamlessly composited scene featuring the product		1 Operation = 1 image generated with the foundation Firefly model
Generate Video	Generate photorealistic or animated video clips from a text prompt, reference images, or a combination of the two.	Video Model 1, 540p resolution	0.4 operations = 1 second of generated video
		Video Model 1, 720p resolution	1 operation = 1 second of generated video
		Video Model 1, 1080p resolution	2 operations = 1 second of generated video

^*Firefly APIs include parameters that allow for multiple images to be generated with each API call.

Photoshop APIs

API action	Capability	Operation definition
Remove Background	Isolate subject of interest in an image and remove background	1 Operation = 1 API call
Create Mask	Isolate a subject of interest in an image (people, objects, etc) and generate an image mask	1 Operation = 1 API call
Create Renditions	Create flat image representations of a PSD	1 Operation = 1 API call
Create actionJSON	Convert .atn file to an actionJSON format	1 Operation = 1 API call
Execute actionJSON	Execute a Photoshop Action with actionJSON format	1 Operation = 1 API call
Execute Photoshop Actions	Executes Photoshop Action file against a PSD, JPEG, PNG, or TIFF	1 Operation = 1 API call
Replace Smart Object	Replace Smart Object in a PSD	1 Operation = 1 API call
Product Crop	Smart crop an image keeping the subject of interest in view	1 Operation = 1 API call
Depth Blur	Apply depth blur to an image input	1 Operation = 1 API call
Edit Text Layer	Change the contents of a text layer in a PSD, e.g. for localization	1 Operation = 1 API call
Apply PSD Edits	Apply basic layer edits (name, state, etc), add/edit adjustment, pixel, and shape layers	1 Operation = 1 API call
Create New PSD	Create new PSD with layers	1 Operation = 1 API call
Create an Artboard	Create artboards from multiple PSD inputs	1 Operation = 1 API call
Get Manifest	Extract metadata from PSD document, including general file and layer information	1 Operation = 1 API call

Lightroom APIs

API action	Capability	Operation definition
Straighten Image	Auto-straighten an image	1 Operation = 1 API call
Autotone	Apply autotone to an image	1 Operation = 1 API call
Edit Image	Adjust Lr adjustments to a photo, including exposure, contrast, detail, etc.	1 Operation = 1 API call
Apply Preset	Apply Lr preset to a photo	1 Operation = 1 API call
Apply XMP	Add XMP metadata to a photo	1 Operation = 1 API call

InDesign APIs

API action	Capability	Operation definition
Data Merge	Generate multiple asset variations by utilizing variable fields from an InDesign-tagged document and a CSV file	1 Operation = 1 output page
Rendition	Create JPEG, PNG, or PDF from the InDesign document	1 Operation = 1 output page
Custom Scripts	Create and execute custom scripts using InDesign features	1 Operation = 1 output* or input page, whichever is higher *Output includes any files saved through Save, Save As, or Save A Copy, and any files created through Export.

Video & Audio APIs

API action	Capability	Operation definition
Reframe	Identify actions in a video and reframe the clip for different aspect ratios	<= 30 FPS: 1 Operation = 60 sec of output video > 30 FPS and <= 60 FPS: 1 Operation = 30 sec of output video > 60 FPS: 1 Operation = 15 sec of output video
Text to Avatar	Convert text scripts into videos featuring lifelike human avatars with a variety of pre-shot avatar selections and natural sounding voices	1 Operation = 5 seconds of output video
Text to Speech	Transform written text into spoken speech, allowing businesses to create dynamic audio	1 Operation = 10 minutes of output speech

Video & Audio APIs, Add-On

API action	Capability	Operation definition
Transcribe	Generate transcripts and captions for audio and video content in the original or translated language	1 Operation = 10 mins of input video or audio
Dub	Automate audio and video voice translation with optional lip-syncing, producing high-quality dubbing output regardless of background noise and music	Without lipsync: 1 Operation = 10 seconds of output video or audio
Dub		With lip sync: 1 Operation = 5 seconds of output video

Content tagging APIs

API action	Capability	Operation definition
Color Tagging	Given an image, can compute a histogram of pixel colors and sort them by dominant colors into buckets	1 Operation = 1 API call per asset
Keyword Tagging	Given a text document, the keyword tagging service automatically extracts keywords or key phrases that best describe the subject of the document	1 Operation = 1 API call per asset
OCR	Given an image, can indicate if text is present in the image. If text is present, OCR can return the text	1 Operation = 1 API call per asset

Substance 3D APIs

API action	Capability	Condition	Operation definition
Generate 3D Object Composite API^**	Seamlessly composite 3D objects into Firefly-generated 2D scenes to create pixel-perfect fidelity compositions, with photorealistic lighting, shadows, and textures	Firefly Image Model 4: Less than or equal to 2000 pixels on the longest side	2 Operations = 1 output image (or variant)
		Firefly Image Model 4: Greater than 2000 pixels on the longest side	3 Operations = 1 output image (or variant)
		Firefly Image Model 4 Ultra: Less than or equal to 2000 pixels on the longest side	3 Operations = 1 output image (or variant)
		Firefly Image Model 4 Ultra: Greater than 2000 pixels on the longest side	4 Operations = 1 output image (or variant)
Render 3D Object API^**	Render pixel-perfect product pack shots at scale by automating the creation of photorealistic pack shots with precise lighting, shadows, and reflections	Less than or equal to 2000 pixels on the longest side	1 Operation = 1 output image
Render 3D Object API^**		Greater than 2000 pixels on the longest side	2 Operations = 1 output image
Create 3D Scene API	Merge multiple 3D assets into a cohesive and detailed scene designed to host a digital twin		1 Operation = 1 output file
Convert 3D Files API	Convert existing 3D content in formats such as FBX and OBJ into more universal file formats like gITF and USD		1 Operation = 1 output file
Describe 3D Scene API	Generate detailed statistics about 3D scene content		No charge

^** Generate 3D Object Composite API and Render 3D Object API include parameters that allow for multiple images to be generated with each API call.

Firefly Creative Production

Action	Capability	Operation definition
Remove Background (flat image output)	Isolate the subject of interest in an image, remove the background, and output a flat image file	1 Operation = 1 output file
Crop	Crop image sizes or aspect ratios	2 Operations = 1 output file
Color Grade	Auto tone or apply Lightroom presets to batches of uploaded images	1 Operation = 1 output file

Get help faster and easier

New user?