API action
Applies to enterprise.
This page describes the capabilities of the Firefly Services APIs and Firefly Creative Production actions, and how operations are defined for each.
Firefly Services
Firefly APIs*
API action | Capability | Condition | Operation definition |
---|---|---|---|
Text to Image | Generate images based on a text prompt | 1 Operation = 1 image generated from Firefly foundational model or custom models | |
Text to Image with Structure Reference | Generate images based on a prompt with optional reference image to match structure | 1 Operation = 1 image generated from Firefly foundational model or custom models | |
Text to Image with Style Reference | Generate images based on a prompt with optional reference image to match style | 1 Operation = 1 image generated from Firefly foundational model or custom models | |
Generative Fill | Fill the masked area of an image with an optional prompt | 1 Operation = 1 image generated with the foundation Firefly model | |
Generative Expand | Expand an image to a new size/aspect ratio | 1 Operation = 1 image generated with the foundation Firefly model | |
Generate Similar Images | Generate similar Images based on the reference image | 1 Operation = 1 image generated with the foundation Firefly model | |
Generate Object Composite | Upload an image (with or without mask), such as a product photo, and utilize a text prompt to generate a seamlessly composited scene featuring the product | 1 Operation = 1 image generated with the foundation Firefly model | |
Generate Video | Generate photorealistic or animated video clips from a text prompt, reference images, or a combination of the two. | Video Model 1, 540p resolution | 0.4 operations = 1 second of generated video |
Video Model 1, 720p resolution | 1 operation = 1 second of generated video | ||
Video Model 1, 1080p resolution | 2 operations = 1 second of generated video |
* Firefly APIs include parameters that allow for multiple images to be generated with each API call.
Photoshop APIs
|
Capability |
Operation definition |
---|---|---|
Remove Background |
Isolate subject of interest in an image and remove background |
1 Operation = 1 API call |
Create Mask |
Isolate a subject of interest in an image (people, objects, etc) and generate an image mask |
1 Operation = 1 API call |
Create Renditions |
Create flat image representations of a PSD |
1 Operation = 1 API call |
Create actionJSON |
Convert .atn file to an actionJSON format |
1 Operation = 1 API call |
Execute actionJSON |
Execute a Photoshop Action with actionJSON format |
1 Operation = 1 API call |
Execute Photoshop Actions |
Executes Photoshop Action file against a PSD, JPEG, PNG, or TIFF |
1 Operation = 1 API call |
Replace Smart Object |
Replace Smart Object in a PSD |
1 Operation = 1 API call |
Product Crop |
Smart crop an image keeping the subject of interest in view |
1 Operation = 1 API call |
Depth Blur |
Apply depth blur to an image input |
1 Operation = 1 API call |
Edit Text Layer |
Change the contents of a text layer in a PSD, e.g. for localization |
1 Operation = 1 API call |
Apply PSD Edits |
Apply basic layer edits (name, state, etc), add/edit adjustment, pixel, and shape layers |
1 Operation = 1 API call |
Create New PSD |
Create new PSD with layers |
1 Operation = 1 API call |
Create an Artboard |
Create artboards from multiple PSD inputs |
1 Operation = 1 API call |
Get Manifest |
Extract metadata from PSD document, including general file and layer information |
1 Operation = 1 API call |
Lightroom APIs
API action |
Capability |
Operation definition |
---|---|---|
Straighten Image |
Auto-straighten an image |
1 Operation = 1 API call |
Autotone |
Apply autotone to an image |
1 Operation = 1 API call |
Edit Image |
Adjust Lr adjustments to a photo, including exposure, contrast, detail, etc. |
1 Operation = 1 API call |
Apply Preset |
Apply Lr preset to a photo |
1 Operation = 1 API call |
Apply XMP |
Add XMP metadata to a photo |
1 Operation = 1 API call |
InDesign APIs
API action |
Capability |
Operation definition |
---|---|---|
Data Merge |
Generate multiple asset variations by utilizing variable fields from an InDesign-tagged document and a CSV file |
1 Operation = 1 output page |
Rendition |
Create JPEG, PNG, or PDF from the InDesign document |
1 Operation = 1 output page |
Custom Scripts |
Create and execute custom scripts using InDesign features |
1 Operation = 1 output* or input page, whichever is higher *Output includes any files saved through Save, Save As, or Save A Copy, and any files created through Export. |
Video & Audio APIs
API action |
Capability |
Operation definition |
---|---|---|
Reframe |
Identify actions in a video and reframe the clip for different aspect ratios |
|
Text to Avatar |
Convert text scripts into videos featuring lifelike human avatars with a variety of pre-shot avatar selections and natural sounding voices |
1 Operation = 5 seconds of output video |
Text to Speech |
Transform written text into spoken speech, allowing businesses to create dynamic audio |
1 Operation = 10 minutes of output speech |
Video & Audio APIs, Add-On
API action | Capability | Operation definition |
---|---|---|
Transcribe | Generate transcripts and captions for audio and video content in the original or translated language | 1 Operation = 10 mins of input video or audio |
Dub | Automate audio and video voice translation with optional lip-syncing, producing high-quality dubbing output regardless of background noise and music | Without lipsync: 1 Operation = 10 seconds of output video or audio |
With lip sync: 1 Operation = 5 seconds of output video |
Content tagging APIs
API action |
Capability |
Operation definition |
---|---|---|
Color Tagging |
Given an image, can compute a histogram of pixel colors and sort them by dominant colors into buckets |
1 Operation = 1 API call per asset |
Keyword Tagging |
Given a text document, the keyword tagging service automatically extracts keywords or key phrases that best describe the subject of the document |
1 Operation = 1 API call per asset |
OCR |
Given an image, can indicate if text is present in the image. If text is present, OCR can return the text |
1 Operation = 1 API call per asset |
Substance 3D APIs
API action | Capability | Condition | Operation definition |
---|---|---|---|
Generate 3D Object Composite API** | Seamlessly composite 3D objects into Firefly-generated 2D scenes to create pixel-perfect fidelity compositions, with photorealistic lighting, shadows, and textures | Firefly Image Model 4: Less than or equal to 2000 pixels on the longest side | 2 Operations = 1 output image (or variant) |
Firefly Image Model 4: Greater than 2000 pixels on the longest side |
3 Operations = 1 output image (or variant) |
||
Firefly Image Model 4 Ultra: Less than or equal to 2000 pixels on the longest side |
3 Operations = 1 output image (or variant) |
||
Firefly Image Model 4 Ultra: Greater than 2000 pixels on the longest side |
4 Operations = 1 output image (or variant) |
||
Render 3D Object API** | Render pixel-perfect product pack shots at scale by automating the creation of photorealistic pack shots with precise lighting, shadows, and reflections | Less than or equal to 2000 pixels on the longest side | 1 Operation = 1 output image |
Greater than 2000 pixels on the longest side |
2 Operations = 1 output image |
||
Create 3D Scene API | Merge multiple 3D assets into a cohesive and detailed scene designed to host a digital twin | 1 Operation = 1 output file | |
Convert 3D Files API | Convert existing 3D content in formats such as FBX and OBJ into more universal file formats like gITF and USD | 1 Operation = 1 output file | |
Describe 3D Scene API | Generate detailed statistics about 3D scene content | No charge |
** Generate 3D Object Composite API and Render 3D Object API include parameters that allow for multiple images to be generated with each API call.
Firefly Creative Production
Action |
Capability |
Operation definition |
---|---|---|
Remove Background (flat image output) |
Isolate the subject of interest in an image, remove the background, and output a flat image file |
1 Operation = 1 output file |
Crop |
Crop image sizes or aspect ratios |
2 Operations = 1 output file |
Color Grade |
Auto tone or apply Lightroom presets to batches of uploaded images |
1 Operation = 1 output file |