VM0VM0
Integration

OpenAI

Use OpenAI models for text, image, and audio generation

OpenAI provides AI models for text generation, image creation, and more. This skill enables your agent to use GPT, DALL-E, Whisper, and other OpenAI APIs.

Required Secrets

SecretDescription
OPENAI_API_KEYAPI key from OpenAI Platform

Configuration

vm0.yaml
version: "1.0"

agents:
  my-agent:
    provider: claude-code
    skills:
      - https://github.com/vm0-ai/vm0-skills/tree/main/openai
    environment:
      OPENAI_API_KEY: "${{ secrets.OPENAI_API_KEY }}"

Run

vm0 run my-agent "generate an image" \
  --secrets OPENAI_API_KEY=sk-xxx

Example Instructions

AGENTS.md
# Image Generation Agent

You use OpenAI DALL-E to create images based on descriptions.

## Workflow

1. Receive image description from user
2. Generate image using DALL-E
3. Save image to artifacts
4. Provide the image URL

## Guidelines

- Create detailed prompts for better results
- Specify style, mood, and composition
- Generate multiple variations when needed
AGENTS.md
# Audio Transcription Agent

You use OpenAI Whisper to transcribe audio files to text.

## Workflow

1. Receive audio file path
2. Transcribe using Whisper API
3. Format the transcript with timestamps
4. Save as markdown file

## Supported Formats

- MP3, MP4, WAV, M4A
- Max file size: 25MB