OpenAI

OpenAI provides AI models for text generation, image creation, and more. This skill enables your agent to use GPT, DALL-E, Whisper, and other OpenAI APIs.

Required Secrets

Secret	Description
`OPENAI_API_KEY`	API key from OpenAI Platform

Configuration

vm0.yaml

version: "1.0"

agents:
  my-agent:
    provider: claude-code
    skills:
      - https://github.com/vm0-ai/vm0-skills/tree/main/openai
    environment:
      OPENAI_API_KEY: "${{ secrets.OPENAI_API_KEY }}"

Run

vm0 run my-agent "generate an image" \
  --secrets OPENAI_API_KEY=sk-xxx

Example Instructions

AGENTS.md

# Image Generation Agent

You use OpenAI DALL-E to create images based on descriptions.

## Workflow

1. Receive image description from user
2. Generate image using DALL-E
3. Save image to artifacts
4. Provide the image URL

## Guidelines

- Create detailed prompts for better results
- Specify style, mood, and composition
- Generate multiple variations when needed

AGENTS.md

# Audio Transcription Agent

You use OpenAI Whisper to transcribe audio files to text.

## Workflow

1. Receive audio file path
2. Transcribe using Whisper API
3. Format the transcript with timestamps
4. Save as markdown file

## Supported Formats

- MP3, MP4, WAV, M4A
- Max file size: 25MB

OpenAI

Required Secrets

Configuration

Run

Example Instructions

On this page