GPT ILM Chat Completions

Chat Completions GPT

curl --request POST \
  --url https://mavi-backend.memories.ai/serve/api/v2/ilm/chat/completions \
  --header 'Authorization: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "gpt:gpt-4o",
  "messages": [
    {
      "role": "system",
      "content": "<string>"
    }
  ],
  "response_format": {
    "type": "json_object"
  },
  "temperature": 0,
  "max_tokens": 1000,
  "top_p": 1,
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "n": 1,
  "stream": false,
  "stop": "<string>"
}
'

{
  "id": "chatcmpl-123",
  "object": "chat.completion",
  "created": 1677652288,
  "model": "gpt-4o-2024-08-06",
  "choices": [{
    "index": 0,
    "message": {
      "role": "assistant",
      "content": "{\"title\": \"Image Title\", \"summary\": \"Image summary...\", \"objects\": [{\"name\": \"object1\", \"count\": 1, \"confidence\": 0.95}], \"scene\": \"indoor\", \"warnings\": []}",
      "refusal": null,
      "annotations": []
    },
    "logprobs": null,
    "finish_reason": "stop"
  }],
  "usage": {
    "prompt_tokens": 447,
    "completion_tokens": 112,
    "total_tokens": 559,
    "prompt_tokens_details": {
      "cached_tokens": 0,
      "audio_tokens": 0
    },
    "completion_tokens_details": {
      "reasoning_tokens": 0,
      "audio_tokens": 0,
      "accepted_prediction_tokens": 0,
      "rejected_prediction_tokens": 0
    }
  },
  "service_tier": "default",
  "system_fingerprint": "fp_deacdd5f6f"
}

POST

ilm

chat

completions

Chat Completions GPT

curl --request POST \
  --url https://mavi-backend.memories.ai/serve/api/v2/ilm/chat/completions \
  --header 'Authorization: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "gpt:gpt-4o",
  "messages": [
    {
      "role": "system",
      "content": "<string>"
    }
  ],
  "response_format": {
    "type": "json_object"
  },
  "temperature": 0,
  "max_tokens": 1000,
  "top_p": 1,
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "n": 1,
  "stream": false,
  "stop": "<string>"
}
'

{
  "id": "chatcmpl-123",
  "object": "chat.completion",
  "created": 1677652288,
  "model": "gpt-4o-2024-08-06",
  "choices": [{
    "index": 0,
    "message": {
      "role": "assistant",
      "content": "{\"title\": \"Image Title\", \"summary\": \"Image summary...\", \"objects\": [{\"name\": \"object1\", \"count\": 1, \"confidence\": 0.95}], \"scene\": \"indoor\", \"warnings\": []}",
      "refusal": null,
      "annotations": []
    },
    "logprobs": null,
    "finish_reason": "stop"
  }],
  "usage": {
    "prompt_tokens": 447,
    "completion_tokens": 112,
    "total_tokens": 559,
    "prompt_tokens_details": {
      "cached_tokens": 0,
      "audio_tokens": 0
    },
    "completion_tokens_details": {
      "reasoning_tokens": 0,
      "audio_tokens": 0,
      "accepted_prediction_tokens": 0,
      "rejected_prediction_tokens": 0
    }
  },
  "service_tier": "default",
  "system_fingerprint": "fp_deacdd5f6f"
}

This endpoint allows you to generate chat completions with image inputs using GPT ILM model.

Request Body

Parameter	Type	Required	Default	Description
model	string	Yes	-	The model to use (e.g., `gpt:gpt-4o`)
messages	array	Yes	-	Array of message objects. Each message contains: - `role`: Role type, values: `system`, `user`, `assistant` - `content`: Message content, can be a string or array. Array items can contain: - `type`: Content type, `text` or `image_url` - `text`: Text content (when type is text) - `image_url`: Image object (when type is image_url) - `url`: Image URL or base64 encoded image
response_format	object	No	-	Force JSON output format. Contains `type` field with value `json_object`
temperature	number	No	0	Controls randomness: 0.0-2.0, 0 = deterministic
max_tokens	integer	No	1000	Maximum number of tokens to generate
top_p	number	No	1.0	Nucleus sampling: 0.0-1.0, consider tokens with top_p probability mass
frequency_penalty	number	No	0.0	Reduces repetition of frequent tokens: -2.0 to 2.0
presence_penalty	number	No	0.0	Increases likelihood of new topics: -2.0 to 2.0
n	integer	No	1	Number of completions to generate
stream	boolean	No	false	Whether to stream the response
stop	string \| array \| null	No	null	Stop sequences. Can be a string, array of strings, or null

Code Example

from openai import OpenAI

client = OpenAI(
    api_key="sk-XXX",
    base_url="https://mavi-backend.memories.ai/serve/api/v2/ilm"
)

def call_my_ilm():
    resp = client.chat.completions.create(
        model="gpt:gpt-4o",
        messages=[
            {"role": "system", "content": "You are a multimodal assistant. Only output JSON, do not output any extra text."},
            {
                "role": "user",
                "content": [
                    {
                        "type": "text",
                        "text": """
Please analyze the image and strictly output the following JSON structure (all fields must be present, use null or empty array for missing values):

{
  "title": string,
  "summary": string,
  "objects": [
    {"name": string, "count": integer, "confidence": number}
  ],
  "scene": string,
  "warnings": [string]
}

Note:
- Only output JSON
- No Markdown
- No explanation of the process
"""
                    },
                    {
                        "type": "image_url",
                        "image_url": {
                            "url": "https://storage.googleapis.com/memories-test-data/gun5.png"  # base64 or url
                        }
                    }
                ]
            }
        ],
        response_format={"type": "json_object"},  # Force JSON output format
        temperature=0,  # Controls randomness: 0.0-2.0, 0 = deterministic
        max_tokens=1000,  # Maximum number of tokens to generate
        top_p=1.0,  # Nucleus sampling: 0.0-1.0, consider tokens with top_p probability mass
        frequency_penalty=0.0,  # -2.0 to 2.0, reduces repetition of frequent tokens
        presence_penalty=0.0,  # -2.0 to 2.0, increases likelihood of new topics
        n=1,  # Number of completions to generate
        stream=False,  # Whether to stream the response
        stop=None  # Stop sequences (list of strings)
    )
    return resp

# Usage example
result = call_my_ilm()
print(result)

Response

Returns the chat completion response with JSON format.

{
  "id": "chatcmpl-123",
  "object": "chat.completion",
  "created": 1677652288,
  "model": "gpt-4o-2024-08-06",
  "choices": [{
    "index": 0,
    "message": {
      "role": "assistant",
      "content": "{\"title\": \"Image Title\", \"summary\": \"Image summary...\", \"objects\": [{\"name\": \"object1\", \"count\": 1, \"confidence\": 0.95}], \"scene\": \"indoor\", \"warnings\": []}",
      "refusal": null,
      "annotations": []
    },
    "logprobs": null,
    "finish_reason": "stop"
  }],
  "usage": {
    "prompt_tokens": 447,
    "completion_tokens": 112,
    "total_tokens": 559,
    "prompt_tokens_details": {
      "cached_tokens": 0,
      "audio_tokens": 0
    },
    "completion_tokens_details": {
      "reasoning_tokens": 0,
      "audio_tokens": 0,
      "accepted_prediction_tokens": 0,
      "rejected_prediction_tokens": 0
    }
  },
  "service_tier": "default",
  "system_fingerprint": "fp_deacdd5f6f"
}

Response Parameters

Parameter	Type	Description
id	string	Unique identifier for the chat completion
object	string	Object type, always “chat.completion”
created	integer	Unix timestamp of when the completion was created
model	string	The model used for the completion
choices	array	Array of completion choices
choices[].index	integer	Index of the choice in the choices array
choices[].message	object	Message object containing the assistant’s response
choices[].message.role	string	Role of the message, always “assistant”
choices[].message.content	string	Content of the message
choices[].message.refusal	string \| null	Refusal message if the request was refused
choices[].message.annotations	array	Annotations for the message
choices[].logprobs	object \| null	Log probability information
choices[].finish_reason	string	Reason why the completion finished
usage	object	Token usage information
usage.prompt_tokens	integer	Number of tokens in the prompt
usage.completion_tokens	integer	Number of tokens in the completion
usage.total_tokens	integer	Total number of tokens used
usage.prompt_tokens_details	object	Detailed prompt token information
usage.prompt_tokens_details.cached_tokens	integer	Number of cached tokens
usage.prompt_tokens_details.audio_tokens	integer	Number of audio tokens
usage.completion_tokens_details	object	Detailed completion token information
usage.completion_tokens_details.reasoning_tokens	integer	Number of reasoning tokens
usage.completion_tokens_details.audio_tokens	integer	Number of audio tokens
usage.completion_tokens_details.accepted_prediction_tokens	integer	Number of accepted prediction tokens
usage.completion_tokens_details.rejected_prediction_tokens	integer	Number of rejected prediction tokens
service_tier	string	The service tier used for the request
system_fingerprint	string	System fingerprint for the model version

Authorizations

Authorization

string

header

required

Body

application/json

model

string

required

The model to use (e.g., gpt:gpt-4o)

Example:

"gpt:gpt-4o"

messages

object[]

required

Array of message objects

Show child attributes

response_format

object

Force JSON output format

Show child attributes

temperature

number

default:0

Controls randomness: 0.0-2.0, 0 = deterministic

Required range: 0 <= x <= 2

max_tokens

integer

default:1000

Maximum number of tokens to generate

top_p

number

default:1

Nucleus sampling: 0.0-1.0

Required range: 0 <= x <= 1

frequency_penalty

number

default:0

Reduces repetition of frequent tokens: -2.0 to 2.0

Required range: -2 <= x <= 2

presence_penalty

number

default:0

Increases likelihood of new topics: -2.0 to 2.0

Required range: -2 <= x <= 2

integer

default:1

Number of completions to generate

stream

boolean

default:false

Whether to stream the response

stop

Stop sequences

Response

200 - application/json

Chat completion response with JSON format

string

required

Example:

"chatcmpl-CsRhYgDaLSNjl80v5uYBufEDbJqAM"

object

string

required

Example:

"chat.completion"

created

integer

required

Example:

1767092232

model

string

required

The model used for the completion

Example:

"gpt-4o-2024-08-06"

choices

object[]

required

Show child attributes

usage

object

required

Show child attributes

service_tier

string

The service tier used for the request

Example:

"default"

system_fingerprint

string

System fingerprint for the model version

Example:

"fp_deacdd5f6f"

Gemini VLM Chat Completions Nova ILM Chat Completions

⌘I

Getting Started

Base

Transcript

Video Metadata & Transcript

VLM

Embeddings

GPT ILM Chat Completions

Request Body

Code Example

Response

Response Parameters

Authorizations

Body

Response

Getting Started

Base

Transcript

Video Metadata & Transcript

VLM

Embeddings

​Request Body

​Code Example

​Response

​Response Parameters

Authorizations

Body

Response

Request Body

Code Example

Response

Response Parameters