Getting Started
The Vector Inference Platform is available to all Vector Institute community members. For an up-to-date list of available models and their specifications, visit inference.vectorinstitute.ai.
Getting an API Key
Access is managed via invite links. To get started:
- Request access via the Slack channel #vector-inference-platform. The AI Engineering team will create your account and send you an invite email.
- Click the link in the invite email. It is valid for 48 hours and takes you to the Vector Proxy dashboard.
- Your API key is generated and displayed once — copy and save it immediately. The key will not be shown again.
API keys have the format vp_xxxxxxxx.yyyyyy....
Prerequisites
Install the OpenAI Python client:
pip install openaiUsage
The platform exposes an OpenAI-compatible API at https://proxy.vectorinstitute.ai/v1. Use it as a drop-in replacement for any OpenAI client by changing the base_url and model parameters.
from openai import OpenAI
client = OpenAI(
base_url="https://proxy.vectorinstitute.ai/v1",
api_key="vp_xxxxxxxx.yyyyyy..."
)
stream = client.chat.completions.create(
model="<model-id>", # see inference.vectorinstitute.ai for available models
messages=[{"role": "user", "content": "Explain attention mechanisms in transformers."}],
stream=True,
)
for chunk in stream:
if chunk.choices:
print(chunk.choices[0].delta.content or "", end="", flush=True)You can also use curl:
curl https://proxy.vectorinstitute.ai/v1/chat/completions \
-H "Authorization: Bearer vp_xxxxxxxx.yyyyyy..." \
-H "Content-Type: application/json" \
-d '{
"model": "<model-id>",
"messages": [{"role": "user", "content": "Hello!"}]
}'Listing Available Models
Retrieve the current list of enabled models programmatically:
curl https://proxy.vectorinstitute.ai/v1/models \
-H "Authorization: Bearer vp_xxxxxxxx.yyyyyy..."Or visit inference.vectorinstitute.ai for a visual overview.