Getting Started
The Vector Inference Platform is available to all usersVector Institute community members. For an up-to-date list of Vector'savailable models and their specifications, visit Bon Echo clusterinference.vectorinstitute.ai. At
Prerequisites
Install the timeOpenAI ofPython thisclient:
pip document,install this includes the AI Engineering team, the Industry and a select group of researchers. (We will be opening up the environment to all Vector researchers later in 2026.)
There are currently 2 models available to use:
openaiUsage Instructions
The Inferenceplatform Platform is available viaexposes an OpenAI-compatible endAPI managedat https://proxy.vectorinstitute.ai/v1. You can use it as a drop-in replacement for any OpenAI client by changing the AIbase_url Engineeringand team.model Start by creatingparameters.
Create a new Python script called qwen3-omni-30b-test.py:
from openai import OpenAI
client = OpenAI(
base_url="https://proxy.vectorinstitute.ai/v1",
timeout=300.0,
api_key="<paste-your-api-key-herekey> (see below about getting api keys)"
)
responsestream = client.chat.completions.create(
model="hosted_vllm/Qwen3-Omni-30B-A3B-Instruct"<model-id>", # see inference.vectorinstitute.ai for available models
messages=[{"role": "user", "content": "WhoExplain isattention better,mechanisms Messiin transformers."}],
stream=True,
)
for chunk in stream:
if chunk.choices:
print(chunk.choices[0].delta.content or Ronaldo?"", end="", flush=True)You can also use curl:
curl https://proxy.vectorinstitute.ai/v1/chat/completions \
-H "Authorization: Bearer <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"model": "<model-id>",
"max_tokens"messages": 128}[{"role": "user", "content": "Hello!"}]
)
print(response)
}'Getting an API Key
Now run the script:
python3 ./qwen3-omni-30b-test.py
You should expect output like the following:
ChatCompletion(id='chatcmpl-619eb6f0-035d-4999-8001-b8b9fb5d4fae', choices=[Choice(finish_reason='stop', index=0, logprobs=None, message=ChatCompletionMessage(content='Whether Lionel Messi or Cristiano Ronaldo is "better" depends on personal preferences, as both are extraordinary footballers with remarkable accomplishments. Here’s a balanced comparison to help understand why each player has their own strong supporters:\n\n🏆 **Goals and Scores**:\n- **Messi**: Over 850 career goals (all competitions). Known for scoring in every position on the pitch.\n- **Ronaldo**: Over 875 career goals. Excels as a prolific striker with great heading and finish.\n\n📊 **Achievements**:\n- **Messi**: \n - 8 Ballon d’Ors (record) \n - 4 Champions League titles\n - 10 Serie A titles (with Barcelona & PSG)\n - Copa America 2021, FIFA World Cup 2022\n- **Ronaldo**: \n - 5 Ballon d’Ors (record until Messi’s 2019 surge, then surpassed)\n - 5 Champions League titles\n - 5 Premier League titles, 3 La Liga titles, 3 Serie A titles\n - Euro 2016, FIFA World Cup 2014 (not champions)\n\n🔄 **Style of Play**:\n- **Messi**: Renowned for dribbling, vision, creativity, and playmaking. A classic number 10.\n- **Ronaldo**: A complete striker with physical power, aerial ability, goalscoring instinct, and stamina.\n\n🏅 **Legacy and Versatility**:\n- Messi is often considered the most complete talent ever — praised for his ability to dominate games with minimal touches.\n- Ronaldo is admired for his relentless work ethic, consistency, longevity, and ability to perform on the biggest stages.\n\n🎯 **Personal Preference**:\n- You might love Messi if you value artistry, tactics, and intricate passing.\n- You might prefer Ronaldo if you admire goal output, physicality, leadership, and big-game mentality.\n\n📣 Conclusion:\nThere is no definitive answer — both are legends of the game. Messi may have the edge in pure footballing skill and impact in top competitions, but Ronaldo’s goal-scoring consistency and ability to adapt across leagues show what\'s possible through hard work.', refusal=None, role='assistant', annotations=None, audio=None, function_call=None, tool_calls=None))], created=1769622523, model='Qwen3-Omni-30B-A3B-Instruct', object='chat.completion', service_tier=None, system_fingerprint=None, usage=CompletionUsage(completion_tokens=497, prompt_tokens=16, total_tokens=513, completion_tokens_details=None, prompt_tokens_details=None))
User Access Management
All user access and API keys are controlledmanaged by the Infrastructure group within the AI Engineering Team.team. PleaseTo directrequest anyaccess, requestsreach toout ourvia the Slack channel at #vector-inference-platform.
Listing APIAvailable KeysModels
You can retrieve the current list of enabled models programmatically via the API:
curl https://proxy.vectorinstitute.ai/v1/models \
-H "Authorization: Bearer <your-api-key>"We'llOr usesimply CyberArkvisit to distribute API Keys. Project leads are responsibleinference.vectorinstitute.ai for sharing these keys with their teams using a "Securedvisual Note" item.overview.