Semifly
Semifly · LLMs

Download open-weight models

Grab the weights from each model’s official Hugging Face page, then run them on your own GPUs or on Semifly in one click.

DeepSeek V4 Pro

DeepSeek · MIT · ~1.6T (MoE)

GLM-5.1

Zhipu AI · MIT · MoE

Kimi K2.7-Code

Moonshot AI · Mod. MIT · MoE

Qwen3.5 (397B-A17B)

Alibaba · Apache 2.0 · 397B (17B active)

Mistral Small 4

Mistral AI · Apache 2.0 · 24B

Licenses as reported June 2026 — always confirm on the official page before deployment.

Run any of these on Semifly

Tokens & API

Access hosted models through a simple, metered token API.

Get API access →

GPU servers

Buy or lease Supermicro GPU systems to self-host open-weight models.

Browse GPU servers →

AI Foundry

Managed compute for training, fine-tuning, and inference.

Explore AI Foundry →