Semifly
Open-weight · Google

Gemma 4 (31B)

Google's open Gemma 4 (31B) — a capable dense model with a 256K context window, easy to fine-tune and deploy on a single high-memory GPU.

Developer
Google
Parameters
31B
Context
256K
License
Gemma
Run on Semifly →Download on Hugging Face

About Gemma 4 (31B)

Google's open Gemma 4 (31B) — a capable dense model with a 256K context window, easy to fine-tune and deploy on a single high-memory GPU.

Parameters, context, and license as reported June 2026. Confirm the current license on the official Hugging Face page before deployment.

← Back to all models

Run any of these on Semifly

Tokens & API

Access hosted models through a simple, metered token API.

Get API access →

GPU servers

Buy or lease Supermicro GPU systems to self-host open-weight models.

Browse GPU servers →

AI Foundry

Managed compute for training, fine-tuning, and inference.

Explore AI Foundry →