Gemma 3n E4b It
Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks such as text generation, speech recognition, translation, and image analysis. Leveraging innovations like Per-Layer Embedding (PLE) caching and the MatFormer architecture, Gemma 3n dynamically manages memory usage and computational load by selectively activating model parameters, significantly reducing runtime resource requirements. This model supports a wide linguistic range (trained in over 140 languages) and features a flexible 32K token context window. Gemma 3n can selectively load parameters, optimizing memory and computational efficiency based on the task or device capabilities, making it well-suited for privacy-focused, offline-capable applications and on-device AI solutions. [Read more in the blog post](https://developers.googleblog.com/en/introducing-gemma-3n/)
定價資訊
快速開始
只需更改 base_url 即可透過 BazaarLink API 使用 Gemma 3n E4b It:
from openai import OpenAI
client = OpenAI(
base_url="https://bazaarlink.ai/api/v1",
api_key="sk-bl-YOUR_API_KEY",
)
response = client.chat.completions.create(
model="google/gemma-3n-e4b-it",
messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://bazaarlink.ai/api/v1",
apiKey: "sk-bl-YOUR_API_KEY",
});
const response = await client.chat.completions.create({
model: "google/gemma-3n-e4b-it",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(response.choices[0].message.content);為什麼透過 BazaarLink 使用 Gemma 3n E4b It?
- ✓美元計費(台幣報價)+ 統一發票 — 台灣團隊無需海外刷卡
- ✓OpenAI 相容 API — 無需改寫程式碼
- ✓自動故障轉移 — 同模型多供應商備援
- ✓中文客服支援 — 在地團隊即時協助
Frequently Asked Questions
What is Gemma 3n E4b It?
Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks such as text generation, speech recognition, translation, and image analysis. Leveraging innovations like Per-Layer Embedding (PLE) caching and the MatFormer architecture, Gemma 3n dynamically manages memory usage and computational load by selectively activating model parameters, significantly reducing runtime resource requirements. This model supports a wide linguistic range (trained in over 140 languages) and features a flexible 32K token context window. Gemma 3n can selectively load parameters, optimizing memory and computational efficiency based on the task or device capabilities, making it well-suited for privacy-focused, offline-capable applications and on-device AI solutions. [Read more in the blog post](https://developers.googleblog.com/en/introducing-gemma-3n/)
How much does the Gemma 3n E4b It API cost?
Gemma 3n E4b It costs $0.0200 per 1K input tokens and $0.0400 per 1K output tokens when accessed through BazaarLink.
How do I use Gemma 3n E4b It with the OpenAI SDK?
Set base_url to "https://bazaarlink.ai/api/v1" and use model ID "google/gemma-3n-e4b-it". All OpenAI SDK methods (chat.completions, embeddings, streaming) work without code changes.
What is the context window for Gemma 3n E4b It?
Gemma 3n E4b It supports a context window of 32,768 tokens.
Is Gemma 3n E4b It available for free?
Gemma 3n E4b It is a paid model. BazaarLink offers free trial credits on registration so you can test it without a credit card.