Mein Newsfeed — Gemma 4

Mein Newsfeed — Gemma 4 https://newsfeed.avintaris.com News zum Thema Gemma 4 de Fri, 22 May 2026 21:22:05 +0000 Gemma 4: Gemma 4 in Android AICore Developer Preview verfügbar https://android-developers.googleblog.com/2026/04/AI-Core-Developer-Preview.html https://android-developers.googleblog.com/2026/04/AI-Core-Developer-Preview.html Wed, 15 Apr 2026 12:00:00 +0000 Gemma 4 Google integriert Gemma 4 (E2B/E4B) in Android System AICore Service — Apps können geräteübergreifend offline Inferenz nutzen. Vision und Audio (E2B/E4B) nativ supported. Erste Pixel-Geräte als Developer Preview, Rollout auf weitere OEMs H2/2026. Function-Calling und JSON-Output direkt im AICore-API exponiert — relevant für agentische On-Device-Workflows. Vergleich zu Gemma 3n: doppelter Context, +15 Punkte MMLU Pro für E4B. Gemma 4: Day-0 Support in llama.cpp, MLX, LM Studio und Ollama https://huggingface.co/blog/gemma4 https://huggingface.co/blog/gemma4 Fri, 03 Apr 2026 12:00:00 +0000 Gemma 4 Direkt am Folgetag: GGUF-Quants (Q4_K_M und alle Präzisionen). llama.cpp-Server mit OpenAI-kompatibler API ready, MLX mit voller Multimodal-Unterstützung inkl. TurboQuant (~4× weniger aktiver Memory). mistral.rs unterstützt alle Modalitäten + Tool-Calling. Speculative-Decoding-Drafter für alle vier Größen mit bis zu ~3× End-to-End-Speedup. Hardware: E2B ~10GB GPU, E4B ~16GB, 26B A4B nur ~8GB aktiv (MoE-Vorteil), 31B ~62GB GPU oder 96GB+ CPU-RAM. Gemma 4: Gemma 4 offiziell veröffentlicht — Apache 2.0 Lizenz und vier Modellgrößen https://blog.google/innovation-and-ai/technology/developers-tools/gemma-4/ https://blog.google/innovation-and-ai/technology/developers-tools/gemma-4/ Thu, 02 Apr 2026 12:00:00 +0000 Gemma 4 Vier Varianten: E2B (2.3B effective / 5.1B total), E4B (4.5B effective / 8B total), 26B A4B (MoE, 4B aktiv) und 31B Dense. Alle multimodal Text/Bild/Video, E2B/E4B mit Audio. Context: 128K für Edge, 256K für 26B/31B. 140+ Sprachen. Architektur: alternierende Attention (sliding 512-1024 + global), Dual RoPE, Per-Layer Embeddings (PLE), Shared KV Cache, USM-style Conformer Audio Encoder. Benchmarks (31B IT): MMLU Pro 85.2%, AIME 2026 89.2%, GPQA Diamond 84.3%, LiveCodeBench v6 80.0%, Codeforces ELO 2150, MMMU Pro 76.9%. 26B A4B: MMLU Pro 82.6%, AIME 88.3%. **LIZENZWECHSEL:** weg von 'Gemma Terms of Use', hin zu Apache 2.0.