Google’s new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM


Gemma 4 benchmark graph

Gemma 4 12B is almost as capable as the version with 26 billion parameters.

Credit:
Google

Gemma 4 12B is almost as capable as the version with 26 billion parameters.


Credit:

Google

Google says the new model is capable of complex multistep reasoning and agentic workflows that previously required the larger Gemma variants. Despite the smaller parameter count, Gemma 4 12B comes with the newly devised Multi-Token Prediction (MTP) drafters, which take advantage of unused processing cycles to calculate possible future tokens. The result is greater speed and efficiency. Google has released optional MTP versions of the other Gemma 4 models, but this is the first one to have MTP out of the box.

Gemma 4 12B is also more efficient thanks to a new approach to multimodality. The Gemma 4 family is natively multimodal, accepting text, audio, or images as inputs. Most gen AI models—including the other Gemma 4 variants—use dedicated encoders to process non-text inputs and pass that data to the LLM. This works well enough, but it increases latency and memory usage.

With the new mid-weight model, Google has implemented a streamlined embedding module for vision, featuring single-matrix multiplication and positional embedding, which allows the data to pass to the LLM with proper spatial awareness. This eliminates the need for a bulky middleman encoder. For audio, there’s no encoding at all. The developers worked out a method of projecting the raw audio signal into the same vectors used for text tokens.

If you want to check out the new Gemma 4 model, it’s accessible without a download via tools like LM Studio, Google AI Edge Gallery, and more. But the whole idea with Gemma 4 12B is that you can run it locally and on your own terms. If you’ve got the RAM, the model weights are available for download immediately on Kaggle and Hugging Face. It’s just shy of 18GB.



Source link

  • Related Posts

    Alphabet’s record-breaking $85B raise for Google’s AI business is a helluva good signal

    If Alphabet’s record-breaking $85 billion stock sale signals investor appetite for AI-related offerings — and it does — we can safely say that investors are voracious. Google’s parent company had…

    A British MP Is Suing To See If xAI Is Legally Responsible For The Images Grok Produces

    UK Labour MP Jess Asato is suing xAI over sexually explicit AI-generated images that were created of her by Grok, The Financial Times reports. The lawsuit is the first high-profile…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Israel, Lebanon renew ceasefire as Trump admits calling Netanyahu ‘crazy’ – National

    Israel, Lebanon renew ceasefire as Trump admits calling Netanyahu ‘crazy’ – National

    BBC Sport weekly quiz: When did Williams last play a professional tennis match?

    BBC Sport weekly quiz: When did Williams last play a professional tennis match?

    Alphabet’s record-breaking $85B raise for Google’s AI business is a helluva good signal

    Alphabet’s record-breaking $85B raise for Google’s AI business is a helluva good signal

    Indie Selects for June 2026: Absolutely Stacked Indie Games

    Indie Selects for June 2026: Absolutely Stacked Indie Games

    Ontario civil servants to see ‘flexibility’ on work options during World Cup: gov’t

    Ontario civil servants to see ‘flexibility’ on work options during World Cup: gov’t

    ARKAY Beverages Launches New Brand Message: “We Don’t Sell Alcohol-Free Spirits, We Sell Happiness”