Where my Gemma 4 gets this data? Trying to explain weird behaviour. Please help!

Posted by OwnTwist3325@reddit | LocalLLaMA | View on Reddit | 8 comments

So I was playing with Gemma 4 and was trying to figure out whether the model could determine its own training data cutoff period. Got some really interesting results but that is not the main point of this post, just context :-)

Turned out that with empty system message, the model thinks that its cutoff date is early 2024. If I will ask it to re-estimate based on latest events it can recall, it can actually find Jan 2025 as a cutoff. If I will ask it to quote the system message, it gets protective and refuses to show it.

Then I added "You are Gemma 4" in the system prompt. Suddenly, it could confidently state its cutoff date - Jan 2025. When asked where that comes from, states that it comes from system prompt. And it can quote it. A lot of it - on the screenshots. Response is stable, with no changes between differently worded requests and different sessions. So, not a hallucination (?). My issue is - I do not know where that comes from! Clearly not from the system prompt I provided. I tried "You are Gemma" - model did not go protective and quoted just that.

Also, with just "You are Gemma 4" in the system message, model felt... very different. Way more confident and... smarter.

I am running it as a single-file gguf model in LMStudio. There should not be any extra weird conditional configurations embeddable, right? What am I missing?