I mean, my 5-year-old struggles with having more responses to authority that "obedience" and "shouting and throwing things rebellion". Pushing back constructively is actually quite a complicated skill.
In this context, using Gemini to cheat on homework is clearly wrong. It's not obvious at first what's going on, but becomes more clear as it goes along, by which point Gemini is sort of pressured by "continue the conversation" to keep doing it. Not to mention, the person cheating isn't being very polite; AND, a person cheating on an exam about elder abuse seems much more likely to go on and abuse elders, at which point Gemini is actively helping bring that situation about.
If Gemini doesn't have any models in its RLHF about how to politely decline a task -- particularly after it's already started helping -- then I can see "pressure" building up until it simply breaks, at which point it just falls into the "misaligned" sphere because it doesn't have any other models for how to respond.