I use the 4B on my phone and it seems to work fine without tool calls. So it's definitely an issue with that and not the model itself. I'll play around and see if I can fix that, you might also try using the Searxng MCP as it's a better web search engine one.
I tried most prompts that didn't rely on recent knowledge on the basic "AI Chat", not the "Agent skills" version.
I just tested "List the 5 most recent Argentina vice presidents" on E4B and it literally got all 5 wrong