Best small model right now (~4B params) that is good with agentic tasks for personal assistant?

Posted by BitGreen1270@reddit | LocalLLaMA | View on Reddit | 49 comments

Looking for suggestions. I have been experimenting with gemma-4-E2B and gemma-4-E4B but the tool calling has been not the best? My tasks are just things like:

etc.

Any suggestions? If it helps, here are my server params:

./llama-server \
    --host 0.0.0.0 \
    --port 8080 \
    -m ~/myp/models/google_gemma-4-E4B-it-Q8_0.gguf \
    --temp 1.0 \
    --top_p 0.95 \
    --top_k 64 \
    -c 65536 \
    --flash-attn on \
    -t 16 \
    --ctx-checkpoints 4 \
    --cache-ram 16384 \
    --chat-template-file /home/lenny/myp/models/jinja/gemma4-improved.jinja \
    -ngl 99