LLM for name/gender classification

Posted by trosler@reddit | LocalLLaMA | View on Reddit | 11 comments

Hey there,
I have a task where I have a huge list with names (e.g. John Smith). And I want to use a LLM to assign a gender to each name (m/f/ambiguous). I have read some research papers that recommended mistral-nemo for this task, yet in my personal tests, the results were mixed. When running the model on the identical data, the results vary a lot, sometimes with very clear names (e.g. John Smith). I hand the LLM the prompt and included, a short list of the names (say, 10 at a time).
- Can you recommend a local LLM for this task?
- Is this "batch" approach fine?
Thanks for ideas and input.
PS: for the "easy" names I used another Python library, so only the truly difficult names remain in the actual dataset.