How to make LLM generate realistic company name variations? (LLaMA 3.2)

Posted by Neural_Nodes@reddit | LocalLLaMA | View on Reddit | 4 comments

Hey all,

I’m building a blacklist company detection system where the LLM (LLaMA 3.2 via Ollama) is used to generate company name variations (misspellings, abbreviations, formatting).

Problem:

The LLM generates unrealistic or unrelated variations instead of true real-world ones. I need high-quality, meaningful variations only.

Example:

Input: “Infosys Limited”

Expected: “Infosys Ltd”, “Infosys”, “Infosys Pvt Ltd”

But LLM sometimes generates irrelevant names.

Looking for:

* How to constrain LLM to generate only valid real-world variations?

* Better prompt strategies or structured output formats?

* Should I combine LLM with rule-based constraints?

Goal is to improve precision in name matching.

Any suggestions would help 🙌