Model for keywording technical notes

Posted by olddoglearnsnewtrick@reddit | LocalLLaMA | View on Reddit | 4 comments

I've assembled a few thousand notes (stored in markdown text files) in the past years with "how to" on diverse subjects such as Linux/Mac managments, scripts, python, docker recipes, mongodb aggregation queries etc etc etc. Today I am searching through them with simple string matching (Craft app) and also I'd like to escape the flat dead simple subject classification I have now. Some of the notes I have under Debian are totally useful also in my MacOS folder etc etc. So I was thinking about using a LLM to ask and generate additional keywords to help me retrieve my recipes also with alternative searches. Is this a sensible idea? What approach do you suggest? Reading a given document say about installing a product, I would like to generate keywords such as linux, debian, macos, bash etc The notes are mostly in english, although my mothertongue sometimes slipped in (Italian) and what's most precious are the "codeblocks" which describe the actual code/steps I need to perform.