I created a tool to generate large synthetic datasets with visual feedback (with walk-through)

Posted by davernow@reddit | LocalLLaMA | View on Reddit | 9 comments

Hi everyone,

I’ve been working on Kiln AI, and I just added some pretty cool synthetic data generation tools:

Walk through

Features

What's Next

As you can probably guess, fine-tuning is coming next 😀. The goal is to make is super easy/fast to start from scratch, generate a large synthetic dataset, and evaluate a variety of methods (fine-tines, different models, prompting tactics, etc).

How to get started:

I’d love any feedback, ideas or suggestions! Feel free to file issues or DM me.