Iteration 17. Scale the data

15-04-2024

Goal

Can I improve the LB score simply by scaling the data?

This is the last idea I have: to take GPT4 and use to to scale the training data.

I'm going to divide the generation process into two steps:

I have generated 2k new training samples, with a cost of around 20$.

I get a LB score of 0.60, worse than the best one which is 0.62

No improvement when scaling the data.

Last update: 2024-04-16