Are you using an older version of Prodigy, by any chance? The batch size of 32 is probably too high, I think we changed this default in recent versions. Anyway, try setting a lower batch size, to hopefully get the model to fit better. 4 would be a good thing to try.
Another thing to note is that the training curve can be expected to follow a sigmoid sort of curve. When there’s very little data, accuracy can be flat, as the model fails to generalise usefully. Then as more data is added, the curve enters a high growth section, before tapering off. So sometimes even when the training curve doesn’t show improvement, you do need to add more data.
Try changing the hyper-parameters though, especially the batch size. I think it should help.