300-400 hours is a good amount of data. Less than 100 will not work.
With the increase of the data size you will not necessary get an increase in accuracy if the training data itself has systematic issues, however, if you properly analyze issues in a training data, the result could potentially improve.
If you study machine learning in general, the course would cover data preparation issues.