
The need for balanced and high-quality training datasets
Before a machine learning model is developed, a training set of manually labeled data is designed. The goal of AI is to augment human performance. Therefore, AI is built on work done by humans. The development of AI systems begins with asking questions that are human about a specific business process. While there has been scientific progress in using semi-supervised and unsupervised machine learning models, the majority of market applications require humans to label the training dataset.
