Single or Multi Dataset Training
Last updated
Last updated
When opening the FELT application, you will have the option to select between single and multi-dataset training. This will determine the following process of training.
The start of the training is the same for both options. The only difference is that in the case of the single dataset training, you pick just one dataset, and for multiple datasets, you pick at least two. The rest of the training setup is the same, you will pick a model, select hyperparameters, and start training. Once the training starts, you can go to the Launched jobs section of the application to view the results. That's the part where things start to differ.
When viewing the results of single dataset training. You will see only one compute job. When the training finishes, you can download the result and use it right away. Therefore training on a single dataset is generally faster as it requires starting just one compute job to get the results.
The situation is different for multiple datasets training. In this case, we run compute job on each dataset separately. We call this local training. The results of local training can't be used directly, and you can't view them in the application. Instead, you have to select local trainings (using checkboxes on the right) and click on aggregate. The aggregation will combine these local jobs into the final model, which you can then download in the Aggregation section. If you have a lot of datasets, you can run multiple aggregations. Each job can aggregate different datasets together.