Functions
determine_train_dataset(data, model_task)
determine_train_dataset
Get the paths to the src and tgt dataset, trying to get the augmented one if it exists.