machine learning datasets as a process