How to create small datasets out of a big data file