MosaicML StreamingDataset: Fast, Accurate Streaming of Training Data from Cloud StorageFebruary 9, 2023 by James Knighton, Karan Jariwala, Davis Blalock and Erica Ji Yuen in Mosaic Research Loading your training data becomes an escalating challenge as datasets grow bigger in size and the number of nodes scales. We built StreamingDataset...