Generative AI | Databricks Blog

Page 11

MosaicML StreamingDataset: Fast, Accurate Streaming of Training Data from Cloud Storage

February 9, 2023 by James Knighton, Karan Jariwala, Davis Blalock and Erica Ji Yuen in Mosaic Research

Loading your training data becomes an escalating challenge as datasets grow bigger in size and the number of nodes scales. We built StreamingDataset...

Blazingly Fast LLM Evaluation for In-Context Learning

February 2, 2023 by Jeremy Dohmann in Mosaic Research

With MosaicML you can now evaluate LLMs on in-context learning tasks (LAMBADA, HellaSwag, PIQA, and more) hundreds of times faster than other evaluation...

5x Faster Image Segmentation Training with MosaicML Recipes

November 17, 2022 by Landan Seguin, Cory Stephenson and Erica Ji Yuen in Mosaic Research

Can't stop, won't stop. Earlier this year, we shared a new baseline for semantic segmentation (basically, classifying an image at the pixel level)...

MosaicML Delivers Leading NLP Performance in MLPerf v2.1

November 9, 2022 by Daya Khudia, Nikhil Sardana, Sam Havens, Alex Trott and Erica Ji Yuen in Mosaic Research

MosaicML leads the MLPerf NLP results, delivering a score of 7.9 minutes on 8x NVIDIA A100 GPUs in the Open Division, thanks to...

Mosaic LLMs: GPT-3 quality for <$500k

September 29, 2022 by Abhi Venigalla and Linden Li in Mosaic Research

Training large language models (LLMs) costs less than you think. Using the MosaicML platform, we show how fast, cheap, and easy it is...

Mosaic LLMs (Part 1): Billion-Parameter GPT Training Made Easy

August 11, 2022 by Abhi Venigalla and Linden Li in Mosaic Research

In Part 1 of this LLM blog post series, we use the MosaicML platform to train vanilla GPT-3 models up to 1.3B params...

Behind the Scenes: Setting a Baseline for Image Segmentation Speedups

July 27, 2022 by Landan Seguin in Mosaic Research

We establish a new semantic segmentation baseline of 45.56 mIoU on the ADE20k segmentation benchmark in 3.5 hours on a system with 8x...

Mosaic ResNet Deep Dive

July 18, 2022 by Matthew Leavitt in Mosaic Research

TL;DR: We recently released a set of recipes which can accelerate training of a ResNet-50 on ImageNet by up to 7x over standard...

MosaicML Satisfies the Need for Speed with MLPerf Results

June 29, 2022 by Bandish Shah, Daya Khudia and Hanlin Tang in Mosaic Research

MosaicML’s Open Division submission to the MLPerf Image Classification benchmark delivers a score of 23.8 minutes (4.5x speed-up relative to our baseline) on...

Farewell, CUDA OOM: Automatic Gradient Accumulation

June 23, 2022 by Mihir Patel and Erica Ji Yuen in Mosaic Research

With automatic gradient accumulation, Composer lets users seamlessly change GPU types and number of GPUs without having to worry about batch size. CUDA...