Nikhil Sardana

Nikhil Sardana's posts

Header image for model training blog post

July 19, 2024/8 min read

How Long Should You Train Your Language Model?

Serving Quantized LLMs on NVIDIA H100 Tensor Core GPUs

January 30, 2024/7 min read

Serving Quantized LLMs on NVIDIA H100 Tensor Core GPUs

LLM Inference Performance Engineering: Best Practices

October 12, 2023/15 min read

LLM Inference Performance Engineering: Best Practices

November 9, 2022/3 min read

MosaicML Delivers Leading NLP Performance in MLPerf v2.1