Skip to main content
Page 1
Engineering blog

A Pattern for the Lightweight Deployment of Distributed XGBoost and LightGBM Models

A common challenge data scientists encounter when developing machine learning solutions is training a model on a dataset that is too large to...
Industries category icon 1

Automated Analysis of Product Reviews Using Large Language Models (LLMs)

Check out our LLM Solution Accelerators for Retail for more details and to download the notebooks. While conversational AI has garnered a lot...
Engineering blog

Using MLflow AI Gateway and Llama 2 to Build Generative AI Apps

To build customer support bots, internal knowledge graphs, or Q&A systems, customers often use Retrieval Augmented Generation (RAG) applications which leverage pre-trained models...
Engineering blog

Synthetic Data for Better Machine Learning

April 12, 2023 by Sean Owen in Engineering Blog
You've likely tried the buzziest advances in generative AI in the past year, tools like ChatGPT and DALL-E . They consume complex data...
Engineering blog

Fine-Tuning Large Language Models with Hugging Face and DeepSpeed

March 20, 2023 by Sean Owen in Engineering Blog
Large language models (LLMs) are currently in the spotlight following the sensational release of ChatGPT. Many are wondering how to take advantage of...
Platform blog

Mitigating Bias in Machine Learning With SHAP and Fairlearn

September 16, 2022 by Sean Owen in Platform Blog
Try this notebook in Databricks. With good reason, data science teams increasingly grapple with questions of ethics, bias and unfairness in machine learning...
Engineering blog

How (Not) to Tune Your Model With Hyperopt

April 15, 2021 by Sean Owen in Engineering Blog
Hyperopt is a powerful tool for tuning ML models with Apache Spark. Read on to learn how to define and execute (and debug)...
Company blog

Better Machine Learning through Active Learning

January 16, 2020 by Sean Owen in Company Blog
Try this notebook to reproduce the steps outlined below Machine learning models can seem like magical savants. They can distinguish hot dogs from...
Engineering blog

How (Not) To Scale Deep Learning in 6 Easy Steps

August 15, 2019 by Sean Owen in Data Science and ML
Try this notebook in Databricks Introduction: The Problem Deep learning sometimes seems like sorcery. Its state-of-the-art applications are at times delightful and at...
Engineering blog

Detecting Data Bias Using SHAP and Machine Learning

June 17, 2019 by Sean Owen in Data Science and ML
Try the Detecting Data Bias Using SHAP notebook to reproduce the steps outlined below and watch our on-demand webinar to learn more. StackOverflow's...
Engineering blog

Near-Real-Time Hardware Failure Rate Estimation with Bayesian Reasoning

February 14, 2019 by Sean Owen in Data Science and ML
Try this notebook in Databricks You might be using Bayesian techniques in your data science without knowing it! And if you're not, then...