<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Posts on Matthew Norberg's Data Engineering Blog</title><link>https://mnorberg-dev.github.io/posts/</link><description>Recent content in Posts on Matthew Norberg's Data Engineering Blog</description><generator>Hugo -- gohugo.io</generator><language>en</language><managingEditor>mattnorberg4@gmail.com (Matthew Norberg)</managingEditor><webMaster>mattnorberg4@gmail.com (Matthew Norberg)</webMaster><copyright>© 2026 Matthew Norberg</copyright><lastBuildDate>Sun, 05 Apr 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://mnorberg-dev.github.io/posts/index.xml" rel="self" type="application/rss+xml"/><item><title>Boston Code Camp Recap: Building AI Agents in Databricks</title><link>https://mnorberg-dev.github.io/posts/boston-code-camp-ai-agents-databricks/</link><pubDate>Sun, 05 Apr 2026 00:00:00 +0000</pubDate><author>mattnorberg4@gmail.com (Matthew Norberg)</author><guid>https://mnorberg-dev.github.io/posts/boston-code-camp-ai-agents-databricks/</guid><description>A recap of my session at Boston Code Camp, covering how to build, deploy, and test AI agents in Databricks using the Responses Agent framework. Includes links to the slides and code on GitHub.</description></item><item><title>The Hidden Cost of Databricks AI Agent Redeploys</title><link>https://mnorberg-dev.github.io/posts/ai-agent-cost-savings/</link><pubDate>Thu, 12 Feb 2026 00:00:00 +0000</pubDate><author>mattnorberg4@gmail.com (Matthew Norberg)</author><guid>https://mnorberg-dev.github.io/posts/ai-agent-cost-savings/</guid><description>Redeploying AI agents in Databricks can quietly increase serving costs in ways that aren’t immediately obvious. Each call to &lt;code&gt;agents.deploy()&lt;/code&gt; creates a new agent version, and even versions receiving 0% of traffic may still consume compute resources. In this post, I walk through how we uncovered this behavior, the hypotheses we tested, and the experiment that confirmed it. Cleaning up unused agent versions ultimately reduced our serving costs by roughly 50%.</description></item><item><title>Creating AI Processing Pipelines: A Data-First Approach</title><link>https://mnorberg-dev.github.io/posts/inference-table-processing-tests/</link><pubDate>Wed, 17 Dec 2025 00:00:00 +0000</pubDate><author>mattnorberg4@gmail.com (Matthew Norberg)</author><guid>https://mnorberg-dev.github.io/posts/inference-table-processing-tests/</guid><description>Databricks Mosaic AI Gateway captures rich AI agent request and response data, but not in a format suitable for analysis. Turning that data into insights requires processing pipelines, and before building them, you need to understand the different shapes inference data can take. This post argues for a data-first approach that intentionally generates and examines real inference cases before designing pipelines that have to survive production.</description></item><item><title>Tracing with Databricks Mosaic AI Gateway: A Practical Guide</title><link>https://mnorberg-dev.github.io/posts/ai-gateway/</link><pubDate>Sun, 05 Oct 2025 00:00:00 +0000</pubDate><author>mattnorberg4@gmail.com (Matthew Norberg)</author><guid>https://mnorberg-dev.github.io/posts/ai-gateway/</guid><description>Step-by-step guide to enabling MLflow tracing with Databricks Mosaic AI Gateway. Details the recommended ResponsesAgent approach, examines alternative methods (foundation/external endpoints and custom Python models), and highlights the pitfalls that make the agent path preferable.</description></item><item><title>Databricks Data Quality Expectations Guide</title><link>https://mnorberg-dev.github.io/posts/data-quality-expectations/</link><pubDate>Sun, 21 Sep 2025 00:00:00 +0000</pubDate><author>mattnorberg4@gmail.com (Matthew Norberg)</author><guid>https://mnorberg-dev.github.io/posts/data-quality-expectations/</guid><description>Practical walkthrough for implementing data quality expectations in Databricks Delta Live Tables (Lakeflow Declarative Pipelines). Covers a four-step process to profile data, formalize and translate rules, quarantine noncompliant records, and balance strict versus permissive checks, with lessons learned and links to the talk, slides, and demo code.</description></item><item><title>The Databricks Tool You Didn't Know You Needed</title><link>https://mnorberg-dev.github.io/posts/databricks-tab-label-tool/</link><pubDate>Tue, 16 Sep 2025 00:00:00 +0000</pubDate><author>mattnorberg4@gmail.com (Matthew Norberg)</author><guid>https://mnorberg-dev.github.io/posts/databricks-tab-label-tool/</guid><description>Explains how to clearly distinguish Databricks environments by adding environment-specific labels to browser tabs with a Tampermonkey userscript. Outlines the risk of identical tab titles, provides the script with domain placeholders, and gives step-by-step setup and troubleshooting guidance for dev/QA/prod.</description></item></channel></rss>