<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>David Bartolomei-Guzmán</title><link>https://davidbartolomei.com/</link><description>Recent content on David Bartolomei-Guzmán</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sun, 05 Apr 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://davidbartolomei.com/feed.xml" rel="self" type="application/rss+xml"/><item><title>Migrating from RunPod to Local Whisper Inference with MLX and a DGX Spark</title><link>https://davidbartolomei.com/migrating-from-runpod-to-local-whisper-inference/</link><pubDate>Sun, 05 Apr 2026 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/migrating-from-runpod-to-local-whisper-inference/</guid><description>&lt;p&gt;In my &lt;a href="https://davidbartolomei.com/case-study-leveraging-machine-learning-for-spoken-media-analysis-share-of-voice-of-puerto-ricos-political-figures-in-2024/"&gt;previous article&lt;/a&gt;, I described how we use OpenAI&amp;rsquo;s Whisper model to transcribe radio and TV broadcasts for Monitorea, our media monitoring platform. At the time, we were running inference on RunPod - a serverless GPU platform that lets you deploy ML models without managing hardware. It was the right call to get started quickly. But as we scaled, the economics stopped making sense.&lt;/p&gt;
&lt;p&gt;Here&amp;rsquo;s how we migrated to fully local inference in about a weekend, using MLX on Apple Silicon and a DGX Spark we call Sparky.&lt;/p&gt;</description></item><item><title>Launching Monitorea: AI Agents for Broadcast Media Intelligence</title><link>https://davidbartolomei.com/launching-monitorea-ai-agents-for-broadcast-media-intelligence/</link><pubDate>Wed, 01 Oct 2025 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/launching-monitorea-ai-agents-for-broadcast-media-intelligence/</guid><description>&lt;p&gt;A year ago I published a &lt;a href="https://davidbartolomei.com/case-study-leveraging-machine-learning-for-spoken-media-analysis-share-of-voice-of-puerto-ricos-political-figures-in-2024/"&gt;case study&lt;/a&gt; analyzing Share of Voice across Puerto Rico&amp;rsquo;s AM radio stations using Whisper transcriptions. The article ended with a long list of &amp;ldquo;future work&amp;rdquo; — fine-tuning, entity recognition, segment classification, summarization. At the time, those were ideas I wanted to explore. As of this month, most of them are running in production.&lt;/p&gt;
&lt;p&gt;&lt;a href="https://monitorea.ai"&gt;Monitorea&lt;/a&gt; is now in private beta. Here&amp;rsquo;s what changed and how we got here.&lt;/p&gt;</description></item><item><title>LLMs Alternatives for Software Development in Regulated Industries</title><link>https://davidbartolomei.com/llms-alternatives-for-software-development-in-regulated-industries/</link><pubDate>Wed, 09 Oct 2024 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/llms-alternatives-for-software-development-in-regulated-industries/</guid><description>&lt;p&gt;By the time you’re reading this, the landscape of Large Language Models (LLMs) has likely evolved further, but one thing remains constant: their usage in software development continues to ramp up, offering new ways to improve productivity and transform workflows.&lt;/p&gt;
&lt;h4 id="enhancing-software-development-with-llms"&gt;Enhancing Software Development with LLMs&lt;/h4&gt;
&lt;p&gt;If you write software for a living, you’ve probably encountered countless articles sharing how LLMs can enhance your productivity and streamline your development workflows. Tools like &lt;a href="https://www.cursor.com/"&gt;Cursor&lt;/a&gt; (my personal favorite), &lt;a href="https://github.com/features/copilot"&gt;GitHub Copilot&lt;/a&gt;, and the newly announced &lt;a href="https://openai.com/index/introducing-canvas/"&gt;OpenAI Canvas&lt;/a&gt; are tailored specifically for software development and are great with code generation, error detection, and automating repetitive tasks. If you have access to any of these tools at work, consider yourself lucky. If not, keep reading to explore some alternatives.&lt;/p&gt;</description></item><item><title>Case Study: Leveraging Machine Learning for Spoken Media Analysis – Share of Voice of Puerto Rico’s Political Figures in 2024</title><link>https://davidbartolomei.com/case-study-leveraging-machine-learning-for-spoken-media-analysis-share-of-voice-of-puerto-ricos-political-figures-in-2024/</link><pubDate>Tue, 01 Oct 2024 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/case-study-leveraging-machine-learning-for-spoken-media-analysis-share-of-voice-of-puerto-ricos-political-figures-in-2024/</guid><description>&lt;p&gt;This is the first in a series of articles where I share my findings exploring Speech-to-Text (STT) ML models to transcribe and analyze spoken content in news media. In this article, I discuss how STT output can be used for automatic mention detection and tracking metrics such as Share of Voice of political figures in Puerto Rico during the 2024 election season.&lt;/p&gt;
&lt;h2 id="the-back-story"&gt;The Back Story&lt;/h2&gt;
&lt;p&gt;Before diving into the details, here’s a brief back story on what sparked my interest in this topic. You can skip directly to the results by scrolling down.&lt;/p&gt;</description></item><item><title>Challenges and Benefits of the Analytics Engineer Role</title><link>https://davidbartolomei.com/challenges-and-benefits-of-the-analytics-engineer-role/</link><pubDate>Mon, 22 Apr 2024 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/challenges-and-benefits-of-the-analytics-engineer-role/</guid><description>&lt;p&gt;The Analytics Engineer is a relatively new role that emerged during the “Modern Data Stack” trend of the last 5-10 years and was &lt;a href="https://www.getdbt.com/what-is-analytics-engineering"&gt;”formalized”&lt;/a&gt; by dbt Labs. In this article, I discuss some of the learnings, challenges, and benefits that I have experienced while adopting this role in my team.&lt;/p&gt;
&lt;p&gt;TLDR: Analytics Engineers merge the analytical skills of a data analyst with the engineering mindset and practices of a software engineer, creating a hybrid profile. This can be very effective in keeping a data team agile, providing insights, and aiding in the decision-making process more effectively than traditional data analysts.&lt;/p&gt;</description></item><item><title>Local Dev with Astro CLI and Remote Databases</title><link>https://davidbartolomei.com/connecting-to-remote-databases-while-developing-with-astro-cli/</link><pubDate>Sun, 11 Feb 2024 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/connecting-to-remote-databases-while-developing-with-astro-cli/</guid><description>&lt;p&gt;When working with Astronomer and Airflow for data processing and ETL tasks, it’s common to encounter scenarios where you need to interact with databases hosted within a private subnet in in your cloud provider. In this blog post, we’ll demonstrate how to set up local development for Astronomer and Airflow while connecting to a Redshift database hosted in a private AWS subnet.&lt;/p&gt;
&lt;p&gt;TLDR: We’ll achieve this by adding a new container to the Docker Compose file that contains the Astro CLI configuration and configure the Airflow Connection to use that container to reach to your remote DB.&lt;/p&gt;</description></item><item><title>Managing Python versions and environments in macOS</title><link>https://davidbartolomei.com/setting-up-python-in-macos/</link><pubDate>Wed, 01 Dec 2021 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/setting-up-python-in-macos/</guid><description>&lt;p&gt;I have been using Python for a while now, and managing dependencies and versions always has been a pain. macOS still comes with Python2.7, but most dev works have moved to Python3.&lt;/p&gt;
&lt;p&gt;&lt;img src="https://davidbartolomei.com/images/python_environment.png" alt=""&gt;&lt;em&gt;&lt;a href="https://xkcd.com/1987/"&gt;&lt;em&gt;Graphic representation of a common Python setup&lt;/em&gt;.&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;This time around, I saw myself using Python more and more and decided to stop kicking the can down the road and “properly” set up Python (at least for my use cases). This is a quick setup summary on how I set up and manage my Python versions and environments using &lt;code&gt;pyenv&lt;/code&gt; and &lt;code&gt;pyenv-virtualenv&lt;/code&gt;. These tools are also compatible with various Linux distributions such as Debian, Ubuntu, Mint.&lt;/p&gt;</description></item><item><title>How to easily migrate your Brew packages in macOS</title><link>https://davidbartolomei.com/automatically-install-almost-everything-in-your-mac/</link><pubDate>Sun, 06 Dec 2020 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/automatically-install-almost-everything-in-your-mac/</guid><description>&lt;p&gt;Recently I decided to do a clean install on my personal Mac at the same time and start optimizing my dev setup and overall productivity. The first step towards this goal was to automate installing all the packages and apps I use daily. I have been using Homebrew to manage my macOS dependencies for a few years now and recently I discovered the awesomeness of Brewfiles to export and move dependencies to your new Mac and/or to track it in version control.&lt;/p&gt;</description></item><item><title>From Software to Data Engineering</title><link>https://davidbartolomei.com/from-software-to-data-engineering/</link><pubDate>Mon, 26 Oct 2020 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/from-software-to-data-engineering/</guid><description>&lt;p&gt;I started to work as a data engineer when me and my team were tasked to automate some data sets and reports for the businesses/brands our team was supporting. The challenge was significant: these sites lacked any established data pipelines, relying solely on basic event capture. What was initially a three-month project ignited my passion for data and analytics. In this post, I’ll share my experiences and key learnings from the past year.&lt;/p&gt;</description></item><item><title>David Bartolomei-Guzmán</title><link>https://davidbartolomei.com/home/bio/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://davidbartolomei.com/home/bio/</guid><description/></item></channel></rss>