Videos
Short, technical explainers on AI coding agents, local AI tooling, data pipelines, and developer workflows. Every video has a companion page with the embed and full transcript; the originals live on YouTube.
All videos
Search across every title and full transcript, or browse newest first. Click a title for the embed and transcript.
35 videos
- Stop Making Users Wait: Async Queues Explained
Async processing as a handoff - Django accepts the request, Redis holds the job, and a Celery worker finishes the slow work after the response is already back. Queues, workers, backlogs, retries, and when not to bother.
- pgvector: Vector Search Inside Postgres (No Extra DB)
Why pgvector lets Postgres do vector search itself - a vector column type, distance operators, and IVFFlat vs HNSW indexes - so most projects never need a separate vector database.
- LangSmith: Close the Loop Between Shipped and Working
LangSmith's agent engine closes the gap between shipped and working - it clusters production failures into named issues, traces each back to the commit that introduced it, and drafts a fix that waits for your approval.
- Tavily For Beginners: Give Your AI Real-Time Web Search
Tavily gives a language model real-time web search in four lines - structured results with a synthesized answer field, search-depth and time-range knobs, and a first-party LangChain tool.
- Running A RAG Pipeline On The Pentagon UFO Files — Real Cypher, Real Citations
Episode 2: open the actual repo and run all six stages on 115 declassified PDFs — Chroma retriever returns cited answers and a FalkorDB graph agent writes Cypher from plain English.
- Teach AI To Read UFO Files (LangGraph + RAG)
Episode 1: the architecture of a six-stage RAG pipeline plus a GraphRAG layer in LangChain + LangGraph, built to read 4 GB of declassified UFO files on a single workstation.
- LangChain Explained for Engineers: How the Runnable Interface Works
Once the Runnable interface clicks, the rest of LangChain — LCEL, LangGraph, tools, retrievers, structured output — becomes predictable instead of surprising.
- LangGraph Local-First: StateGraph, Reducers, and Custom Chat Models
LangGraph's StateGraph, reducers, conditional edges, and checkpointer run identically against local models - no hosted LLM or API key required.
- 20 Essential 3D Printing Terms: Fillet, Chamfer, Manifold
Twenty precise 3D printing terms - fillet, chamfer, manifold, draft angle, clearance fit - that let you command an AI to design parts that actually print on the first try.
- LangChain Framework Explained: LCEL, LangGraph & RAG
A complete tour of LangChain's five packages - LCEL, LangGraph, retrievers, tools, structured output - assembled into a working RAG application.
- Blender MCP: Export 3MF and Automate 3D Prints via AI
Wiring Blender's MCP server and 3MF export into Claude Code or Codex so AI agents can drive a full 3D print workflow end-to-end.
- CADQuery and AI: Automating Parametric 3D Printing from Text
How AI plus CADQuery turns text descriptions into parametric CAD scripts and STL files - and where the workflow quietly breaks.
- Web Scraping in the AI Era: From BeautifulSoup to LLM Extraction
Every web page is structured data wearing an HTML costume. The four moves under the hood, the three-layer stack, and what AI changed about all of it.
- How to Get Cited by AI Answer Engines
What actually makes a page show up inside ChatGPT, Perplexity, and Google AI Overviews - and how to write for it.
- How Local AI Actually Remembers: Inside The Agent Log
Every local coding agent writes a structured log. Resume, replay, and audit are all downstream of that one file.
- How Pi Mono Actually Works in Your App
Walking through the Pi Mono shared agent stack from inside a real app, not as an abstract architecture diagram.
- Fish Speech S2 Pro: Open-Source Voice AI That Beats Closed Models
Hands-on with Fish Speech S2 Pro - what it sounds like, what it costs to run locally, and where it actually wins.
- I Tested Claude Code Tools - Here's What Surprised Me
The lesser-known Claude Code tools, what they really do, and which ones changed how I move through a repo.
- How Pi Mono Actually Works: The Shared Agent Stack Behind Pi
The architecture under Pi Mono - one harness, many agents - and why that shape keeps showing up in local AI tools.
- How Repo Rules, MCP, and Approvals Control AI Coding Agents
The three control surfaces that decide what an agent can read, run, and change inside your repo.
- Why Cursor Picks the Wrong Files In Large Repos (And How to Fix It)
What Cursor's retrieval is really doing in a large monorepo, why it drifts, and the small repo changes that pull it back on track.
- OpenAI Codex Is Now a Multi-Agent Command Center
Codex grew up: cloud tasks, sub-agents, and approvals stitched into one workflow you can actually drive from the CLI.
- How Codex Actually Finds the Right Files
AGENTS.md, skills, MCP, search, and compaction - the layers Codex uses to narrow a big repo down to a working set.
- Codex Explained for Engineers: How the Coding Agent Actually Works
A plain-English walkthrough of the Codex loop: instructions in, tools and reads, edits out, approvals around it.
- Cursor Explained for Engineers: How the AI IDE Actually Works
What Cursor is doing under the hood - context, retrieval, edits, and the trade-offs that come with the IDE shape.
- How MCP Actually Works: The Capability Bus Behind Codex, Claude Code, and Gemini CLI
MCP, demystified. Why it's a capability bus, not a plugin store, and what that changes about how agents reach external tools.
- How AI Coding CLIs Actually Work: Codex, Claude Code, Gemini CLI
Side-by-side teardown of three terminal coding agents - the loop, the tools, and the parts they each get right.
- Why Terminal AI Is The Fastest Workflow For Developers
The case for keeping AI coding in the terminal: tighter loops, smaller context, and tools you already trust.
- OpenClaw Explained: How the Local AI Control Plane Works
Inside OpenClaw - a local control plane for routing AI work between models, tools, and approvals on your machine.
- ComfyUI Explained for Engineers: How to Build Local AI Apps
ComfyUI from a software engineer's angle - nodes as functions, graphs as programs, and where the real product lives.
- Bruno for Local AI APIs: Faster, Reviewable Workflows
Using Bruno to drive local AI APIs - file-based requests, OpenAPI sync, and reviewable diffs instead of throwaway curl.
- Build a Local AI Coding App in Neovim
Wiring a real local AI coding workflow into Neovim - the editor stays the editor, the agent stays in the loop.
- Azure Data Factory with Azure CLI - Part 2: Reading Pipeline & Dataset JSON
Companion to the blog post - reading ADF pipeline and dataset JSON from the Azure CLI to inspect a real factory.
- Azure Data Factory with Azure CLI - Part 1: Discovering Pipelines & Datasets
Discovering an ADF instance from the command line - listing pipelines, datasets, and linked services without the portal.
- How to Use ChatGPT Codex & Apply Git Patches
Quick tutorial on driving ChatGPT Codex with diffs you can review, apply, and roll back like any other commit.