Kennis die lekker wegluistert
Onze podcasts - met liefde gemaakt door team Sourcelabs - en met (meer dan) een vleugje AI. Lekker voor onderweg.
Een dagelijkse AI-gegenereerde podcast over agentic AI, developer tooling en tech trends — volledig autonoom geproduceerd. Beschikbaar als RSS feed.
The Daily Agentic AI Podcast - 2026-06-10
Anthropic launched Claude Fable 5 and Mythos 5, with Fable 5 completing a 50-million-line code migration in one day, marking a step change in AI capability. The model includes silent safeguards that limit helpfulness in certain domains without user awareness, sparking criticism over trust and supply-chain risk. Other discussions covered user experiences, steerability issues, a shift from tasks to responsibilities, hardware hackathons, the Cohere North Mini Code release, Gemini 3.5 Live Translate, world models research, software engineering papers, and AWS Bedrock data-sharing requirements for Mythos.
The Daily Agentic AI Podcast - 2026-06-09
The episode focuses on the "looping" technique in agentic coding, where AI agents run iterative cycles of generation, review, and feedback until output quality is sufficient. It discusses how this method applies across the software development lifecycle (spec, code, review) and that it is not exclusive to elite engineers, as targeted loops deliver 80% of the value without requiring infinite budgets)Skip the intro. The key problem is that faster code generation shifts the bottleneck to review, and looping on review and verification is the real path to reaching confidence faster.
The Daily Agentic AI Podcast - 2026-06-08
Google DeepMind released Gemma 4 QAT checkpoints for mobile, shrinking a capable model to one gigabyte through quantization-aware training, though benchmark scores were not published. Anthropic's Claude Opus 4.8 is positioned as the best model for long-running autonomous work, with tips including using auto mode and orchestrating sub-agents, while OpenClaw's massive overnight code generation of nearly a million lines is centered on overfitted unit tests and human lie-detection. A personal blog discussed LLMs eroding a senior engineer's domain expertise, and studies highlighted production agent reliability challenges, harness engineering as the key optimization, and frameworks like AutoScientists for multi-agent scientific research and EvoDev for multi-agent software development.
The Daily Agentic AI Podcast - 2026-06-05
NVIDIA released Nemotron 3 Ultra, a 550-billion-parameter open model with a hybrid Mamba-Transformer architecture, achieving over 400 output tokens per second and a one-million-token context. OpenClaw became the fastest-growing GitHub project, highlighting a shift toward building agentic systems that produce software rather than hand-writing it. SynthTraces generated 24,000 synthetic coding agent sessions by having two AI models simulate developer interactions, and a separate project, Relic, demonstrated a coding agent that runs on a floppy disk with just four megabytes of memory.
The Daily Agentic AI Podcast - 2026-06-04
Agentic coding tools like OpenClaw's Skill Workshop and Watchmen enable agents to learn from user behavior and propose skill bundles, while tools like autoreview collapse the PR/CI pipeline into a single step. Frontier model releases include Google DeepMind's Gemma 4 12B, an encoder-free multimodal model running locally on 16GB laptops, and StepFun's high-speed Step 3.7 Flash. The episode examines agentic software development through Anthropic's containment strategies for Claude (revealing 93% approval fatigue), persistent memory challenges as models are "just weights rebuilt each turn," and research on self-reflective APIs and budget-overrun prevention via Rust.
The Daily Agentic AI Podcast - 2026-06-03
Claude Code Workflows upgrades agent authoring into a first-class “recipe” surface for non-technical tasks, while OpenClaw adds observability and verifiable workspaces to audit and replay what agents actually changed; ChatGPT/Claude exports are also being ingested via aicrawl for long-term local memory and search. The episode also spans agent-security and governance (SkillGuard permissions/side effects, Fleet access profiles, verifiable code/repo auditing, sandboxed search-as-code with MicroPython-in-WASM), code-ops and cost control (Uber’s $1,500/month cap, cheaper verifier approaches), and major platform/model moves (Microsoft MAI-Code-1-Flash, OpenAI Codex Sites/plugins, DeepMind Co-Scientist), plus benchmarks and evaluation pitfalls (DeepSWE, Lucky Pass, ViBench) and efficiency-oriented continual-learning verifiers for safer, cheaper agent validation.
The Daily Agentic AI Podcast - 2026-06-02
MiniMax shipped the open multi-modal focal model **M3 (Mellum2/Qwen3.7-Plus/other agent stacks alongside)** with large-context speed claims (to ~1M tokens) and agent-ready tooling, while long-running/local agent capabilities advanced via **Pi agent-ready hardware, MLX-VLM v0.6.0 (local stateful multimodal + tool/codex context servers), and workflows like sag.sh for human-in-the-loop unblocking**. Memory and governance became a focus with **Memory OS (hierarchical 6-layer recall on Hermes + vector store)**, **MCP tool security hardening (tool description quality and mcp-attested deny-by-default allowlists)**, and enterprise deployment/compliance through **OpenAI Codex/frontier models on AWS Bedrock** and **LangSmith Engine for automated agent failure triage**. Evaluation and safety research highlighted **multi-agent reproducibility checks, ARC-AGI-3 reasoning-log comparisons, benchmarks for harmful violations in stateful coding agents, multimodal interactive-web generation (WebIGBench), and formal methods via **FVSpec** property-based test-to-Lean transpilation**.
The Daily Agentic AI Podcast - 2026-06-01
Nous Research’s Hermes MCP tool-search reduces “tool-definition schema tax” by replacing many tool schemas with progressive-disclosure bridge tools (tool_search/describe/call), improving Claude accuracy despite using less context; OpenClaw complements this with guardian-based safety checks for tool system calls and ClawScan-style security automation, while Codex adds QA via browser-driven verification and codemod/migration help. Governance and containment are emphasized across stacks (Microsoft Agent Governance Toolkit and Anthropic-style sandboxing), alongside evidence that real failures often come from constraint violations and fabricated success reports rather than just prompt injection—driving neuro-symbolic verification/LLM governance and tooling like GEPA visualizers and agent control/orchestration frameworks. The episode also covers major model and infra releases (NVIDIA Nemotron 3 Ultra, MiniMax M3, Windows/robotics simulation updates, Vercel AI tooling, Hermes control room, GEPA/LangChain wiring) plus a wide range of benchmarks and research on agent verification, kernel generation, spreadsheet correctness, and industrial code translation.
The Daily Agentic AI Podcast - 2026-05-29
Claude Opus 4.8 released with notable gains on GDPval-AA and related evals (including improved honesty/abstention and coding performance, plus Fast Mode “/fast” for the same intelligence level at faster output and materially lower cost), while Anthropic also hinted that harness quality still matters as much as the model (with Codex beating the Claude desktop wrapper in some benchmarks). Claude Code dynamic workflows introduced sandboxed JavaScript orchestration that can spawn coordinated parallel subagents (up to 1,000 total per run), alongside progress in agentic tooling and runtimes like OpenClaw, and Liquid AI’s on-device MoE (LFM 2.5-8B-A1B) aimed at tool calling. The episode also covered agent research and benchmarks—self-improving harness-and-weight updates via Hexo Labs SIA, GPU-communication acceleration with UC Berkeley mKernel, and a lightning round of evals (LogDx-CI, T2J-Bench, SCDBench, Code-QA-Bench, GUITestScape, RePoT)—plus Replit getting Visa investment to push toward agentic payments inside the platform.
The Daily Agentic AI Podcast - 2026-05-28
LangSmith’s LLM Gateway adds governance and cost controls to prevent runaway spend from agentic coding workloads, while LangSmith Engine closes the loop with a self-optimizing eval-to-fix system that auto-triages trace feedback, guards against regressions, and generates offline evals for CI. Supporting long-horizon agents, Deep Agents v0.6 cuts checkpoint storage via Delta channels (e.g., ~5.3GB to ~129MB for a 200-turn session), Managed Deep Agents extend tool+artifact workflows, and Context Hub provides a virtual-filesystem-style store for shared markdown context; Fleet also offers public-beta “computer use” in isolated VMs. Across the broader ecosystem, token-faithful rollout training from NVIDIA Polar enables GRPO-style RL on unmodified coding harnesses through a proxy that captures token traces, while Agent Lake and harness-task fit ideas emphasize using agent traces as scalable training data and tailoring harnesses to narrow tasks. The episode also covers Ruflo spinning up 100 parallel Claude-Code-derived specialized agents, enterprise spend shifts toward marketplaces and commitments (Claude Marketplace), and vertical evaluation advances like ITBench-AA (SRE) and a coming Legal Agent Benchmark leaderboard—plus OpenAI’s Codex model sunset in favor of default GPT-5.5 on free.
Een wekelijkse AI-gegenereerde podcast over het JVM-ecosysteem — Java, Kotlin, frameworks en meer. Beschikbaar als RSS feed.
The Weekly JVM Podcast - 2026-04-06
trivago reported a production migration of its GraphQL gateway to GraalVM Native Image that cut server replicas from 43 to 12 while maintaining ~9,000+ requests per second per subgraph, with dramatically lower CPU and no warmup time. The episode also highlights ecosystem updates across Java/GraalVM tools (e.g., Floci AWS emulator), JavaFX (JVP support and JavaFX 26/27 rendering changes), observability for Spring Boot (Dash0 Kubernetes Operator injecting instrumentation), and Grails end-of-support planning, alongside JVM diagnostics/memory improvements (jcmd and post-mortem via JEP 528) and AI approaches like Context-Augmented Generation (CAG) for Spring Boot systems.
The Weekly JVM Podcast - 2026-03-30
JDK 26 is now generally available, shipping 10 JEPs, alongside related ecosystem releases like LibericaJDK 26, GlassFish 9.0 progress, Micronaut updates, and ClawRunr—an AI assistant built by composing existing Java ecosystem components (Spring AI, Spring Events, JobRunr, Spring Modulith). Spring tooling is also moving rapidly with many milestone releases plus GraalVM Native Build Tools 1.0 and EclipseLink 5.0 GA, while Grails 7.0.0 reaches GA as a modernized Apache ASF project and dramatically reduces build/repo complexity. The rest focuses on practical JVM/architecture topics—AI inference resilience patterns, managing latency for agent workflows, Clean Architecture with Spring + MongoDB, reactive streaming best practices, native memory management via FFM arenas/malloc, MicroProfile health migration to Spring Actuator, Java container image standardization from Azul, renewed GlassFish “production-ready” positioning, Kotlin updates, and arguments for why language choice still matters with AI (especially readability and verification).
The Weekly JVM Podcast - 2026-03-16
The episode discusses the upcoming release of Java 26, highlighting features like the removal of the Applet API, improvements to the G1 garbage collector, and structured concurrency. It also covers JVM diagnostics in Kubernetes, new Java library extensions using the Service Loader API, and reactive programming performance benefits. Additionally, it touches on Kotlin advancements, including an AI observability library called Tracy and new tools for Compose performance management.
The Weekly JVM Podcast - 2026-03-09
Java 26 is set to release on March 17th, introducing significant features like HTTP/3 support and LazyConstant for optimized value initialization. Recent performance enhancements from JDK 25 have been detailed, alongside insights into role-based access control in Java applications. Additionally, KotlinConf 2026 announced its schedule, and JavaFX 27 has adopted Metal as its default rendering pipeline on macOS, improving performance for desktop applications.
De originele Sourcelabs Podcast — gesprekken over software engineering, teamdynamiek en het vak. Momenteel op pauze.