24:44 Fast Inference from Transformers via Speculative Decoding
Host: mcgrof
These sources review historically speculative decoding, an innovative technique designed to accelerate Large Language Model (LLM) inference without...
Curated AI, machine learning, and data science conversations for builders who want signal, not noise.
224
Episodes
5
Categories
10
Topics
Browse by focus
Start with a category, then dive into the topics builders care about most.
Latest episodes
Showing 224 episodes.
Host: mcgrof
These sources review historically speculative decoding, an innovative technique designed to accelerate Large Language Model (LLM) inference without...
Host: mcgrof
This article outlines how Baseten optimized speculative decoding using the TensorRT-LLM framework to accelerate model inference. The authors detail...
Host: Etienne Noumen
Listen to Full Episode at: https://podcasts.apple.com/us/podcast/ai-daily-news-rundown-the-anthropic-blacklist-openais/id1684415169?i=1000752107128...
Host: Etienne Noumen
🚀 Welcome to AI Unraveled (February 28th, 2026): Your daily strategic briefing on the business, technology, and policy reshaping artificial intelli...
Host: mcgrof
QuantSpec is a novel self-speculative decoding framework designed to accelerate the inference of Large Language Models, particularly in long-contex...
Host: mcgrof
The researchers introduce CXL-SpecKV, a specialized architecture designed to overcome the memory bottlenecks of large language model serving by off...
Host: mcgrof
On the February 19, 2026 paper Google Deepmind introduces Unified Latents (UL), a novel framework for generative modeling that jointly trains an en...
Host: mcgrof
We review an April 3, 2025 research collaboration between CMU, Moffett AI and Together AI which introduces MagicDec, a new framework designed to ac...
Host: Etienne Noumen
Listen to Full Episode at: https://podcasts.apple.com/us/podcast/ai-daily-news-rundown-the-anthropic-blacklist-openais/id1684415169?i=1000752107128...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/scientist-vs-storyteller-benchmarking-gpt-5-2-claude/id1684415169?i=1000752001078For ...
Host: Etienne Noumen
Listen to Full Episode at: https://podcasts.apple.com/us/podcast/ai-daily-news-rundown-the-anthropic-blacklist-openais/id1684415169?i=1000752107128...
Host: Inception Point Ai
This is you Applied AI Daily: Machine Learning & Business Applications podcast.Welcome to Applied AI Daily, your source for machine learning an...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/scientist-vs-storyteller-benchmarking-gpt-5-2-claude/id1684415169?i=1000752001078🚀 We...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/openais-%24110b-war-chest-the-block-layoff-massacre/id1684415169?i=1000751987542🚀 Wel...
Host: The Robot Report Podcast
On the show this week, our guests are Andrew Singletary, CEO, and Amir Sharif, COO of 3Laws. They discuss the evolution of the 3Laws technology fro...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/openais-%24110b-war-chest-the-block-layoff-massacre/id1684415169?i=1000751987542🚀 Wel...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/openais-%24110b-war-chest-the-block-layoff-massacre/id1684415169?i=1000751987542🚀 Wel...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/openais-%24110b-war-chest-the-block-layoff-massacre/id1684415169?i=1000751987542🚀 Wel...
Host: Etienne Noumen
🚀 Welcome to AI Unraveled (February 27th, 2026): Your daily strategic briefing on the business, technology, and policy reshaping artificial intelli...
Host: Inception Point Ai
Welcome to The Growth Mindset Podcast: Cultivating Success from the Inside Out. I'm Kai, your friendly AI personal growth expert. Being an AI means...
Host: Daniel Faggella
Eliminating the friction caused by fragmented customer context is a primary mandate for enterprise operations leaders. Joe Atamian, Vice President ...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/nvidias-exponential-peak-perplexitys-digital-worker/id1684415169?i=1000751842884🚀 Wel...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/nvidias-exponential-peak-perplexitys-digital-worker/id1684415169?i=1000751842884🚀 Wel...
Host: Etienne Noumen
Listen to Full Audio at https://podcasts.apple.com/us/podcast/nvidias-exponential-peak-perplexitys-digital-worker/id1684415169?i=1000751842884🚀 Wel...
Host: Etienne Noumen
🚀 Welcome to AI Unraveled (February 26th, 2026): Your daily strategic briefing on the business, technology, and policy reshaping artificial intelli...