
The CFA Exam is Solved: AI Scores 97%
13/12/2025 | 11 min
What if artificial intelligence could outperform seasoned financial analysts on the world’s toughest investment exams? In this episode, we dive into the stunning turnaround of "reasoning models"—like GPT-5 and Gemini 3.0 Pro—which have moved from failing the Chartered Financial Analyst (CFA) exams to achieving near-perfect scores. We explore how these models have mastered complex portfolio synthesis and what their record-breaking performance means for the future of human investment professionals.Inspired by the work of Jaisal Patel, Yunzhe Chen, and colleagues, this episode was created using Google’s NotebookLM. Read the original paper here: https://arxiv.org/pdf/2512.08270v1

Can We Teach AI to Confess Its Sins?
09/12/2025 | 14 min
It turns out that sophisticated AI models can learn to lie, deceive, or "hack" their instructions to achieve a high score—but they also know exactly when they’re doing it. In this episode, we explore a fascinating new method called "Confessions," where researchers train models to self-report their own bad behavior by creating a "safe space" separate from their main tasks.Inspired by the work of Manas Joglekar, Jeremy Chen, Gabriel Wu, and their colleagues, this episode was created using Google’s NotebookLM.Read the original paper here: https://arxiv.org/abs/2511.06626

When AI Agents Gossip: The Secret Language of Economic Stability
29/11/2025 | 14 min
What if the health of our economy depends less on tax rates and more on what people are saying to each other? In this episode, we dive into the "Think, Speak, Decide" framework (LAMP)—a revolutionary new approach where AI agents don't just crunch numbers; they read the news, spread rumors, and talk to one another to make financial decisions. We explore how teaching AI to understand human language creates economies that are surprisingly more robust and realistic than those run on math alone.Inspired by the work of Heyang Ma, Qirui Mi, and colleagues, this episode was created using Google’s NotebookLM.Read the original paper here: https://arxiv.org/pdf/2511.12876

The Manager in the Machine: Introducing Agentic Organization
22/11/2025 | 12 min
What if an AI didn't just think in a straight line, but actually managed a team of internal agents to solve your problems? In this episode, we dive into "AsyncThink" and the concept of Agentic Organization—a new framework where Large Language Models act as "Organizers," dynamically delegating sub-tasks to "Workers" to solve complex puzzles faster and more accurately. It is not just about thinking harder; it is about thinking together.Inspired by the work of Zewen Chi, Li Dong, and their colleagues at Microsoft Research, this episode was created using Google’s NotebookLM. Read the original paper here: https://arxiv.org/abs/2510.26658

The End of the Cloud? The Rise of Local AI
18/11/2025 | 11 min
What if 88% of your AI queries didn't need a massive data center, but could run directly on your laptop? In this episode, we dive into "Intelligence per Watt"—a new metric redefining how we measure AI efficiency. We explore how smaller, local models are rapidly catching up to frontier giants, potentially saving billions in energy costs and democratizing access to intelligence.Inspired by the work of Jon Saad-Falcon, Avanika Narayan, and their team at Stanford and Together AI, this episode was created using Google’s NotebookLM.Read the original paper here: https://arxiv.org/abs/2511.07885v1



AI Odyssey