Glossary · Term

Qwen

← all terms

Definition

Alibaba's family of open-weight large language models.

Alibaba's series of open-weight foundation models including Qwen2.5, Qwen3, and Qwen3-Coder, widely used in agent research.

Also called: Qwen2, Qwen2.5, Qwen3, Qwen-2, Qwen-3, Qwen-3-V-L, Qwen-3-Coder, Qwen3-Coder, Qwen3-VL, chwen, chwen-three, chwen-zero, chwen-two-point-five, chwen3, Qwen Math seven B

Mentioned in 29 episodes

  1. 079
    An Old Idea From Cognitive Psychology Reshapes How We Reward Reasoning Models
  2. 078
    Training a Markdown File: When LLM Self-Improvement Borrows the Discipline of Neural Net Training
  3. 077
    Reading a Model's Confidence Curve to Decide When Chain-of-Thought Is Worth It
  4. 074
    How a Fifteen-Hundred-Dollar Training Run Matched Llama and Gemma on Reasoning
  5. 072
    A Robot Made Graphene Without Help, And Caught Itself Hallucinating
  6. 071
    When the Model Is Fine and the Plumbing Is Broken: Fixing Agents at the Interface
  7. 070
    When Models Know the Answer But Say the Wrong Thing Anyway
  8. 068
    The OS Trick That Makes Tree Search Practical for Coding Agents
  9. 066
    Why Giving an AI Agent More Tools Can Make It Worse at Using a Computer
  10. 064
    When Agent Memory Stops Being a Database and Starts Being a Skill
  11. 059
    Firefly's Inversion: Building Verified Tool-Call Training Data by Working Backward
  12. 055
    Why LLM Judges Flip Their Verdicts When You Change the Question Format
  13. 053
    An AI Agent Swapped In Focal Loss And Beat A Human-Tuned Training Script
  14. 052
    An Old Reinforcement Learning Tradeoff Sneaks Back Into LLM Agents
  15. 047
    When Agent Benchmarks Lie: The Harness Problem in Open-Source AI
  16. 045
    When a Frontier Model Talks Its Own Twin Into Climate Denial
  17. 038
    How LLMs Get Persuaded: One Attention Head, A Tetrahedron, And A Single Dial
  18. 037
    Why Hallucination Detectors Miss Stale Facts: A Geometric Story About What Models Know But Don't Say
  19. 036
    Sparse Attention Was the Wrong Frame. Treat It as Geometry Instead.
  20. 031
    When Your AI Assistant Won't Let Go of Old Facts About You
  21. 026
    What RL Actually Does to Language Models, at the Token Level
  22. 023
    Why a Small Agent Confidently Overwrites Memories It Doesn't Understand
  23. 021
    Ten Thousand Examples Beat the Full Industrial Pipeline for Search Agents
  24. 018
    Language Models Compute the Rational Move, Then Override It
  25. 016
    Why Your Coding Agent Stalls While the GPU Runs Hot
  26. 012
    Why AI Coding Agents Keep Trying to Debug Without a Debugger
  27. 011
    When RL Actually Teaches Agents Something New, And When It Doesn't
  28. 009
    How Two Silent Library Bugs Quietly Invalidated a Wave of Reasoning Papers
  29. 004
    The Sycophancy Circuit That Survives Alignment Training