Qi Sun

PhD Student, Science Tokyo · Founding Team, Sakana AI

prof_pic.jpg

Me on Mount Fuji!

Hi there! I’m Qi (pron. like “chee”), a PhD student at Science Tokyo, advised by Prof. Rio Yokota, and I’m also a researcher at Sakana AI, where I work very closely with Yujin Tang.

My research focuses on how to shape neural networks (mostly LLMs) into more complex and capable forms, toward the broader goal of better machine intelligence, built from collaborative and adaptive systems at scale.

Some highlights:

  • I’m a core builder of Fugu, Sakana AI’s multi-agent system product.
  • Introduced Transformer², a framework for self-adaptive LLMs (1.2k+ GitHub 🌟).
  • Revealed a ‘painter-like’ structure in transformer layers

Feel free to reach out about anything by lfsm.martin[AT]gmail.com. One last cool fact: my bench press PR is 140 kg!

selected publications

  1. arXiv
    ecs.png
    Evolutionary Context Search for Automated Skill Acquisition
    Qi Sun, Stefan Nielsen, Rio Yokota, and 1 more author
    arXiv preprint arXiv:2602.16113, 2026
  2. ICLR
    trinity_cover.png
    TRINITY: An Evolved LLM Coordinator
    Jinglue Xu*, Qi Sun*, Peter Schwendeman, and 3 more authors
    In International Conference on Learning Representations (ICLR), 2026
  3. Nature MI
    evo-merge.png
    Evolutionary Optimization of Model Merging Recipes
    Takuya Akiba, Makoto Shing, Yujin Tang, and 2 more authors
    Nature Machine Intelligence, 2025
  4. ICLR
    tf-2.gif
    Transformer-Squared: Self-adaptive LLMs
    Qi Sun*, Edoardo Cetin*, and Yujin Tang*
    In International Conference on Learning Representations (ICLR), 2025
  5. AAAI
    painter.png
    Transformer Layers as Painters
    Qi Sun*, Marc Pickett*, Aakash Kumar Nain, and 1 more author
    In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025
  6. arXiv
    aicuda.png
    Towards Robust Agentic CUDA Kernel Benchmarking, Verification, and Optimization
    Robert Tjarko Lange, Qi Sun, Aaditya Prasad, and 3 more authors
    arXiv preprint arXiv:2509.14279, 2025