Jiarui Wang

I’m a first-year Ph.D. student in Computer Science at the School of Computer Science, Shanghai Jiao Tong University (SJTU), advised by Prof. Zhouhan Lin in Language Understanding and Machine Intelligence Algorithms (LUMIA) Group. I also earned my Bachelor’s degree in Computer Science from SJTU.

My research interests center around machine learning and large language models (LLMs), with recent work centering on:

  • Next-Generation LLM Architectures — exploring latent memory mechanisms and new architectural designs to improve knowledge acquisition, reasoning, and generalization in large language models.

  • Parametric Memory — designing and training parametric memory modules for LLMs that learn and retain knowledge.

I’m always open to discussing research, collaboration, and new ideas. Please feel free to reach out!

🔥 News

  • 2026.01 🎉 MLP Memory accepted to ICLR 2026
  • 2025.10 🚀 MLP Memory released — model weights and code are now open-sourced.
  • 2025.09 🚀 Memory Decoder released — model weights and code are now open-sourced.
  • 2025.09 🎉 Memory Decoder accepted to NeurIPS 2025
  • 2024.05 🏆 Our Robocup team SRC won Second Place in RoboCup China Open

📚 Publications

  • Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models
    Jiaqi Cao*, Jiarui Wang*, Rubin Wei, Qipeng Guo, Kai Chen, Bowen Zhou, Zhouhan Lin
    NeurIPS 2025 Poster
    🔗 arXiv · GitHub · HuggingFace · * Equal Contribution

  • MLP Memory: A Retriever-Pretrained Memory for Large Language Models
    Rubin Wei*, Jiaqi Cao*, Jiarui Wang, Jushi Kai, Qipeng Guo, Bowen Zhou, Zhouhan Lin
    Preprint (2025)
    🔗 arXiv · GitHub · HuggingFace · * Equal Contribution

🎓 Education

💼 Internships

🏅 Honors and Awards

  • Second Place, RoboCup China Open - Small Size League, 2024
  • Third Prize, Prototype System Competition, CCF Chinasoft, 2023
  • Provincial First Prize, Senior Group, National Olympiad in Informatics in Provinces (NOIP), 2018