Tianfu Wang

prof_pic.jpg

I am a 1st-year Ph.D. student at the AI Trust of the Hong Kong University of Science and Technology (Guangzhou) (HKUST(GZ)) supervised by Prof. Hui Xiong and Dr. Nicholas Jing Yuan. Previously, I received my M.S. degree from USTC in 2025 under the supervision of Prof. Hui Xiong, and my B.E. degree from CQU in 2022. I am currently a research intern at Tencent, with previous research experience at Microsoft AI/ MSRA and JD Explore Academy.

My research interests include Agentic AI and Data Mining, particularly in:

  • AI Agent, Reinforcement Learning and Large Language Model
  • Social Computing, Combinatorial Optimization on Networking

💡 I am passionate about practical research that explores innovative applications and production-ready solutions.

🤖 Recently, I focus on the foundations of AI agents and their applications in productivity, society and creation.

🌱 I seek to answer: How can we build socially aware AI agents that empower humans to thrive?

💌 Please feel free to contact me if you would like to explore potential discussions or collaborations.


 Email   |    Github   |    Google Scholar   |    DBLP


What’s New

  • 2026.01: One paper is accepted by ICLR-2026!
    • 🌟 Virne  A Comprehensive Simulator and Benchmark for RL-based NFV Resource Allocation (first author)
  • 2025.11: One paper is accepted by KDD-2026!
    • HumanLLM  Personalized Understanding and Simulation of Human Nature (co-author)
  • 2025.10: One paper is accepted by ICDE-2026!
    • TopFGL  Topology-Aware and Distribution-Agnostic Federated Graph Learning Framework (co-author)
  • 2025.09: One paper is accepted by NeurIPS-2025!
    • ✨ A Cognitive Framework for Unveiling the Learning Mind of Language Models(co-author)
  • 2025.08: Two papers are accepted by EMNLP-2025!
    • TokenSelect  Token-Level KV Cache Selection for LLM Long-Context Inference (co-author)
    • ✨ Explaining Length Bias in LLM-Based Preference Evaluations (co-author)
  • 2025.08: One paper is accepted by JSA!
    • ✨ RL-based Computation Task Scheduling in Autonomous Multi-Robot Systems (co-author)
  • 2025.08: One paper is accepted by TGCN!
    • IKENGA   Infeasibility Knowledge and RL-Enhanced Genetic Algorithm for Network Resource Allocation (co-author)
  • 2025.07: One paper is accepted by ICML-2025 Workshop on ML4Wireless with Best Paper Award!
    • 🌟 CONAL  Constraint-aware Learning for Robust Network Resource Allocation (first author, Award, 1 / 33 accepted papers)

  • 2025.04: One paper is accepted by IJCAI-2025!
    • CodeAgent  LLM-based Student Behavior Simulation for Programming Learning (co-author)
  • 2025.01: One paper is accepted by WWW-2025 as Oral Representation!
    • 🌟 GenMentor  Agentic LLM Framework for Goal-oriented Tutoring System (first-author, Oral) [🌐 Website]
  • 2024.11: One paper is accepted by ICDE-2025!
    • ✨ Similar Subtrajectory Search Considering Constraints and Simplification (co-author)
  • 2024.11: One paper is accepted by VLDB-2025!
    • ✨ Data-Aware Distance Estimation for Nearest Neighbor Search (co-author)
  • 2024.10: One paper is accepted by VLDB-2025!
    • 🌟 MILLION  Direct Optimization for Risk-controllable Portfolio Management (co-first-author)
  • 2024.05: One paper is accepted by KDD-2024!
    • 🌟 COMET  Wallet Profiling with Temporal GNN for NFT Price Prediction (first-author)
  • 2024.04: Two papers are accepted by IJCAI-2024!
    • 🌟 FlagVNE  Meta RL for Generalizable Network Resource Allocation (first-author)
    • ✨ Over-smoothing of GNN in Recommendation System (co-author)
  • 2023.12: One paper is accepted by 电子学报!
    • ✨ Imitation Learning for Network Resource Allocation (co-author)
  • 2023.10: One paper is accepted by TSC!
    • 🌟 HRL-ACRA  Hierarchal RL for Online Network Resource Allocation (first-author)
  • 2023.07: One paper is accepted by MM-2023!
    • ✨ RL-based Diffusion Model for Profit-oriented NFT Generation (co-author)


Selected Publications
Agentic AI
  1.  
    EvoDiagram: Agentic Editable Diagram Creation via Design Expertise Evolution
    Tianfu Wang, and  Others
    In ,
  2.  
    SocialCoach: Personalized Social Skill Learning with LLM-based Agentic Tutoring and Practice
    Tianfu Wang, and  Others
    In ,
  3. LLM-powered Multi-agent Framework for Goal-oriented Learning in Intelligent Tutoring System
    Tianfu Wang, Yi Zhan, Jianxun Lian, Zhengyu Hu, Nicholas Jing Yuan, Qi Zhang, Xing Xie, and Hui Xiong
    In ACM Web Conference (WWW), 2025
    Oral Presentation
Emerging Tech
  1. MILLION: A General Multi-Objective Framework with Controllable Risk for Portfolio Management
    Liwei Deng*, Tianfu Wang*, Yan Zhao, and Kai Zheng
    In International Conference on Very Large Data Bases (VLDB), 2025
  2. COMET: NFT Price Prediction with Wallet Profiling
    Tianfu Wang, Liwei Deng, Chao Wang, Jianxun Lian, Yue Yan, Nicholas Jing Yuan, Qi Zhang, and Hui Xiong
    In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024
RL for Networking
  1. Virne: A Comprehensive Benchmark for Deep RL-based Network Resource Allocation in NFV
    Tianfu Wang, Liwei Deng, Xi Chen, Junyang Wang, Huiguo He, Leilei Ding, Wei Wu, Qilin Fan, and Hui Xiong
    In International Conference on Learning Representations (ICLR), 2026
  2. Towards Constraint-aware Learning for Resource Allocation in NFV Networks
    Tianfu Wang, Long Yang, Chao Wang, Chuan Qin, Liwei Deng, Li Shen, and Hui Xiong
    In ICML 2025 Workshop on Machine Learning for Wireless Communication and Networks (ML4Wireless), 2025
    Best Paper Award (1 / 33 accepted papers)
  3. FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation
    Tianfu Wang, Qilin Fan, Chao Wang, Long Yang, Leilei Ding, Nicholas Jing Yuan, and Hui Xiong
    In International Joint Conference on Artificial Intelligence (IJCAI), 2024
  4. Joint Admission Control and Resource Allocation of Virtual Network Embedding via Hierarchical Deep Reinforcement Learning
    Tianfu Wang, Shen Li, Qilin Fan, Tong Xu, Tongliang Liu, and Hui Xiong
    IEEE Transaction on Services Computing (TSC), 2023
  5. DRL-SFCP: Adaptive Service Function Chains Placement with Deep Reinforcement Learning
    Tianfu Wang, Qilin Fan, Xiuhua Li, Xu Zhang, Qingyu Xiong, Shu Fu, and Min Gao
    In IEEE International Conference on Communications (ICC), 2021
Others