Curriculum Vitae

Education

2024-Present

2020-2024

Research experience

Postgraduate(2024-Present):

LLM-Based Intelligent Companion AI Digital Humans with Consciousness and Memory

Project summary: In the digital age, addressing challenges such as prompt injection attacks, data leakage, low interpretability, and insufficient emotional engagement faced by LLM-based companion agents in fields like education and healthcare, we constructed a comprehensive framework to enhance security protection, improve reasoning capabilities, quantify uncertainty, and customize emotional support. The goal is to create a safe, reliable, and empathetic AI assistant. Responsible for the post-training and interpretability modules of the LLM.

Outcomes

Backdoor Attack Defense for Concept Bottleneck Models

Project Summary: In explainable AI (XAI), Concept Bottleneck Models (CBMs) enhance interpretability via understandable underlying concepts, but are vulnerable to concept-level backdoor attacks (hidden triggers in concepts causing undetectable misbehavior). First proposed Conceptguard defense: constructed poisoned datasets, divided data subsets, and used majority voting to mitigate data-driven backdoor impacts.

Key Contributions:

Publications

Awards