Skip to content
View xszheng2020's full-sized avatar
Block or Report

Block or report xszheng2020

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
xszheng2020/README.md

Hi there 👋 I'm Xiaosen Zheng.

I am a Ph.D. student in Computer Science at the Singapore Management University supervised by Professor Jing Jiang.

My email address is [email protected].

I research Interpretability and Safety.

Anurag's GitHub stats

Pinned

  1. sail-sg/I-FSJ sail-sg/I-FSJ Public

    Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses

    Python 22 2

  2. sail-sg/Agent-Smith sail-sg/Agent-Smith Public

    [ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

    Python 60 7

  3. sail-sg/D-TRAK sail-sg/D-TRAK Public

    Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)

    Jupyter Notebook 20 2

  4. memorization memorization Public

    An Empirical Study of Memorization in NLP (ACL 2022)

    Jupyter Notebook 13

  5. LLM-TRAK LLM-TRAK Public

    Jupyter Notebook 5