Enhancing LLM Robustness to Perturbed Instructions

Official repository for Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study.

Our AdvMix dataset is available here.

Instructions

Setting up the environment

git clone https://github.com/ary4n99/llm-robustness.git
cd llm-robustness
pip install -r requirements.txt

cp example.yaml config.yaml
cp .env.example .env

Running the code

To run attack pipelines:

python run_pipelines.py --config ./path/to/config --log-level INFO --seed 0

To run semantic integrity analysis:

python semantic_integrity.py

Citation

@inproceedings{
      agrawal2025enhancing,
      title={Enhancing {LLM} Robustness to Perturbed Instructions: An Empirical Study},
      author={Aryan Agrawal and Lisa Alazraki and Shahin Honarvar and Thomas Mensink and Marek Rei},
      booktitle={ICLR 2025 Workshop on Building Trust in Language Models and Applications},
      year={2025},
      url={https://openreview.net/forum?id=abllmCsDp8}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
helpers		helpers
results		results
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
example.yaml		example.yaml
requirements.txt		requirements.txt
run_pipelines.py		run_pipelines.py
semantic_integrity.py		semantic_integrity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enhancing LLM Robustness to Perturbed Instructions

Instructions

Setting up the environment

Running the code

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Enhancing LLM Robustness to Perturbed Instructions

Instructions

Setting up the environment

Running the code

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages