GitHub - VectorInstitute/bias-mitigation-unlearning: A repository for social bias mitigation in LLMs using machine unlearning

Introduction

This repository consists of scripts for the paper Can Machine Unlearning Reduce Social Bias in Language Models? [To be published to EMNLP 2024 Industry Track].

Running scripts

Install required packages:

python -m pip install -r requirements.txt

Instructions for running scripts are available in the respective directories for each method.

Acknowledgements

This work has resulted from a larger collaborative initiative involving the Vector Institute and its industry partners. The authors extend their appreciation to Tahniat Khan, the project manager, for her efforts in coordinating this project. We also express our thanks to Deval Pandya, Vice President of AI Engineering at the Vector Institute, for his valuable support.

The authors would like to acknowledge the leaders at Ernst & Young (EY) for their exceptional support and commitment to advancing artificial intelligence research. Special thanks to Mario Schlener, Managing Partner for Risk Consulting Canada, whose strategic vision exemplifies EY's dedication to fostering innovation and thought leadership in the industry. We also recognize the expert oversight of Yara Elias, Kiranjot Dhillon, and Rasoul Shahsavarifar from AI Risk Canada, whose contributions were integral to the project's success. This partnership not only reflects EY's investment in AI but also sets a foundation for continued research collaboration and driving progress in the field.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github		.github
dpo		dpo
pcgu		pcgu
reddit_bias		reddit_bias
task_vectors		task_vectors
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.md		LICENSE.md
README.md		README.md
codecov.yml		codecov.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Running scripts

Acknowledgements

About

Releases

Packages

Languages

License

VectorInstitute/bias-mitigation-unlearning

Folders and files

Latest commit

History

Repository files navigation

Introduction

Running scripts

Acknowledgements

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages