Preethi Lahoti

Preethi Lahoti

Research Areas

Authored Publications

Automated Adversarial Discovery for Safety Classifiers

Anu Sinha

Preethi Lahoti

Yash Kumar Lal

Ananth Balashankar

Yao Qin

2024

AART: AI-Assisted Red-Teaming with Diverse Data Generation for New LLM-powered Applications

Bhaktipriya Radharapu

Kevin Robinson

Lora Aroyo

Preethi Lahoti

The 2023 Conference on Empirical Methods in Natural Language Processing (2023) (to appear)

Fairness without Demographics through Adversarially Reweighted Learning

Preethi Lahoti

Alex Beutel

Jilin Chen

Kang Lee

Flavien Prost

Nithum Thain

Xuezhi Wang

Ed H. Chi

Advances in Neural Information Processing Systems 33 (2020)

Search on Google Scholar