Folgen
Yangsibo Huang
Yangsibo Huang
Bestätigte E-Mail-Adresse bei google.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Evaluating Gradient Inversion Attacks and Defenses in Federated Learning
Y Huang, S Gupta, Z Song, K Li, S Arora
NeurIPS, 2021
3372021
Catastrophic Jailbreak of Open-Source LLMs via Exploiting Generation
Y Huang, S Gupta, M Xia, K Li, D Chen
ICLR, 2024
2782024
Detecting pretraining data from large language models
W Shi, A Ajith, M Xia, Y Huang, D Liu, T Blevins, D Chen, L Zettlemoyer
ICLR, 2024
2682024
Deep Q learning driven CT pancreas segmentation with geometry-aware U-Net
Y Man*, Y Huang*, J Feng, X Li, F Wu
IEEE Transactions on Medical Imaging, 2019
1782019
Instahide: Instance-hiding schemes for private distributed learning
Y Huang, Z Song, K Li, S Arora
ICML, 2020
1772020
Recovering Private Text in Federated Learning of Language Models
S Gupta*, Y Huang*, Z Zhong, T Gao, K Li, D Chen
NeurIPS, 2022
1042022
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
B Wei*, K Huang*, Y Huang*, T Xie, X Qi, M Xia, P Mittal, M Wang, ...
ICML, 2024
902024
Advancing differential privacy: Where we are now and future directions for real-world deployment
R Cummings, D Desfontaines, D Evans, R Geambasu, Y Huang, ...
Harvard Data Science Review, 2024
73*2024
TextHide: Tackling Data Privacy in Language Understanding Tasks
Y Huang, Z Song, D Chen, K Li, S Arora
EMNLP, 2020
642020
SORRY-bench: Systematically evaluating large language model safety refusal behaviors
T Xie, X Qi, Y Zeng, Y Huang, UM Sehwag, K Huang, L He, B Wei, D Li, ...
ICLR, 2025
53*2025
DeepMC: a deep learning method for efficient Monte Carlo beamlet dose calculation by predictive denoising in magnetic resonance-guided radiotherapy
R Neph, Q Lyu, Y Huang, YM Yang, K Sheng
Physics in Medicine & Biology 66 (3), 035022, 2021
47*2021
MUSE: Machine Unlearning Six-way Evaluation for Language Models
W Shi, J Lee, Y Huang, S Malladi, J Zhao, A Holtzman, D Liu, ...
ICLR, 2025
452025
Privacy Implications of Retrieval-Based Language Models
Y Huang, S Gupta, Z Zhong, K Li, D Chen
EMNLP, 2023
412023
A Safe Harbor for AI Evaluation and Red Teaming
S Longpre, S Kapoor, K Klyman, A Ramaswami, R Bommasani, ...
ICML, 2024
40*2024
Privacy-Preserving Learning via Deep Net Pruning
Y Huang, Y Su, S Ravi, Z Song, S Arora, K Li
arXiv preprint arXiv:2003.01876, 2020
30*2020
A Dataset Auditing Method for Collaboratively Trained Machine Learning Models
Y Huang, CY Huang, X Li, K Li
IEEE Transactions on Medical Imaging, 2022
28*2022
An adversarial perspective on machine unlearning for ai safety
J Łucki, B Wei, Y Huang, P Henderson, F Tramèr, J Rando
arXiv preprint arXiv:2409.18025, 2024
26*2024
Evaluating Copyright Takedown Methods for Language Models
B Wei, W Shi, Y Huang, NA Smith, C Zhang, L Zettlemoyer, K Li, ...
NeurIPS, 2024
202024
NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
Y Huang, D Liu, Z Zhong, W Shi, YT Lee
arXiv preprint arXiv:2302.10879, 2023
162023
Ai risk management should incorporate both safety and security
X Qi, Y Huang, Y Zeng, E Debenedetti, J Geiping, L He, K Huang, ...
arXiv preprint arXiv:2405.19524, 2024
152024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20