1 yr. ago • 100%

[HN] PoisonGPT: We hid a lobotomized LLM on Hugging Face to spread fake news

blog.mithrilsecurity.io PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

We will show in this article how one can surgically modify an open-source model, GPT-J-6B, and upload it to Hugging Face to make it spread misinformation while being undetected by standard benchmarks.

[ comments | sourced from HackerNews ]

AI @lemmy.ml PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

Actually Useful AI @programming.dev PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

AI Infosec @infosec.pub PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

0 comments