NLP

LLM attacks

Universal and Transferable Adversarial LLM Attacks

Foundational large language models such as GPT-4, GPT-3.5, LLaMA and more are trained on huge corpus of text from the internet, which contains a large amount of offensive information. Therefore, these foundational large language models are capable of generating a great deal of objectionable content. For this reason, there are a lot of efforts to […]

Universal and Transferable Adversarial LLM Attacks Read More »

Scroll to Top