Category: academic papers

academic papers, AI, Exploits, Global Security News, Physical Security, Security Bloggers Network, threat models, Uncategorized

Regulating AI Behavior with a Hypervisor

April 23, 2025

Interesting research: “Guillotine: Hypervisors for Isolating Malicious AIs.” Abstract:As AI models become more embedded in critical sectors like finance, healthcare, and the military, their inscrutable behavior poses ever-greater risks to society. To mitigate this risk, we propose Guillotine, a hypervisor architecture for sandboxing powerful AI models—models that, by accident or malice, can generate existential threats…

“Emergent Misalignment” in LLMs

February 27, 2025

Interesting research: “Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs“: Abstract: We present a surprising result regarding LLMs and alignment. In our experiment, a model is finetuned to output insecure code without disclosing this to the user. The resulting model acts misaligned on a broad range of prompts that are unrelated to coding: it…

The Vanity Press in Academia

January 12, 2025

I’ve never been a regular resident of the ivory halls of academia, but Mich Kabay recently made me aware of an article about legitimate scientific journals driven to distraction by being flooded with commentary apparently reflecting a surge in the use of artificial intelligence rather than legitimate research and analysis. The Science article claims that…