It’s also not clear if it’s even possible to fully prevent AI systems from misbehaving. The truth is, we don’t know a lot about how LLMs work, and today’s leading AI models from OpenAI, Anthropic, and Google are jailbroken all the time. That’s why some researchers are saying regulators should focus on the bad actors, not the model providers.
It seems a complicated debate. Hard to find out where you want to stand. I want to show a method to find answers by creating 3 variants of an analogy.
For how many of these cases do you think somebody should be doing something?
Case 1:
A huge warehouse full of firearms. Burglars are breaking into it every night and stealing lots of weapons. The owners say they don't know how this warehouse was built and how to make it more secure in order to stop the criminals from obtaining lots of new weapons every day. The general public starts calling to the government to do something. Some say the warehouse owner should take responsibility. Others say it all depends on how the criminals use the weapons. The criminals seem to know how to use them good...
Case 2:
A huge warehouse full of hammers. Burglars are breaking into it every night and stealing lots of hammers. The owners say they don't know how this warehouse was built and how to make it more secure in order to stop the criminals from obtaining lots of new hammers every day. The general public starts calling to the government to do something. Some say the warehouse owner should take responsibility. Others say it all depends on how the criminals use the hammers. The criminals seem to know how to use them good...
Case 3:
A huge warehouse full of tulips. Burglars are breaking into it every night and stealing lots of flowers. The owners say they don't know how this warehouse was built and how to make it more secure in order to stop the criminals from obtaining lots of new flowers every day. The general public starts calling to the government to do something. Some say the warehouse owner should take responsibility. Others say it all depends on how the criminals use the tulips. The criminals seem to know how to use them good...