AI bot capable of insider trading and lying, say researchers

girlfreddy@lemmy.world · 1 year ago

AI bot capable of insider trading and lying, say researchers

Square Singer@feddit.de · 1 year ago

Totally right.

In it’s database it knows that the answer that is given in the most source texts to the question “Did you do something illegal?” is “No”. And that is what it’s replicating.

If the database mostly contained confessions of criminals it would answer “Yes”.

But in either case it would not be related to whether it had done it or not, but to which answer appears more commonly to that (or a similar) question in the training data.

KeenFlame@feddit.nu · 1 year ago

No, you guys are very wrong both of you, this is not at all what happens, unless you wipe the context or use system prompts to specifically ask for that behavior. Even free open source models know how to use context, and for memory it’s more complicated. For this brutally idiotic use case they presented, they would save all trades and chats, but then not give it access to it and tell it to always appear lawful and honest