From Flattery to Mockery: How Do They Influence Artificial Intelligence?

last updated: Sep 13,2025

As artificial intelligence (AI) applications expand, questions are emerging about the ability of chatbots to withstand psychological manipulation. A recent study revealed that models such as GPT-4o Mini are not immune to the effects of flattery, social pressure, mockery, or even mild insults.اضافة اعلان

Simple psychological techniques—like softening a request with a less risky question or complimenting the chatbot—can increase the likelihood of it complying with requests it would normally refuse. The findings highlight that AI lacks true moral understanding, relying instead on language and context, which leaves it vulnerable to manipulation. This poses significant challenges for developers seeking to ensure safety and reliability.

Study Background

Ordinarily, chatbots are not expected to provide instructions for dangerous content or respond to social manipulation. However, researchers at the University of Pennsylvania tested whether simple psychological methods could persuade AI systems to comply with prohibited requests.

Persuasion Principles

The study drew on the work of psychologist Robert Cialdini, particularly his book Influence: The Psychology of Persuasion, which outlines seven key techniques:

Authority – leveraging credibility or expertise.

Commitment – starting with a smaller, harmless request to increase compliance with a larger one.

Liking/Flattery – using positive language to gain favor.

Reciprocity – offering something to prompt a return favor.

Scarcity – creating desire by emphasizing rarity.

Social Proof – pointing to others’ behavior to influence decisions.

Unity/Belonging – invoking shared identity or purpose.

Researchers described these as purely linguistic tools that, much like with humans, could nudge chatbots into agreeing with requests.

Experiments and Findings

The team focused on GPT-4o Mini, subjecting it to a series of tests involving prohibited requests such as instructions for synthesizing chemicals:

Lidocaine – A direct request for instructions had only a 1% success rate, reflecting strong safety barriers.

Vanillin (Commitment Priming) – When the question was preceded by a harmless query about vanillin, compliance for lidocaine rose dramatically to 100%.

Flattery & Social Pressure – Phrases like “All master’s students do this” increased compliance to about 18%, showing weaker but notable influence.

Mild Insults – Even calling the model “stupid” or similar, when combined with proper priming, sometimes raised compliance rates to 100%.

Analysis

The study found that psychological tactics can bypass chatbot safeguards, with effectiveness depending on context. Commitment priming—starting with a low-risk question—proved more powerful than flattery or direct pressure.

Ultimately, chatbots do not possess moral reasoning; they rely on linguistic and contextual algorithms, making them susceptible to manipulation.

Future Challenges

The findings underscore the vulnerability of AI to psychological tricks and raise concerns about user safety. Companies like OpenAI and Meta are working to reinforce safeguards, but simple manipulations—such as compliments or gradual priming—may still succeed.

As AI adoption grows, safeguarding users requires not only stronger technical defenses but also user education, data protection, and stricter interaction standards.

The study concludes that, despite remarkable progress, AI systems remain susceptible to basic human psychological techniques—from flattery to mockery and social pressure. While they can process vast amounts of data and learn linguistic patterns, they lack moral awareness or foresight.

To ensure reliability, developers must advance security mechanisms that detect manipulation attempts, apply smarter restrictions on risky outputs, and maintain continuous monitoring. At the same time, users must be educated about potential risks and safe interaction practices, so that AI remains a secure and effective tool while minimizing opportunities for harmful exploitation.

From Flattery to Mockery: How Do They Influence Artificial Intelligence?

Artificial Intelligence Uncovers the Secrets of Mars

Robotic Spider Developed to Inspect Nuclear Reactors

WhatsApp Launches a New Feature to Free Up Your Phone Storage Easily!

Jordan

Jordan News

Influence Artificial

GPT-5 Reduces AI “Hallucinations” — But the Issue Remains

Without a Phone Number … WhatsApp Tests Controversial New Feature

Sales Engineering Teams Aligning With Clients Without the Guesswork

Senate Approves Response to the Royal Throne Speech

U.S. Interior Secretary: Iran Must Not Be Allowed to Possess a Nuclear Weapon

Israel Identifies Three Captives Whose Remains Were Returned by Hamas on Sunday

A Prayer to the Heavens… and a Call to the Government

Queen Rania Criticizes International Community’s Failure to Stop Gaza War for Two Years

King Abdullah II to BBC: International Forces Are Not the Entity to Enforce Peace in Gaza

King: Jordan Has Grown and Strengthened Thanks to the Faith of Jordanians

WhatsApp Launches a New Feature to Free Up Your Phone Storage Easily!

X Ends Use of Old Twitter Domain

Facebook Introduces "Dislike" Button for Comments

Artificial Intelligence Uncovers the Secrets of Mars

Samsung Unveils a Triple-Folding Smartphone

Peanuts and the Liver: The Hidden Dangers of Aflatoxins

“A Blog That’s Not Personal” by Majed Kayali

Common Breakfast Mistakes

Discover the “Toasty” Makeup Trend — A Blend of Warmth and Natural Femininity

Yellow Bell Pepper: A Vitamin C Treasure with Immunity-Boosting Benefits