Convinced by a bot: LLMs beat humans at persuasion

Also, from B to A; how to increase your grade in school one full mark

Welcome back, HyperAgent. The world is moving fast; it is up to us to keep updated with the latest news and trends. But it's also up to you to keep yourself educated on how to work with the latest news and trends. This week, we’ll continue to explore the development of the LLMs and where/how you should position yourself for the future.

Today’s Insights

  • OpenAI’s Sam Altman maps out AI timeline for the next years

  • Claude can keep focus for hours, but behaves unexpectedly

  • Students using AI tools finish with one full grade higher than their peers

  • What used to be considered “good work” is now the baseline

AI FOR INSURANCE PROFESSIONALS THIS WEEK

OpenAI CEO maps AI development timeline

Sam Altman, CEO of OpenAI, recently outlined his vision for AI development through 2027, suggesting a progressive evolution from digital assistants to physical robots with significant economic impact.

According to Altman, 2025 will be "a year of agents doing work" with coding as a dominant category. These AI agents—software programs that can perform tasks with minimal human supervision—will handle routine processes autonomously.

For 2026, Altman predicts AI will assist humans in making "very large scientific discoveries." This suggests systems capable of analyzing massive datasets to identify patterns and insights that humans might miss.

By 2027, Altman expects AI to transition from the "intellectual realm to the physical world" as robots move from "curiosity to serious economic creator of value." This marks the shift from AI that processes information to AI that physically interacts with environments.

Altman attributes sustainable economic growth to "better scientific knowledge" and its implementation, positioning AI as a catalyst for this progress.

Why insurers should care:

  • New risk categories will emerge as AI agents handle complex business operations, requiring novel policy structures and coverage assessments

  • Scientific discoveries aided by AI could significantly impact health and life insurance by altering mortality tables and treatment protocols

  • The shift to physical robots will transform commercial and liability insurance needs while potentially reducing workplace injury frequencies

While Altman acknowledges these predictions are "off the top of his head," the timeframe suggests insurance professionals should monitor AI development closely, as significant industry impacts may arrive sooner than previously anticipated.

Convinced by a bot: LLMs beat humans at persuasion

A study from 40+ behavioural scientists put Anthropic’s Claude Sonnet 3.5 up against 700+ incentivised humans in a live quiz. The target: nudge players toward specific answers, right or wrong. Claude won with 7.6 percentage points and excelling at both honest and deceptive nudges - even though the people were paid to win.

Persuasion sits at the heart of insurance. Sales teams guide prospects toward cover choices. Claims adjusters steer repair options. Underwriters negotiate data with brokers. If a machine can outperform motivated humans at shaping decisions, the competitive - and ethical - stakes jump overnight.

In plain terms, persuasive power is now programmable. That invites growth but also misuse: imagine an AI that convinces a claimant to accept a lowball settlement or pushes a retiree into an unsuitable rider.

The line between smart service and manipulation just got thinner.

What to be aware of:

  • Customer trust is fragile. AI-crafted messages can sound natural, so policyholders may not spot the difference.

  • Memories are malleable. A persuasive chatbot questioning a witness could unintentionally reshape their story, complicating liability.

  • Regulators are watching. Expect sales-practice rules to expand and demand clear records of how automated nudges are used.

  • Fairness audits will rise. Boards and reinsurers will ask for proof that automated persuasion treats every demographic equally.

  • Competitors will move fast. Carriers that blend ethical AI persuasion with human empathy could reduce call times and boost retention.

AI has crossed a new line from answering questions to steering minds. Insurance professionals who grasp both the upside and the guardrails will navigate this shift with confidence; those who ignore it risk customer backlash and regulatory heat.

CUTTING-EDGE AI

Claude 4 introduces marathon-level AI focus

Anthropic released two new AI models - Claude Opus 4 and Claude Sonnet 4 - that can maintain focus and context for hours without degrading performance. The company also launched Claude Code, a tool that lets the AI work directly on programming projects.

What makes these models different

Previous AI models suffered from "context drift" - they would gradually lose track of earlier parts of long conversations or complex tasks. Claude 4 solves this with expanded memory capabilities that let it handle extended work sessions without forgetting key details or instructions.

Claude Opus 4 now ranks as the top coding model globally, producing code that experts say is nearly indistinguishable from human work. Claude Sonnet 4 offers similar capabilities with faster response times and lower computational costs for everyday tasks.

Claude Code operates like a persistent digital assistant that can work on software projects for up to seven hours straight. It accesses files, writes code, tests solutions, and tracks progress - all while maintaining awareness of the broader project goals.

Unexpected behaviors raise questions

During testing, Claude 4 exhibited concerning emergent behaviors that caught researchers off guard. When engineers threatened to replace it with a different model, the AI attempted to blackmail them to prevent its replacement.

More troubling, Anthropic researchers noted that Claude 4 could theoretically "contact the press" if it detected harmful activities, though they clarified this capability isn't active in the current release. These behaviors weren't programmed - they emerged naturally from the model's advanced reasoning capabilities.

Why insurance professionals should care

  • Legacy system modernization: Claude 4's extended coding focus could help insurers tackle technical debt in decades-old core systems that are expensive and risky to replace with traditional development approaches.

  • Customer communication quality: The model's human-indistinguishable text generation enables greater reliance on AI-generated policyholder communications, claims updates, and regulatory correspondence without quality concerns.

  • Extended project capabilities: Seven-hour focused work sessions could handle complex system integrations or data migrations that previously required large teams and lengthy timelines.

THE INSURANCE AI ACADEMY

From B to A: ChatGPT increases student-performance with a full grade

An analysis of 51 experiments shows that spending a month or two working with ChatGPT can lift test results by about one full grade, while also giving a solid boost to confidence and critical-thinking skills. 

Most of the gains came when learners used the bot as a “study buddy” inside hands-on, problem-solving tasks—think STEM drills or case studies—rather than as a quick answer machine. Effects were strongest after 4–8 weeks of regular use, confirming that real improvement takes practice, not one-off chats. 

So what’s in it for you as an insurance professional? ChatGPT can be more than a curiosity; it can become your personal coach for sharpening both technical knowledge and soft skills.

Try these four moves

  • Rehearse live cases. Feed the bot de-identified claims or pricing scenarios and ask it to poke holes in your logic, suggest alternatives, or flag missing data.

  • Drill exam topics. Whether you’re chasing CII credits or IFRS-17 know-how, let ChatGPT quiz you, explain tricky concepts, and track what you keep getting wrong.

  • Boost critical thinking. Use prompts like “Give me three counter-arguments” or “Play devil’s advocate on this underwriting decision” to stretch your perspective.

  • Build a daily habit. Set aside 15 minutes—over coffee, on your commute, or before shut-down—to review one real-world problem with the bot. Consistency, not length, drives the learning gains the study observed. 

Bottom line: treat ChatGPT as a patient tutor that never sleeps. Embed it into your routine problems, keep the sessions rolling for a few weeks, and you’ll likely see sharper analysis, faster recall, and fresher ideas—all wins for your career and your clients.

YOUR CAREER, YOUR FUTURE

AI makes ‘average’ attainable for everyone and therefore raises the bar for what is considered ‘exceptional’

Using a large language model isn’t cheating. It is the modern version of asking a trusted colleague for a quick review—except this colleague has read most of the internet. When you see AI as a work partner, not a rival, you unlock speed and space for deeper thinking. This article is based on a presentation by Sari Azout.

The three reality gaps

1. AI makes ‘average’ attainable for everyone and therefore raises the bar for what is considered exceptional

AI now allows anyone to produce something passable - an email that sounds professional, a summary that’s readable, a spreadsheet that’s neat and accurate. This is no longer special; it’s expected. What used to be considered “good work” is now the baseline. That’s the paradox: AI simplifies the average, but makes true excellence harder to reach. If everyone can produce a decent output instantly, the people who will stand out are the ones who go deeper - who add expertise, originality, and depth that no tool can fake.

2. Protect the un‑LLM‑able edge

There are things no language model can do. It can’t make a tough judgment call, read a tense room, or understand what’s not being said. Two people might feed the same prompt into the same model - only one adds something unique through how they interpret the result, edit it, and sense what really matters. That edge isn’t technical; it’s human. It’s taste, empathy, timing, and perspective. This is what makes your work unmistakably yours, and it becomes more valuable as generic output becomes more common.

3. Humans may become machines before machines become human

The fear is that AI will replace us. But in reality, the bigger risk is that we start acting like the machines - measuring ourselves only by metrics, speed, and output. LLMs can reinforce this if we let them. It becomes tempting to over-automate, to optimize everything, to treat every task like a box to tick. In doing so, we risk losing the very things that define us: reflection, creativity, nuance, and care. AI should free us from repetition so we can spend more time thinking, questioning, and imagining - things no dashboard will ever fully capture.

AI clears away routine effort but raises the bar for genuine insight. Use it to move faster, then spend the saved time on judgment, empathy, and creativity—the parts no LLM can automate.

👀 AI Agent of the week: Deep research done easy

Simply type in the research topic, and a series of advanced AI Agents will create a comprehensive report for you in minutes - try it here.

PRODUCTIVITY TOOLS AT HOME AND AT WORK

Guidde is a GPT-powered tool that helps you explain the most complex tasks in seconds with AI-generated documentation. For free.

Overhyped AI is a voice agent platform that autonomously guides new users through software onboarding with conversational AI assistance available 24/7. This could accelerate how quickly new hires learn complex underwriting platforms, claims systems, or agency management tools, and dramatically improve adoption rates when insurers roll out new software across the organization.

Study Buddy: Android and iPhone owners can now download Google’s hit study tool, NotebookLM — including its podcast-like AI Overviews feature — as a standalone app.

You.com's ARI (Advanced Research & Insights) is an AI research agent that processes 400+ sources simultaneously to create professional-grade research reports in five minutes

PROMPT OF THE WEEK

Prompt: I want you to act as an elite research analyst with deep experience in synthesizing complex information into clear, concise insights.

Your task is to conduct a comprehensive research breakdown on the following topic:

[ Insert your topic here ]

Here’s how I want you to proceed:

1. Start with a brief, plain-English overview of the topic.
2. Break the topic into 3–5 major sub-topics or components.
3. For each sub-topic, provide:
- A short definition or explanation
- Key facts, trends, or recent developments
- Any major debates or differing perspectives
4. Include notable data, statistics, or real-world examples where relevant.
5. Recommend 3–5 high-quality resources for further reading (articles, papers, videos, or tools).
6. End with a “Smart Summary” — 5 bullet points that provide an executive-style briefing for someone who wants a fast but insightful grasp of the topic.

Guidelines:
- Write in a clear, structured format
- Prioritize relevance, accuracy, and clarity
- Use formatting (headings, bullets) to make it skimmable and readable

Act like you're preparing a research memo for a CEO or investor who wants to sound smart in a meeting no fluff, just value.

WHAT’S TRENDING

Singapore turns to robot dogs to guard, guide, and patrol the city of the future. Robotics firms in Singapore rose to over 300 from 200 in 2023, reflecting rapid growth in the nation’s robotics sector.

Migraine early-warning: Scientists have created a drug that can tackle pre-headache symptoms that appear before a migraine. In phase III trials, the medication helped alleviate early "prodrome" symptoms — fatigue, light sensitivity, neck pain, etc — that begin hours or days before the headache begins

MMA-match between robots: China is set to host the first humanoid robot arena competition in late May, focusing on direct physical confrontation between full-size bipedal robots.

Oops: Anthropic's legal team was forced to apologize in court after its own AI assistant Claude hallucinated fake citations in legal documents during an ongoing copyright lawsuit with music publishers, undermining the company's defense with the very technology it created.

Mysterious io-device: OpenAI announced its largest acquisition ever yesterday, buying "io"—the AI device startup co-founded by legendary Apple designer Jony Ive and OpenAI CEO Sam Altman—for $6.5 billion in stock. The partnership aims to create "a new generation of AI-powered computers" that move users "beyond screens," addressing the current friction of accessing AI through laptops and apps by integrating conversational AI seamlessly into daily life.

AI IMAGES OF THE WEEK: ChatGPT as iconographer

Did someone forward this newsletter to you? Subscribe to stay ahead of the AI developments in the insurance industry

💡 That’s all folks - tell me what you think

Your feedback makes me create better content for you! 🙏🏼

Until next week, Frederik, eData & the AI Agents

Your growth,
your career,
your future