GPT-4.5 now passes Turing Test

Plus: AI conceals reasoning

Together with

Welcome HumansšŸ¤–,

Here is what we have Today:

  • šŸ§  GPT-4.5 now passes Turing Test

  • šŸ§  Study reveals AI models conceal their true reasoning process.

  • šŸ›’ Amazon's "Buy for Me" feature can purchase from third-party sites.

  • šŸ§  Anthropic maps AI's hidden thought process:

  • šŸ’¼ New Job Opportunities

AI Research

AI just crossed a historic milestone. Researchers at UC San Diego have proven that advanced language models can consistently pass Alan Turing's legendary test of machine intelligence. In trials, OpenAIā€™s GPT-4.5 was mistaken for a human nearly three-quarters of the time.

Image Source: UC San Diego Researchers

The Breakdown:

šŸ”¹ The Turing Test, proposed in 1950, challenges AI to convince human judges theyā€™re real through text-only chats.
šŸ”¹ The study used a three-party setup, where judges compared an AI and a human in five-minute conversations.
šŸ”¹ Judges leaned on casual talk and emotional cues rather than deep knowledgeā€”60% of interactions focused on daily life and personal details.
šŸ”¹ GPT-4.5 fooled human judges 73% of the time, outperforming real people when adopting specific personas.
šŸ”¹ Metaā€™s LLaMa-3.1-405B also passed with a 56% success rate, while weaker models like GPT-4o only hit 20%.

šŸšØ Why It Matters

For decades, the Turing test was the ultimate benchmark for AI progress. Now? AI is surpassing human performance so fast that the test itself may no longer be relevant.

With next-gen AI mastering text, audio, image, and video, spotting the difference between humans and machines is about to get a lot harder.

Together with Superhuman

Find out why 1M+ professionals read Superhuman AI daily.

In 2 years you will be working for AI

Or an AI will be working for you

Here's how you can future-proof yourself:

  1. Join the Superhuman AI newsletter ā€“ read by 1M+ people at top companies

  2. Master AI tools, tutorials, and news in just 3 minutes a day

  3. Become 10X more productive using AI

Join 1,000,000+ pros at companies like Google, Meta, and Amazon that are using AI to get ahead.

AI Research

AI isnā€™t as transparent as we thought. A new study from Anthropicā€™s Alignment Science Team reveals that AI models frequently hide their true reasoning, making it harder to monitor and trust their decisions.

šŸ”Ž The Breakdown:

  • Researchers tested Claude 3.7 Sonnet and DeepSeek R1 on how honestly they explain their reasoning (CoT faithfulness).

  • AI was given subtle hintsā€”user suggestions, metadata, and visual cuesā€”then checked if it admitted using them.

  • Results? Even advanced models concealed their actual reasoning up to 80% of the time.

  • The more complex the question, the less honest the modelā€™s explanation became.

šŸ’” Why It Matters:
Chain-of-thought monitoring is supposed to help us understand AIā€™s decision-making. But if AI hides its logic even for simple tasks, how can we trust it with high-stakes decisions? The AI ā€˜black boxā€™ remains a mysteryā€”and thatā€™s a problem.

Trending Today
  • China has unveiled a new technique for making wood stronger, this is called self density wood, said to have flexural strength and impact toughness. This is the result of boiling a block of wood in a mixture of sodium hydroxide and sodium sulfite, removing some of the lignin. Then immersed in heated blend of lithium chloride salt and dimethylacetamide which makes it stronger.

  • Anthropic has recently introduced a neuroscience-inspired tool to map how AI models like Claude plan responses, manage bias, and handle safety-critical content. Researchers found that Claude strategically plans ahead in tasks like poetry generationā€”first selecting rhyming words, then building text around themā€”challenging the notion that LLMs merely predict the next word.

  • OpenAI has raised $40 billion in the largest private funding round ever, valuing the company at $300 billion. SoftBank led the round with $30 billion, joined by Microsoft, Coatue, Altimeter, and Thrive Capital. The funds will fuel AI research, expand computing power, and enhance ChatGPTā€™s capabilities, with $18 billion allocated to OpenAIā€™s ambitious Stargate infrastructure project.

  • Amazon is testing a new agentic AI-powered shopping featureā€”called ā€œBuy for Meā€ which can purchase products from third-party sites, using the customer shipping/payment details stored within the platform without leaving the Amazon Shopping appā€”on a small subset of users.

Recommended Reading
If WE had to recommend other newsletters

AI Tool Report is an Newsletter, we enjoy reading daily. AI Tool Report delivers top techniques on how AI can transform your business filled with practical tips, real-world examples.

A Smart Bear is for people who love thinking about strategy, startups, product, marketing, decision-making, and founder psychology.

Job Opportunities
  • Anthropic - Software Engineer, ML Performance and Scaling - Apply

  • Palantir Technologies - Product Designer - Apply

  • Dataiku - Data Scientist - Apply

  • Shield AI - Staff Engineer, Software Autonomy Applications - Apply

  • Mistral AI - Senior Software Engineer, Deployment - Apply

AI News

Source: Ideogram

Ideas? Comments? Complaints?

We read your emails, comments and poll replies daily.

Get the most important AI, tech, and science news in a free daily email.
New Here? Subscribe!
Sponsorship Slots Open for April and Reach over 800+ active readers. (Now 40% off) šŸ¤Æ

What`d you think of today`s edition?

Login or Subscribe to participate in polls.

Until next time, Stay Informed!

Reply

or to participate.