• Projects of AI
  • Posts
  • Amazon Nova Act browses complex tasks across websites

Amazon Nova Act browses complex tasks across websites

Plus: OpenAI goes open-source.

Together with

Welcome HumansšŸ¤–,

Here is what we have Today:

  • šŸ¤– Nova Act browses the web while you sleep.

  • šŸ§  UC Berkeley's brain-to-speech tech works in 1 second.

  • šŸ˜ˆ First open model since GPT-2 coming "soon."

  • šŸ—£ļø PlayAI's Dialog model sounds unnervingly human.

  • šŸ’¼ New Job Opportunities

Amazon

Amazon just dropped Nova Act, an AI agent that can navigate web browsers like a proā€”filling forms, managing calendars, and completing multi-step tasks without human supervision.

Developers also get a powerful SDK to build custom AI agents for web automation.

Image Source: Amazon

šŸ’” The details:
āœ… Outperforms Claude 3.7 Sonnet and OpenAIā€™s Computer Use Agent in browser task benchmarks.
āœ… Automates web actionsā€”filling forms, booking appointments, managing workflows.
āœ… Powers Alexa+, Amazonā€™s upcoming upgrade that could bring AI agents to millions.
āœ… Built by experts, led by ex-OpenAI researchers David Luan and Pieter Abbeel at Amazonā€™s SF AGI Lab.

āš” Why it matters:
Amazon isnā€™t top-of-mind for AI, but with millions of Alexa users, it might be the first to bring AI agents to the mainstream. However, with AI agents still prone to errors, Nova Actā€™s real-world reliability could make or break public trust in autonomous AI.

Together with Morning Brew

Your job calledā€”it wants better business news

Welcome to Morning Brewā€”the worldā€™s most engaging business newsletter. Seriously, we mean it.

Morning Brewā€™s daily email keeps professionals informed on the business news that matters, but with a twistā€”think jokes, pop culture, quick writeups, and anything that makes traditionally dull news actually enjoyable.

Itā€™s 100% freeā€”so why not give it a shot? And if you decide youā€™d rather stick with dry, long-winded business news, you can always unsubscribe.

AI Research

Researchers at UC Berkeley and UCSF have developed an AI that transforms brain signals into speech with just a one-second delayā€”a massive leap for brain-computer interfaces.

Image Source: UC Berkeley

The Breakthrough:

  • The AI deciphers signals from the motor cortex, converting intended speech into words instantlyā€”a huge jump from the 8-second delay of earlier systems.

  • It reconstructs speech using the patientā€™s own pre-injury voice, making communication sound natural and personal.

  • Unlike past models, it generates words outside its training data, proving itā€™s learning real speech patterns rather than just memorizing phrases.

  • Works across multiple brain-sensing technologies, making it widely adaptable.

Why It Matters:

For those who have lost their voice due to ALS, stroke, or paralysis, this tech is life-changing. With near-instant responses, speech restoration is no longer just an experimentā€”itā€™s a reality.

Trending Today
  • OpenAI has finally announced itā€™s planning to ā€œrelease a powerful new open-weight language model (which will be the first since GPT-2) with reasoning capabilitiesā€ similar to its o3-mini reasoning model, over ā€œthe coming months after dropping hints for months.

  • Runway just introduced Gen-4, a new AI model that brings increased consistency and control to video generations ā€“ with enhancements designed to be incorporated into professional cinematic workflows.

  • Googleā€™s Gemini 2.5 Pro Exp. scored a 130 on Mensa Norwayā€™s IQ test, the highest of any model and well surpassing the average human score of 100.

  • PlayAI is teaming up with AI hardware company Groq to make its human-sounding AI voice model, Dialog, without sacrificing audio quality. Groqā€™s speedy infrastructure helps the model generate audio 15 times faster than real-time.

Recommended Reading
If WE had to recommend other newsletters

AI Tool Report is an Newsletter, we enjoy reading daily. AI Tool Report delivers top techniques on how AI can transform your business filled with practical tips, real-world examples.

A Smart Bear is for people who love thinking about strategy, startups, product, marketing, decision-making, and founder psychology.

Job Opportunities
  • DeepMind - Staff Robotics Systems Safety Engineer - Apply

  • Mistral AI - Product Lead, Prosumer - Apply

  • Shield AI - Senior Engineer, Systems Test - Apply

  • DeepMind - Senior Software Engineer, Gemini apps - Apply

  • Grammarly - Senior Product Design Manager, Growth - Apply

AI News

Source: Ideogram

Ideas? Comments? Complaints?

We read your emails, comments and poll replies daily.

Get the most important AI, tech, and science news in a free daily email.
New Here? Subscribe!
Sponsorship Slots Open for April and Reach over 800+ active readers. (Now 40% off) šŸ¤Æ

What`d you think of today`s edition?

Login or Subscribe to participate in polls.

Until next time, Stay Informed!

Reply

or to participate.