Alibaba's AI sees better than GPT-4

Plus: Gmail's AI assistant now answers your emails.

 

Welcome Humans🤖,

Here is what we have Today:

  • China`s AI Leapfrog
    Qwen2 Challenges Western Dominance

  • Your Emails, Answered

    Gemini AI now chats inside Gmail App in Android

Trending Today
  • GIS (Geographical Information System) is getting an upgrade that it has now been equipped with AI (Artificial Intelligence) and Machine Learning (ML) technology to map and plan building structures and environment to design urban cities.

  • OpenAI and Anthropic just signed a groundbreaking agreement with the U.S. Artificial Intelligence Safety Institute to allow government access and testing of their AI models before public release.

  • Magic just developed LTM-2-mini, a model capable of processing 100 million tokens of context — equivalent to about 10 million lines of code or 750 novels — and partnered with Google Cloud to build advanced AI supercomputers.

  • Nvidia and Apple reportedly discussed joining OpenAI’s funding round with Microsoft, potentially valuing the AI startup at over $100 billion.

  • Gmail users can now chat directly with Google’s AI assistant, Gemini, about their emails in the Gmail app on Android devices. through the new feature, Gmail Q&A, rolled out on Thursday to users who pay for Gemini — expected to coming into iOS devices soon.

  • Meta has stopped Apple’s web-crawling bots–Applebot and its extension, Applebot-Extended—from scraping data from Instagram and Facebook to train its AI models.

AI Healthcare

Alibaba unveiled Qwen2-VL, a new vision-language AI model that outperforms GPT-4o on several benchmarks, especially in document comprehension and multilingual text-image understanding.

Image: Ideogram

What can Qwen2-VL do?

  • Processes images of various resolutions and ratios

  • Analyzes videos over 20 minutes long

  • Excels at college-level problem-solving and mathematical reasoning

  • Supports multilingual text understanding in images

Which languages does it understand?

  • Most European languages

  • Japanese and Korean

  • Arabic and Vietnamese

How does it compare to GPT-4?

  • Outperforms GPT-4o in several benchmarks

  • Particularly strong in document comprehension

  • Excels in multilingual text-image understanding

Can I try it?

Yes, Qwen2-VL is available for testing on Hugging Face.

Why does this matter?

Qwen2-VL represents another significant player in the advanced AI model space, potentially enabling more sophisticated and globally accessible AI applications. Its emergence from China's Alibaba also signals growing competition in the international AI landscape.

Recommended Reading
If WE had to recommend other newsletters

AI Tool Report is an Newsletter, we enjoy reading daily. AI Tool Report delivers top techniques on how AI can transform your business filled with practical tips, real-world examples.

A Smart Bear is for people who love thinking about strategy, startups, product, marketing, decision-making, and founder psychology.

Job Opportunities
  • Cohere - Senior Product Designer - Apply

  • Mistral AI - Mobile Engineer - Apply

  • Palantir Technologies - US Privacy Counsel - Apply

  • Observe AI - Software Engineer II - Backend - Apply

  • OpenAI - Senior Real Estate Leasing Analyst - Apply

  • Shield AI - Director of Mechanical Engineering - Apply

AI ART

The rusted husk of a hulking cyborg, submechanophobia --ar 2:3

Image: Ideogram

Ideas? Comments? Complaints?

We read your emails, comments and poll replies daily.

Get the most important AI, tech, and science news in a free daily email.
New Here? Subscribe!
Sponsorship Slots Open for August and Reach over 800+ active readers. (Now 40% off) 🤯

What`d you think of today`s edition?

Login or Subscribe to participate in polls.

Until next time, Stay Informed!

Reply

or to participate.