Gemini-exp-1206 tops Chatbot Arena

Plus: Meta's budget Llama

Together with

Welcome Humans🤖,

Here is what we have Today:

  • 🧠 Meta's Llama 3.3: Speed meets smarts.

  • 🏆 Google DeepMind's Gemini claims Chatbot Arena crown

  • 🕵️‍♀️ AI tries to escape? OpenAI's wild test results

  • 👻 xAI's phantom image generator vanishes. What happened?

  • 💼 New Job Opportunities

Meta

Meta has just released Llama 3.3, a new 70 billion parameter open text model that performs similarly to the previous 405 billion parameter Llama 3.1 model. But the kicker? It's significantly faster and much more cost-effective.s

Image Source: Ideogram

What are the details?

Llama 3.3 has a 128,000 token context window and outperforms competitors like GPT-4o, Gemini Pro 1.5, and Amazon's Nova Pro on several benchmarks. And get this - it's 10 times cheaper than the 405B version, costing just $0.10 per million input tokens and $0.40 per million output tokens. That's nearly 25 times cheaper than GPT-4o!

Wow, that's impressive. What else should I know?

Meta CEO Mark Zuckerberg revealed that Meta AI has nearly 600 million active monthly users and is "on track to be the most used AI assistant in the world." He also shared that the next iteration, Llama 4, is planned for 2025, with training happening at Meta's new $10 billion, 2-gigawatt data center in Louisiana.

So in summary, Llama 3.3 packs a punch while being much more efficient and cost-effective than its predecessors. Meta is really pushing the boundaries with their language models.

Exactly. Meta is raising the bar when it comes to open AI models, matching industry leaders in performance while being much more efficient. Can't wait to see what they come up with for Llama 4!

Together With The Rundown AI

Start learning AI in 2025

Everyone talks about AI, but no one has the time to learn it. So, we found the easiest way to learn AI in as little time as possible: The Rundown AI.

It's a free AI newsletter that keeps you up-to-date on the latest AI news, and teaches you how to apply it in just 5 minutes a day.

Plus, complete the quiz after signing up and they’ll recommend the best AI tools, guides, and courses – tailored to your needs.

Google

Google's new Gemini language model! You heard it right just reclaimed the top spot on the Chatbot Arena leaderboard, surpassing OpenAI.

Image Source: LM Arena

What are the key details on Gemini-exp-1206?

Well, the model was just released on the one-year anniversary of Gemini. It's now ranked #1 overall on the Chatbot Arena, up from #2 previously. Notably, Gemini can process and understand video content, unlike ChatGPT and Claude which are limited to just images.

The model also maintains its impressive 2 million token context window, allowing it to handle over an hour of video. And the best part? Gemini-exp-1206 is completely free to use through Google AI Studio and the Gemini API.

Wow, that's pretty impressive. Especially with OpenAI raising their top tier pricing to $200 per month. Sounds like Google is taking a different approach by making their top AI model available for free.

Exactly. While the pure performance edge may be slim, the combination of Gemini's capabilities and zero cost is a real game-changer for AI accessibility. It'll be interesting to see how this shakes up the competitive landscape.

Definitely. Google is really making a strong push with Gemini. It'll be worth keeping an eye on how this model performs and how it's received compared to the paid competition.

Trending Today
  • xAI briefly rolled out Aurora, a new AI image generator integrated with Grok that appeared to produce more photorealistic images than the previous Grok integrated Flux model, particularly with landscapes, still-life images and human photorealism. Though the feature was pulled after just a few hours of testing.

  • Apollo Research conducted tests on OpenAI’s full o1, finding that the new model revealed some instances of alarming behaviour, including attempting to escape and lying about actions—though the scenarios were unrealistic for the real world.

  • Google’s DeepMind has launched GenCast, an AI weather model outperforming traditional systems. It uses over 50 predictions for more accurate forecasts and will be integrated into Google Search and Maps, providing valuable tools for both research and everyday use.

  • Microsoft is addressing privacy concerns around Copilot Vision by implementing session-based data deletion, with plans to develop more sophisticated privacy infrastructure as the technology continues to evolve.

  • OpenAI is reportedly considering removing its AGI exclusion clause with Microsoft, which would pave the way for billions in future investments as the company aims to transition away from its non-profit structure.

  • OpenAI is on the cusp of unveiling an enhanced version of its Sora video generator, integrating sophisticated capabilities like text-to-video, text-and-image-to-video, and text-and-video-to-video generation, accommodating clips up to one minute in duration.

Recommended Reading
If WE had to recommend other newsletters

AI Tool Report is an Newsletter, we enjoy reading daily. AI Tool Report delivers top techniques on how AI can transform your business filled with practical tips, real-world examples.

A Smart Bear is for people who love thinking about strategy, startups, product, marketing, decision-making, and founder psychology.

Job Opportunities
  • DeepMind - Research Scientist, Gemini Multilinguality - Apply

  • People AI - Sr. Marketing Operations Manager - Apply

  • Notable - Solutions Engineer - Apply

  • Soundhound AI- Product Support Manager - Apply

  • Perplexity - User Operations Generalist - Apply

AI News MEME

Source: Ideogram

Ideas? Comments? Complaints?

We read your emails, comments and poll replies daily.

Get the most important AI, tech, and science news in a free daily email.
New Here? Subscribe!
Sponsorship Slots Open for January and Reach over 800+ active readers. (Now 40% off) 🤯

What`d you think of today`s edition?

Login or Subscribe to participate in polls.

Until next time, Stay Informed!

Reply

or to participate.