• unwind ai
  • Posts
  • Sora and Gemini 1.5 changed the GenAI Landscape 💥

Sora and Gemini 1.5 changed the GenAI Landscape 💥

PLUS: OpenAI launches the most sophisticated text-to-video AI model, Google tightens its grip with largest context length LLM Gemini 1.5

Today’s top AI Highlights:

  1. Google launches Gemini with 1 million context length

  2. OpenAI text-to-video model “Sora” breaks the internet with its amazing output

  3. Stable Cascade makes training and fine-tuning easy on consumer hardware

  4. Lindy — Build AI Agents with NoCode

& so much more!

Read time: 3 mins

Latest Developments 🌍

Google Launches Gemini 1.5 with 1M Context Length

Just a week back Google rolled out Gemini Ultra and there’s already a new update to the model. Gemini 1.5 comes with a huge 1 million context length and delivers dramatically enhanced performance over its predecessor across different modalities (text, code, image, audio, video) but with less compute. Here’s EVERYTHING you need to know about the new model:

OpenAI Releases its first-ever Text-to-Video Model Sora

Sora is an AI model that can create realistic and imaginative scenes from text instructions. It can generate videos up to a minute long while maintaining visual quality and adherence to the user’s prompt. It enables the creation of intricate scenes, animates images, and extends videos, blending creativity with realism.

Starting today, Sora is becoming available to red teamers to assess critical areas for harms or risks plus a number of visual artists, designers, and filmmakers. Here’s EVERYTHING you need to know about Sora:

Stable Cascade: A Leap in Text-to-Image Models 🚀

Stable Cascade by Stability AI represents a significant advancement in text-to-image generation utilizing a novel three-stage approach based on the Würstchen architecture. It is designed for non-commercial use and is important for its ease of training and fine-tuning on consumer hardware, offering a cost-effective solution for creators.

Key Highlights:

  • Three-Stage Approach: Stable Cascade's unique architecture divides the text-to-image generation process into three distinct stages, facilitating highly compressed latent space and enhancing image quality.

  • Efficiency and Accessibility: Designed for consumer hardware, it offers a 16x reduction in training costs compared to similar-sized models, making advanced text-to-image generation more accessible.

  • Versatility and Innovation: Alongside standard image generation, Stable Cascade introduces capabilities for image variations, image-to-image generations, and supports advanced techniques like ControlNet and LoRA for custom finetuning, showcasing its versatility and potential for creative exploration.

Tools of the Trade ⚒️

  1. Lindy: Lindy lets you build your custom AI agents to automate tasks and get things done 10x cheaper and 10x faster — no coding required. With Lindy, you can:

    1. Automate emails, meeting schedules, and note-taking.

    2. Connect your AI employee to over 3,000+ existing apps.

    3. Streamline customer support, sales, recruiting, and more.

  1. Microsoft UFO: UFO is a UI-Focused dual-agent framework to fulfill user requests on Windows OS by seamlessly navigating and operating within individual or spanning multiple applications. UFO operates as a dual-agent framework where both agents leverage the multi-modal capabilities of GPT-Vision to comprehend the application UI and fulfill the user's request.

  2. Quary: Quary connects to your data warehouse and lets your team transform raw data into valuable insights in seconds, right from your browser! https://www.ycombinator.com/launches/KKW-quary-transform-data-together

  3. Machined: Machined creates content that will help you rank on search engines. Simply put, Machined will generate a large number of informational articles on any topic you like; structured, written and interlinked in a way that search engines love. Ready

😍 Enjoying so far, TWEET NOW to share with your friends!

Hot Takes 🔥

  1. My oversimplified view of how AI companies see (and are building) the near future of AI: Google: You get a personalized assistant that knows you, your email, etc. and helps you with it Microsoft: You get an intern to help with work OpenAI: Autonomous agents execute on your goals ~ Ethan Mollick

  2. Dream Scenario… Ilya and Karpathy start a company, raise $1T to build open source AGI!❤️❤️ ~ Bindu Reddy

Meme of the Day 🤡

When your code is a mess but it still somehow works

Image

That’s all for today!

See you tomorrow with more such AI-filled content. Don’t forget to subscribe and give your feedback below 👇

Real-time AI Updates 🚨

⚡️ Follow me on Twitter @Saboo_Shubham for lightning-fast AI updates and never miss what’s trending!!

PS: I curate this AI newsletter every day for FREE, your support is what keeps me going. If you find value in what you read, share it with your friends by clicking the share button below!

Reply

or to participate.