• unwind ai
  • Posts
  • Grok-3 to be Trained on 100,000 H100

Grok-3 to be Trained on 100,000 H100

PLUS: AI DevOps Copilot, HybridRAG, GPT-Instagram

Today’s top AI Highlights:

  1. HybridRAG combines Knowledge graphs & Vector RAG for better insights

  2. Cut AI Assistant costs by 77.8% with this logic

  3. xAI builds the world’s most powerful AI supercomputer with 100k H100s

  4. Lenovo leak shows cheaper Copilot Plus PCs coming this month

  5. AI DevOps copilot to supercharge your cloud stack

& so much more!

Read time: 3 mins

Latest Developments

Financial analysts are constantly searching for valuable insights in unstructured data like earnings calls and reports. Traditional methods struggle with this, but LLMs offer a solution. However, these models face challenges with domain-specific terminology and complex document formats. This is where HybridRAG, a novel approach, steps in.

HybridRAG integrates knowledge graphs and vector retrieval to improve information extraction from financial documents and provide more accurate and contextually relevant answers.

Key Highlights:

  1. Enhanced retrieval - HybridRAG combines information from both a vector database and a knowledge graph, offering a broader and more comprehensive understanding of the financial data.

  2. Improved accuracy - It achieves 96% retrieval accuracy, surpassing both traditional VectorRAG and GraphRAG. It also outperforms VectorRAG and GraphRAG in faithfulness and answer relevancy.

  3. Flexible and adaptable - It excels in both extractive and abstractive questions, leveraging the strengths of each RAG technique to handle different types of queries and complexities in financial data.

  4. Wider application - This technique holds promise for improving information extraction in fields like legal, healthcare, and research, where complex and diverse data needs to be analyzed.

LLM-powered chatbots seem useful for enterprise applications but are often unreliable in production environments. This is where a new approach shines: separating conversation ability from business logic execution.

Two different semi-structured approaches were analyzed: CALM, which isolates conversational tasks from business logic execution, and LangGraph, which integrates them. The study strongly favors CALM which delivered consistent, reliable, and cost-effective AI interactions.

Key Highlights:

  1. Cost efficiency - Using CALM reduced costs per user message by up to 77.8%. This is a game-changer for businesses looking to deploy AI assistants at scale, especially in areas like customer support, where costs can quickly add up.

  2. Lower latency - The CALM approach significantly outperforms LangGraph in terms of latency. On average, CALM responds in 2.08 seconds, whereas LangGraph takes 7.4 seconds. Further optimizations reduce CALM’s response time to 1.58 seconds.

  3. Consistency and reliability - CALM ensures that the AI assistant follows a predefined set of rules and processes, reducing errors and inconsistencies. LangGraph’s integrated approach showed frequent inconsistencies, such as booking incorrect flights and offering impossible dates, which are critical failures in enterprise settings.

Quick Bites

  1. xAI team has successfully brought their Colossus supercomputer with 100,000 H100s online in just 122 days, making it the most powerful AI training system in the world. Elon Musk has announced that it will double in size to 200,000 (50k H200s) in a few months. 

  2. Amazon has acquired the founders and many employees of AI robotics startup Covariant. It’s also signed a non-exclusive license to use Covariant’s robotic foundation models. This allows Amazon to further develop its robotics capabilities without a full acquisition.

  3. LAION has released a new dataset Re-LAION-5B after removing links to suspected child sexual abuse material (CSAM), based on recommendations from various human rights organizations. Re-LAION-5B is available in two versions: Re-LAION-5B Research, which has been cleaned of known links to CSAM, and Re-LAION-5B Research-Safe, which further removes additional NSFW content.

  4. Lenovo is releasing affordable Copilot Plus PCs this month, as leaked by Evan Blass, with prices starting at €899. These new PCs feature a Qualcomm Snapdragon X Plus chip. They will be lower-priced than existing models and are expected to be available in September.

Tools of the Trade

  1. Kura: AI DevOps copilot for software teams to manage and optimize their cloud infrastructure. It can answer questions about your cloud systems, deploy resources, and proactively surface issues by integrating directly with services like AWS, GCP, and Azure.

  2. GPT-Instagram: A GPT-based autonomous multi-agent AI app that recommends Instagram posts based on user queries and personalities extracted from user’s historical Instagram data.

  3. Meter Command: AI tool that lets IT and Networking teams manage their networks using natural language, dynamically creating custom dashboards and software interfaces on the fly. It simplifies network management for all skill levels with real-time updates and personalized controls.

  4. Awesome LLM Apps: Build awesome LLM apps using RAG to interact with data sources like GitHub, Gmail, PDFs, and YouTube videos through simple text. These apps will let you retrieve information, engage in chat, and extract insights directly from content on these platforms.

Hot Takes

  1. Cope.
    Programming is no longer hard and it’s ok to accept that. AI can help you create anything you want, so long as you aren’t retarded.
    Data structures and “good software” were never difficult to understand, either - creating lines of “good code” was. It’s now simple. ~
    System

  2. General Purpose Technologies, like AI, have effects that are complex; impossible to predict in advance; impact different groups & industries differently; and which play out over time.
    The sooner we get past AI is “all good” or “all bad” arguments & focus on context, the better. ~
    Ethan Mollick

Meme of the Day

That’s all for today! See you tomorrow with more such AI-filled content.

Real-time AI Updates 🚨

⚡️ Follow me on Twitter @Saboo_Shubham for lightning-fast AI updates and never miss what’s trending!

Unwind AI - Twitter | LinkedIn | Instagram | Facebook

PS: We curate this AI newsletter every day for FREE, your support is what keeps us going. If you find value in what you read, share it with at least one (or 20) of your friends!

Reply

or to participate.