- unwind ai
- Posts
- OpenSource LLMs with 200k Context 🧠
OpenSource LLMs with 200k Context 🧠
PLUS: Real-life JARVIS Plays and Plans, Runway ML's Motion Magic
Today’s top AI Highlights:
JARVIS-1: The Next-Gen AI Agent that Performs and Plans
OpenAI Prepares for Building Superintelligence
The Fastest Inference Machine Yet
01.AI’s LLMs with Humongous Context Surpass Much Bigger LLMs
GEN-2 Brings Images to Life with a Stroke of Brush
Read time: 3 mins
Latest Developments 🌍
Real-life JARVIS is Here 🦸🏼♀️
Chinese researchers introduce JARVIS-1, an innovative open-world multi-task agent, designed for complex planning and control tasks in the dynamic Minecraft universe. This agent marks a leap forward in integrating multimodal inputs and memory-augmented language models for enhanced task execution and learning capabilities.
Key Highlights:
JARVIS-1 utilizes a combination of visual and textual inputs, employing a multimodal language model, enabling it to understand tasks and environmental feedback more effectively, more accurate and context-aware planning, particularly in long-horizon tasks.
A distinctive feature of JARVIS-1 is its multimodal memory mechanism. This memory stores scenarios and plans from successful past experiences, which the agent can retrieve for future planning.
JARVIS-1 features an autonomous learning capability. It can autonomously generate tasks for exploration and store these experiences in its multimodal memory. This self-instruct feature enables the agent to continuously refine its planning skills and adapt to new challenges.
OpenAI Seeks More Funds for AGI 💰
OpenAI is reportedly seeking further funding from Microsoft to pursue its artificial general intelligence ambitions. This follows a significant $10 billion investment by Microsoft earlier. The partnership with Microsoft extends beyond financial aspects, focusing on collaborative efforts to achieve breakthroughs in AI.
OpenAI has even revealed plans for the next generation of its AI model, GPT-5, although a specific timeline for its release was not committed. GPT-5 is expected to be more sophisticated than its predecessors, requiring more extensive data for training for which OpenAI has also called for large-scale datasets under the OpenAI Data Partnerships initiative to train GPT-5. The company's current focus is on creating more autonomous AI agents capable of performing complex tasks such as code execution, making payments, sending emails, or filing claims.
Hyper-Speed AI with the Fastest Inference Engine 🏃♂️
Together AI has introduced the Together Inference Engine, proclaimed as the world's fastest inference stack. It significantly outperforms existing services in speed and efficiency, marking a substantial advancement in the speed of generative AI.
Key Highlights:
The Together Inference Engine operates at speeds of 117 tokens per second for Llama-2-70B-Chat and 171 for Llama-2-13B-Chat, outperforming competitors like TGI, vLLM, and various serverless APIs. This enables faster and more efficient user experiences in demanding AI applications.
Built on CUDA with NVIDIA Tensor Core GPUs, it incorporates advanced techniques like FlashAttention-2 and Flash-Decoding. The engine ensures high-quality outputs, matching the accuracy benchmarks of the Hugging Face implementation without compromising model behavior.
The engine offers Serverless Endpoints for over 50 open-source models and customizable Dedicated Instances, both with auto-scaling. Over 100 models are available, continuously expanding. Improved performance translates to lower costs for users, with significant reductions in pricing for various services.
Open-Source LLMs with 200k Context Surpassing GPT-4’s 📝
01.AI, an emerging player in the field of AI, has open-sourced its Yi series models, including Yi-6B and Yi-34B. These bilingual (English/Chinese) models were developed from scratch and boast a huge context window of 200k tokens.
Key Highlights:
With 34 billion parameters and training on a 3 trillion token corpus, the Yi-34B model outperforms larger models like LLaMA2-70B and Falcon-180B in various language processing evaluations. Its balance of size and efficiency offers a cost-effective solution for complex AI projects.
Both Yi-6B and Yi-34B models stand out with their 200K context window, far surpassing even GPT-4’s context window of 128k tokens, allowing for deeper understanding of lengthy text, and enhancing their performance in detailed language processing tasks.
The Yi series is fully open for academic research, with free commercial licenses available.
Tools of the Trade ⚒️
Motion Brush in Gen-2: Runway ML has introduced Motion Brush into GEN-2 which lets you add controlled movement to specific parts of still images by just swiping a brush over it.
Superpowered AI: Provides an API for building LLM applications with access to external knowledge, enhancing reliability and performance for a variety of use cases, addressing common issues in RAG like out-of-context search results and hallucinations.
Market Analyst GPT: Paste a snapshot of any stock or crypto chart with technical indicators and geta comprehensive analysis to understand and predict market trends.
Paigo 2.0: A comprehensive billing management platform to provide fully automated, flexible solutions tailored for any pricing strategy, particularly aimed at startups and businesses seeking to optimize their billing operations.
AITable.ai: No-code platform to create custom AI chatbots from spreadsheet-like tables, featuring easy integration with websites and social media, versatile training options, and API support for various applications.
😍 Enjoying so far, TWEET NOW to share with your friends!
Hot Takes 🔥
How about e/acc-as-a-service? ~ Bojan Tunguz
Google is an ever shifting web of goals and efforts. ~ Source
Meme of the Day 🤡
That’s all for today!
See you tomorrow with more such AI-filled content. Don’t forget to subscribe and give your feedback below 👇
Real-time AI Updates 🚨
⚡️ Follow me on Twitter @Saboo_Shubham for lightning-fast AI updates and never miss what’s trending!!
PS: I curate this AI newsletter every day for FREE, your support is what keeps me going. If you find value in what you read, share it with your friends by clicking the share button below!
Reply