• unwind ai
  • Posts
  • Generative AI Funding at an All time High 📈

Generative AI Funding at an All time High 📈

Plus: OpenAI Function Calling and API Updates, Google's Virtual Try-On Tools, Pirated GPT-4 and more.

Hey there 👋

We’re starting this week’s edition with Salesforce jumping in the generative AI bandwagon to join the flag-bearers, with the below announcements:

Also, this week was brimming with a palpable sense of “rivalry” as Meta released and open-sourced an AI music generator MusicGen in a bout to Google’s guarded MusicLM, and AMD launched a new AI chip which is apparently the “world’s most advanced AI accelerator”, contesting Nvidia’s dominance. Adding to the spice was GPT-4 that has surpassed human capabilities in crafting pitch decks across multiple industries to become a favoured choice for securing funding.

If this made you curious, keep scrolling for more juicy details because we’ve covered it all!

This issue covers:

  • Latest Developments 🌍

  • News from the Industry 🧑‍🏫

  • Tools of the Trade ⚒️

  • AI Meme of the Week 🤡

Latest Developments 🌍

Our Pick 👌

Tracking Everything Everywhere All at Once: OmniMotion, a globally consistent motion representation, that allows for accurate, full-length motion estimation of every pixel in a video.

[resize output image]
  • Weakly supervised information extraction from inscrutable handwritten document images: Addressing the limitations of existing information extraction methods when dealing with handwritten documents.

  • Video-ChatGPT: A multimodal model that combines visual and language understanding to generate human-like conversations about videos.

  • FasterViT: Combines the benefits of CNNs and ViT for high image throughput in computer vision applications, using a Hierarchical Attention approach.

  • Transformers learn through gradual rank increase: Transformers exhibit incremental learning dynamics with increasing rank difference between trained and initial weights.

  • Face0: Enables instant conditioning of a text-to-image model on a face, for prompt-based image generation and control.

  • STUDY: Socially-aware recommender system that utilizes a modified transformer decoder network for joint inference over user groups in a social network.

  • Judging LLM-as-a-judge with MT-Bench and Chatbot Arena: GPT-4 as a judge for evaluating chat assistants shows over 80% agreement with human preferences.

  • Scalable 3D Captioning with Pretrained Models: Cap3D, an automatic approach to generating descriptive text for 3D objects, that leverages pretrained models to consolidate captions from multiple views of a 3D asset.

  • Image Captioners Are Scalable Vision Learners Too: Plain image captioning is a more powerful pretraining strategy for vision encoders than contrastive pretraining on image-text pairs.

  • Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding: Significantly improves dialog state tracking performance and reduced word error rate in automatic speech recognition.

  • Galactic: A high-speed simulation and reinforcement learning framework for training robotic mobile manipulation skills.

  • ChatGPT is fun, but it is not funny!: ChatGPT struggles with generating diverse and original jokes, repeating the same 25 Jokes over 90% times.

  • Retrieval-Enhanced Contrastive Vision-Text Models: Utilize external memory to improve fine-grained knowledge retrieval and boost CLIP performance.

  • SayTap: A method to control quadrupedal robots using natural language commands and foot contact patterns.

  • TART: A plug-and-play transformer module that enhances reasoning abilities in LLMs, improving performance across various tasks, models, and modalities.

  • Mind2Web: A dataset for developing and evaluating language-based generalist agents to perform complex tasks on real-world websites.

  • WebGLM: An efficient web-enhanced Q&A system that augments pre-trained LLMs with web search and retrieval capabilities, improves upon WebGPT.

  • GPT-Calls: Using GPT model for efficient and accurate call segmentation and topic extraction without the need for labeled data.

News from the Industry 🧑‍🏫

Our Pick 👌

  • OpenAI has announced the following Function calling and API updates:

    • Developers can now describe functions to GPT-4 and GPT-3.5-turbo, allowing the model to output a JSON object containing arguments to call those functions.

    • The existing GPT-4 and GPT-3.5-turbo models have been improved, and new models have been introduced with extended context length.

    • The cost of text-embedding-ada-002 is being reduced by 75% to $0.0001 per 1K tokens.

    • The cost of GPT-3.5-turbo’s input tokens is reduced by 25%, to $0.0015 per 1K input tokens and $0.002 per 1K output tokens.

    • Deprecation timelines have been announced for older versions of the models.

Tools of the Trade ⚒️

Our Pick 👌

Framer AI: Create and publish a website in seconds using simple text prompts, provides many editing options and an in-built copywriter.

  • Airplane Autopilot: Develop internal tools and dashboard to simplify engineering operations on the Airplane platform, using text prompts, without coding.

  • Juri Flow: Get instant legal assistance from expert AI lawyer well-versed in various legal domains.

  • Greenifs AI: Ensures compliance with green marketing guidelines, detects greenwashing errors, and helps improve marketing communications.

  • Composer: Build trading algorithms using AI, backtest strategies, and execute trades, all without coding.

  • JobWizard: AI-powered job hunting tool that automates job applications, provides personalized answers and tracks applications in real-time.

  • Credal: Secure AI solution for enterprises that integrates with existing data sources, provides secure chat UI and APIs, enforces access policies, generates audit logs and redacts sensitive data.

  • AIAgent: An intelligent web app that empowers users to automate workflows, runs multiple AI Agents concurrently, powered by GPT-4, no API keys.

  • Bothatch: Transform your data into conversations, create and train AI-powered chatbots that engage in personalized interactions and automate tasks.

  • RestoGPT: AI that generates free online ordering storefront with integrated POS and delivery, enables autopilot order acceptance and fulfillment without fees.

  • Deeto: Connect your prospects with top customers to provide trustworthy insights, facilitate dialogue, and close deals faster.

  • Whisper Web: Offers ML-powered speech recognition directly in your browser, enabling audio-to-text conversion in real-time.

  • Perplexity AI Profile: Create your own AI profile by setting your bio, choosing language and location, and get answers tailored to your preferences.

  • Sentelo: Efficient learning through paraphrasing, code explanations, summaries, control questions, expanded content and more.

AI Meme of the Week 🤡

That’s all for this week!

Will see you next Saturday with more such content. Don’t forget to subscribe and give your feedback below.

BONUS 🎉

Share this newsletter with three other friends and stand a chance to win my book GPT-3: The Ultimate Guide to build NLP Products with OpenAI API. Winners will be selected on a monthly basis.

Reply

or to participate.