• unwind ai
  • Posts
  • TikTok's Billion-Dollar AI Dance πŸ‘―β€β™€οΈ

TikTok's Billion-Dollar AI Dance πŸ‘―β€β™€οΈ

PLUS: Google's RoboCat, Meta's Voicebox, Opera's AI browser and Musk-Zuckerberg catfight πŸ₯Š

Hey there πŸ‘‹

This week we saw quite a few interesting developments in robotics. From affordable home robots that can navigate homes effortlessly to robotics agents that can learn themselves to operate different arms, these advancements are really pushing the boundaries of what robots can do.

Speaking of arms, Stability AI’s new image generation models have caught the limelight this week. People are buzzing about the stunningly beautiful arms these models can create, showcasing a remarkable improvement from the days of blurry and blotchy hands.

Besides these, there were many other intriguing events! With Sam Altman surrounded by controversies again to TikTok's impressive purchase of GPUs worth a staggering $1 billion from NVIDIA, there's been no shortage of news to grab our attention. Want to know them all? Just keep scrolling!

This issue covers:

  • Latest Developments 🌍

  • News from the Industry πŸ§‘β€πŸ«

  • Tools of the Trade βš’οΈ

  • Hot Takes πŸ”₯

  • AI Meme of the Week 🀑

Latest Developments 🌍

Our Pick πŸ‘Œ

RoboCat: A self-improving robotic agent: Learns to operate different robotic arms, solves tasks from a small number of demonstrations, and enhances its performance through self-generated data.

  • Rerender a Video: Zero-shot text-guided video-to-video translation, utilizing an adapted diffusion mode, achieving temporal coherence without re-training or optimization.

  • Language to Rewards for Robotic Skill Synthesis: Defining reward parameters to optimize control policies and enable robots to accomplish diverse tasks using LLMs.

  • DreamHuman: Generates realistic and animatable 3D human avatars from textual descriptions, surpassing previous approaches in visual quality and diversity.

  • DecodingTrust: Analyzing trustworthiness in GPT models to expose biases, toxicity, privacy breaches, and manipulation potential.

  • Seeing the World through Your Eyes: Reconstructing 3D scenes from eye reflections in portraits, enhancing eye poses, scene radiance, and iris texture.

  • Textbooks Are All You Need: Phi-1, a compact language model trained on top-quality textual data, exhibits exceptional accuracy.

  • MagicBrush: A manually annotated dataset for instruction-guided image editing enabling the training of large-scale text-guided image editing models.

  • LLMs can label data as well as humans, but faster: SOTA LLMs can label text datasets at the same or better quality but ~20x faster and ~7x cheaper, GPT-4 being the top performer.

  • Explore, Establish, Exploit: A red teaming framework for evaluating and mitigating harmful outputs of LLMs.

  • Full Parameter Fine-tuning for LLMs with Limited Resources: A new optimizer called LOMO, which combines gradient computation and parameter update to reduce memory usage.

  • Demystifying GPT Self-Repair for Code Generation: Examining GPT's self-repair capability in code generation, notably improved with feedback.

  • VidEdit: Zero-shot text-based video editing that achieves strong temporal smoothness and preservation of the original video structure.

  • Scaling Open-Vocabulary Object Detection: Achieved using pretrained vision-language models, self-training with pseudo-box annotations showing significant improvements.

  • Macaw-LLM: A multi-modal language model integrating diverse data types to handle complex real-world scenarios.

  • Perceiver TF: A deep neural network framework that improves multitrack music transcription by accurately transcribing multiple instruments and vocals together.

  • KoLA: A benchmark to assess LLMs' knowledge-related abilities, using fair data comparisons and contrastive evaluation criteria, resulting in intriguing findings.

  • HomeRobot: An economical home robot capable of performing tasks using open-vocabulary mobile manipulation.

  • Language-Guided Music Recommendation for Video via Prompt Analogies: Using natural language prompts and a trimodal model to retrieve music samples that match the video style and user's language query.

  • Robots Learning from Visual Affordances: Enabling robots to learn tasks from videos for flexible performance in varied environments.

  • MPT-30B by MosaicML: A new, more powerful open-sourced LLM licensed for commercial use that offers 8k context window, outperforms GPT-3.

  • OpenLLaMA: Open-source reproductions of Meta's LLaMA models demonstrating comparable performance to original LLaMA and GPT-J models.

News from the Industry πŸ§‘β€πŸ«

Our Pick πŸ‘Œ

Opera has made its AI-powered browser Opera One available for download that has Aria (AI chat assistant) and an intuitive interface with Modular Design, Tab Islands, Multithread Compositor and more!

Tools of the Trade βš’οΈ

Our Pick πŸ‘Œ

Dropbox AI: New AI-powered tools released by Dropbox which include universal search that connects all tools, content and apps in a single search bar, smart suggestions, summarizing, collaborative document editor, and more.

  • Giskard: Testing and debugging solution to detect risks of performance issues, biases, and errors in your model before production.

  • Warp: Modern, Rust-based terminal with built-in AI that speeds up software development.

  • Nonoisy: Online audio editing platform that uses AI to remove background noise, master audio, and level volume.

  • Leet Resumes: Free resume writing service, uses a combination of AI and human expertise to create resumes tailored to individual needs.

  • NoteGenie: AI-driven note-taking platform with features like identifying key topics, extracting important information, and categorizing notes.

  • Danelfin: Stock analytics platform that uses AI to analyse over 10,000 features per day per stock and assigns a rating out of 10.

  • Parallel Domain Data Lab: A synthetic data platform that generates high-fidelity, customisable synthetic data for training computer vision and perception models.

  • Parrot: AI-powered platform for remote depositions, offering digital reporting, transcription, video conferencing, and live chat solutions.

  • Factiverse AI - Editor: Find factual mistakes in your text (whether human written or AI generated) and get links to credible sources to verify the information.

  • EssayGrader: Uses AI to provide feedback on the quality of long essays, summarize, and detect AI-generated text.

  • MyShell: Create personalized chatbots called Shells, customize your Shell's appearance, voice, and personality, and train it to perform specific tasks.

  • GPT Engineer: Open-source tool to automatically generate entire well-formatted and functional codebases from simple text prompts.

  • AI Signature Generator: Create a personalized, professional handwritten signature with AI enhancement.

  • Upword: AI-powered research tool that generates notes, extracts key ideas, summarizes, simplifies and translates, text and generates audio summaries.

  • Virtual CMO: AI marketing co-pilot for solopreneurs, just state the business and problems and get effective marketing solutions.

Hot Takes πŸ”₯

Image

Meme of the Week 🀑

Image

That’s all for this week!

Will see you next Saturday with more such content. Don’t forget to subscribe and give your feedback below.

BONUS πŸŽ‰

Share this newsletter with three other friends and stand a chance to win my book GPT-3: The Ultimate Guide to build NLP Products with OpenAI API. Winners will be selected on a monthly basis.

Reply

or to participate.