TLDR AI 2024-04-24

Ray-Ban smart glasses with AI πŸ•ΆοΈ, OpenAI enterprise πŸ’Ό, Phi 3 models 🌐

πŸš€
Headlines & Launches

OpenAI Announces New Enterprise AI Features (4 minute read)

OpenAI has announced new enterprise-grade features for its API customers, including enhanced security measures, an upgraded Assistants API, a new Projects feature for granular access control, and cost management tools. These updates demonstrate OpenAI's focus on offering a more "plug and play" experience for enterprises, countering the rise of competitors like Meta's Llama 3 and open models from Mistral.

The Ray-Ban Meta Smart Glasses Have Multimodal AI Now (5 minute read)

Meta has rolled out multimodal AI to its Ray-Ban Meta Smart Glasses, allowing users to process photos, audio, and text through voice commands for tasks like identification and translation, although the AI's capabilities are limited and sometimes inaccurate.

Apple Working on On-Device LLM for Generative AI Features (2 minute read)

Apple's LLM will reportedly run entirely on-device rather than via the cloud like most existing AI services.
🧠
Research & Innovation

Phi 3 (17 minute read)

Phi 3 is a series of models, 3B-14B in size, which perform exceptionally well on modern benchmarks. The 3B model claims to outperform the original ChatGPT model. Weights have been released. There is a variant available with a 128k context length.

Instruction hierarchy (17 minute read)

OpenAI published research on giving system prompts stronger weighting, which dramatically improves model robustness to jailbreaks and adversarial attacks.

Introducing SEED-X: Multimodal Foundation Models (16 minute read)

SEED-X advances multimodal foundation models by tackling real-world application challenges. It can understand images of any size and aspect ratio and produce images with varying levels of detail.
πŸ‘¨β€πŸ’»
Engineering & Resources

Image Generation with MultiBooth (3 minute read)

MultiBooth introduces a two-phase process to enhance multi-concept image generation, overcoming the challenges of concept fidelity and high costs found in other methods.

Exploring LLaMA3's Performance in Low-Bit Quantization (GitHub Repo)

Meta's LLaMA3, a leading large language model, is being tested for its efficiency in low-bit scenarios, often essential in systems with limited resources. This study, available on GitHub and Hugging Face, aims to refine and improve quantization strategies for future large language models.

Instructor (GitHub Repo)

Instructor is a Python library that makes it easy to work with structured outputs from large language models.
🎁
Miscellaneous

How does ChatGPT work? As explained by the ChatGPT team (6 minute read)

In this article, OpenAI's Evan Morikawa provides insights into ChatGPT's inner workings, covering input text processing and tokenization to prediction sampling using large language models. ChatGPT operates by turning tokens into numerical vectors (embeddings), multiplying them by a weight matrix of billions, and selecting the most probable next word. The tech is grounded in extensive pretraining to predict text based on vast internet data.

Self-Reasoning Tokens, teaching models to think ahead (4 minute read)

Recent experiments introduced "Reasoning Tokens" to improve the thinking process of language models like GPT-2, encouraging them to make calculations for future tokens. Early results show a 35% decrease in loss, indicating the models can indeed learn to anticipate future information. This approach could enhance the ability of language models to plan and reason in a self-supervised manner, potentially reducing the need for step-by-step explanations.

Los Angeles is using AI in a pilot program to try to predict homelessness and allocate aid (7 minute read)

The Los Angeles County Department of Health Services is using predictive AI to prevent homelessness, identifying at-risk individuals to provide aid and successfully keeping 86% of participants housed. The program, initiated in 2021, has assisted nearly 800 households with over $4,000 in support each. Despite concerns over privacy and ethics, the AI initiative shows promise in addressing California's climbing homelessness crisis.
⚑️
Quick Links

Startup Uses AI To Edit Human Data (1 minute read)

A Berkeley-based startup, Profluent, claims to have used generative AI to create a new gene editor called OpenCRISPR-1, which it has used to edit human DNA.

Generating 3D Scenes from Six Images (4 minute read)

6Img-to-3D is a novel method that uses transformers to create 3D-consistent images from just six input photos.

Panjaya (Product)

Localize or tailor video content for any audience in any language while preserving the speakers' natural voice and lip movements.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for