Welcome to M5BMachine 5-Minute Briefing

Your centralized dashboard for the generative AI revolution. Track the latest models, secure exclusive offers, and master the prompt.

Research• May 28, 2026

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026

Apple is presenting new research at the annual IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), which takes place in person in Denver at the Colorado Convention Center from June 3 to June 7. We are proud to sponsor the conference, which brings together the scientific and indust...

#Apple#On-device AI

Research• May 22, 2026

VSAS-Bench: Real-Time Evaluation of Visual Streaming Assistant Models

Streaming vision-language models (VLMs) continuously generate responses given an instruction prompt and an online stream of input frames. This is a core mechanism for real-time visual assistants. Existing VLM frameworks predominantly assess models in offline settings. In contrast, the performance of...

#Apple#On-device AI

Research• May 8, 2026

Apple Workshop on Privacy-Preserving Machine Learning & AI 2026

At Apple, we believe privacy is a fundamental human right. As AI capabilities increase and become more integrated into people’s daily lives, advancing research in privacy-preserving techniques is increasingly important to ensure privacy is protected while users enjoy innovative AI experiences. Appl...

#Apple#On-device AI

Research• May 8, 2026

Large-Scale High-Quality 3D Gaussian Head Reconstruction from Multi-View Captures

We propose HeadsUp, a scalable feed-forward method for reconstructing high-quality 3D Gaussian heads from large-scale multi-camera setups. Our method employs an efficient encoder-decoder architecture that compresses input views into a compact latent representation. This latent representation is then...

#Apple#On-device AI

Research• May 7, 2026

Text-Conditional JEPA for Learning Semantically Rich Visual Representations

Image-based Joint-Embedding Predictive Architecture (I-JEPA) offers a promising approach to visual self-supervised learning through masked feature prediction. However with the inherent visual uncertainty at masked positions, feature prediction remains challenging and may fail to learn semantic repre...

#Apple#On-device AI

Research• May 7, 2026

What Matters in Practical Learned Image Compression

One of the major differentiators unlocked by learned codecs relative to their hard-coded traditional counterparts is their ability to be optimized directly to appeal to the human visual system. Despite this potential, a perceptual yet practical image codec is yet to be proposed. In this work, we aim...

#Apple#On-device AI

Research• May 6, 2026

SpecMD: A Comprehensive Study on Speculative Expert Prefetching

Mixture-of-Experts (MoE) models enable sparse expert activation, meaning that only a subset of the model’s parameters is used during each inference. However, to translate this sparsity into practical performance, an expert caching mechanism is required. Previous works have proposed hardware-centric ...

#Apple#On-device AI

Research• Apr 30, 2026

International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2026

Apple is presenting new research at the annual International Conference on Acoustics, Speech and Signal Processing (ICASSP), which takes place in person in Barcelona, Spain, from May 4 to 8. We are proud to again sponsor the conference, which brings together the scientific and industrial research co...

#Apple#On-device AI

Research• Apr 22, 2026

Apple Machine Learning Research at ICLR 2026

Apple is advancing AI and ML with fundamental research, much of which is shared through publications and engagement at conferences in order to accelerate progress in this important field and support the broader community. This week, the Fourteenth International Conference on Learning Representations...

#Apple#On-device AI

Research• Apr 17, 2026

International Conference on Learning Representations (ICLR) 2026

Apple is presenting new research at the annual International Conference on Learning Representations (ICLR), which takes place in person in Rio de Janeiro, Brazil, from April 23 to 27. We are proud to again sponsor the conference, which brings together the scientific and industrial research communiti...

#Apple#On-device AI

Research• Apr 10, 2026

ACM Human-Computer Interaction Conference (CHI) 2026

Apple is presenting new research at the annual ACM (Association of Computing Machinery) CHI Conference on Human Factors in Computing Systems, which takes place in person in Barcelona, Spain, from April 13 to 17. We are proud to again sponsor the conference, which brings together the scientific and i...

#Apple#On-device AI

Research• Mar 31, 2026

ProText: A Benchmark Dataset for Measuring (Mis)gendering in Long-Form Texts

We introduce ProText, a dataset for measuring gendering and misgendering in stylistically diverse long-form English texts. ProText spans three dimensions: Theme nouns (names, occupations, titles, kinship terms), Theme category (stereotypically male, stereotypically female, gender-neutral/non-gendere...

#Apple#On-device AI

Research• Mar 28, 2026

Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting

Existing feed-forward 3D Gaussian Splatting methods predict pixel-aligned primitives, leading to a quadratic growth in primitive count as resolution increases. This fundamentally limits their scalability, making high-resolution synthesis such as 4K intractable. We introduce LGTM (Less Gaussians, Tex...

#Apple#On-device AI

Research• Mar 26, 2026

Drop-In Perceptual Optimization for 3D Gaussian Splatting

Despite their output being ultimately consumed by human viewers, 3D Gaussian Splatting (3DGS) methods often rely on ad-hoc combinations of pixel-level losses, resulting in blurry renderings. To address this, we systematically explore perceptual optimization strategies for 3DGS by searching over a di...

#Apple#On-device AI

Research• Mar 24, 2026

SafetyPairs: Isolating Safety Critical Image Features with Counterfactual Image Generation

This paper was accepted at the Principled Design for Trustworthy AI — Interpretability, Robustness, and Safety across Modalities Workshop at ICLR 2026. What exactly makes a particular image unsafe? Systematically differentiating between benign and problematic images is a challenging problem, as subt...

#Apple#On-device AI

Research• Mar 17, 2026

AMES: Approximate Multi-modal Enterprise Search via Late Interaction Retrieval

We present AMES (Approximate Multimodal Enterprise Search), a unified multimodal late interaction retrieval architecture which is backend agnostic. AMES demonstrates that fine-grained multimodal late interaction retrieval can be deployed within a production grade enterprise search engine without arc...

#Apple#On-device AI

Research• Mar 16, 2026

RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning

Dense image captioning is critical for cross-modal alignment in vision-language pretraining and text-to-image generation, but scaling expert-quality annotations is prohibitively expensive. While synthetic captioning via strong vision-language models (VLMs) is a practical alternative, supervised dist...

#Apple#On-device AI

Research• Mar 3, 2026

EMBridge: Enhancing Gesture Generalization from EMG Signals through Cross-Modal Representation Learning

Hand gesture classification using high-quality structured data such as videos, im- ages, and hand skeletons is a well-explored problem in computer vision. Alterna- tively, leveraging low-power, cost-effective bio-signals, e.g., surface electromyo- graphy (sEMG), allows for continuous gesture predict...

#Apple#On-device AI

Research• Feb 24, 2026

depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers

PyTorch \texttt{2.x} introduces a compiler designed to accelerate deep learning programs. However, for machine learning researchers, adapting to the PyTorch compiler to full potential can be challenging. The compiler operates at the Python bytecode level, making it appear as an opaque box. To addres...

#Apple#On-device AI

Research• Feb 23, 2026

Apple Workshop on Reasoning and Planning 2025

Reasoning and planning are the bedrock of intelligent AI systems, enabling them to plan, interact, adapt, and ultimately, operate independently. At Apple, understanding and advancing reasoning capablilities in AI systems has long been an area of active research, and has resulted in numerous publicat...

#Apple#On-device AI

Research• Feb 13, 2026

Faster Rates For Federated Variational Inequalities

In this paper, we study federated optimization for solving stochastic variational inequalities (VIs), a problem that has attracted growing attention in recent years. Despite substantial progress, a significant gap remains between existing convergence rates and the state-of-the-art bounds known for f...

#Apple#On-device AI

Research• Feb 6, 2026

How PARTs Assemble into Wholes: Learning the Relative Composition of Images

The composition of objects and their parts, along with object-object positional relationships, provides a rich source of information for representation learning. Hence, spatial-aware pretext tasks have been actively explored in self-supervised learning. Existing works commonly start from a grid stru...

#Apple#On-device AI

Research• Feb 6, 2026

VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning

Video-conditioned sound and speech generation, encompassing video-to-sound (V2S) and visual text-to-speech (VisualTTS) tasks, are conventionally addressed as separate tasks, with limited exploration to unify them within a signle framework. Recent attempts to unify V2S and VisualTTS face challenges i...

#Apple#On-device AI

Research• Jan 9, 2026

Inferring Optical Tissue Properties from Photoplethysmography using Hybrid Amortized Inference

Smart wearables enable continuous tracking of established biomarkers such as heart rate, heart rate variability, and blood oxygen saturation via photoplethysmography (PPG). Beyond these metrics, PPG waveforms contain richer physiological information, as recent deep learning (DL) studies demonstrate....

#Apple#On-device AI

← Prev

1 2