Google Releases Gemini 3.1 Flash Live: A Real-Time Multimodal Voice Model for Low-Latency Audio, Video, and Tool Use for AI Agents
Google has released Gemini 3.1 Flash Live in preview for developers through the Gemini Live API in Google AI Studio. This model targets low-latency, more natural, and more reliable real-time voice interactions, serving as Google’s ‘highest-quality audio and speech model to date.’ By natively process...
A Coding Implementation to Run Qwen3.5 Reasoning Models Distilled with Claude-Style Thinking Using GGUF and 4-Bit Quantization
In this tutorial, we work directly with Qwen3.5 models distilled with Claude-style reasoning and set up a Colab pipeline that lets us switch between a 27B GGUF variant and a lightweight 2B 4-bit version with a single flag. We start by validating GPU availability, then conditionally install either ll...
Transform your headphones into a live personal translator on iOS.
Google Translate’s Live translate with headphones is officially arriving on iOS! And we're expanding the capability for both iOS and Android users to even more countries…
Cohere AI Releases Cohere Transcribe: A SOTA Automatic Speech Recognition (ASR) Model Powering Enterprise Speech Intelligence
In the landscape of enterprise AI, the bridge between unstructured audio and actionable text has often been a bottleneck of proprietary APIs and complex cascaded pipelines. Today, Cohere—a company traditionally known for its text-generation and embedding models—has officially stepped into the Automa...
The results from 2025 are intriguing. Screens are part of everyday life that many of us use for work, communication and entertainment. However, there are also signs that people are limiting their screen time. The total hours we are spending on screens has not really changed, but digging deeper, ther...
Your Job Isn’t Going Away… But It’s Definitely Evolving
When AI comes to your workplace it doesn’t have to be with a dramatic flourish. There don’t have to be redundancies. There don’t have to be robots marching through the door. One tool. Then another. Then one day your work will simply look different. AI is not so much taking jobs, it is transforming t...
Apple Is Finally Rebuilding Siri From the Ground Up. But Will It Be Any Good This Time?
Ok, I’m going to ask this question, even though I already know the answer. When was the last time you used Siri for something critical? I thought so. It’s been around for a while, but it hasn’t necessarily been useful. That may change soon. Apparently, Apple is building a new version of Siri from sc...
Val Kilmer’s digital resurrection is jolting the entertainment industry, and raising some uncomfortable dilemmas
Val Kilmer is returning to the screen. But not exactly. Not in some retro montage. Not in a long-gone flashback. No, I’m talking about the real deal. Well, sort of. This time, he’ll be brought to life via AI. I can’t blame you if you’re both amazed and a bit disturbed by this news. The basic gist is...
Cohere launches an open source voice model specifically for transcription
Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It currently supports 14 languages.
Tencent AI Open Sources Covo-Audio: A 7B Speech Language Model and Inference Pipeline for Real-Time Audio Conversations and Reasoning
Tencent AI Lab has released Covo-Audio, a 7B-parameter end-to-end Large Audio Language Model (LALM). The model is designed to unify speech processing and language intelligence by directly processing continuous audio inputs and generating audio outputs within a single architecture. System Architectur...
Google launches Lyria 3 Pro music generation model
Google is launching Lyria 3 Pro, an upgraded music model that generates longer, more customizable tracks, as it expands AI music tools across Gemini, enterprise products, and other services.
Granola raises $125M, hits $1.5B valuation as it expands from meeting notetaker to enterprise AI app
Granola's valuation jumped from $250 million to $1.5 billion with this round, and it has added more support for AI agents after users previously complained.
Meta launches new initiative to support entrepreneurship, drive AI adoption
Meta CEO Mark Zuckerberg said in a memo to staff that small businesses have always been a big part of the company's business model, and that while tens of millions of entrepreneurs already use its platforms to grow and connect with customers, the company wants to do more in the space.
With the new Browser Shield for AI and Data Lineage products, security teams can now stop data exposure at the prompt and see exactly what AI is doing with their sensitive files Cyera, the AI Security Platform built for the age of agents, today announced a new set of capabilities:...
The post Cyera ...