AI
AI News

Latest AI News

Bosch’s fancy coffee machine is getting Alexa Plus

Bosch’s fancy coffee machine is getting Alexa Plus

A countertop coffee maker has launched a hands-free integration with Amazon's voice assistant, allowing users to brew coffee, adjust settings, and access recipes via voice commands. This feature aims to enhance user convenience and streamline the coffee-making process, appealing to tech-savvy consumers looking for smarter kitchen appliances.

The Verge
Shut the fridge door!

Shut the fridge door!

Samsung has introduced voice control features for the opening and closing of its Family Hub smart refrigerators. This upgrade allows users to operate the fridge hands-free via voice commands, enhancing convenience, especially while cooking. The feature is compatible with Samsung's Bixby assistant, and it aims to streamline kitchen tasks. Existing Family Hub owners will receive the update through a software upgrade, while new buyers will benefit from the functionality out of the box. This development underscores Samsung's commitment to integrating smart technology into everyday appliances, potentially influencing consumer decisions in the smart home market.

The Verge
This robot vacuum at CES 2026 can find lost items on your floor and alert you of them

This robot vacuum at CES 2026 can find lost items on your floor and alert you of them

The Narwal Flow 2 robot vacuum, recently unveiled at CES, features advanced cleaning capabilities with a self-cleaning system that allows it to wash its own mop pads. It boasts improved navigation using LiDAR technology, allowing for more efficient mapping of spaces. The Flow 2 can also be controlled via a smartphone app, enhancing user convenience.

ZDNet
How DeepSeek's new way to train advanced AI models could disrupt everything - again

How DeepSeek's new way to train advanced AI models could disrupt everything - again

DeepSeek has launched Manifold-Constrained Hyper-Connections (mHCs), a new technology designed to enhance data connections in complex systems. This innovation aims to improve efficiency in data processing and analytics. The specific applications include better performance in machine learning and AI models, potentially revolutionizing how organizations handle large datasets. Further details on implementation and industry impact are anticipated.

ZDNet
Two Deep Learning Approaches for Automated Segmentation of Left Ventricle in Cine Cardiac MRI

Two Deep Learning Approaches for Automated Segmentation of Left Ventricle in Cine Cardiac MRI

Researchers have developed two deep learning models, LNU-Net and IBU-Net, for left ventricle segmentation in short-axis cine MRI images. LNU-Net enhances the U-Net with layer normalization, while IBU-Net combines instance and batch normalization. Tested on a dataset of 805 MRI images from 45 patients, both models significantly improved segmentation accuracy, outperforming existing methods in terms of the dice coefficient and average perpendicular distance. This advancement could enhance clinical diagnostics and quantification in cardiology.

arXiv
Categorical Reparameterization with Denoising Diffusion models

Categorical Reparameterization with Denoising Diffusion models

A new paper introduces a diffusion-based soft reparameterization for optimizing categorical variables, enhancing existing continuous relaxations. This method utilizes a Gaussian noising process with an efficient closed-form denoiser, allowing for backpropagation without prior training. Experiments indicate that this approach offers competitive or improved performance on various benchmarks, addressing the challenges of noise and bias in traditional optimization methods.

arXiv
Investigating the Viability of Employing Multi-modal Large Language Models in the Context of Audio Deepfake Detection

Investigating the Viability of Employing Multi-modal Large Language Models in the Context of Audio Deepfake Detection

A study investigates the use of Multimodal Large Language Models (MLLMs) for audio deepfake detection, an area previously underexplored. By combining audio inputs with text prompts, researchers evaluated two models, Qwen2-Audio-7B-Instruct and SALMONN, in zero-shot and fine-tuned modes. Results indicate that while performance on out-of-domain data is lacking, the models excel on in-domain tasks with minimal supervision, suggesting a promising direction for enhancing audio deepfake detection.

arXiv
Nvidia's AI empire: A look at its top startup investments | TechCrunch

Nvidia's AI empire: A look at its top startup investments | TechCrunch

Nvidia's financial performance has surged following the AI boom, especially post-ChatGPT launch. The company reported record revenues and profits, with cash reserves significantly increasing. This growth underscores Nvidia's pivotal role in AI hardware, particularly in supplying GPUs for AI applications, positioning it as a leader in the tech landscape.

TechCrunch
In 2026, AI will move from hype to pragmatism | TechCrunch

In 2026, AI will move from hype to pragmatism | TechCrunch

In 2026, the AI landscape is expected to prioritize practical applications over the development of increasingly large language models. The industry will shift towards refining AI for real-world use cases, emphasizing efficiency, integration, and user-friendly designs. This focus aims to enhance AI's utility across various sectors, ensuring it meets tangible needs.

TechCrunch
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction

GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction

Recent advancements in 3D reconstruction using GaMO (Geometry-aware Multi-view Outpainter) tackle the limitations of existing methods that struggle with limited input views. By expanding the field of view from current camera poses, GaMO maintains geometric consistency and enhances scene coverage. In tests on Replica and ScanNet++, it achieved superior reconstruction quality and a $25\times$ speedup over leading diffusion methods, processing within 10 minutes. For more details, visit the project page: https://yichuanh.github.io/GaMO/.

arXiv
Many Minds from One Model: Bayesian Transformers for Population Intelligence

Many Minds from One Model: Bayesian Transformers for Population Intelligence

Researchers have introduced Population Bayesian Transformers (B-Trans), a novel approach that enables diverse model behaviors from a single set of pre-trained weights in large language models. By treating normalization layer offsets as stochastic variables, B-Trans maintains coherence while allowing for varied outputs. Experiments show that it improves semantic diversity and task performance in zero-shot generation and reinforcement learning scenarios, outperforming traditional deterministic models. This method enhances collaborative decision-making by aggregating predictions from multiple model instances.

arXiv