Tag: allm.link | en

AI Search: Lies and Fabrications

A Columbia Journalism Review (CJR) investigation reveals that AI-powered search engines are increasingly providing fabricated information and phantom citations, prioritizing speed over accuracy. This trend undermines the credibility of online sources, reduces website traffic, and poses a significant threat to the future of reliable information and informed public discourse. The situation is getting worse.

AI Search: Lies and Fabrications

AI Video's Physics Problem

Generative video models, like Tencent's Hunyuan and Alibaba's Wan 2.1, have made strides in temporal consistency. However, they often struggle with realistic physics, producing scenes where objects defy gravity or move unnaturally. A new benchmark, VideoPhy-2, aims to address this, highlighting the next major challenge for video AI: understanding physical commonsense.

AI Video's Physics Problem

Alibaba's Quark: AI Super Assistant

Alibaba upgrades Quark, its AI-powered information platform, into a super assistant. Driven by the Qwen model, Quark offers enhanced search, AI chat, deep thinking, and task execution. It's a significant step in Alibaba's AI strategy, aiming to integrate AI across its businesses and improve user experiences in various scenarios, from research to travel planning.

Alibaba's Quark: AI Super Assistant

Alibaba's R1-Omni: AI That Sees Emotions

Alibaba unveils R1-Omni, an open-source AI model that detects emotions through visual cues like facial expressions and body language. This marks a significant step beyond text-based emotion analysis, challenging competitors like OpenAI. The move is part of Alibaba's broader AI strategy, amidst growing competition and ethical considerations in the rapidly evolving field.

Alibaba's R1-Omni: AI That Sees Emotions

Claude AI to Get Voice Chat and Memory

Anthropic's Claude AI chatbot is set to receive major upgrades, including two-way voice interactions and memory capabilities. These enhancements aim to create more natural, personalized, and contextually relevant user experiences, positioning Claude as a versatile and adaptive assistant in the competitive AI landscape. The focus is on responsible implementation and ongoing refinement.

Claude AI to Get Voice Chat and Memory

Cohere's Command A: Speed & Efficiency

Cohere unveils Command A, a new large language model (LLM) designed for enterprise use. It boasts superior speed and efficiency, requiring fewer GPUs and offering twice the context length of competitors. Command A excels in inference and retrieval-augmented generation (RAG) tasks, making it a cost-effective and powerful solution for businesses seeking to enhance productivity.

Cohere's Command A: Speed & Efficiency

Gemma 3: Google's Efficient LLM

Google's Gemma 3 is a powerful and efficient open-source LLM. It surpasses competitors in performance while using fewer resources. Gemma 3 boasts multilingual capabilities, advanced functionalities like function calling, and optimized quantum versions. It's a significant step forward in accessible and sustainable AI, built upon the advancements of Gemini 2.0.

Gemma 3: Google's Efficient LLM

Grok AI Chatbot Adds Automatic URL Detection

Elon Musk's xAI chatbot, Grok, now automatically detects and reads URLs shared in user messages. This feature, found in the 'Behavior' settings, enhances context, research, and information retrieval, making Grok a more powerful and versatile AI assistant. It signifies a major step in integrating chatbots with the wider internet.

Grok AI Chatbot Adds Automatic URL Detection

Meta & SG Gov Launch Llama AI Incubator

Meta partners with the Singapore Government to launch the Llama Incubator Program, fostering open-source AI innovation. This initiative empowers startups, SMEs, and public sector agencies to develop AI solutions using Meta's Llama model, driving economic growth and societal progress. The program offers mentorship, resources, and prioritizes AI safety, culminating in a Demo Day in October 2025.

Meta & SG Gov Launch Llama AI Incubator

Mistral AI Launches Advanced OCR API

Mistral AI unveils Mistral OCR, a new API designed to digitize documents with superior accuracy and speed. It excels in multilingual support, complex layouts, and structured data extraction, surpassing existing solutions. Mistral OCR integrates with LLMs, offers high-speed processing, and prioritizes security, making it ideal for enterprises seeking to enhance efficiency and unlock insights.

Mistral AI Launches Advanced OCR API