Multimodal Presentation Models

NVIDIA Unveils New Open AI Models at CES 2026 & New AI Platform with 5x Speed

NVIDIA’s new AI releases debut at CES 2026, including thirteen models and a supercomputer 5x faster than Blackwell, helping ...

EurekAlert!

Beyond bigger models: How efficient multimodal AI is redefining the future of intelligence

Multimodal large language models have shown powerful abilities to understand and reason across text and images, but their ...

VentureBeat

Companies are commercializing multimodal AI models to analyze videos and more

Earlier this month, researchers at the Allen Institute for AI — a nonprofit founded by late Microsoft cofounder Paul Allen — released an interactive demo of a system they describe as part of a “new ...

Forbes

Beyond Large Language Models: How Multimodal AI Is Unlocking Human-Like Intelligence

The AI industry has long been dominated by text-based large language models (LLMs), but the future lies beyond the written word. Multimodal AI represents the next major wave in artificial intelligence ...

SiliconANGLE

H2O.ai releases small language models for multimodal processing tasks

H2O.ai Inc. on Thursday introduced two small language models, Mississippi 2B and Mississippi 0.8B, that are optimized for multimodal tasks such as extracting text from scanned documents. The models ...

SiliconANGLE

Microsoft releases new Phi models optimized for multimodal processing, efficiency

Microsoft Corp. today expanded its Phi line of open-source language models with two new algorithms optimized for multimodal processing and hardware efficiency. The first addition is the text-only ...

VentureBeat

Multimodal models are fast becoming a reality — consequences be damned

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Roughly a year ago, VentureBeat wrote about progress in the AI and ...

TechCrunch

Meet two open source challengers to OpenAI’s ‘multimodal’ GPT-4V

OpenAI’s GPT-4V is being hailed as the next big thing in AI: a “multimodal” model that can understand both text and images. This has obvious utility, which is why a pair of open source projects have ...

Campus Technology

WHO Paper Raises Concerns about Multimodal Gen AI Models

Unless developers and governments adjust their practices around generative AI, large multimodal models may be adopted faster than they can be made safe for use, warns a new paper by the World Health ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results