B, an open-weight multimodal vision AI model designed to deliver strong math, science, document and UI reasoning with far ...
Berlin Coyotiv and OpenServ Labs published a research paper introducing BRAID (Bounded Reasoning for Autonomous ...
Scientists warn that current AI tests reward polite responses rather than real moral reasoning in large language models.
DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and ...
The company mainly trained Phi-4-reasoning-vision-15B on open-source data. The data included images and text-based descriptions of the objects depicted in those images. Before it started training the ...
Open-sourcing a model allows researchers, developers, and companies to access and use the model’s weights and architecture, ...
The Naglieri Nonverbal Ability Test (NNAT) is a nonverbal assessment designed to measure general reasoning ability in K-12 students, helping schools identify students with strong problem-solving ...
Metilience unveils a hybrid AI reasoning engine for high-stakes exams, leveraging structured cognitive error analysis ...
Microsoft releases Phi-4 Reasoning Vision 15B, a multimodal AI model that activates its own thinking mode and handles ...
OpenAI’s next GPT model is coming—and soon, according to a person with knowledge of it.Among the highlights, the new model, ...
OpenAI has launched its new ChatGPT 5.4 with Extreme Reasoning mode for long-duration task focus. As well as a 1M-token context window ...