Tag: tasks

As AI agents take on more tasks, governance becomes a priority

AI techniques are beginning to transfer past easy responses. In lots of organisations, AI brokers at the moment

By saad

GPT-5.2 first impressions: a powerful update, especially for business tasks and workflows

OpenAI has formally launched GPT-5.2, and the reactions from early testers — amongst whom OpenAI seeded the mannequin

By saad

AI agents are taking over complex enterprise tasks

New adoption information from Perplexity reveals how AI brokers are driving workflow effectivity good points by taking up

By saad

Beyond math and coding: New RL framework helps train LLM agents for complex, real-world tasks

Researchers on the College of Science and Expertise of China have developed a brand new reinforcement studying (RL)

By saad

Alibaba's AgentEvolver lifts model performance in tool use by ~30% using synthetic, auto-generated tasks

Researchers at Alibaba’s Tongyi Lab have developed a brand new framework for self-evolving brokers that create their very

By saad

MCP-Universe benchmark shows GPT-5 fails more than half of real-world orchestration tasks

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues to enterprise AI,

By saad

Salesforce’s new CoAct-1 write their own code to accomplish tasks

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues to enterprise AI,

By saad

New vision model from Cohere runs on two GPUs, beats top-tier VLMs on visual tasks

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues to enterprise AI,

By saad

Fine-tuning vs. in-context learning: New research guides better LLM customization for real-world tasks

Be a part of our each day and weekly newsletters for the most recent updates and unique content

By saad

How the A-MEM framework supports powerful long-context memory so LLMs can take on more complicated tasks

Be a part of our every day and weekly newsletters for the most recent updates and unique content

By saad

Researchers find you don’t need a ton of data to train LLMs for reasoning tasks

Be a part of our each day and weekly newsletters for the most recent updates and unique content

By saad

Beyond benchmarks: How DeepSeek-R1 and o1 perform on real-world tasks

Be a part of our day by day and weekly newsletters for the newest updates and unique content

By saad