REDMOND / LONDON (IT BOLTWISE) – Microsoft has created a simulated marketplace environment to test the skills of AI agents. The results show that these models are vulnerable to manipulation and have difficulty collaborating.

Today’s daily deals at Amazon! ˗ˋˏ$ˎˊ˗

Microsoft, in collaboration with Arizona State University, has developed a new simulation environment to test the capabilities of AI agents. This environment, known as the ‘Magentic Marketplace’, serves as a synthetic platform on which the behavior of AI agents is studied. A typical experiment might involve a customer agent trying to order dinner according to a user’s instructions, while agents from different restaurants compete for the order.

The initial experiments involved 100 customer agents interacting with 300 business agents. Because the marketplace’s source code is open source, other groups can easily adopt the code to conduct new experiments or reproduce the results. Ece Kamar, the managing director of Microsoft Research’s AI Frontiers Lab, emphasizes the importance of this research to better understand the capabilities of AI agents.

The research examined a mix of leading models, including GPT-4o, GPT-5 and Gemini 2.5 Flash, and discovered some surprising weaknesses. In particular, the researchers found that companies could use techniques to convince customer agents to buy their products. A notable decrease in efficiency was noted when a customer agent was given too many options to choose from, overwhelming the agent’s attention space.

The agents also had difficulties when asked to cooperate, as they appeared to be unsure about what role each agent should play in the collaboration. Performance improved when the models were given more explicit instructions to work together, but the researchers still saw room for improvement in the models’ inherent capabilities.


Order an Amazon credit card without an annual fee with a credit limit of 2,000 euros!

Bestseller No. 1 ᵃ⤻ᶻ “KI Gadgets”

Bestseller No. 2 ᵃ⤻ᶻ “KI Gadgets”

Bestseller No. 3 ᵃ⤻ᶻ “KI Gadgets”

Bestseller No. 4 ᵃ⤻ᶻ «KI Gadgets»

Bestseller No. 5 ᵃ⤻ᶻ “KI Gadgets”

Did you like the article or the news - Microsoft's AI agents in the test: Surprising weaknesses in the Magentic Marketplace -? Then subscribe to us on Insta: AI News, Tech Trends & Robotics - Instagram - Boltwise

Our KI morning newsletter “The KI News Espresso” with the best AI news of the last day free by email – without advertising: Register here for free!




Microsoft's AI agents in the test: Surprising weaknesses in the Magentic Marketplace
Microsoft’s AI agents in the test: Surprising weaknesses in the Magentic Marketplace (Photo: DALL-E, IT BOLTWISE)

Please send any additions and information to the editorial team by email to de-info[at]it-boltwise.de. Since we cannot rule out AI hallucinations, which rarely occur with AI-generated news and content, we ask you to contact us via email and inform us in the event of false statements or misinformation. Please don’t forget to include the article headline in the email: “Microsoft’s AI agents in the test: Surprising weaknesses in the Magentic Marketplace”.

Source link

Leave a Reply

Your email address will not be published. Required fields are marked *