PANews February 27 News, according to Cointelegraph, the open-source AI laboratory Sentient announced the launch of Arena, a production-level testing environment for evaluating AI agents’ performance in enterprise workflows. The digital asset departments of Pantera Capital and Franklin Templeton have joined Arena’s initial testing group.
Sentient stated that Arena is not a static model test but simulates enterprise conditions—including long documents, incomplete information, and conflicting sources—to standardize task testing for AI agents. The platform tracks failure categories such as hallucinations, missing evidence, citation errors, and reasoning flaws to help developers diagnose issues. Arena plans to publish comparative performance metrics through a public leaderboard and release test reports summarizing common failure modes and solutions.
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to
Disclaimer.
Related Articles
Startale Expands to Abu Dhabi to Scale Regulated Blockchain Infrastructure
Startale Group, the global crypto infrastructure company behind Japan’s largest public Astar Network chain, is taking one of the world’s fastest-growing state-backed crypto ecosystems in
Abu Dhabi
The company was selected as one of 27 firms from more than 2,400 global applicants for the Hub71
DailyCoin11m ago
KelpDAO $290M Exploit Attributed to North Korea's Lazarus Group
LayerZero attributed a $290 million exploit of KelpDAO's cross-chain rsETH configuration to North Korea's Lazarus Group on April 18, describing the attacker as a "highly-sophisticated state actor." According to LayerZero, the incident was limited to KelpDAO's rsETH setup and did not spread to other
CryptoFrontier1h ago
Ripple XRP Ledger Is Set to be Quantum-Resistant by 2028, Signaling Highly Bullish Future for XRP Price
Ripple XRP Ledger is set to be quantum-resistant by 2028.
This signals a highly bullish future for XRP.
Tron is also working to catch up with this progressive move.
Crypto and blockchain continue to chase success and progress with no concern for the current state of markets. For
CryptoNewsLand1h ago
Ripple Plans Quantum-Resistant XRP Ledger by 2028
Ripple announced on Monday a multi-stage roadmap to build quantum-resistant infrastructure for the XRP Ledger (XRPL) by 2028, addressing growing concerns about the security of existing cryptographic systems against future quantum computing threats.
Quantum Threat Context
While quantum computing t
CryptoFrontier1h ago
DefiLlama Founder: Arbitrum Prioritizing Seized Funds for Aave Market Could Cut Bad Debt by 80%
Gate News message, April 21 — According to DefiLlama founder 0xngmi, if Arbitrum prioritizes using seized funds for the Aave market on Arbitrum, bad debt could be significantly reduced. Under a "loss socialization" scenario, Arbitrum would face no bad debt at all; if rsETH on the L2 faces a
GateNews3h ago
Singapore's MetaComp Launches AI Agent Framework for Financial Compliance and Payments
MetaComp debuts StableX Know Your Agent for regulated AI in payments, combining multi-vendor analytics to slash false clean rates, with AgentX Skills supporting Claude; aims for auditable cross-border finance via downloadable AI Skills.
Abstract: MetaComp introduces the StableX Know Your Agent framework to govern AI agents in regulated payments and wealth management, covering identity, permissions, monitoring, auditing, and agent-to-agent interactions. It reduces false positives by parallel analytics from multiple vendors and enables auditable cross-border finance through downloadable AI Skills (AgentX), starting with Claude support and expansion across regions.
GateNews3h ago