Popular AI model performance benchmark may be flawed, Meta researchers warn

  • Posted on September 9, 2025
  • By South China Morning Post
  • 2 Views
Popular AI model performance benchmark may be flawed, Meta researchers warn

‘We’ve identified multiple loopholes with SWE-bench Verified,’ the manager at Meta Platforms’ AI research lab Fair says.
continue reading...

Author
South China Morning Post

You May Also Like