首页 News 正文

The UK is the first to release AI model security detection tools to identify and evaluate related risks

阿豆学长长ov
339 0 0

Caixin News, May 12th (Editor Niu Zhanlin) - The UK Artificial Intelligence (AI) Security Research Institute released a new testing platform on Friday aimed at strengthening the monitoring of security risks in advanced AI models.
It is reported that the toolbox is called Inspect and can be used to evaluate AI models in a range of fields, including their core knowledge, reasoning ability, and autonomy. By releasing under an open source license, this means that Inspect can be used for free by the global AI community.
Last October, the UK announced the establishment of the Artificial Intelligence Security Research Institute, which will research and test new AI models; In February of this year, the UK also announced that it will invest over £ 100 million to launch nine new research centers and provide technical training to AI regulatory agencies.
At a press conference, the UK Institute for Artificial Intelligence Security stated that Inspect is a software library that allows testers to evaluate the specific capabilities of individual AI models and then give a score based on the results.
Inspect can be used starting from Friday, which is also the AI security testing platform first launched by nationally supported institutions.
Under the current wave of AI competition, more and more AI models will be launched this year, making it more urgent than ever to promote the development of AI security.
However, it is still quite difficult to benchmark AI models at present, as the most complex AI models today are basically "black boxes", and their infrastructure, training data, and other key details are usually kept confidential by the companies that created them and not disclosed to the public.
So, how did Inspect address this challenge? Mainly through its scalability, it can adapt and accept new testing technologies. The built-in components of Inspect can be enhanced or extended through third-party software packages written in Python.
Inspect consists of three basic components: dataset, solver, and scorer. The dataset is used to evaluate the sample set for testing, the solver is a component that performs actual testing work, and the scorer is responsible for evaluating the work results of the solver, ultimately generating a comprehensive evaluation of the performance of the AI model. This design allows Inspect to flexibly adapt to different testing needs and evaluation standards.
As part of the UK's ongoing leadership in the field of AI security, I have approved the open source Inspect, which demonstrates the UK's unique talent and creativity in innovation and technological development, and consolidates our position as a world leader in this field, according to UK Minister of Science Michelle Doneland.
Ian Hogarth, Chairman of the Institute of Artificial Intelligence Security, claims that successful AI security testing collaboration means having a shared and accessible evaluation method, and we hope that Inspect can become the cornerstone of AI security research, research organizations, and academia.
LogoMoney.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表LogoMoney.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   美股市场:纽约股市三大股指4月30日涨跌不一。截至当天收盘,道琼斯工业平均指数比前一交易日上涨141.74点,收于40669.36点,涨幅为0.35%;标准普尔500种股票指数上涨8.23点,收于5569.06点,涨幅为0.15%;纳斯 ...
    joey791216
    2 小时前
    支持
    反对
    回复
    收藏
  •   美国总统特朗普近日在接受媒体采访时表示,他第二个任期不仅治理美国,也治理全世界。   特朗普于4月24日接受了《大西洋》(The Atlantic)月刊采访,这段专访于4月28日发布。   “第一次当总统时,我要做两 ...
    lfancn
    昨天 12:10
    支持
    反对
    回复
    收藏
  •   东风有限回应武汉工厂关停事宜   据第一财经,4月29日,东风汽车有限公司证实,该公司武汉工厂目前正常运行,后续也不会关停。东风有限称,该公司将在东风与日产母公司的支持下平稳有序发展,持续加速向新能源 ...
    king19831101
    昨天 09:56
    支持
    反对
    回复
    收藏
  •   4月29日凌晨,阿里巴巴开源新一代通义千问模型Qwen3(千问3),参数量为DeepSeek-R1的三分之一,成本大幅下降。据称,该模型性能全面超越R1、OpenAI-o1等领先模型,登顶全球最强开源模型。   千问3是国内首个“ ...
    风雨中行走
    前天 10:32
    支持
    反对
    回复
    收藏
阿豆学长长ov 注册会员
  • 粉丝

    0

  • 关注

    0

  • 主题

    27