首页 News 正文

Alibaba takes action! "Breaking through the global bottom price"

Le174
1264 0 0

On May 21st, the reporter learned from Alibaba Cloud that the API input price of the Qwen Long, the main model of the Tongyi Qianwen GPT-4, has decreased from 0.02 yuan/thousand tokens to 0.0005 yuan/thousand tokens, a direct decrease of 97%. This means that 1 yuan can buy 2 million tokens, which is equivalent to the amount of text in 5 New China Dictionary books. This model supports up to 10 million tokens of long text input, and after a price reduction, it is approximately 1/400 of the GPT-4 price.
At the Wuhan AI Leaders Summit held on the 21st, Liu Weiguang, Senior Vice President of Alibaba Cloud Intelligent Group and President of Public Cloud Business Unit, said, "As China's largest cloud computing company, Alibaba Cloud has significantly reduced the price of large model inference this time in order to accelerate the explosion of AI applications. We expect the number of calls to large model APIs to increase by thousands of times in the future."
Liu Weiguang described the new changes in Alibaba's Tongyi Qianwen as "breaking through the global bottom price and accelerating the AI outbreak".
The price reduction covers a total of 9 commercial and open-source series models
It is reported that the price reduction of Tongyi Qianwen this time covers a total of 9 commercial and open source series models, including Qwen Long, Qwen Max, Qwen 1.5-72B, etc. Among them, the main model of Tongyi Qianwen is Qwen Long, with a maximum context length of tens of millions. The API input price has decreased from 0.02 yuan/thousand tokens to 0.0005 yuan/thousand tokens, a decrease of 97%; The flagship model Qwen Max, just released, has caught up with GPT-4 Turbo in terms of performance on the authoritative benchmark OpenCompass. Its API input price has dropped to 0.04 yuan/thousand tokens, a decrease of 67%.
Among them, the main model Qwen Long performs against GPT-4, can handle ultra long contextual scenarios, supports input in different languages such as Chinese and English, and supports ultra long contextual conversations up to 10 million tokens (approximately 15 million words or 15000 pages of documents). The document service launched synchronously with the Alibaba Cloud Bailian platform supports parsing and dialogue in various document formats such as Word, PDF, Markdown, EPUB, and Mobi.
Public cloud+API will become the mainstream way for enterprises to use large models
As the performance of large models gradually improves, AI application innovation is entering a period of intensive exploration, but high inference costs remain a key factor restricting the large-scale application of large models.
Unlike private deployment, cloud based invocation provides greater space for cost reduction and efficiency enhancement of large models. In general, private deployment of open-source models requires self built clusters, taking into account multiple cost factors such as hardware procurement, software deployment, network costs, electricity costs, hardware depreciation, and manpower. If computing resources are idle or overloaded, additional costs need to be paid; Calling the big model API on the cloud truly achieves on-demand and on-demand use.
Liu Weiguang described the new changes in Alibaba's Tongyi Qianwen as "breaking through the global bottom price and accelerating the AI outbreak".
He stated that whether it is an open source model or a commercial model, public cloud+API will become the mainstream way for enterprises to use large models, mainly for three reasons:
Firstly, the technological dividends and economies of scale of public clouds bring enormous cost and performance advantages. Alibaba Cloud can continuously optimize from both the model itself and the AI infrastructure, pursuing the ultimate inference cost and performance. Alibaba Cloud has built an extremely elastic AI computing power scheduling system based on self-developed core technologies and products such as heterogeneous chip interconnection, high-performance network HPN7.0, high-performance storage CPFS, and artificial intelligence platform PAI. Combined with the Bailian distributed inference acceleration engine, it significantly reduces model inference costs and accelerates model inference speed.
That is to say, even for the same open-source model, the call price on public clouds is far lower than that of private deployment. Taking the Qwen-72B open-source model and a monthly usage of 100 million tokens as an example, directly calling the API on Alibaba Cloud Bailian only costs 600 yuan per month, and the average monthly cost of private deployment exceeds 10000 yuan.
The second is that the cloud is more convenient for multiple model calls and provides enterprise level data security protection. Alibaba Cloud can provide a dedicated VPC environment for each enterprise, achieving computation isolation, storage isolation, network isolation, and data encryption, fully ensuring data security. At present, Alibaba Cloud has led or deeply participated in the formulation of more than ten international and domestic technical standards related to large model security.
The third is the natural openness of cloud vendors, which can provide developers with the richest models and toolchains. On the Alibaba Cloud Bailian platform, hundreds of high-quality models from both domestic and international markets, such as Tongyi, Baichuan, ChatGLM, and Llama series, are gathered. The platform is equipped with a large model customization and application development toolchain, allowing developers to easily test and compare different models, develop exclusive large models, and easily build RAG and other applications. From selecting models, adjusting models, building applications, to providing external services, it's a one-stop solution.
According to the latest data, the Tongyi Big Model has served over 90000 enterprises through Alibaba Cloud and over 2.2 million enterprises through DingTalk services. It has been applied in fields such as PC, mobile phones, automobiles, aviation, astronomy, mining, education, healthcare, catering, gaming, and cultural tourism.
On May 9th, Xiaomi's artificial intelligence assistant "Xiaoai Classmate" reached a cooperation agreement with Alibaba Cloud Tongyi Big Model to strengthen its multimodal AI generation capabilities in image generation, image understanding, and other aspects, and has been implemented on various types of devices such as Xiaomi cars and mobile phones. In addition, companies such as Weibo, Zhongan Insurance, and Perfect World Games have also announced the integration of the Tongyi Big Model and its application in social media, insurance, gaming, and other fields.
LogoMoney.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表LogoMoney.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   美股市场:纽约股市三大股指4月30日涨跌不一。截至当天收盘,道琼斯工业平均指数比前一交易日上涨141.74点,收于40669.36点,涨幅为0.35%;标准普尔500种股票指数上涨8.23点,收于5569.06点,涨幅为0.15%;纳斯 ...
    joey791216
    昨天 11:57
    支持
    反对
    回复
    收藏
  •   美国总统特朗普近日在接受媒体采访时表示,他第二个任期不仅治理美国,也治理全世界。   特朗普于4月24日接受了《大西洋》(The Atlantic)月刊采访,这段专访于4月28日发布。   “第一次当总统时,我要做两 ...
    lfancn
    前天 12:10
    支持
    反对
    回复
    收藏
  •   东风有限回应武汉工厂关停事宜   据第一财经,4月29日,东风汽车有限公司证实,该公司武汉工厂目前正常运行,后续也不会关停。东风有限称,该公司将在东风与日产母公司的支持下平稳有序发展,持续加速向新能源 ...
    king19831101
    前天 09:56
    支持
    反对
    回复
    收藏
  •   当地时间周四,美股三大股指集体收涨,其中道指和标普500指数实现“八连涨”。不过,三大股指均在尾盘出现小幅跳水。   苹果、亚马逊于周四美股盘后公布了最新业绩,尽管业绩有所超出预期,但仍有令市场不满 ...
    jiangu12
    11 小时前
    支持
    反对
    回复
    收藏
Le174 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    3