首页 News 正文

What is the mysterious new product of Google OpenAI? Latest Speculation: Multimodal AI Assistant

男人的余味偷
1302 0 0

OpenAI is determined to launch a live broadcast and launch new products the day before Google I/O Conference, demonstrating the magical updates of ChatGPT and GPT-4.
What is this mysterious new product? The speculation about GPT-5 and search engines has been personally overturned by OpenAI CEO Altman.
According to the latest reports, the AI assistant built into the phone may be a product that OpenAI is about to release.
Technology media The Information cited insiders as saying that OpenAI plans to launch a multimodal AI model that has visual and auditory functions, can communicate with you, recognize objects, and has better logical reasoning ability than current chatbots. OpenAI has already demonstrated this model to some clients.
OpenAI has developed models that can transcribe audio and text to speech. The report states that the new model is equivalent to a combination of these models, but is more accurate and responsive faster. The new model can help AI assistants distinguish mood, better understand semantics, and theoretically, it can help students learn mathematics or translate real-world gestures.
However, although the new model can surpass the GPT-4 Turbo in answering certain types of questions, there are still hallucinations.
According to developer Anany Arora, OpenAI may launch a service with built-in ChatGPT function on mobile phones for making phone calls. Arora posted screenshots of the above call related code on social media and also found evidence that OpenAI has been configured as a server for real-time audio and video communication.
Using artificial intelligence to make phone calls can save users waiting time, and this service can be seen as one of the functions of AI assistants.
The AI assistant is also a feature that Google has been developing. It is reported that Google Pixel 9 series phones will have a brand new exclusive AI assistant "Pixie" built-in, which can view items through the device's camera and perform operations such as indicating the place of purchase or providing instructions for using the items.
Altman previously revealed in an interview with Salesforce CEO Marc Benioff that his favorite AI movie is "She," a story about a man falling in love with his AI virtual assistant. "The idea of a dialogue language interface has incredible foresight."
The Information reported that Altman hopes to ultimately develop a virtual assistant that can respond quickly, similar to the AI assistant in the movie, and support existing voice assistants such as Apple Siri with this technology.
It is worth noting that according to insiders, Apple is about to reach an agreement with OpenAI to introduce the latter's technology on the new generation iOS operating system. Both parties have been finalizing the terms of an agreement to use the ChatGPT feature in Apple's next-generation iPhone operating system iOS 18.
The new model relies on the cloud for operation and is expected to be included in the free version of ChatGPT in the future
OpenAI believes that AI assistants with visual and auditory capabilities may bring about changes like smartphones. It can observe the environmental information of users, provide suggestions, and potential use cases such as acting as a tutor, translating traffic signs, repairing cars, and so on.
But similar technologies currently require too high hardware barriers to run on personal devices. Media analysis points out that the new model depends on the cloud to run and requires Internet connection to work. It may take several months or even years to make complex artificial intelligence conversations with visual and auditory functions small enough to run on personal devices such as smartphones.
It is currently unclear when OpenAI will provide these new features to paying customers, but according to people who have tried the voice assistant, OpenAI's ultimate plan is to include these features in the free version of ChatGPT, with the goal of lower operating costs than its state-of-the-art model GPT-4 Turbo.
OpenAI did not respond to the above speculation.
What will OpenAI ultimately launch? The answer will be revealed next week, and OpenAI has announced that it will live stream on its official website at 10am Pacific Time on May 13th (1am Beijing Time on May 14th), showcasing some updates to ChatGPT and GPT-4.
LogoMoney.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表LogoMoney.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   美股市场:纽约股市三大股指4月30日涨跌不一。截至当天收盘,道琼斯工业平均指数比前一交易日上涨141.74点,收于40669.36点,涨幅为0.35%;标准普尔500种股票指数上涨8.23点,收于5569.06点,涨幅为0.15%;纳斯 ...
    joey791216
    3 天前
    支持
    反对
    回复
    收藏
  •   当地时间周四,美股三大股指集体收涨,其中道指和标普500指数实现“八连涨”。不过,三大股指均在尾盘出现小幅跳水。   苹果、亚马逊于周四美股盘后公布了最新业绩,尽管业绩有所超出预期,但仍有令市场不满 ...
    jiangu12
    前天 10:28
    支持
    反对
    回复
    收藏
  •   5月2日,全球电商巨头亚马逊公布了2025年第一季度财报。亚马逊第一季度净销售额为1556.67亿美元,较2024年第一季度同比增长9%;净利润为171.27亿美元,较2024年第一季度增长64%;每股摊薄收益1.59美元,较上年同 ...
    独品金莲芳
    昨天 10:16
    支持
    反对
    回复
    收藏
  •   周三热门中概股涨跌不一。纳斯达克中国金龙指数(HXC)收跌0.95%。   上涨股当中(按市值从高到低),台积电涨1.34%,阿里巴巴涨0.46%,拼多多涨1.36%,网易涨0.66%,中华电信涨1.33%,理想汽车涨0.91%,日月 ...
    蓝蓝的彩
    3 天前
    支持
    反对
    回复
    收藏
男人的余味偷 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    2