首页 News 正文

What is the mysterious new product of Google OpenAI? Latest Speculation: Multimodal AI Assistant

男人的余味偷
1304 0 0

OpenAI is determined to launch a live broadcast and launch new products the day before Google I/O Conference, demonstrating the magical updates of ChatGPT and GPT-4.
What is this mysterious new product? The speculation about GPT-5 and search engines has been personally overturned by OpenAI CEO Altman.
According to the latest reports, the AI assistant built into the phone may be a product that OpenAI is about to release.
Technology media The Information cited insiders as saying that OpenAI plans to launch a multimodal AI model that has visual and auditory functions, can communicate with you, recognize objects, and has better logical reasoning ability than current chatbots. OpenAI has already demonstrated this model to some clients.
OpenAI has developed models that can transcribe audio and text to speech. The report states that the new model is equivalent to a combination of these models, but is more accurate and responsive faster. The new model can help AI assistants distinguish mood, better understand semantics, and theoretically, it can help students learn mathematics or translate real-world gestures.
However, although the new model can surpass the GPT-4 Turbo in answering certain types of questions, there are still hallucinations.
According to developer Anany Arora, OpenAI may launch a service with built-in ChatGPT function on mobile phones for making phone calls. Arora posted screenshots of the above call related code on social media and also found evidence that OpenAI has been configured as a server for real-time audio and video communication.
Using artificial intelligence to make phone calls can save users waiting time, and this service can be seen as one of the functions of AI assistants.
The AI assistant is also a feature that Google has been developing. It is reported that Google Pixel 9 series phones will have a brand new exclusive AI assistant "Pixie" built-in, which can view items through the device's camera and perform operations such as indicating the place of purchase or providing instructions for using the items.
Altman previously revealed in an interview with Salesforce CEO Marc Benioff that his favorite AI movie is "She," a story about a man falling in love with his AI virtual assistant. "The idea of a dialogue language interface has incredible foresight."
The Information reported that Altman hopes to ultimately develop a virtual assistant that can respond quickly, similar to the AI assistant in the movie, and support existing voice assistants such as Apple Siri with this technology.
It is worth noting that according to insiders, Apple is about to reach an agreement with OpenAI to introduce the latter's technology on the new generation iOS operating system. Both parties have been finalizing the terms of an agreement to use the ChatGPT feature in Apple's next-generation iPhone operating system iOS 18.
The new model relies on the cloud for operation and is expected to be included in the free version of ChatGPT in the future
OpenAI believes that AI assistants with visual and auditory capabilities may bring about changes like smartphones. It can observe the environmental information of users, provide suggestions, and potential use cases such as acting as a tutor, translating traffic signs, repairing cars, and so on.
But similar technologies currently require too high hardware barriers to run on personal devices. Media analysis points out that the new model depends on the cloud to run and requires Internet connection to work. It may take several months or even years to make complex artificial intelligence conversations with visual and auditory functions small enough to run on personal devices such as smartphones.
It is currently unclear when OpenAI will provide these new features to paying customers, but according to people who have tried the voice assistant, OpenAI's ultimate plan is to include these features in the free version of ChatGPT, with the goal of lower operating costs than its state-of-the-art model GPT-4 Turbo.
OpenAI did not respond to the above speculation.
What will OpenAI ultimately launch? The answer will be revealed next week, and OpenAI has announced that it will live stream on its official website at 10am Pacific Time on May 13th (1am Beijing Time on May 14th), showcasing some updates to ChatGPT and GPT-4.
LogoMoney.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表LogoMoney.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   困扰开发者多年的“苹果税”迎来松动的契机。   日前,美国地方法院作出裁决,要求苹果支持移动开发者将用户引导至第三方支付平台消费,这意味着此后iOS开发者将不再受“苹果税”的制约,可以直接推广并将用户 ...
    abc691001
    昨天 10:58
    支持
    反对
    回复
    收藏
  •   美股市场:美股三大指数5日集体下跌,道指、标普终结9连涨。截至当天收盘,道琼斯工业平均指数比前一交易日下跌98.60点,收于41218.83点,跌幅为0.24%;标准普尔500种股票指数下跌36.29点,收于5650.38点,跌幅 ...
    jgserver
    昨天 11:04
    支持
    反对
    回复
    收藏
  •   据媒体援引消息人士报道,如果当前的贸易谈判未能带来令人满意的结果,欧盟计划对约1000亿欧元(约合1130亿美元)的美国产品加征额外关税。   知情人士称,拟议的报复措施最早将于周三与成员国分享,并将进行 ...
    benhao
    26 分钟前
    支持
    反对
    回复
    收藏
  •   北京时间5月3日晚,“股神”沃伦·巴菲特(Warren Buffett)旗下伯克希尔·哈撒韦公司(Berkshire Hathaway)2025年年度股东大会,在美国内布拉斯加州的奥马哈市举办。   在这场一年一度的“投资界春晚”上, ...
    romuvic
    前天 12:15
    支持
    反对
    回复
    收藏
男人的余味偷 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    2