Google's' Her 'rushes to land OpenAI voice AI still' holds on '
六月清晨搅
发表于 2024-8-14 20:39:29
275
0
0
On the early morning of August 14th Beijing time, Google officially released its intelligent voice assistant Gemini Live at the "Made by Google" conference. This feature directly challenges OpenAI's GPT-4o voice mode and marks another step towards more natural, universal, and user-friendly AI interaction.
According to Google, users can have free and smooth conversations with Gemini Live instead of relying on traditional input and output settings.
During the conversation, users can interrupt to inquire about more details or pause for a period of time before resuming.
In order to make conversations more natural, Google also offers ten voices for users to choose from. Google said, "It's like having a companion in your pocket that you can talk to about new ideas or practice important conversations with
The GPT-4o advanced voice mode previously released by Open AI also allows users to interrupt during conversations and perceive and respond to emotional fluctuations. In terms of voice settings, Open AI offers four types of voices, all produced in collaboration with professional voice actors.
In addition, Google will also connect Gemini Live with other applications and tools. Google has announced that it will launch extension features such as Keep, Tasks, Utilities, Calendar, YouTube Music, etc. in the coming weeks.
Google described the specific application scenarios of these features. For example, if a user needs to host a dinner party, Gemini Live can find specific recipes and add ingredients to the Keep shopping list, as well as customize a playlist that "reminds people of the late 1990s"; For example, by taking a photo of a concert poster, Gemini Live can answer whether the user is available on the day and remind them to buy tickets.
However, during the live demonstration of Gemini Live features at the "Made by Google" conference, there was a small incident. Google executive Dave Citron asked Gemini Live if there were any events on his schedule, but he tried Gemini Live twice in a row without any response until he changed his device for the third time before successfully demonstrating.
Currently, Google has provided an English version to Gemini premium subscribers on Android phones and will expand to iOS in the coming weeks, offering more language modes. The latest Pixel 9 series phones released by Google also feature Gemini Live functionality.
Industry insiders believe that the release of Gemini Live is an important milestone in the development of artificial intelligence interaction. By introducing voice interruption and selection functions, Google is not only competing with OpenAI, but also promoting human-computer interaction, thereby changing the competitive landscape of the artificial intelligence chatbot market and forcing other companies to create more natural, practical, and attractive artificial intelligence assistants.
At the same time, the innovative development of human-computer interaction has also brought new problems and challenges. For example, how will artificial intelligence quickly handle topic changes while maintaining contextual unity and relevance? How to handle interference information without losing important clues? More importantly, with the deepening development of artificial intelligence, where is its boundary with real life?
However, GPT-4o, which OpenAI publicly introduced three months ago, has not yet been fully implemented. On August 9th, OpenAI released a blog post about security, detailing the company's security efforts in developing GPT-4o and exploring the potential risks these technologies may pose to society.
OpenAI pointed out in the report the risks that artificial intelligence's humanoid social model may pose. OpenAI believes that users may establish social relationships with artificial intelligence and reduce the need for human interaction. This is beneficial for lonely individuals, but it can affect healthy interpersonal relationships.
OpenAI revealed that during the early testing of GPT-4o, they observed subtle changes in the interaction language between users and models, such as "This is our last day together" and so on. This seemingly harmless expression may hide bigger problems behind it.
In addition, OpenAI also mentioned that GPT-4o sometimes unintentionally generates outputs that mimic user voices, which means that AI speech engines may be used for fraud.
And these security issues are also one of the reasons why OpenAI controls the landing pace of GPT-4o. As for whether Google Gemini Live has addressed similar security risks, it has not been disclosed.
All security related risks, whether we are aware of them or the additional possibilities attached to Pandora's Box, are issues that need to be further addressed in the field of artificial intelligence to ensure that technological progress serves humanity.
LogoMoney.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表LogoMoney.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表LogoMoney.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- OpenAI, 코인베이스 전 임원 최고마케팅책임자로 영입
- OpenAI宣布!12天12场新品发布会
- 谷歌狙击OpenAI 集中火力猛攻AI智能体
- Google Snipes OpenAI, Concentrates Fire on Attacking AI Agents
- グーグル、OpenAI集中火力を狙撃しAIエージェントを猛攻
- 구글, OpenAI 저격, AI 지능체 맹공 화력 집중
- 阿根廷三季度摆脱严重经济衰退
- Google launches a reasoning model similar to OpenAI o1
- グーグル、OpenAI o 1に似た推理モデルを発売
- 구글, OpenAI o1과 비슷한 추리 모델 출시
-
美股市场:纽约股市三大股指4月30日涨跌不一。截至当天收盘,道琼斯工业平均指数比前一交易日上涨141.74点,收于40669.36点,涨幅为0.35%;标准普尔500种股票指数上涨8.23点,收于5569.06点,涨幅为0.15%;纳斯 ...
- joey791216
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
当地时间周四,美股三大股指集体收涨,其中道指和标普500指数实现“八连涨”。不过,三大股指均在尾盘出现小幅跳水。 苹果、亚马逊于周四美股盘后公布了最新业绩,尽管业绩有所超出预期,但仍有令市场不满 ...
- jiangu12
- 前天 10:28
- 支持
- 反对
- 回复
- 收藏
-
5月2日,全球电商巨头亚马逊公布了2025年第一季度财报。亚马逊第一季度净销售额为1556.67亿美元,较2024年第一季度同比增长9%;净利润为171.27亿美元,较2024年第一季度增长64%;每股摊薄收益1.59美元,较上年同 ...
- 独品金莲芳
- 昨天 10:16
- 支持
- 反对
- 回复
- 收藏
-
周三热门中概股涨跌不一。纳斯达克中国金龙指数(HXC)收跌0.95%。 上涨股当中(按市值从高到低),台积电涨1.34%,阿里巴巴涨0.46%,拼多多涨1.36%,网易涨0.66%,中华电信涨1.33%,理想汽车涨0.91%,日月 ...
- 蓝蓝的彩
- 3 天前
- 支持
- 反对
- 回复
- 收藏