The Enhancement of Bixby

The Korean company Samsung Electronics announced new updates to its voice assistant Bixby that are designed to improve user experience, performance, and capabilities of the intelligent assistant and platform. One of the most interesting innovations concerns the voice of the users. According to Samsung, they “can personalize their Bixby Text Call voice”. “Using the new Bixby Custom Voice Creator, users can record different sentences for Bixby to analyze and create an AI generated copy of their voice and tone. Currently available in Korean, this generated voice is planned to be compatible with other Samsung apps beyond phone calls” (Samsung, 22 February 2023). As early as 2017, Oliver Bendel wrote with respect to Adobe VoCo: “Today, just a few minutes of samples are enough to be able to imitate a speaker convincingly in all kinds of statements.” In his article “The synthetization of human voices”, published in AI & Society, he also made ethical considerations. Now there seems to be a recognized market for such applications and they are being rolled out more widely.

The Old, New Neons

The company Neon picks up an old concept with its Neons, namely that of avatars. Twenty years ago, Oliver Bendel distinguished between two different types in the Lexikon der Wirtschaftsinformatik. With reference to the second, he wrote: “Avatars, on the other hand, can represent any figure with certain functions. Such avatars appear on the Internet – for example as customer advisors and newsreaders – or populate the adventure worlds of computer games as game partners and opponents. They often have an anthropomorphic appearance and independent behaviour or even real characters …” (Lexikon der Wirtschaftsinformatik, 2001, own translation) It is precisely this type that the company, which is part of the Samsung Group and was founded by Pranav Mistry, is now adapting, taking advantage of today’s possibilities. “These are virtual figures that are generated entirely on the computer and are supposed to react autonomously in real time; Mistry spoke of a latency of less than 20 milliseconds.” (Heise Online, 8 January 2019, own translation) The neons are supposed to show emotions (as do some social robots that are conquering the market) and thus facilitate and strengthen bonds. “The AI-driven character is neither a language assistant a la Bixby nor an interface to the Internet. Instead, it is a friend who can speak several languages, learn new skills and connect to other services, Mistry explained at CES.” (Heise Online, 8 January 2019, own translation)