Chatbots for dead, endangered, and extinct languages are being developed at the FHNW School of Business. One well-known example is @llegra, a chatbot for Vallader. Oliver Bendel recently tested the reach of GPTs for endangered languages such as Irish (Irish Gaelic), Maori, and Basque. According to ChatGPT, there is a relatively large amount of training material for them. On May 12, 2024 – after Irish Girl and Maori Girl – a first version of Adelina, a chatbot for Basque, was created. It was later improved in a second version. As part of the kAIxo project (the Basque “kaixo” corresponds to the english “hello”), the chatbot or voice assistant kAIxo is to be developed that speaks Basque. The purpose is to keep users practicing written or spoken language or to develop the desire to learn the endangered language. The chatbot should be based on a Large Language Model (LLM). Both prompt engineering and fine-tuning are conceivable for customization. Retrieval Augmented Generation (RAG) can play a central role. The result will be a functioning prototype. Nicolas Lluis Araya, a student of business informatics, has been recruited to implement the project. The kick-off meeting will take place on September 3, 2024.
Teaching and Learning with GPTs
In the spring semester of 2024, Prof Dr Oliver Bendel integrated virtual tutors into his teaching. These were ‘custom versions of ChatGPT’, so-called GPTs. Social Robotics Girl was available for the elective modules on social robotics, created in November 2023, and Digital Ethics Girl from February 2024 for the compulsory modules “Ethik und Recht” and ‘Ethics and Law’ within the Wirtschaftsinformatik and Business Information Technology degree programmes (FHNW School of Business) and “Recht und Ethik” within Geomatics (FHNW School of Architecture, Construction and Geomatics). The virtual tutors have the “world knowledge” of GPT-4, but also the specific expertise of the technology philosopher and business information scientist from Zurich. It has been shown that the GPTs can provide certain impulses and loosen up the lessons. They show their particular strength in group work, where students no longer have to consult their lecturer’s books – which is hardly useful when there is a lot of time pressure – but can ask them specific questions. Last but not least, there are opportunities for self-regulated learning. The paper “How Can GenAI Foster Well-being in Self-regulated Learning?” by Stefanie Hauske and Oliver Bendel was published in May 2024 – it was submitted to the AAAI Spring Symposia in December 2023 and presented at Stanford University at the end of March 2024.
Maori Girl Can Speak and Write Maori
Conversational agents have been the subject of Prof. Dr. Oliver Bendel’s research for a quarter of a century. He dedicated his doctoral thesis at the University of St. Gallen from the end of 1999 to the end of 2022 to them – or more precisely to pedagogical agents, which would probably be called virtual learning companions today. He has been a professor at the FHNW School of Business since 2009. From 2012, he mainly developed chatbots and voice assistants in the context of machine ethics, including GOODBOT, LIEBOT, BESTBOT, and SPACE THEA. In 2022, the information systems specialist and philosopher of technology then turned his attention to dead and endangered languages. Under his supervision, Karim N’diaye developed the chatbot @ve for Latin and Dalil Jabou the chatbot @llegra for Vallader, an idiom of Rhaeto-Romanic, enhanced with voice output. He is currently testing the range of GPTs – “customized versions of ChatGPT”, as OpenAI calls them – for endangered languages such as Irish (Irish Gaelic), Maori, and Basque. According to ChatGPT, there is a relatively large amount of training material for them. On May 9, 2024 – one week after Irish Girl – a first version of Maori Girl was created. At first glance, it seems to have a good grasp of the Polynesian language of the indigenous people of New Zealand. You can have the answers translated into English or German. Maori Girl is available in the GPT Store and will be further improved over the next few weeks
Saving Languages with Language Models
On February 19, 2024, the article “@llegra: a chatbot for Vallader” by Oliver Bendel and Dalil Jabou was published in the International Journal of Information Technology. From the abstract: “Extinct and endangered languages have been preserved primarily through audio conservation and the collection and digitization of scripts and have been promoted through targeted language acquisition efforts. Another possibility would be to build conversational agents like chatbots or voice assistants that can master these languages. This would provide an artificial, active conversational partner which has knowledge of the vocabulary and grammar and allows one to learn with it in a different way. The chatbot, @llegra, with which one can communicate in the Rhaeto-Romanic idiom Vallader was developed in 2023 based on GPT-4. It can process and output text and has voice output. It was additionally equipped with a manually created knowledge base. After laying the conceptual groundwork, this paper presents the preparation and implementation of the project. In addition, it summarizes the tests that native speakers conducted with the chatbot. A critical discussion elaborates advantages and disadvantages. @llegra could be a new tool for teaching and learning Vallader in a memorable and entertaining way through dialog. It not only masters the idiom, but also has extensive knowledge about the Lower Engadine, that is, the area where Vallader is spoken. In conclusion, it is argued that conversational agents are an innovative approach to promoting and preserving languages.” Oliver Bendel has been increasingly focusing on dead, extinct and endangered languages for some time. He believes that conversational agents can help to strengthen and save them.
A Conversational Agent as a Superhero
Researchers at the University of Washington have developed a web app to help children develop skills such as self-awareness and emotional management. They have published their findings in their paper “Self-Talk with Superhero Zip: Supporting Children’s Socioemotional Learning with Conversational Agents”. From the abstract: “Here, we examine whether children can learn to use a socioemotional strategy known as ‘self-talk’ from a conversational agent (CA). To investigate this question, we designed and built ‘Self-Talk with Superhero Zip,’ an interactive CA experience, and deployed it for one week in ten family homes to pairs of siblings between the ages of five and ten … We found that children could recall and accurately describe the lessons taught by the intervention, and we saw indications of children applying self-talk in daily life.” (Fu et al. 2023) The paper can be downloaded at dl.acm.org/doi/abs/10.1145/3585088.3589376 (Image: DALL-E 3).
Talking with Social Robotics Girl
On November 6, 2023, OpenAI made so-called GPTs available to ChatGPT Plus users. According to the US company, anyone can easily create his or her own GPT without any programming knowledge. Initial tests have shown the performance of the new function. ChatGPT suggests a name for the chatbot, creates the profile picture and accepts documents with text and reference lists to expand its knowledge of the topic. The function is ideal for creating your own learning companions, modern educational agents so to speak. But you can also benefit from chatbots from other users and providers. A GPT called Social Robotics Girl, which provides information about social robotics, has been available since November 12, 2023. It was created by Prof. Dr. Oliver Bendel and is based on a collection of his articles on this topic. It can therefore give his definition of social robots and make classifications based on his five-dimension model. ChatGPT Plus users can access Social Robotics Girl via chat.openai.com/g/g-TbhZSZaer-social-robotics-girl (Image: DALL-E 3).
@llegra, a Chatbot for Vallader
Conversational agents have been a research subject of Prof. Dr. Oliver Bendel for a quarter of a century. He dedicated his doctoral thesis at the University of St. Gallen to them. At the School of Business FHNW, he developed them with his changing teams from 2012 to 2022, primarily in the context of machine ethics and social robotics. The philosopher of technology now devotes himself increasingly to dead, extinct, and endangered languages. After @ve (2022), a chatbot for Latin based on GPT-3, another project started in March 2023. The chatbot @llegra is developed by Dalil Jabou for the Rhaeto-Romanic idiom Vallader, which occurs in the Lower Engadine between Martina in the northeast and Zernez in the southwest, as well as in Val Müstair. The user can type text and gets text output. In addition, @llegra speaks with the help of a text-to-speech system from the company SlowSoft, which supports the project. The GPT-3 speech model produced rather unsatisfactory results. The breakthrough then came with the use of GPT-4. The knowledge base was supplemented with the help of four children’s books on Vallader. The project will be completed in August 2023. The results will be published thereafter.
The @ve Project
On January 19, 2023, the final presentation was held for the @ve project, which started in September 2022. The chatbot runs on the website www.ave-bot.ch and on Telegram. Like ChatGPT, it is based on GPT-3 from OpenAI (@ve is not GPT-3.5, but GPT-3.0). The project was initiated by Prof. Dr. Oliver Bendel, who wants to devote more time to dead, extinct, and endangered languages. @ve was developed by Karim N’diaye, who studied business informatics at the Hochschule für Wirtschaft FHNW. You can talk to her in Latin, i.e. in a dead language that thus comes alive in a way, and ask her questions about grammar. It was tested by a relevant expert. One benefit, according to Karim N’diaye, is that you can communicate in Latin around the clock, thinking about what and how to write. One danger, he says, is that there are repeated errors in the answers. For example, sometimes the word order is not correct. In addition, it is possible that the meaning is twisted. This can happen with a human teacher, and the learner should always be alert and look for errors. Without a doubt, @ve is a tool that can be profitably integrated into Latin classes. There, students can report what they have experienced with it at home, and they can have a chat with it on the spot, alone or in a group, accompanied by the teacher. A follow-up project on an endangered language has already been announced (Illustration: Karim N’diaye/Unsplash).
Ethics of Conversational Agents
The Ethics of Conversational User Interfaces workshop at the ACM CHI 2022 conference “will consolidate ethics-related research of the past and set the agenda for future CUI research on ethics going forward”. “This builds on previous CUI workshops exploring theories and methods, grand challenges and future design perspectives, and collaborative interactions.” (CfP CUI) From the Call for Papers: “In what ways can we advance our research on conversational user interfaces (CUIs) by including considerations on ethics? As CUIs, like Amazon Alexa or chatbots, become commonplace, discussions on how they can be designed in an ethical manner or how they change our views on ethics of technology should be topics we engage with as a community.” (CfP CUI) Paper submission deadline is 24 February 2022. The workshop is scheduled to take place in New Orleans on 21 April 2022. More information is available via www.conversationaluserinterfaces.org/workshops/CHI2022/.
Conversational Agent as Trustworthy Autonomous System
The Dagstuhl seminar “Conversational Agent as Trustworthy Autonomous System (Trust-CA)” will take place from September 19 – 24, 2021. According to the website, Schloss Dagstuhl – Leibniz-Zentrum für Informatik “pursues its mission of furthering world class research in computer science by facilitating communication and interaction between researchers”. Organizers of this event are Asbjørn Følstad (SINTEF – Oslo), Jonathan Grudin (Microsoft – Redmond), Effie Lai-Chong Law (University of Leicester) and Björn Schuller (University of Augsburg). They outline the background as follows: “CA, like many other AI/ML-infused autonomous systems, need to gain the trust of their users in order to be deployed effectively. Nevertheless, in the first place, we need to ensure that such systems are trustworthy. Persuading users to trust a non-trustworthy CA is grossly unethical. Conversely, failing to convince users to trust a trustworthy CA that is beneficial to their wellbeing can be detrimental, given that a lack of trust leads to low adoption or total rejection of a system. A deep understanding of how trust is initially built and evolved in human-human interaction (HHI) can shed light on the trust journey in human-automation interaction (HAI). This calls forth a multidisciplinary analytical framework, which is lacking but much needed for informing the design of trustworthy autonomous systems like CA.” (Website Dagstuhl) Regarding the goal of the workshop, the organizers write: “The overall goal of this Dagstuhl Seminar is to bring together researchers and practitioners, who are currently engaged in diverse communities related to Conversational Agent (CA) to explore the three main challenges on maximising the trustworthiness of and trust in CA as AI/ML-driven autonomous systems – an issue deemed increasingly significant given the widespread uses of CA in every sector of life – and to chart a roadmap for the future research on CA.” (Website Dagstuhl) Oliver Bendel (School of Business FHNW) will talk about his chatbot and voice assistant projects. These emerge since 2013 from machine ethics and social robotics. Further information is available here (photo: Schloss Dagstuhl).