In October 2023, we argued that the future of AI would be in “focused large language models” (FLLMs). These are purpose-built language models that target a specific industry, set of languages, or task and that are correspondingly smaller than the large language models (LLMs) being created by OpenAI, Google, Meta, and others. Those massive models – GPT-4o has over 175 billion parameters – are like Swiss Army knives: They are prepared to handle almost any task, from creating a haiku to drawing...
Read More
A common worry about generative AI (GenAI) is that the content that it creates may be subject to copyright claims. Our recent survey of freelance linguists reflected this concern: Copyright issues are their second most important concern with GenAI, with 74% viewing the technology negatively or strongly negatively in this regard. However, an examination of claims about copyright and how GenAI works reveals a different picture. This blog post covers details about how the technology works with its ...
Read More
It is incredible to think that, less than eight years after the first publicly available neural machine translation (NMT) systems appeared on the scene, some media coverage already sees NMT as so 2015. As generative AI (GenAI) really exploded into public view in 2022, it wasn’t surprising that an overactive tech press’ imagination would see it as the be-all and end-all of technology. Our recent survey with freelance linguists certainly reflects this view, with many language workers expressing ...
Read More
National variants of multinational languages such as English, French, and Spanish underscore the challenges to a government or business of providing readable content to any citizen or customer, prospect, or employee. They face what sociolinguists call varieties – that is, dialects, registers, styles, lexicons, and gender conventions – in both written and spoken language. Furthermore, usage and comprehension issues extend deep into any interaction, further affected by payment systems, regulatio...
Read More
The explosion of interest in generative AI technology like ChatGPT has led to general calls for the regulation of artificial intelligence – these will also affect the language industry. Although today it is unclear who would be responsible for a critical mistranslation stemming from MT on a company’s website, CSA Research predicts that within a year some unlucky company will become the textbook case that decides this matter. This post explores how current and proposed regulation of artificial ...
Read More
The word No seems to have been going around a lot in conversations lately. From discussions (welfare, personal space, the right to say No) to a conversation with my local taxi driver about the use of the word “Non” as a simple statement in France, when he asked about purchasing something, and where he perceived not just the word used as a negative, but as the expression of a cultural difference. “Non” with a Gallic shrug can mean so much more than a simple “No, we don’t have any”; it refl...
Read More
Multimedia, transcribed audio, and AI-generated content in all the world’s digital languages join more traditional content types in filling up data centers. Together they create challenges and opportunities across organizations, raise the alarm for more oversight of content, and further the case for aligning enterprise content strategies, investment, and operations.
Read More
January 03, 2023
|
Donald A. DePalma | Artificial intelligence,
Business climate,
Buyer strategic planning,
Content technology,
Digital transformation,
Global content,
Machine translation,
Translation market size,
Translation technology,
Corporate social responsibility | For LSPs,
For Buyers,
For Technology Vendors |
Share
|
Large language models have been in the news a lot in November and December and the coverage has been mixed, to put it mildly. Meta posted its Galactica model on November 15 but took it down just three days later in the face of intense criticism. By contrast, when OpenAI released ChatGPT two weeks later, on November 30, the response was much more positive. Examining why the reactions were so different provides insight into the potential and limitations of machine translation (MT) as well as cauti...
Read More
Technology developments tend to follow a typical pattern of improvement over time, known as an S-curve. Although it is a familiar pattern, it is worth unpacking its five phases and considering how they apply to language technology and forecasts about it. Examining how they have played out with successive generations of machine translation points to a future in which other advanced natural language processing technologies have tremendous potential to deliver useful and innovative capabilities.
Read More
Last week, the Washington Post published an article about Blake Lemoine’s claim that his employer Google’s LaMDA language model/chatbot system had achieved sentience and had a “soul.” Lemoine, an engineer in the company’s responsible AI group, based his assertion on a dialogue in which LaMDA expressed human-seeming sentiments and concepts. Google placed Lemoine on leave, thereby sparking renewed discussion about what machine sentience is and what it means. What can the experience of the lan...
Read More