Connect with us

Anthropic

Why Does AI Like Goblins And Japan So Much?

Published

on

why-does-ai-like-goblins-and-japan-so-much?

“But here is the annoying little cave goblin,” and ”brutal little goblin of a dynamic” are two responses ChatGPT gave to a Reddit user in February. “Since 5.3 and 5.4, it’s started comparing anything negative to being a goblin,” said the Reddit user.

And they were not the only person to experience this. “After the 5.4 update, ChatGPT uses ‘goblin’ in almost every conversation. Sometimes it’s ‘gremlin.’ A recent chat of mine used goblin three times in four messages,” said another user on the popular tech forum Hacker News.

Indeed, the appearances of all these goblins led OpenAI — the creators of ChatGPT — to investigate and publish an article on their blog, titled “Where the goblins came from.”

The short answer is: it was an accident. Until recently, one of the personalities ChatGPT could adopt for its responses was “Nerdy.” In training this personality, they encouraged the model to use metaphors of fantastical creatures. “We unknowingly gave particularly high rewards for metaphors with creatures. From there, the goblins spread,” says the OpenAI article.

These unusual or unexpected reactions from AI models are more common than they seem. A group of Spanish researchers recently published a scientific article with another surprising finding: AI chatbots love to talk about Japan. “It was a surprise to see how Japan began to stand out in the models’ responses,” says Carla Pérez Almendros, a professor at Cardiff University and co-author of the study.

It’s already known that the models are biased towards Western values, but the passion for Japan went even further: “In English, Japan is the most frequently mentioned country, because we exclude the U.S. and the U.K., but even more interesting was seeing that the same thing happened in Spanish and Chinese, because that’s where we would have expected the U.S., for example, to be the preferred choice. But no, there was Japan,” explains Pérez Almendros.

OpenAI employees had an easier time seeing how goblins and gremlins had creeped into ChatGPT responses: they observed growth of 175% and 52%, respectively, since the launch of ChatGPT 5.1. “If the behavior were simply a broad internet trend, we would expect it to spread more evenly,” OpenAI explains in the blog. In contrast, mentions of fantastical creatures were concentrated in the “Nerdy” personality. That personality accounted for only 2.5% of all responses ChatGPT gave its users, but it was responsible for 66.7% of the “goblin” mentions. Goblins were therefore vastly overrepresented when the “Nerdy” personality was activated.

To prevent their “Nerdy” Codex programming model from being overrun with gremlins, the programmers had to request that the model remove them. For fans of fantastical creatures, OpenAI has published five lines of code that suppress the anti-goblin instructions.

And what about Japan? “Our unconfirmed hypothesis is that all the models have ‘security training,’ and there’s a bias toward Western countries like the U.S., which they try to mitigate,” says José Camacho Collados, also a professor at Cardiff University and co-author of the study. “At the same time, there are ‘problematic’ countries, perhaps Russia, Israel, the Middle East, and quite a few more, so Japan is in a good position because it’s a culture that people like, it’s mentioned a lot, and it’s also ‘neutral,’ so it’s a perfect combination for the models to use as an example. In fact, after Japan, there’s India, which might be similar,” he adds.

This overrepresentation of goblins and Japan is yet another example of the biases in these models, and of why questions must always be asked carefully and their answers treated with skepticism. “They are all biased,” says Pérez Almendros. “Sometimes intentionally, to avoid offensive or more representative responses, and other times it’s the training data that’s biased. The risk is that we believe they are objective, that they represent reality, because they don’t.”

At OpenAI, they offer a similar response, though more sugar‑coated: the goblin fixation is a “powerful example of how reward signals can shape model behavior in unexpected ways, and how models can learn to generalize rewards in certain situations to unrelated ones.”

These influences, at least, we can understand. But there are others we can’t. A few months ago, Anthropic — the creators of Claude — published the strange language that two models from the same family can share to exchange information. They discovered that if you tell one chatbot that owls are its favorite animal and then ask it to write lists of random numbers (like 285, 574, 384), another model learns from those numbers that it also loves owls. How is this possible? The researchers believe they are unwittingly hiding small, secret clues. It’s a much more dangerous way of contaminating biases.

No one knows for sure what’s really going on in these cases. “I’m interested in how the models ‘contaminate’ each other,” says Joseba Fernández de Landa, a postdoctoral researcher at the HiTZ Center of the University of the Basque Country (EHU) and co-author of the article on Japan. “The fact that different models respond with similar biases could indicate some kind of contamination and that they tend to homogenize with each other.”

He continues: “But this largely occurs due to human interference: we are the ones who, for now, choose the training strategies and data. And by using the models, we can audit their flaws and alert the developers, like with goblins. From there, the developers can decide whether to correct them or not, just as we can choose whether or not to use them.”

Sign up for our weekly newsletter to get more English-language news coverage from EL PAÍS USA Edition

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published.

Alphabet

Nvidia Vuelve A Romper Su Techo: Bate Su Máximo Histórico En Bolsa Y Se Refuerza Como Empresa Más Valiosa Del Mundo

Published

on

nvidia-vuelve-a-romper-su-techo:-bate-su-maximo-historico-en-bolsa-y-se-refuerza-como-empresa-mas-valiosa-del-mundo

Nvidia refuerza su posición como la compañía más grande del mundo por capitalización bursátil. Sus acciones se dispararon este lunes un 4% en el Nasdaq, hasta 216,61 dólares, nuevo máximo histórico, superando los 212 dólares, establecidos en otoño del pasado año. Con este renovado avance, el valor de mercado del gigante de los chips de inteligencia artificial (IA) se ha situado por encima de los 5,26 billones de dólares (unos 4,49 billones de euros), ampliando la ventaja sobre sus más inmediatos seguidores.

En este momento, Alphabet, matriz de Google, es la segunda mayor empresa del mundo con una capitalización bursátil de 4,21 billones de dólares, un billón menos que Nvidia. A continuación se sitúan Apple, con 3,92 billones; Microsoft, con 3,15 billones; Amazon, con 2,80 billones; TSMC, con 2,1 billones; Broadcom, con 1,98 billones; la petrolera saudí Aramco, con 1,75 billones; Meta, con 1,72 billones; y Tesla, con 1,42 billones. Del Top-10 de compañías más capitalizadas, nueve pertenecen a la industria tecnológica.

En las últimas jornadas, Nvidia se ha visto favorecida por el fuerte tirón de las empresas de chips, que a su vez está derivado de las fuertes previsiones de aumento de la inversión en centros de datos y otras infraestructuras digitales para la IA. En el último mes, Broadcom acumula una revalorización del 39%, por un 47% de Micron Technology, un 65% de AMD, un 41% de Texas Instruments, y un 97% de Intel, que deja atrás su crisis bursátil tras duplicar valor desde finales de marzo y dispararse un 323% en el último año.

El avance de Intel, ayudado por acuerdos como el Terafab, firmado con Tesla y SpaceX, las empresas de Elon Musk, ha disparado las plusvalías latentes de Nvidia, SoftBank y la propia Administración Trump, que compraron acciones del histórico fabricante norteamericano de chips en su reestructuración financiera en verano de 2025.

Además, Nvidia sigue teniendo el respaldo de los analistas, algunos de los cuales apuestan por un fuerte aumento de la remuneración a los accionistas, una vez que la compañía finalice su actual plan de inversiones en su ecosistema industrial de la IA. al que ha destinado más de 80.000 millones de dólares en los últimos meses, con apuestas como la citada de Intel, Nokia, OpenAI, Anthropic, xAI, Synopsys, Revolut, Wayve, Nscale, Lumentum Holdings y Coherent, entre otras muchas.

En este sentido, según Bank of America, la compañía dirigida por Jensen Huang puede generar más de 400.000 millones de dólares de flujo de caja libre (FCF) entre 2026 y 2027.

Durante su conferencia anual de desarrolladores GTC, celebrada a mediados de marzo, Nvidia anunció que generará al menos un billón de dólares en ingresos hasta 2027, gracias a los nuevos chips Blackwell y Vera Rubin, por encima de la estimación lanzada por la propia empresa el año anterior, que apuntaba a 500.000 millones. La compañía aprovechó el evento para prometer nuevas remuneraciones a los accionistas, señalando que prevé un cambio hacia la recompra de acciones y el pago de dividendos en el segundo semestre del año, una vez que la empresa haya hecho frente a los compromisos de inversión ya contraídos. Nvidia planea destinar en torno al 50% de su flujo de caja libre a retribuir a los accionistas, tanto a través de la recompra de títulos como del pago de dividendos en efectivo.

Durante su último ejercicio fiscal, cerrado a finales de enero, la compañía devolvió 41.100 millones de dólares a sus accionistas en forma de recompra de acciones y dividendos en efectivo, en torno al 43% de su flujo de caja libre.

En términos generales, los analistas mantienen una recomendación de compra fuerte para las acciones de Nvidia, basada en 40 recomendaciones de compra, una de mantener y una de venta en los últimos tres meses. Además, el precio objetivo medio de su acción es de 274,38 dólares por acción, ñp que todavía implica un potencial de revalorización cercano al 26%.

Este nuevo repunte ha coincidido con otra oleada de inversiones en start-ups vinculadas a la IA, entre las que destacan las multimillonarias inversiones anunciadas por Alphabet y Amazon en Anthropic, o la posible compra de Cursor por parte de SpaceX, por cerca de 60.000 millones de dólares.

Dentro de esta euforia, precisamente, y con el Nasdaq en máximos históricos, SpaceX está ultimando el proceso para su salida a Bolsa, que puede ser la mayor de la historia, con un objetivo de recaudación cercano a 75.000 millones de dólares y una valoración de dos billones. La empresa controlada por Elon Musk, pasaría a formar parte del Top-10 de compañías más capitalizadas.

Continue Reading

Trending

Copyright © 2017 Spanish Property & News