People have noticed for a while now that Google's Bard/Gemini has inserted random hindi/bengali words often. [0]
I just caught this in an o3-pro thought process: "and customizing for low difficulty. কাজ করছে!"
That last set of chars is apparently Bengali for "working!".
I just find it curious that similar "errors" are appearing from multiple different models... what is the training method or reasoning that these alternate languages can creep in, does anyone know?
[0] https://www.reddit.com/r/Bard/comments/18zk2tb/bard_speaking_random_languages/
Comments URL: https://news.ycombinator.com/item?id=44264325
Points: 7
# Comments: 9
Login to add comment
Other posts in this group

Hey HN, Henry and Roman here - we've been building a cross-platform framework for deploying LLMs, VLMs, Embedding Models and TTS models locally on smartphones.
Ollama enables deploying LLMs mode
Article URL: https://openfront.io/
Comments URL: https://news.ycombinator.com/item?id=44528943

Article URL: https://www.nber.org/papers/w33989
Comments URL: https://news.ycombinat

Article URL: https://simonwillison.net/2025/Jul/11/grok-musk/

Article URL: https://bentoml.com/llm/
Comments URL: https://news.ycombinator.com/item?id=44527
