Linear Algebra Explains Why Some Words Are Effectively Untranslatable

(aethermug.com)

76 points | by mrcgnc 6 hours ago

55 comments

yubblegum 42 minutes ago
Interestingly enough for this morning's walk I was musing over the tension between the hypotheses that: 'LLMs can map between languages in the vector space' (thus languages are ~equivalent); and 'Language affects thoughts' (as in German is good for Philosophy and English for getting things done).
If both these thoughts are true, then it would appear that languages have topological characteristics. We can (topologically) map from one to another, 'thoughts' (that is a complex of words) form 'paths on the language manifold' and certain paths may be more 'natural' in one topological form than the other.
pjsg 3 hours ago
The article seems to think that a word is untranslateable if there is no single word in the target language. If I'm not misreading the article, then this is completely obvious -- just consider the number of words in English and the number of words in almost any other language, and you will find that there are more English words than the other language. It is now clear that there exist English words that don't correspond to a single word in the other language.
[-]
- ErroneousBosh 5 minutes ago
  > It is now clear that there exist English words that don't correspond to a single word in the other language.
  But that's true of any language. Not only that, but English uses loanwords heavily which are often Anglicisations of words from other languages, which may not in themselves be just one word.
  "Ho ho ho", the flag-waving Little Englander types say, "Gaelic is such a stupid language, they don't even have a word for 'television', they just say 'television' in a stupid accent!"
  But English also has no word for "television". Worse, the word "television" isn't even just a loanword, it's two words from two different languages, "tele" from Greek and "vision" from Latin. What a bodge job! Imagine letting something like that slip through to production use!
  The hypothetical Catalan-Hungarian inventor of it in another leg of the trousers of time may have called it llunylátás, and then where would we be?
  Well, most languages would have some variant of that word to mean "television", as they do now, I expect.
  The English word "galore" (meaning "sufficient" shading towards "more than enough") comes from the Gaelic words "gu leòr", (goo lyaawr, the grave accent above the o makes the vowel sound longer). What a silly language English is, doesn't have a word that means "more than you're ever likely to need", has to steal one from Gaelic and then spell it wrong.
  Oh, they use this word "whisky". You know what that means? It means "uisge beatha" but they only say the first word, in a silly accent because they can't pronounce it properly.
  Quite often there's no single word for a thing you're trying to translate but that doesn't mean it's untranslateable. English has only one single word for rain, for example, but Gaelic has about half a dozen of which the only ones I can reproduce here are "uisge" (that word again) which just means "water", and "fras" which is more like a gentle shower. The rest of the words in the Gaelic of the North-West of Scotland that refer to rainy weather are, of course, profane in the extreme.
- drivebyhooting 3 hours ago
  That isn’t a proof. Synonyms can bolster the enumeration sans augmenting novelty.
  [-]
  - nyeah 21 minutes ago
    It kind of is a proof if we assume that single words can be translated at all. Translate a single word from Language X (more words) to language Y (fewer words) and back. I can't uniquely recover all the words in Language X that way.
  - sjducb an hour ago
    Synonyms rarely have identical meanings for example:
    Happy: Joyful, cheerful, merry, delighted
    Or
    Beautiful: Lovely, pretty, attractive
    The only truly identical synonym I can think of is flammable and inflammable
    [-]
    - drivebyhooting 42 minutes ago
      Perhaps perhaps.
      But what joyful means to you likely differs from what it means to me, simply because we haven’t read the exact same literature and had the same conversations.
      [-]
      - dvfjsdhgfv 6 minutes ago
        True true.
  - manwe150 3 hours ago
    That is the crux of the article premise: each synonym conveys similar denotations (principle component is I think what the article called it), but usually with some difference in connotations (the off axis contributions). You can nudge the languages vectors towards each other by adding enough synonyms and modifiers together, but they are always a little bit off even still
  - mannykannot 3 hours ago
    True, but many languages now have words that were absent from their earlier vocabularies. Shakespeare did not have the option to use 'telephone', 'semiconductor' or 'entropy'.
  - James_K 3 hours ago
    I think the reasonable reader will conclude it's unlikely for any two languages to share exactly the same vocabulary, accounting for synonyms.
- seanhunter 3 hours ago
  Not sure this approach really accounts for the difference between a language like German where you have one compound word for a concept that would require multiple words in English. For one good example, the German "Nomenkompositum" is "compound noun" in English.
  [-]
  - bloppe an hour ago
    Some giant portion of English vocabulary actually are compound words. English loves using compound words but only if the roots are sourced from Latin or Greek: words like electrocardiogram ("electronic heart picture", sourced from Greek), agriculture ("field nurturing", from Latin), and telecommunication ("far sharing", a hybrid of Latin and Greek roots). Probably the overwhelming majority of the words in an English dictionary will be compound words, and people regularly coin neologisms ("new words") using this formula.
    An English speaker might be willing to accept componoma ("names placed together", Latin) or synthetonoma (also "names placed together", Greek) without breaking stride.
    [-]
    - seanhunter 27 minutes ago
      I wasn’t saying there are no compound nouns in English at all. If you count portmanteau words like “Brexit” and jargon there are a massive abundance of them. All I was saying is the approach would count certain concepts as untranslatable when they clearly aren’t, simply because in one language you have a compound word and in the other language you use several words to express the same concept. It’s definitely not untranslatable but the translation function isn’t one to one.
    - aitchnyu an hour ago
      A couple of ape cubs who learned sign language saw a duck and invented "waterbird". We have to know two dead languages to know if aquaplaning or hydroplaning is the right word.
    - swsieber an hour ago
      What sticks out to me is that the first word in these ends with a vowel so they don't sound like compound words.
  - z500 3 hours ago
    If you ignore the spaces, the only real difference between German and English compound nouns are the infixes between elements to show bracketing. Case in point: Nomenkompositum
  - suddenlybananas 3 hours ago
    That's just a difference in orthography. English could easily have had an orthographic standard where we write "compoundnoun" for compounds. This is in contrast with a language like French, where compound nouns are relatively rare. Compare English "Olive oil" and German "Olivenöl" with French "huile d'olive". In French you need to have a preposition to combine the two nouns, whereas English and German do noun-noun composition.
    [-]
    - adrian_b 3 hours ago
      You are right but neither yours nor those of the previous posters are good examples of compound nouns.
      These examples have just the meanings of a noun + adjective or of a noun + noun in genitive case, where some languages are lazier than others and omit the markers of case or of adjectival derivation from noun, which are needed in more strict languages.
      There are also other kinds of compound nouns, where the compound noun does not have the meaning of its component words, but only some related meaning (usually either a pars pro toto meaning or a metaphorical meaning). Those are true compound nouns, not just abbreviated sequences of words from which the grammatical markers have been omitted.
      Such compound words were very frequent in Ancient Greek, from where they have been inherited in the scientific and technical language, where they have been used to create names for new things and concepts, e.g. arthropod, television, phonograph, basketball, "bullet train" and so on.
      This kind of compound words are almost never translatable, but they are frequently borrowed from one language to another and during the borrowing process sometimes the component words are translated, but the result is not a translated word, it is a new word that is added to the destination language.
      [-]
      - seanhunter 22 minutes ago
        > There are also other kinds of compound nouns, where the compound noun does not have the meaning of its component words, but only some related meaning (usually either a pars pro toto meaning or a metaphorical meaning)
        The example that people often quote from German is “kummerspeck” which would literally translate as “grief bacon”, but means weight you put on through comfort eating having gone through a bereavement or other trauma.
- godelski an hour ago
  There's a real irony that the examples are coming from Japanese since it is an agglutinative language.
  I think people don't realize how weird language is. Like you could look at Chinese and call each sentence a "word" as there are no spaces. What's the difference between that and a compound word like "nighttime" or the whole German language where you got words like Krankenwagen ("patient" + "car").
  Now this doesn't mean there aren't words or phrases that aren't translatable. But the thing is we can always translate the words themselves. What we can't always translate is the meaning behind them. I think the best example of this comes from Star Trek and the Tamarian Language[0,1]. "Sokath, his eyes open!" The problem with communication is not that the words don't translate, it is that the meaning behind them doesn't. Just as people struggle with idioms when learning American English or why someone might be confused about why someone "shit in the milk" or "fucked the dog". Words are an embedding. A compression.
  The thing people are constantly forgetting, but is more important than ever in a globally connected world, is that words are not perfect representations of thoughts. We compress our thoughts into them and hope the person on the other side can decompress them. It is why you can more easily communicate with your close friends who have better context than you can with another person that natively speaks your language and is why someone that learns a new language can speak perfectly well but still struggle to communicate. Language is not just words, it is culture[2]. So in a much more connected world today we have these disconnects in culture and thus interpretation of what people say. I know every one of you has been told to "speak to your audience" but how do you speak to your audience when your audience is everybody and when you don't know who your audience is? The new paradigm requires us to be much better interpreters than we were before. Least everyone is going to sound crazy, other than those you frequently talk to and have that shared understanding.
  [0] https://memory-alpha.fandom.com/wiki/Tamarian_language
  [1] https://www.youtube.com/watch?v=3-wzr74d7TI
  [2] This is, btw, why people argue for embodied AI being so critical. Not because LLMs can't appear to grasp the language, but because we as humans have embodied our language so deeply you probably didn't even realize that I used the word "grasp" to refer to an abstract concept and not something you can actually touch with your hand.
- cortesoft 3 hours ago
  Yeah, I was interpreting 'untranslatable' to mean what it says, but they meant 'untranslatable with only a couple words', which is a very different claim.
- bpt3 an hour ago
  You're correct.
  In another blog post where he uses "shibui" as an example of an untranslatable word, he says, "Saying shibui like that, in a mere second, conveys what would otherwise make a clunky and unnecessarily long digression."
  At the root of nearly all the blog posts like this one (basically explaining why they don't agree with a widely held belief) is a redefinition of a term or word into something very specific that contradicts the common definition.
epistasis 3 hours ago
> If the mere sight of the above is like a punch in the face for you, don't worry. I'm not going to math you to death in what follows. I will only remind you of a tiny basic part of it that I think relates to languages.
Yes, that mathematical expression is like a punch in my face, but not for the reason you think. I am offended that the rank of the matrix does not match the dimension of the matrix, not that I'm seeing a matrix.
[-]
- tptacek 2 hours ago
  It's a 3x3 matrix with 3 independent rows. The rank matches the dimension.
- trostaft 2 hours ago
  You probably mean that the size of the matrix is incompatible with the size of the vector?
proteal 3 hours ago
I think a succinct way to describe my thoughts on linear algebra/language is that language has high dimensionality (ie many different basis vectors that may not necessarily be orthogonal) and that individual languages use a unique coordinate system to express thought. Each language is a lossy approximation of all conceivable thought and some languages can more efficiently represent the “all thoughts” vector space because they have basis vectors that point in more uncommon directions (like the go to japan example). So while you can more or less point to any thought in any language, some thoughts are easier to express in certain languages, which the post (and me) agree to be untranslatable words.
I tried to find the really interesting article about language and color that describes how some cultures use different naming schemes for colors but couldn’t find it. It talked about how back in the day we don’t know orange as a color, we just thought it was red-yellow and only after the fruit was distributed did the word for the color catch on. Here’s the best article I can find that talks about this phenomena https://burnaway.org/magazine/blue-language-visual-perceptio...
[-]
- AlotOfReading an hour ago
```
    Each language is a lossy approximation of all conceivable thought...
```
  This ultimately boils down to the private language discussion started by Wittgenstein. If you admit public language is a lossy approximation of meaning, you're taking a position on the existence of private languages.
- MangoToupe 3 hours ago
  > Each language is a lossy approximation of all conceivable thought
  I'm not quite sure I understand this—I do have mental sensations/processes sans language, but I would not characterize them as "thoughts". To me, a thought is inherently linguistic, even if they relate to non-linguistic mental processes. So to me, learning a new language is very literally learning how to think differently.
  [-]
  - proteal an hour ago
    I think we’re in agreement, but I’m afraid I don’t have the philosophical language to precisely pin my mental model into words (what a meta conundrum lol). I’ll try my best here, but I may come back in a few days with an edit if I can more coherently write my ideas.
    I take a slightly more narrow definition of “thoughts” that may be more akin to “expressions” - ideas that can be communicated, so excluding non-linguistic mental processes. I think that may be where we disconnect. A lot of my idea about thoughts comes from the Borges story, Funes the memorius (short story about a dude who could not forget - interesting read and really clarifies my feelings on my definition of “all possible thought”). In the story he talks about tree leaves, but instead imagine needing a unique linguistic scheme for every single unique snowflake you ever see. It would be a linguistic nightmare! Therefore language must generalize otherwise it becomes noncommunicable and that generalization to me induces the “lossy approximation” I attribute to language in my prior comment.
    So, in my head Funes’s mind represent the abstract space of all possible thoughts. When we use language, we are stacking words/sentences/paragraphs/etc together almost like vector addition trying to reach a particular point in the thought vector space. Some languages have really clean ways of getting to certain thoughts while others take a mouthful and still don’t get you exactly there (物の哀れ example from link).
    I agree with your statement on new languages being different thinking. As you follow that vector addition process to get to the “thought,” different languages will take you on different paths to get to your destination thought because languages encode those vectors differently, even if the destination thought is the same. In my mental model, the act of thinking is putting those language vectors together and tracing their path to get to your thought.
    And if my comment still makes no sense - I might have to incubate this thought a bit more :) but I do recommend the story- it’s a quick, thought provoking read.
voxleone 2 hours ago
My personal analogy, useful in my early days: Translating is like finding a vector in another space that points in the same direction or carries a similar magnitude of meaning.
In other words:
The source sentence is a vector in “language A space.”
The target sentence is a vector in “language B space.”
A good translation finds a vector that has the same direction (same meaning, intent, tone) even though it lies in a different coordinate system (the new language).
[-]
- aitchnyu an hour ago
  when did you develop this analogy? Is it well before 2015, when Google demoed a vector model that solved Man:Woman,King:_____ ?
KolenCh 27 minutes ago
Big claim but not much substance. They should try to really understand linear algebra first, and also linguistics a bit. Semantic domain (from linguistics) is a better way to describe it, where using sets (from math) might better convey what they want to say.
nyeah 18 minutes ago
Why is the first linear algebra example a 3x3 matrix times a 2x1 vector? What am I supposed to do with that? If it's some oddball thing like the Kronecker product, could he like tell us that?
d-lisp 4 hours ago
Are they multiplying a 3x3 matrix by a 2 component vector ?
[-]
- tptacek 3 hours ago
  In that one case, yeah; I don't think they're going for anything more than general illustration here.
  [-]
  - magicalhippo 3 hours ago
    The text that follows does take on a new meaning though, for those that know linear algebra:
    If the mere sight of the above is like a punch in the face for you, don't worry.
    Almost makes me wonder if it was intentional.
  - nyeah 11 minutes ago
    But what does that illustrate?
- moron4hire an hour ago
  Everyone knows you need a 4x4 matrix to do translation, anyway. Now, scale, rotation, and skew...
- seanhunter 3 hours ago
  Yeah that made me twitch also.
- wjholden 3 hours ago
  It also has mixed square brackets and curved parentheses. I stopped reading the article when I saw this.
midtake an hour ago
What a trendy article, in tune with our recently linear-algebraic turn in how we see language thanks to LLM's.
But I think this exposes an even greater problem, where words thought to be direct translations will always drift in vector value as they are weighted for attention within their respective corpora. Are we on the brink of translation-nihilism?
This isn't even limited to complex phenomena or shades of snow. Even "I like" is a different construction in many languages, in an unexpected way to new language learners.
ralferoo 2 hours ago
There's one aspect that I think the article starts to hint at, but doesn't quite make the jump to is that words in a language just map to a subset of concepts that don't necessarily have the same subset boundaries in other languages.
If you think back to the meme from a decade or two ago about how men and women perceive colour [1], where e.g. "pink" to a man covers a whole range of colours to a woman, then that kind of hints at the idea.
One example back in the realm of vocabulary is the English word "happy". This embodies a range of meanings from joy, willingness, pleased, contentment, satiation, etc. There might be some overlap in some of these meanings with other words like "joy" or "excited", that don't have the same overlaps in other languages. E.g. "happy" might be translated to French as "heureuse" for the senses of pleased or content, but not for willingness sense.
Similarly, the French word "dommage" can be translated into a whole bunch of English words that aren't normally synonyms of each other - pity, damage, shame, harm.
This kind of nuance can lead to two opposite problems when translating - when the meaning is limited to a subset of possible meanings by context, and the wrong one is chosen in the foreign language, and when the author's meaning embodied multiple meanings and the chosen translation doesn't cover all of them.
Some of these features can lead to the humour in subtle jokes being lost in translation, e.g. "he'd be late to his own funeral".
[1] e.g. https://www.psychologytoday.com/us/blog/brain-babble/201504/... or https://digitalsynopsis.com/design/male-vs-female-color-perc...
raincom 3 hours ago
There is a better thesis coming from the late philosopher W.V.O Quine: indeterminacy of translation [1]
[1] https://plato.stanford.edu/entries/quine/#IndeTran
rich_sasha 3 hours ago
It's a tenuous analogy, but if you along with it, you can take it further.
You could consider the "cost" of expressing a word as some kind of metric or norm on the vector. What in one language/basis is a simple Kronecker delta, in another is a very complex vector (of course if it were the same vector in two bases, it would have the same length, but we could rather think of translation as an affine transformation, say).
And finally, with two bases, they need not span the same vector space. You can have a three-coordinate vector space all you like, if you have only two basis vectors you ain't spanning it. At best you can hope for an orthogonal projection from one to the other, and lose some nuance.
Eventually, with bilinguality, you learn not to translate words. Concepts live in different languages and describe a reality. Usually you can describe that reality in two different languages, but sometimes not.
sigbottle an hour ago
It could be the case that it's not even "effectively", "in practice", etc.
N^{any constant} is not bijective with a single R.
nph278 3 hours ago
Just read Wittgenstein (The Blue+Brown Books / Philosophical investigations), and this confusion will go away. The difference between translation, definition, and explanation needs to be understood.
zvmaz 4 hours ago
> I hope these rather unorthodox leaps between linguistics and mathematics helped make it almost obvious that some words and ideas are untranslatable in practice. I also hope you don't take the analogy too seriously, because it won't go much further than this.
Phew! Thanks for clarifying.
behnamoh 3 hours ago
Tangent: I really like vornoi diagrams and part of me thinks there's a hidden, precious concept they represent. I didn't get their relation to the article but was wondering if they have applications in engineering/sciences.
[-]
- HanClinto an hour ago
  A Voronoi diagram is created when you color every point on an image according to which discrete point it is closest to.
  So in this case, I see the diagrams as representing the boundaries drawn when projecting / quantizing complex ideas into a set of central points that are insufficient for catching all of the nuance of the original. How well can you adapt a nuanced idea to a different space?
  If Language A has an idea that exists at one point in space, which is the closest word in Language B that might be used to represent it? A Voronoi diagram is one possible way of illustrating it.
  Tangent on your tangent: this GDC presentation from 2016 is probably my favorite real-world application of Voronoi Diagrams, and uses them for N-player split-screen camera control: https://www.youtube.com/watch?v=tu-Qe66AvtY&t=1594s
  I have a lingering dream in the back of my mind to make a single-couch Liero-style casual game for N-players with good dynamic camera support using this technique.
mannyv 4 hours ago
Communication/language depends on shared context. The more context you share the shorter the trigger for evoking that thing and that context. And if you share no context communication becomes very difficult.
I wasn't aware that that idea was in dispute.
[-]
- pixl97 3 hours ago
  And honestly, without a lot more communication even with a person that speaks your language you have no idea if you actually have a shared context. While an American from NYC and one from some backwater town in Kansas share a lot of context but there is a lot of context they don't, so as communication becomes more detailed between them it's very likely that 'translation' between each other will be somewhat incorrect.
  This is also why lawyer speak is so particular. Language is fuzzy in most cases. Only language that relates to discrete physical objects gets closer to the binary state of exactness described in the article.
triclops200 3 hours ago
This article assumes that concepts are somehow precise coordinates within a single language; that's not the case, at best, speakers of a language mutually approximate a relatively consistent representation, but like, look at a word like yeet or whatever: we decided as a society on its meaning while it was being developed, as it were. Furthermore, it never rigorously defines what it means by translation. It claims 上京 is a single basis meaning moving to Tokyo, for example, but that isn't even an accurate translation: the individual components represent superior/greater/above and Tokyo and as an idiomatic phrase it represents the concept of moving to the capital for a better life. Something like "moving on up" or the like in some vernaculars of English, and idioms translating to idioms is a form of translation. It's disingenuous to represent the first concept as a single basis but not the second. Similarly, it claims mono no aware (物の哀れ) is unable to be translated, but, again, more literally "translated" is saying "the sorrow within things" character by character, and, only as an idiom has the full contextual understanding. It's not really a single point even if it's rather accurately located in a hypothetical embedding space by Japanese speakers. Imo, an English translation of the concept is "everything is dust in the wind", only 2 more individual conceptual units than the original Japanese phrase, and 3 of them are mainly just connecting words, but it's understood as a similar idiom/concept, here.
Concepts are only usefully distinguished by context and use.
By the author's own argumentation: nothing is translatable (or, generally, even communicatable) unless it has a fixed relative configuration to all other concepts that is precisely equivalent. In practice, we handle the fuzziness as part of communication and its useless to try and define a concept as untranslatable unless you're also of the camp that nothing is ever communicated (in which case, this response to the author's post is completely useless as nobody could possibly understand it enough internally for it to be useful. If you've read this far, congrats on squaring the circle somehow)
[-]
- DFHippie an hour ago
  This. Two speakers of the same language only have approximately the same understanding of the meanings of the words they both use. Communication succeeds because we are constantly seeking and correcting misunderstandings that arise due to no two people speaking exactly the same language.
  The same process that allows two speakers of the same language to communicate adequately allows one to translate from one language to another. If it were truly impossible to translate from one language to another, we would be unable to perceive this and argue about it. The recognition and correction of errors is part of the process of translation just as it is part of the process of communication in a single language.
miltonlost 31 minutes ago
Reading any poem that makes use of extensive wordplay within a language shows why there will always be some untranslatable aspect. You can't create all the exact shades of a single pun if all those shades aren't in a different language.
Go translate an ee cummings poem and make sure to retain all its meanings.