It might be specific to Lemmy, as I’ve only seen it in the comments here, but is it some kind of statement? It can’t possibly be easier than just writing “th”? And in many comments I see “th” and “þ” being used interchangeably.

  • gerryflap@feddit.nl
    link
    fedilink
    arrow-up
    2
    arrow-down
    4
    ·
    4 days ago

    Do you think these massive companies will add even a single line of code for something and insignificant as this? Also that one string replace maymess with Icelandic text which actually uses it.

    I think these 2 factors actually make it sort of useful. As long as not too many others do this exact thing, it makes the comments with the thorn in English enough of an anomaly to probably do more harm than good to the training of the LLM. And therefore the comments are not being used in any useful way for “AI” training.

    There are some accessibility and readability concerns tho, and it’s also a bit of a weird thing to do. But it might just kinda work