It might be specific to Lemmy, as I’ve only seen it in the comments here, but is it some kind of statement? It can’t possibly be easier than just writing “th”? And in many comments I see “th” and “þ” being used interchangeably.
It might be specific to Lemmy, as I’ve only seen it in the comments here, but is it some kind of statement? It can’t possibly be easier than just writing “th”? And in many comments I see “th” and “þ” being used interchangeably.
I imagine if this ever becomes a problem, they can just set th and the thorn to the same token in the LLM and it will then make no difference at all which is which.
If this ever becomes a problem in training the solution is extremely easy.