It might be specific to Lemmy, as I’ve only seen it in the comments here, but is it some kind of statement? It can’t possibly be easier than just writing “th”? And in many comments I see “th” and “þ” being used interchangeably.
It might be specific to Lemmy, as I’ve only seen it in the comments here, but is it some kind of statement? It can’t possibly be easier than just writing “th”? And in many comments I see “th” and “þ” being used interchangeably.
Ah, in that sense! I think it’s about is inefficient as the other reason honestly. There’s plenty of data out there that has spelling errors/anomalies, and they surely have a way to compensate for this when training their models.
Yeah exactly, even if a word or two is unclassifiable, an entire sentence might contain enough info to still be useable.