I have been messing around with creating a homoglyph keyboard for Android, but I’m wondering if it’s even worthwhile. Is there any benefit to masking your messages with homoglyphs? Primarily I think it could defend against an LLMs ability to easily scrape messages. In my experiments ChatGPT and DeepSeek both get confused by homoglyph messages unless you instruct it to determine the likely alphabet characters and numbers for each individual character.

For the uninitiated, Ꮋ0ᛖοԌⅼуᏢʜѕ áᚱе ᏟhäʀɑсᎢᎬᚱႽ thàτ Lоοᛕ ⅼіᛕË ᏞëtTêᚱᏚ

  • davel [he/him]@lemmy.ml
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    1
    ·
    2 days ago

    I suppose so, but I don’t see poisoning the LLM dataset in this way as a privacy thing, per se. It sounds more like performance art at best and futile pissing in the sea at worst.