Skip to content

De-emojification does not preserve raw text & ignores non-ascii emojis #3

Description

@Thf772

Non-emoji letters in emoji messages are removed when de-emojification is applied. Also, non-ascii emojis are not parsed.

Example:

Raw: &&abcdefghijklmnopqrstuvwxyz
Emojified: :ahmadinejad: :berlusconois: :challengedenied: :DDDD: :excellent: :facepalm: :grabspopcorn: :horror: :iykwim: :johncenahorse: :killme: :Le_peuple_ne_veut_que_son_dû.: :miam: :noraj: :oubliez_pas_de_nettoyer_le_chan_général: :pasconvaincu: :quoi: :rimshot: :showme: :trounoir: :uhhh: :vroum: :windaube: x :ytho: :zoggiplz:
Translated: abcDefghijkmnpqrstuvwyz

Above, the following letters are missing:

  • L (:Le_peuple_ne_veut_que_son_dû.:)
  • O (:oubliez_pas_de_nettoyer_le_chan_général:)
  • X (x)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions