Skip to content

Include option to additionally retrieve external IDs for data #59

@wkyoshida

Description

@wkyoshida

Terms

Languages

ALL

Description

This issue is to discuss an option (i.e. a flag perhaps) to also retrieve external IDs for data when running the data process (this is optional, as I'm thinking this should probably be something to opt-in, i.e. not the default behavior). On the Scribe-Server side, this information could be later useful for tracking when specific data points are new or have been updated in the external sources Scribe references, e.g. Wikidata. For those interested, it could also potentially be useful to see the IDs.

  • For nouns, verbs, and prepositions, this is likely the Wikidata lexemes.

  • For translations, autosuggestions, and emoji keywords - sources for these data points are from elsewhere - e.g. Wikipedia, Unicode CLDR, translation models. I believe these wouldn't really have IDs tied to them..
    Considerations for Scribe-Server:

    • I wonder if it could make sense to attempt to tie them to a matching Wikidata lexeme, but I'm still unsure as this likely could get messy.
    • Is there anything else we could use that makes sense?
  • Also, would doing this even make sense?

Open for discussion! 😊👀

Metadata

Metadata

Assignees

No one assigned

    Labels

    dataRelates to data or WikidataquestionFurther information is requested

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions