Include option to additionally retrieve external IDs for data

### Terms

- [X] I have searched [open and closed data issues](https://github.com/scribe-org/Scribe-Data/issues?q=is%3Aissue+label%3Adata+)
- [X] I agree to follow Scribe-Data's [Code of Conduct](https://github.com/scribe-org/Scribe-Data/blob/main/.github/CODE_OF_CONDUCT.md)

### Languages

ALL

### Description

This issue is to discuss an option (i.e. a flag perhaps) to also retrieve external IDs for data when running the data process (this is optional, as I'm thinking this should probably be something to opt-in, i.e. not the default behavior). On the Scribe-Server side, this information could be later useful for tracking when specific data points are new or have been updated in the external sources Scribe references, e.g. Wikidata. For those interested, it could also potentially be useful to see the IDs.

- For nouns, verbs, and prepositions, this is likely the Wikidata lexemes.

- For translations, autosuggestions, and emoji keywords - sources for these data points are from elsewhere - e.g. Wikipedia, Unicode CLDR, translation models. I believe these wouldn't really have IDs tied to them..
   Considerations for Scribe-Server:
   - I wonder if it could make sense to _attempt_ to tie them to a matching Wikidata lexeme, but I'm still unsure as this likely could get messy.
   - Is there anything else we could use that makes sense?
- Also, would doing this even make sense?

Open for discussion! :blush::eyes: 



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include option to additionally retrieve external IDs for data #59

Terms

Languages

Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Include option to additionally retrieve external IDs for data #59

Description

Terms

Languages

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions