Skip to content

Review extraction of authors from codemeta.json #957

@juanjemdIos

Description

@juanjemdIos

The codemeta parser is currently extracting authors from the referencePublication and then consolidating them during the codemeta export to reconcile duplicates or missing info.
The parser is currently extracting authors from the referencePublication by taking the author key exactly as it appears in the JSON, without any transformation. So in this case we'll take all "author" which is not completely correct because in other parts, such as in the citation section, authors use family_name and given_name, while in referencePublication they use familyName and givenName.

"citation": [
    {
        "result": {
            "title": "SOMEF: Software metadata extraction framework",
            "authors": [
                {
                    "type": "Agent",
                    "name": "Daniel Garijo",
                    "family_name": "Garijo",
                    "given_name": "Daniel",
                    "url": "https://orcid.org/0000-0003-0454-7145"
                },

"referencePublication": [
{
"identifier": "10.1162/qss_a_00167",
"name": "A Framework for Creating Knowledge Graphs of Scientific Software Metadata",
"url": "https://doi.org/10.1162/qss_a_00167",
"@type": "ScholarlyArticle",
"author": [
{
"@type": "Person",
"givenName": "Aidan",
"familyName": "Kelley"
},
{
"@type": "Person",
"givenName": "Daniel",
"familyName": "Garijo",
"@id": "https://orcid.org/0000-0003-0454-7145"
}
]
}
]
We should apply a small transformation to standardize the property names.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions