Skip to content

rlitschk/clef-dataloaders

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

13 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

CLEF dataloader

Setup

Dataloaders expect the following structure after downloading and unzipping CLEF:

clef/
β”œβ”€β”€ clef-low-resource
β”‚         └── long_paper
β”œβ”€β”€ DocumentData
β”‚         β”œβ”€β”€ dutch
β”‚         β”œβ”€β”€ english
β”‚         β”œβ”€β”€ finnish
β”‚         β”œβ”€β”€ french
β”‚         β”œβ”€β”€ german
β”‚         β”œβ”€β”€ italian
β”‚         └── russian
β”œβ”€β”€ RelAssess
β”‚         β”œβ”€β”€ 2001
β”‚         β”œβ”€β”€ 2002
β”‚         └── 2003
└── Topics
    β”œβ”€β”€ 2001
    β”œβ”€β”€ 2002
    └── 2003

References

@inproceedings{Bonab2019swahiliclef,
    author = {Bonab, Hamed and Allan, James and Sitaraman, Ramesh},
    title = {Simulating CLIR Translation Resource Scarcity Using High-Resource Languages},
    year = {2019},
    url = {https://doi.org/10.1145/3341981.3344236},
    booktitle = {Proceedings of ICTIR},
    pages = {129–136},
}
@inproceedings{braschler2003clef,
    title={{CLEF 2003--Overview of results},
    author={Braschler, Martin},
    booktitle={Workshop of the Cross-Language Evaluation Forum for European Languages},
    pages={44--63},
    year={2003},
    organization={Springer}
}

About

initial commit

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages