This is a table of references used for possible keywords when generating description text. The table includes the source itself, a description, the author, the prepared input file, and the date retrieved.
Minor changes were made to the original data as needed for the program's use case. For example, JSON files were converted to plain-text lists.
| Name | Source | Description | Author | Input File | Date |
|---|---|---|---|---|---|
| Baby Names | Kaggle | List of Baby Names from 1910 to 2021. | Evan Zhang | first-names.csv | 17 June 2021 |
| Personal - Language | Mockaroo | List of languages from the Mockaroo random data generator. Used for spoken accents. | Mark - Mockaroo developer. | accents.txt | 17 June 2021 |
| Proverbs | Corpora | A list of proverbs used for 'About' text. | @dairusk | quotes.txt | 17 June 2021 |
| Encouraging Words | Corpora | A list of encouraging words to tell someone about something they created. Used as keywords for appearance description. | @dairusk | encouraging-words.txt | 17 June 2021 |
| -ing Forms | Textstelle | A list of verbs with the '-ing' suffix. Used for skill descriptions. | @gambolputty | ing-forms.txt | 17 June 2021 |
| Descriptions | Corpora | A list of adjectives for describing people. Used for 'otherPersonality' description keywords. | @dairusk | descriptions.txt | 17 June 2021 |
| Hobbies | Kaggle | List of hobbies for 'otherHobbies' description keywords. | Mohamed Adel | hobbies.txt | 17 June 2021 |
| Video Games Rating By 'ESRB' | Kaggle | List of video games for 'otherGaming' description keywords. | Mohammed Alhamad | video-games.txt | 17 June 2021 |
| Common Animals | Corpora | List of common animals for 'otherPets' description keywords. | @dairusk | animals.txt | 17 June 2021 |
| New Technologies | Corpora | List of new or emerging technologies. Used for 'otherTechnology' description keywords. | @dairusk | technology.txt | 17 June 2021 |
| Academic Subjects | Corpora | Classification of Instructional Programs. Used for 'otherQualifications' description keywords. | @dairusk | academic-subjects.txt | 17 June 2021 |
| Industries | Corpora | A list of all industries on LinkedIn, as of May 21, 2013. Used for 'otherExperienceAreas' description keywords. | @dairusk | industries.txt | 17 June 2021 |
| Fortune 500 | Corpora | The 2014 Fortune 500 list. List of companies for 'PreviousExperience' employers. | @dairusk | employers.txt | 17 June 2021 |
| Occupations | Corpora | A list of occupations, or jobs that people might have. Used for 'PreviousExperience' job titles. | @dairusk | occupations.txt | 17 June 2021 |
| List of allergens | Wikipedia | List of allergies from Wikipedia. Used for 'otherAllergies' description keywords. | Wikipedia contributors | allergies.txt | 18 June 2021 |
| Monsters | Corpora | A list of monsters and other mythic creatures. Used for 'otherFears' description keywords. | @dairusk | monsters.txt | 18 June 2021 |
| Units of Time | Corpora | A list of units of time ordered by magnitude, both formal and colloquial. Used for 'otherAvailability' description keywords. | @dairusk | time.txt | 27 June 2021 |
| Unit of Time | Wikipedia | A list of units of time from Wikipedia. Used for 'otherAvailability' description keywords. | Wikipedia contributors | time.txt | 27 June 2021 |
| Basic - Frequency | Mockaroo | List of frequencies from the Mockaroo random data generator. Used for 'otherAvailability' description keywords. | Mark - Mockaroo developer. | time.txt | 6 July 2021 |
Originally Written: 1 July 2021
Last Updated: 23 July 2021