Skip to content

Add Sentiment Github Dataset Documentation Notebooks#3

Open
splimon wants to merge 1 commit intomainfrom
1-sentiment-github-dataset-notebook-documentation
Open

Add Sentiment Github Dataset Documentation Notebooks#3
splimon wants to merge 1 commit intomainfrom
1-sentiment-github-dataset-notebook-documentation

Conversation

@splimon
Copy link
Copy Markdown
Collaborator

@splimon splimon commented Apr 4, 2026

Adds five notebooks that build a pipeline to contextualize the GitHub Gold Standard sentiment dataset (7,122 comments) with GHTorrent project context and re-download comment data via Kaiaulu:

  • Notebook 1: Load the GitHub Gold Standard sentiment CSV into a GHTorrent MySQL database
  • Notebook 2: Explore GHTorrent tables to map sentiment comments to main project repos
  • Notebook 3: Auto-generate Kaiaulu .yml config files for 82 main project repos
  • Notebook 4: Download and parse commit comments via Kaiaulu
  • Notebook 5: Download and parse PR inline comments via Kaiaulu

…t and download via Kaiaulu

Adds five notebooks that build a pipeline to contextualize the GitHub Gold Standard
sentiment dataset (7,122 comments) with GHTorrent project context and
re-download comment data via Kaiaulu:

- Notebook 1: Load the GitHub Gold Standard sentiment CSV into a GHTorrent MySQL database
- Notebook 2: Explore GHTorrent tables to map sentiment comments to main project repos
- Notebook 3: Auto-generate Kaiaulu .yml config files for 82 main project repos
- Notebook 4: Download and parse commit comments via Kaiaulu
- Notebook 5: Download and parse PR inline comments via Kaiaulu
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant