Skip to content

Apo.clean up#18

Open
innazh wants to merge 18 commits intomainfrom
apo.clean_up
Open

Apo.clean up#18
innazh wants to merge 18 commits intomainfrom
apo.clean_up

Conversation

@innazh
Copy link
Collaborator

@innazh innazh commented Mar 12, 2026

  • Master readme with intro, navigation, setup instructions, and licensing
  • Grouped examples into learn and projects
  • Introducing a new section into per-project readmes for /learn: “what you learned”
  • Switched the use of two separate commands (branch create and checkout) to just one - bauplan checkout -b branch-name
  • 02 readme added a section on how to fix the pipeline
  • 03 readme added cache_policy=NONE for prefect because otherwise it seems to try to serialize bauplan’s client and this errors out in the console (looks ugly but still passes)
  • 04 readme incorporating the missing concepts from docs/guides/secrets (that is to be deprecated)
  • 04 add the instructions to provide your own s3 bucket, was creds install instructions
  • 05 fix the bucket name, improve readme’s description
  • Projects examples: group all query syntax change, some cmd changes, improved description

jacopotagliabue and others added 18 commits February 27, 2026 21:09
…d narration

- 02-data-quality-expectations: move files into pipeline/ to encapsulate the pipeline logic
- 03-safe-ingestion-on-a-schedule: rename from wap-on-a-schedule, replace WAP
  lexicon with "safe ingestion", auto-generate branch name from username,
  add default args, make uv-compatible
- 06-data-engineering-with-claude-code: move narration content from existing
  repo into this subproject
- community/marimo: move pipeline files into pipeline/ subfolder

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Replace pandas with polars in the Streamlit app
- Remove --bauplan_user_name flag, auto-infer from authenticated client
- Rename bpln_pipeline/ to pipeline/, update README paths
- Remove public. prefix from spotify_playlists table references
- Switch README setup from pip/venv to uv sync
- Update streamlit command to use uv run python -m streamlit
- Fix unused variable in mongo_utils.py

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…deps

- Move bauplan files and marimo notebook into pipeline/ subfolder
- Update models.py imports to use local pipeline.taxi_notebook
- Update polars to 1.38.1 and marimo to 0.20.4 in bauplan decorators
- Update README with correct paths and uv run python -m marimo command
- Add bauplan run --project-dir pipeline instruction

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…username

- Move src/app/ and src/bpln_pipeline/ to top-level app/ and pipeline/
- Remove src/setup_dataset/ (tables now loaded via standalone script)
- Replace pandas with polars in Streamlit app
- Remove --bauplan_username flag, auto-infer from authenticated client
- Remove boto3 dependency, add polars and pyyaml
- Fix yaml path reference from bpln_pipeline to pipeline
- Update README with uv-based setup and correct paths
- Fix unused variable in pinecone_utils.py

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…app/

- Replace scattered case_study_telemetry/, pipelines/, docker/, scripts/
  with a single pipeline/ folder (models.py, expectations.py, bauplan_project.yml)
- Add gold layer: signal_summary with hourly per-sensor aggregates
- Add Streamlit dashboard (app/viz_app.py) querying the gold table
- Update source to bauplan.telemetry_bronze (now in the lakehouse)
- Update pyproject.toml: simplify deps, fix description
- Bump polars to 1.38.1 in 02-data-quality-expectations

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@innazh innazh self-assigned this Mar 12, 2026
@innazh innazh added the documentation Improvements or additions to documentation label Mar 12, 2026
@ciro-greco
Copy link
Collaborator

@innazh I noticed that in many occasions we write bauplan instead of Bauplan. We standardized to the second. Let's make sure the docs are uniform.

@@ -0,0 +1,49 @@
# Safe Ingestion with Bauplan and Prefect

Implement the Write-Audit-Publish (WAP) pattern using [Prefect](https://www.prefect.io/) and bauplan to safely ingest data into the lakehouse on a recurring schedule.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would add a line that says This example is implemented using Prefect, for other orchestration tools (e.g. Airflow, Temporal) see our integration guide https://docs.bauplanlabs.com/integrations/orchestrators/

# Git for Data

This example showcases the power of Bauplan's git-for-data model through a series of SDK commands that go beyond basic pipeline runs.
A deep dive into bauplan's git-for-data model through a series of SDK commands that go beyond basic pipeline runs.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For an exhaustive introduction to the core concepts of git for data see https://docs.bauplanlabs.com/concepts/git_for_data

@@ -1 +1 @@
# Data Valentine Challenge
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we change this title which is utterly unintelligible?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants