Noise-aware ML pipeline for large-scale agricultural yield prediction using PySpark and LightGBM, with feature and label noise simulation, mitigation, and distributed training.
-
Updated
Mar 13, 2026 - Python
Noise-aware ML pipeline for large-scale agricultural yield prediction using PySpark and LightGBM, with feature and label noise simulation, mitigation, and distributed training.
An interactive Q&A system that extracts actionable insights from agricultural data (1997–2014). It combines efficient EDA, a rule-based query engine, and a Gradio interface to answer questions on crop yields, rainfall, and production. Built in Jupyter Notebook with a modular, extensible pipeline.
Add a description, image, and links to the crop-yield-analysis topic page so that developers can more easily learn about it.
To associate your repository with the crop-yield-analysis topic, visit your repo's landing page and select "manage topics."